Curiosity and Procrastination in Reinforcement Learning

Curiosity and Procrastination in Reinforcement Learning

  • October 25, 2018
Table of Contents

Curiosity and Procrastination in Reinforcement Learning

Episodic Curiosity through Reachability: Observations are added to memory, reward is computed based on how far the current observation is from the most similar observation in memory. The agent receives more reward for seeing observations which are not yet represented in memory.

Source: googleblog.com

Share :
comments powered by Disqus

Related Posts

The What-If Tool: Code-Free Probing of Machine Learning Models

The What-If Tool: Code-Free Probing of Machine Learning Models

How would changes to a datapoint affect my model’s prediction? Does it perform differently for various groups–for example, historically marginalized people? How diverse is the dataset I am testing my model on?

Read More