Curiosity and Procrastination in Reinforcement Learning

October 25, 2018

Table of Contents

Episodic Curiosity through Reachability: Observations are added to memory, reward is computed based on how far the current observation is from the most similar observation in memory. The agent receives more reward for seeing observations which are not yet represented in memory.

Source: googleblog.com

AI Company Accused of Using Humans to Fake Its AI

Preview 7 Open Source Projects from the Uber Open Summit

The What-If Tool: Code-Free Probing of Machine Learning Models

How would changes to a datapoint affect my model’s prediction? Does it perform differently for various groups–for example, historically marginalized people? How diverse is the dataset I am testing my model on?

Curiosity and Procrastination in Reinforcement Learning

Tags :

Share :

Related Posts

AI Company Accused of Using Humans to Fake Its AI

Preview 7 Open Source Projects from the Uber Open Summit

The What-If Tool: Code-Free Probing of Machine Learning Models