Accelerating Deep Neuroevolution: Train Atari in Hours on a Single Personal Computer

Accelerating Deep Neuroevolution: Train Atari in Hours on a Single Personal Computer

  • April 23, 2018
Table of Contents

Accelerating Deep Neuroevolution: Train Atari in Hours on a Single Personal Computer

We recently published an article announcing five papers on deep neuroevolution, including the discovery that genetic algorithms can solve deep reinforcement learning problems as well as popular alternatives, such as deep Q-learning and policy gradients. That work follows on Salimans et al. 2017, which showed the same for evolution strategies (ES), another neuroevolution algorithm. We further described how ES can be improved by adding exploration in the form of a pressure for agents to be novel, and how ES relates to gradient descent.

All of that research was computationally expensive: It was conducted on between 720 and 3000 CPUs distributed across a large, high-performance computing cluster, seemingly putting deep neuroevolution out of reach for most researchers, students, companies, and hobbyists.

Source: uber.com

Tags :
Share :
comments powered by Disqus

Related Posts

A Face-Detection Library in 200 Lines of JavaScript

A Face-Detection Library in 200 Lines of JavaScript

The pico.js library is a JavaScript implementation of the method described in 2013 by Markuš et al. in a technical report. The reference implementation is written in C and available on GitHub: https://github.com/nenadmarkus/pico.

Read More
The AI Revolution Hasn’t Happened Yet

The AI Revolution Hasn’t Happened Yet

Artificial Intelligence (AI) is the mantra of the current era. The phrase is intoned by technologists, academicians, journalists and venture capitalists alike. As with many phrases that cross over from technical academic fields into general circulation, there is significant misunderstanding accompanying the use of the phrase.

Read More
This AI Will Turn Your Dog Into a Cat

This AI Will Turn Your Dog Into a Cat

As detailed in a paper published to arXiv, the neural net is actually a generative adversarial network (GAN), which is a way of training a machine learning algorithm without human supervision. In GANs, two neural nets are pitted against one another: One neural net generates new images and tries to trick the other neural net into thinking the images are real. If the other neural net is able to tell the generated images are false

Read More