Accelerating Deep Neuroevolution: Train Atari in Hours on a Single Personal Computer

Accelerating Deep Neuroevolution: Train Atari in Hours on a Single Personal Computer

  • April 23, 2018
Table of Contents

Accelerating Deep Neuroevolution: Train Atari in Hours on a Single Personal Computer

We recently published an article announcing five papers on deep neuroevolution, including the discovery that genetic algorithms can solve deep reinforcement learning problems as well as popular alternatives, such as deep Q-learning and policy gradients. That work follows on Salimans et al. 2017, which showed the same for evolution strategies (ES), another neuroevolution algorithm. We further described how ES can be improved by adding exploration in the form of a pressure for agents to be novel, and how ES relates to gradient descent.

All of that research was computationally expensive: It was conducted on between 720 and 3000 CPUs distributed across a large, high-performance computing cluster, seemingly putting deep neuroevolution out of reach for most researchers, students, companies, and hobbyists.

Source: uber.com

Tags :
Share :
comments powered by Disqus

Related Posts

This AI Will Turn Your Dog Into a Cat

This AI Will Turn Your Dog Into a Cat

As detailed in a paper published to arXiv, the neural net is actually a generative adversarial network (GAN), which is a way of training a machine learning algorithm without human supervision. In GANs, two neural nets are pitted against one another: One neural net generates new images and tries to trick the other neural net into thinking the images are real. If the other neural net is able to tell the generated images are false

Read More
Machine Learning’s ‘Amazing’ Ability to Predict Chaos

Machine Learning’s ‘Amazing’ Ability to Predict Chaos

Half a century ago, the pioneers of chaos theory discovered that the “butterfly effect” makes long-term prediction impossible. Even the smallest perturbation to a complex system (like the weather, the economy or just about anything else) can touch off a concatenation of events that leads to a dramatically divergent future. Unable to pin down the state of these systems precisely enough to predict how they’ll play out, we live under a veil of uncertainty.

Read More