HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Asynchronous Methods for Deep Reinforcement Learning

Volodymyr Mnih; Adrià Puigdomènech Badia; Mehdi Mirza; Alex Graves; Timothy P. Lillicrap; Tim Harley; David Silver; Koray Kavukcuoglu

Asynchronous Methods for Deep Reinforcement Learning

Abstract

We propose a conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers. We present asynchronous variants of four standard reinforcement learning algorithms and show that parallel actor-learners have a stabilizing effect on training allowing all four methods to successfully train neural network controllers. The best performing method, an asynchronous variant of actor-critic, surpasses the current state-of-the-art on the Atari domain while training for half the time on a single multi-core CPU instead of a GPU. Furthermore, we show that asynchronous actor-critic succeeds on a wide variety of continuous motor control problems as well as on a new task of navigating random 3D mazes using a visual input.

Code Repositories

wtingda/DeepRLBreakout
tf
Mentioned in GitHub
Kaixhin/ACER
pytorch
Mentioned in GitHub
hulanwin/A3C-DRL
tf
Mentioned in GitHub
nvlabs/gbrl_sb3
pytorch
Mentioned in GitHub
miyosuda/async_deep_reinforce
tf
Mentioned in GitHub
AI-RG/rl-experiments
tf
Mentioned in GitHub
muupan/async-rl
Mentioned in GitHub
bkhmsi/meta-rl-harlow
pytorch
Mentioned in GitHub
amanda-lambda/hack-flappy-bird-drl
pytorch
Mentioned in GitHub
ofekluis/sonic_project_ss19
Mentioned in GitHub
dickreuter/neuron_poker
Mentioned in GitHub
deepsense-ai/Distributed-BA3C
tf
Mentioned in GitHub
avillemin/Minecraft-AI
pytorch
Mentioned in GitHub
marload/deep-rl-tf2
tf
Mentioned in GitHub
alexmlamb/blocks_rl_gru_setup
pytorch
Mentioned in GitHub
Kaixhin/NoisyNet-A3C
pytorch
Mentioned in GitHub
joshiatul/game_playing
Mentioned in GitHub
Khrylx/PyTorch-RL
pytorch
Mentioned in GitHub
uvipen/Super-mario-bros-A3C-pytorch
pytorch
Mentioned in GitHub
mavischer/DRRL
pytorch
Mentioned in GitHub
gungui98/deeprl-a3c-ai2thor
tf
Mentioned in GitHub
Nasdin/ReinforcementLearning-AtariGame
pytorch
Mentioned in GitHub
amanda-lambda/drl-experiments
pytorch
Mentioned in GitHub
yukezhu/tensorflow-reinforce
tf
Mentioned in GitHub
chainer/chainerrl
pytorch
Mentioned in GitHub
JulT1/RL_SS19
tf
Mentioned in GitHub
Zartris/TD3_continuous_control
pytorch
Mentioned in GitHub
Jzar/Space-Invaders-DQN
tf
Mentioned in GitHub
Sheepsody/Batched-Impala-PyTorch
pytorch
Mentioned in GitHub
ikostrikov/pytorch-rl
pytorch
Mentioned in GitHub
vladfi1/universe-starter-agent
tf
Mentioned in GitHub
PaulCharnay/Projet_AIF
Mentioned in GitHub
ikostrikov/pytorch-a3c
pytorch
Mentioned in GitHub
tensorlayer/RLzoo
tf
Mentioned in GitHub
liuyuezhang/pyrl
pytorch
Mentioned in GitHub
sainijagjit/A3C-Pytorch
pytorch
Mentioned in GitHub
dsinghnegi/atari_RL_agent
pytorch
Mentioned in GitHub
brett-daley/fast-dqn
tf
Mentioned in GitHub
qihongl/demo-advantage-actor-critic
pytorch
Mentioned in GitHub
4rChon/NL-FuN
tf
Mentioned in GitHub
lcswillems/torch-ac
pytorch
Mentioned in GitHub
InSpaceAI/RL-Zoo
tf
Mentioned in GitHub
khanhptnk/bandit-nmt
pytorch
Mentioned in GitHub
openai/universe-starter-agent
tf
Mentioned in GitHub
amaudruz/RL_openaigym
pytorch
Mentioned in GitHub
bentrevett/pytorch-rl
pytorch
Mentioned in GitHub
qihongl/dlstm-demo
pytorch
Mentioned in GitHub
sharan-dce/A3C
tf
Mentioned in GitHub
NervanaSystems/coach
tf
Mentioned in GitHub
natsumeS/analysis
Mentioned in GitHub
traai/async-deep-rl
tf
Mentioned in GitHub
marload/DeepRL-TensorFlow2
tf
Mentioned in GitHub
cdesilv1/sc2_ai_cdes
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
atari-games-on-atari-2600-alienA3C LSTM hs
Score: 945.3
atari-games-on-atari-2600-alienA3C FF hs
Score: 518.4
atari-games-on-atari-2600-alienA3C FF (1 day) hs
Score: 182.1
atari-games-on-atari-2600-amidarA3C FF (1 day) hs
Score: 283.9
atari-games-on-atari-2600-amidarA3C LSTM hs
Score: 173.0
atari-games-on-atari-2600-amidarA3C FF hs
Score: 263.9
atari-games-on-atari-2600-assaultA3C LSTM hs
Score: 14497.9
atari-games-on-atari-2600-assaultA3C FF hs
Score: 5474.9
atari-games-on-atari-2600-assaultA3C FF (1 day) hs
Score: 3746.1
atari-games-on-atari-2600-asterixA3C FF hs
Score: 22140.5
atari-games-on-atari-2600-asterixA3C LSTM hs
Score: 17244.5
atari-games-on-atari-2600-asterixA3C FF (1 day) hs
Score: 6723
atari-games-on-atari-2600-asteroidsA3C LSTM hs
Score: 5093.1
atari-games-on-atari-2600-asteroidsA3C FF (1 day) hs
Score: 3009.4
atari-games-on-atari-2600-asteroidsA3C FF hs
Score: 4474.5
atari-games-on-atari-2600-atlantisA3C LSTM hs
Score: 875822.0
atari-games-on-atari-2600-atlantisA3C FF hs
Score: 911091.0
atari-games-on-atari-2600-atlantisA3C FF (1 day) hs
Score: 772392.0
atari-games-on-atari-2600-bank-heistA3C LSTM hs
Score: 932.8
atari-games-on-atari-2600-bank-heistA3C FF (1 day) hs
Score: 946.0
atari-games-on-atari-2600-bank-heistA3C FF hs
Score: 970.1
atari-games-on-atari-2600-battle-zoneA3C FF hs
Score: 12950.0
atari-games-on-atari-2600-battle-zoneA3C FF (1 day) hs
Score: 11340.0
atari-games-on-atari-2600-battle-zoneA3C LSTM hs
Score: 20760.0
atari-games-on-atari-2600-beam-riderA3C LSTM hs
Score: 24622.2
atari-games-on-atari-2600-beam-riderA3C FF (1 day) hs
Score: 13235.9
atari-games-on-atari-2600-beam-riderA3C FF hs
Score: 22707.9
atari-games-on-atari-2600-berzerkA3C FF (1 day) hs
Score: 1433.4
atari-games-on-atari-2600-berzerkA3C FF hs
Score: 817.9
atari-games-on-atari-2600-berzerkA3C LSTM hs
Score: 862.2
atari-games-on-atari-2600-bowlingA3C LSTM hs
Score: 41.8
atari-games-on-atari-2600-bowlingA3C FF hs
Score: 35.1
atari-games-on-atari-2600-bowlingA3C FF (1 day) hs
Score: 36.2
atari-games-on-atari-2600-boxingA3C LSTM hs
Score: 37.3
atari-games-on-atari-2600-boxingA3C FF hs
Score: 59.8
atari-games-on-atari-2600-boxingA3C FF (1 day) hs
Score: 33.7
atari-games-on-atari-2600-breakoutA3C FF (1 day) hs
Score: 551.6
atari-games-on-atari-2600-breakoutA3C LSTM hs
Score: 766.8
atari-games-on-atari-2600-breakoutA3C FF hs
Score: 681.9
atari-games-on-atari-2600-centipedeA3C FF (1 day) hs
Score: 3306.5
atari-games-on-atari-2600-centipedeA3C LSTM hs
Score: 1997.0
atari-games-on-atari-2600-centipedeA3C FF hs
Score: 3755.8
atari-games-on-atari-2600-chopper-commandA3C LSTM hs
Score: 10150.0
atari-games-on-atari-2600-chopper-commandA3C FF (1 day) hs
Score: 4669.0
atari-games-on-atari-2600-chopper-commandA3C FF hs
Score: 7021.0
atari-games-on-atari-2600-crazy-climberA3C FF (1 day) hs
Score: 101624.0
atari-games-on-atari-2600-crazy-climberA3C FF hs
Score: 112646.0
atari-games-on-atari-2600-crazy-climberA3C LSTM hs
Score: 138518.0
atari-games-on-atari-2600-demon-attackA3C FF (1 day) hs
Score: 84997.5
atari-games-on-atari-2600-demon-attackA3C LSTM hs
Score: 115201.9
atari-games-on-atari-2600-demon-attackA3C FF hs
Score: 113308.4
atari-games-on-atari-2600-double-dunkA3C FF (1 day) hs
Score: 0.1
atari-games-on-atari-2600-double-dunkA3C FF hs
Score: -0.1
atari-games-on-atari-2600-double-dunkA3C LSTM hs
Score: 0.1
atari-games-on-atari-2600-enduroA3C FF hs
Score: -82.5
atari-games-on-atari-2600-enduroA3C LSTM hs
Score: -82.5
atari-games-on-atari-2600-enduroA3C FF (1 day) hs
Score: -82.2
atari-games-on-atari-2600-fishing-derbyA3C FF hs
Score: 18.8
atari-games-on-atari-2600-fishing-derbyA3C LSTM hs
Score: 22.6
atari-games-on-atari-2600-fishing-derbyA3C FF (1 day) hs
Score: 13.6
atari-games-on-atari-2600-freewayA3C FF (1 day) hs
Score: 0.1
atari-games-on-atari-2600-freewayA3C FF hs
Score: 0.1
atari-games-on-atari-2600-freewayA3C LSTM hs
Score: 0.1
atari-games-on-atari-2600-frostbiteA3C LSTM hs
Score: 197.6
atari-games-on-atari-2600-frostbiteA3C FF hs
Score: 190.5
atari-games-on-atari-2600-frostbiteA3C FF (1 day) hs
Score: 180.1
atari-games-on-atari-2600-gopherA3C FF hs
Score: 10022.8
atari-games-on-atari-2600-gopherA3C LSTM hs
Score: 17106.8
atari-games-on-atari-2600-gopherA3C FF (1 day) hs
Score: 8442.8
atari-games-on-atari-2600-gravitarA3C LSTM hs
Score: 320.0
atari-games-on-atari-2600-gravitarA3C FF hs
Score: 303.5
atari-games-on-atari-2600-gravitarA3C FF (1 day) hs
Score: 269.5
atari-games-on-atari-2600-heroA3C FF hs
Score: 32464.1
atari-games-on-atari-2600-heroA3C LSTM hs
Score: 28889.5
atari-games-on-atari-2600-heroA3C FF (1 day) hs
Score: 28765.8
atari-games-on-atari-2600-ice-hockeyA3C LSTM hs
Score: -1.7
atari-games-on-atari-2600-ice-hockeyA3C FF (1 day) hs
Score: -4.7
atari-games-on-atari-2600-ice-hockeyA3C FF hs
Score: -2.8
atari-games-on-atari-2600-james-bondA3C FF (1 day) hs
Score: 351.5
atari-games-on-atari-2600-james-bondA3C LSTM hs
Score: 613.0
atari-games-on-atari-2600-james-bondA3C FF hs
Score: 541.0
atari-games-on-atari-2600-kangarooA3C FF hs
Score: 94.0
atari-games-on-atari-2600-kangarooA3C FF (1 day) hs
Score: 106.0
atari-games-on-atari-2600-kangarooA3C LSTM hs
Score: 125.0
atari-games-on-atari-2600-krullA3C FF hs
Score: 5560.0
atari-games-on-atari-2600-krullA3C LSTM hs
Score: 5911.4
atari-games-on-atari-2600-krullA3C FF (1 day) hs
Score: 8066.6
atari-games-on-atari-2600-kung-fu-masterA3C LSTM hs
Score: 40835.0
atari-games-on-atari-2600-kung-fu-masterA3C FF (1 day) hs
Score: 3046.0
atari-games-on-atari-2600-kung-fu-masterA3C FF hs
Score: 28819.0
atari-games-on-atari-2600-montezumas-revengeA3C FF (1 day) hs
Score: 53
atari-games-on-atari-2600-montezumas-revengeA3C FF hs
Score: 67
atari-games-on-atari-2600-montezumas-revengeA3C LSTM hs
Score: 41
atari-games-on-atari-2600-ms-pacmanA3C FF hs
Score: 653.7
atari-games-on-atari-2600-ms-pacmanA3C LSTM hs
Score: 850.7
atari-games-on-atari-2600-ms-pacmanA3C FF (1 day) hs
Score: 594.4
atari-games-on-atari-2600-name-this-gameA3C LSTM hs
Score: 12093.7
atari-games-on-atari-2600-name-this-gameA3C FF hs
Score: 10476.1
atari-games-on-atari-2600-name-this-gameA3C FF (1 day) hs
Score: 5614.0
atari-games-on-atari-2600-pongA3C FF (1 day) hs
Score: 11.4
atari-games-on-atari-2600-pongA3C LSTM hs
Score: 10.7
atari-games-on-atari-2600-pongA3C FF hs
Score: 5.6
atari-games-on-atari-2600-private-eyeA3C FF hs
Score: 206.9
atari-games-on-atari-2600-private-eyeA3C LSTM hs
Score: 421.1
atari-games-on-atari-2600-private-eyeA3C FF (1 day) hs
Score: 194.4
atari-games-on-atari-2600-qbertA3C LSTM hs
Score: 21307.5
atari-games-on-atari-2600-qbertA3C FF hs
Score: 15148.8
atari-games-on-atari-2600-qbertA3C FF (1 day) hs
Score: 13752.3
atari-games-on-atari-2600-river-raidA3C LSTM hs
Score: 6591.9
atari-games-on-atari-2600-river-raidA3C FF hs
Score: 12201.8
atari-games-on-atari-2600-river-raidA3C FF (1 day) hs
Score: 10001.2
atari-games-on-atari-2600-road-runnerA3C LSTM hs
Score: 73949.0
atari-games-on-atari-2600-road-runnerA3C FF hs
Score: 34216.0
atari-games-on-atari-2600-road-runnerA3C FF (1 day) hs
Score: 31769.0
atari-games-on-atari-2600-robotankA3C LSTM hs
Score: 2.6
atari-games-on-atari-2600-robotankA3C FF hs
Score: 32.8
atari-games-on-atari-2600-robotankA3C FF (1 day) hs
Score: 2.3
atari-games-on-atari-2600-seaquestA3C FF (1 day) hs
Score: 2300.2
atari-games-on-atari-2600-seaquestA3C LSTM hs
Score: 1326.1
atari-games-on-atari-2600-seaquestA3C FF hs
Score: 2355.4
atari-games-on-atari-2600-space-invadersA3C FF (1 day) hs
Score: 2214.7
atari-games-on-atari-2600-space-invadersA3C FF hs
Score: 15730.5
atari-games-on-atari-2600-space-invadersA3C LSTM hs
Score: 23846.0
atari-games-on-atari-2600-star-gunnerA3C FF (1 day) hs
Score: 64393.0
atari-games-on-atari-2600-star-gunnerA3C LSTM hs
Score: 164766.0
atari-games-on-atari-2600-star-gunnerA3C FF hs
Score: 138218.0
atari-games-on-atari-2600-tennisA3C LSTM hs
Score: -6.4
atari-games-on-atari-2600-tennisA3C FF hs
Score: -6.3
atari-games-on-atari-2600-tennisA3C FF (1 day) hs
Score: -10.2
atari-games-on-atari-2600-time-pilotA3C FF hs
Score: 12679.0
atari-games-on-atari-2600-time-pilotA3C LSTM hs
Score: 27202.0
atari-games-on-atari-2600-time-pilotA3C FF (1 day) hs
Score: 5825.0
atari-games-on-atari-2600-tutankhamA3C LSTM hs
Score: 144.2
atari-games-on-atari-2600-tutankhamA3C FF hs
Score: 156.3
atari-games-on-atari-2600-tutankhamA3C FF (1 day) hs
Score: 26.1
atari-games-on-atari-2600-up-and-downA3C FF hs
Score: 74705.7
atari-games-on-atari-2600-up-and-downA3C FF (1 day) hs
Score: 54525.4
atari-games-on-atari-2600-up-and-downA3C LSTM hs
Score: 105728.7
atari-games-on-atari-2600-ventureA3C LSTM hs
Score: 25.0
atari-games-on-atari-2600-ventureA3C FF (1 day) hs
Score: 19.0
atari-games-on-atari-2600-ventureA3C FF hs
Score: 23.0
atari-games-on-atari-2600-video-pinballA3C FF (1 day) hs
Score: 185852.6
atari-games-on-atari-2600-video-pinballA3C FF hs
Score: 331628.1
atari-games-on-atari-2600-video-pinballA3C LSTM hs
Score: 470310.5
atari-games-on-atari-2600-wizard-of-worA3C FF (1 day) hs
Score: 5278.0
atari-games-on-atari-2600-wizard-of-worA3C LSTM hs
Score: 18082.0
atari-games-on-atari-2600-wizard-of-worA3C FF hs
Score: 17244.0
atari-games-on-atari-2600-zaxxonA3C FF (1 day) hs
Score: 2659.0
atari-games-on-atari-2600-zaxxonA3C FF hs
Score: 24622.0
atari-games-on-atari-2600-zaxxonA3C LSTM hs
Score: 23519.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp