HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Lasse Espeholt; Hubert Soyer; Remi Munos; Karen Simonyan; Volodymir Mnih; Tom Ward; Yotam Doron; Vlad Firoiu; Tim Harley; Iain Dunning; Shane Legg; Koray Kavukcuoglu

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Abstract

In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and extended training time. We have developed a new distributed agent IMPALA (Importance Weighted Actor-Learner Architecture) that not only uses resources more efficiently in single-machine training but also scales to thousands of machines without sacrificing data efficiency or resource utilisation. We achieve stable learning at high throughput by combining decoupled acting and learning with a novel off-policy correction method called V-trace. We demonstrate the effectiveness of IMPALA for multi-task reinforcement learning on DMLab-30 (a set of 30 tasks from the DeepMind Lab environment (Beattie et al., 2016)) and Atari-57 (all available Atari games in Arcade Learning Environment (Bellemare et al., 2013a)). Our results show that IMPALA is able to achieve better performance than previous agents with less data, and crucially exhibits positive transfer between tasks as a result of its multi-task approach.

Code Repositories

google-deepmind/streetlearn
tf
Mentioned in GitHub
deepmind/streetlearn
tf
Mentioned in GitHub
facebookresearch/torchbeast
pytorch
Mentioned in GitHub
deepmind/scalable_agent
Official
tf
Mentioned in GitHub
villinvic/Georges
Mentioned in GitHub
windstrip/DeepMind-StreetLearn
tf
Mentioned in GitHub
Sheepsody/Batched-Impala-PyTorch
pytorch
Mentioned in GitHub
heiner/scalable_agent
tf
Mentioned in GitHub
google-research/valan
tf
Mentioned in GitHub
facebookresearch/gala
pytorch
Mentioned in GitHub
michaelnny/deep_rl_zoo
pytorch
Mentioned in GitHub
google-deepmind/scalable_agent
tf
Mentioned in GitHub
haje01/impala
pytorch
Mentioned in GitHub
threewisemonkeys-as/torched_impala
pytorch
Mentioned in GitHub
urw7rs/spiralpp
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
atari-games-on-atari-2600-alienIMPALA (deep)
Score: 15962.10
atari-games-on-atari-2600-amidarIMPALA (deep)
Score: 1554.79
atari-games-on-atari-2600-assaultIMPALA (deep)
Score: 19148.47
atari-games-on-atari-2600-asterixIMPALA (deep)
Score: 300732.00
atari-games-on-atari-2600-asteroidsIMPALA (deep)
Score: 108590.05
atari-games-on-atari-2600-atlantisIMPALA (deep)
Score: 849967.50
atari-games-on-atari-2600-bank-heistIMPALA (deep)
Score: 1223.15
atari-games-on-atari-2600-battle-zoneIMPALA (deep)
Score: 20885.00
atari-games-on-atari-2600-beam-riderIMPALA (deep)
Score: 32463.47
atari-games-on-atari-2600-berzerkIMPALA (deep)
Score: 1852.70
atari-games-on-atari-2600-bowlingIMPALA (deep)
Score: 59.92
atari-games-on-atari-2600-boxingIMPALA (deep)
Score: 99.96
atari-games-on-atari-2600-breakoutIMPALA (deep)
Score: 787.34
atari-games-on-atari-2600-centipedeIMPALA (deep)
Score: 11049.75
atari-games-on-atari-2600-chopper-commandIMPALA (deep)
Score: 28255.00
atari-games-on-atari-2600-crazy-climberIMPALA (deep)
Score: 136950.00
atari-games-on-atari-2600-defenderIMPALA (deep)
Score: 185203.00
atari-games-on-atari-2600-demon-attackIMPALA (deep)
Score: 132826.98
atari-games-on-atari-2600-double-dunkIMPALA (deep)
Score: -0.33
atari-games-on-atari-2600-enduroIMPALA (deep)
Score: 0.00
atari-games-on-atari-2600-fishing-derbyIMPALA (deep)
Score: 44.85
atari-games-on-atari-2600-freewayIMPALA (deep)
Score: 0.00
atari-games-on-atari-2600-frostbiteIMPALA (deep)
Score: 317.75
atari-games-on-atari-2600-gopherIMPALA (deep)
Score: 66782.30
atari-games-on-atari-2600-gravitarIMPALA (deep)
Score: 359.50
atari-games-on-atari-2600-heroIMPALA (deep)
Score: 33730.55
atari-games-on-atari-2600-ice-hockeyIMPALA (deep)
Score: 3.48
atari-games-on-atari-2600-james-bondIMPALA (deep)
Score: 601.50
atari-games-on-atari-2600-kangarooIMPALA (deep)
Score: 1632.00
atari-games-on-atari-2600-krullIMPALA (deep)
Score: 8147.40
atari-games-on-atari-2600-kung-fu-masterIMPALA (deep)
Score: 43375.50
atari-games-on-atari-2600-montezumas-revengeIMPALA (deep)
Score: 0.00
atari-games-on-atari-2600-ms-pacmanIMPALA (deep)
Score: 7342.32
atari-games-on-atari-2600-name-this-gameIMPALA (deep)
Score: 21537.20
atari-games-on-atari-2600-phoenixIMPALA (deep)
Score: 210996.45
atari-games-on-atari-2600-pitfallIMPALA (deep)
Score: -1.66
atari-games-on-atari-2600-pongIMPALA (deep)
Score: 20.98
atari-games-on-atari-2600-private-eyeIMPALA (deep)
Score: 98.50
atari-games-on-atari-2600-qbertIMPALA (deep)
Score: 351200.12
atari-games-on-atari-2600-river-raidIMPALA (deep)
Score: 29608.05
atari-games-on-atari-2600-road-runnerIMPALA (deep)
Score: 57121.00
atari-games-on-atari-2600-robotankIMPALA (deep)
Score: 12.96
atari-games-on-atari-2600-seaquestIMPALA (deep)
Score: 1753.20
atari-games-on-atari-2600-skiingIMPALA (deep)
Score: -10180.38
atari-games-on-atari-2600-solarisIMPALA (deep)
Score: 2365.00
atari-games-on-atari-2600-space-invadersIMPALA (deep)
Score: 43595.78
atari-games-on-atari-2600-star-gunnerIMPALA (deep)
Score: 200625.00
atari-games-on-atari-2600-surroundIMPALA (deep)
Score: 7.56
atari-games-on-atari-2600-tennisIMPALA (deep)
Score: 0.55
atari-games-on-atari-2600-time-pilotIMPALA (deep)
Score: 48481.50
atari-games-on-atari-2600-tutankhamIMPALA (deep)
Score: 292.11
atari-games-on-atari-2600-up-and-downIMPALA (deep)
Score: 332546.75
atari-games-on-atari-2600-ventureIMPALA (deep)
Score: 0.00
atari-games-on-atari-2600-video-pinballIMPALA (deep)
Score: 572898.27
atari-games-on-atari-2600-wizard-of-worIMPALA (deep)
Score: 9157.50
atari-games-on-atari-2600-yars-revengeIMPALA (deep)
Score: 84231.14
atari-games-on-atari-2600-zaxxonIMPALA (deep)
Score: 32935.50
atari-games-on-atari-57IMPALA, deep
Human World Record Breakthrough: 3
Mean Human Normalized Score: 957.34%
atari-games-on-atari-gamesIMPALA, deep
Mean Human Normalized Score: 957.34%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures | Papers | HyperAI