HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Massively Parallel Methods for Deep Reinforcement Learning

Arun Nair; Praveen Srinivasan; Sam Blackwell; Cagdas Alcicek; Rory Fearon; Alessandro De Maria; Vedavyas Panneershelvam; Mustafa Suleyman; Charles Beattie; Stig Petersen; Shane Legg; Volodymyr Mnih; Koray Kavukcuoglu; David Silver

Massively Parallel Methods for Deep Reinforcement Learning

Abstract

We present the first massively distributed architecture for deep reinforcement learning. This architecture uses four main components: parallel actors that generate new behaviour; parallel learners that are trained from stored experience; a distributed neural network to represent the value function or behaviour policy; and a distributed store of experience. We used our architecture to implement the Deep Q-Network algorithm (DQN). Our distributed algorithm was applied to 49 games from Atari 2600 games from the Arcade Learning Environment, using identical hyperparameters. Our performance surpassed non-distributed DQN in 41 of the 49 games and also reduced the wall-time required to achieve these results by an order of magnitude on most games.

Code Repositories

londoed/Kortex
tf
Mentioned in GitHub
nandomp/AICollaboratory
Mentioned in GitHub
londoed/Gorila
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
atari-games-on-atari-2600-alienGorila
Score: 813.5
atari-games-on-atari-2600-amidarGorila
Score: 189.2
atari-games-on-atari-2600-assaultGorila
Score: 1195.8
atari-games-on-atari-2600-asterixGorila
Score: 3324.7
atari-games-on-atari-2600-asteroidsGorila
Score: 933.6
atari-games-on-atari-2600-atlantisGorila
Score: 629166.5
atari-games-on-atari-2600-bank-heistGorila
Score: 399.4
atari-games-on-atari-2600-battle-zoneGorila
Score: 19938.0
atari-games-on-atari-2600-beam-riderGorila
Score: 3822.1
atari-games-on-atari-2600-bowlingGorila
Score: 54
atari-games-on-atari-2600-boxingGorila
Score: 74.2
atari-games-on-atari-2600-breakoutGorila
Score: 313.0
atari-games-on-atari-2600-centipedeGorila
Score: 6296.9
atari-games-on-atari-2600-chopper-commandGorila
Score: 3191.8
atari-games-on-atari-2600-crazy-climberGorila
Score: 65451.0
atari-games-on-atari-2600-demon-attackGorila
Score: 14880.1
atari-games-on-atari-2600-double-dunkGorila
Score: -11.3
atari-games-on-atari-2600-enduroGorila
Score: 71.0
atari-games-on-atari-2600-fishing-derbyGorila
Score: 4.6
atari-games-on-atari-2600-freewayGorila
Score: 10.2
atari-games-on-atari-2600-frostbiteGorila
Score: 426.6
atari-games-on-atari-2600-gopherGorila
Score: 4373.0
atari-games-on-atari-2600-gravitarGorila
Score: 538.4
atari-games-on-atari-2600-heroGorila
Score: 8963.4
atari-games-on-atari-2600-ice-hockeyGorila
Score: -1.7
atari-games-on-atari-2600-james-bondGorila
Score: 444.0
atari-games-on-atari-2600-kangarooGorila
Score: 1431.0
atari-games-on-atari-2600-krullGorila
Score: 6363.1
atari-games-on-atari-2600-kung-fu-masterGorila
Score: 20620.0
atari-games-on-atari-2600-montezumas-revengeGorila
Score: 84
atari-games-on-atari-2600-ms-pacmanGorila
Score: 1263.0
atari-games-on-atari-2600-name-this-gameGorila
Score: 9238.5
atari-games-on-atari-2600-pongGorila
Score: 16.7
atari-games-on-atari-2600-private-eyeGorila
Score: 2598.6
atari-games-on-atari-2600-qbertGorila
Score: 7089.8
atari-games-on-atari-2600-river-raidGorila
Score: 5310.3
atari-games-on-atari-2600-road-runnerGorila
Score: 43079.8
atari-games-on-atari-2600-robotankGorila
Score: 61.8
atari-games-on-atari-2600-seaquestGorila
Score: 10145.9
atari-games-on-atari-2600-space-invadersGorila
Score: 1183.3
atari-games-on-atari-2600-star-gunnerGorila
Score: 14919.2
atari-games-on-atari-2600-tennisGorila
Score: -0.7
atari-games-on-atari-2600-time-pilotGorila
Score: 8267.8
atari-games-on-atari-2600-tutankhamGorila
Score: 118.5
atari-games-on-atari-2600-up-and-downGorila
Score: 8747.7
atari-games-on-atari-2600-ventureGorila
Score: 523.4
atari-games-on-atari-2600-video-pinballGorila
Score: 112093.4
atari-games-on-atari-2600-wizard-of-worGorila
Score: 10431.0
atari-games-on-atari-2600-zaxxonGorila
Score: 6159.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Massively Parallel Methods for Deep Reinforcement Learning | Papers | HyperAI