HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Distributional Reinforcement Learning with Quantile Regression

Will Dabney; Mark Rowland; Marc G. Bellemare; Rémi Munos

Distributional Reinforcement Learning with Quantile Regression

Abstract

In reinforcement learning an agent interacts with the environment by taking actions and observing the next state and reward. When sampled probabilistically, these state transitions, rewards, and actions can all induce randomness in the observed long-term return. Traditionally, reinforcement learning algorithms average over this randomness to estimate the value function. In this paper, we build on recent work advocating a distributional approach to reinforcement learning in which the distribution over returns is modeled explicitly instead of only estimating the mean. That is, we examine methods of learning the value distribution instead of the value function. We give results that close a number of gaps between the theoretical and algorithmic results given by Bellemare, Dabney, and Munos (2017). First, we extend existing results to the approximate distribution setting. Second, we present a novel distributional reinforcement learning algorithm consistent with our theoretical formulation. Finally, we evaluate this new algorithm on the Atari 2600 games, observing that it significantly outperforms many of the recent improvements on DQN, including the related distributional algorithm C51.

Code Repositories

facebookresearch/Horizon
pytorch
Mentioned in GitHub
ku2482/rljax
jax
Mentioned in GitHub
sannebh/riskaverse
pytorch
Mentioned in GitHub
marload/dist-rl-tf2
tf
Mentioned in GitHub
sannebh/riskaverse_rl
pytorch
Mentioned in GitHub
Abdelhamid-bouzid/Distributional-RL
pytorch
Mentioned in GitHub
ku2482/fqf-iqn-qrdqn.pytorch
pytorch
Mentioned in GitHub
facebookresearch/ReAgent
pytorch
Mentioned in GitHub
Kchu/DeepRL_CK
pytorch
Mentioned in GitHub
robinzixuan/IQN_Agent
pytorch
Mentioned in GitHub
marload/DistRL-TensorFlow2
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
atari-games-on-atari-2600-alienQR-DQN-1
Score: 4871
atari-games-on-atari-2600-amidarQR-DQN-1
Score: 1641
atari-games-on-atari-2600-assaultQR-DQN-1
Score: 22012
atari-games-on-atari-2600-asterixQR-DQN-1
Score: 261025
atari-games-on-atari-2600-asteroidsQR-DQN-1
Score: 4226
atari-games-on-atari-2600-atlantisQR-DQN-1
Score: 971850
atari-games-on-atari-2600-bank-heistQR-DQN-1
Score: 1249
atari-games-on-atari-2600-battle-zoneQR-DQN-1
Score: 39268
atari-games-on-atari-2600-beam-riderQR-DQN-1
Score: 34821
atari-games-on-atari-2600-berzerkQR-DQN-1
Score: 3117
atari-games-on-atari-2600-bowlingQR-DQN-1
Score: 77.2
atari-games-on-atari-2600-boxingQR-DQN-1
Score: 99.9
atari-games-on-atari-2600-breakoutQR-DQN-1
Score: 742
atari-games-on-atari-2600-centipedeQR-DQN-1
Score: 12447
atari-games-on-atari-2600-chopper-commandQR-DQN-1
Score: 14667
atari-games-on-atari-2600-crazy-climberQR-DQN-1
Score: 161196
atari-games-on-atari-2600-defenderQR-DQN-1
Score: 47887
atari-games-on-atari-2600-demon-attackQR-DQN-1
Score: 121551
atari-games-on-atari-2600-double-dunkQR-DQN-1
Score: 21.9
atari-games-on-atari-2600-enduroQR-DQN-1
Score: 2355
atari-games-on-atari-2600-fishing-derbyQR-DQN-1
Score: 39
atari-games-on-atari-2600-freewayQR-DQN-1
Score: 34
atari-games-on-atari-2600-frostbiteQR-DQN-1
Score: 4384
atari-games-on-atari-2600-gopherQR-DQN-1
Score: 113585
atari-games-on-atari-2600-gravitarQR-DQN-1
Score: 995
atari-games-on-atari-2600-heroQR-DQN-1
Score: 21395
atari-games-on-atari-2600-ice-hockeyQR-DQN-1
Score: -1.7
atari-games-on-atari-2600-james-bondQR-DQN-1
Score: 4703
atari-games-on-atari-2600-kangarooQR-DQN-1
Score: 15356
atari-games-on-atari-2600-krullQR-DQN-1
Score: 11447
atari-games-on-atari-2600-kung-fu-masterQR-DQN-1
Score: 76642
atari-games-on-atari-2600-montezumas-revengeQR-DQN-1
Score: 0
atari-games-on-atari-2600-ms-pacmanQR-DQN-1
Score: 5821
atari-games-on-atari-2600-name-this-gameQR-DQN-1
Score: 21890
atari-games-on-atari-2600-phoenixQR-DQN-1
Score: 16585
atari-games-on-atari-2600-pitfallQR-DQN-1
Score: 0
atari-games-on-atari-2600-pongQR-DQN-1
Score: 21
atari-games-on-atari-2600-private-eyeQR-DQN-1
Score: 350
atari-games-on-atari-2600-qbertQR-DQN-1
Score: 572510
atari-games-on-atari-2600-river-raidQR-DQN-1
Score: 17571
atari-games-on-atari-2600-road-runnerQR-DQN-1
Score: 64262
atari-games-on-atari-2600-robotankQR-DQN-1
Score: 59.4
atari-games-on-atari-2600-seaquestQR-DQN-1
Score: 8268
atari-games-on-atari-2600-skiingQR-DQN-1
Score: -9324
atari-games-on-atari-2600-solarisQR-DQN-1
Score: 6740
atari-games-on-atari-2600-space-invadersQR-DQN-1
Score: 20972
atari-games-on-atari-2600-star-gunnerQR-DQN-1
Score: 77495
atari-games-on-atari-2600-surroundQR-DQN-1
Score: 8.2
atari-games-on-atari-2600-tennisQR-DQN-1
Score: 23.6
atari-games-on-atari-2600-time-pilotQR-DQN-1
Score: 10345
atari-games-on-atari-2600-tutankhamQR-DQN-1
Score: 297
atari-games-on-atari-2600-up-and-downQR-DQN-1
Score: 71260
atari-games-on-atari-2600-ventureQR-DQN-1
Score: 43.9
atari-games-on-atari-2600-video-pinballQR-DQN-1
Score: 705662
atari-games-on-atari-2600-wizard-of-worQR-DQN-1
Score: 25061
atari-games-on-atari-2600-yars-revengeQR-DQN-1
Score: 26447
atari-games-on-atari-2600-zaxxonQR-DQN-1
Score: 13112

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Distributional Reinforcement Learning with Quantile Regression | Papers | HyperAI