HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Continuous control with deep reinforcement learning

Timothy P. Lillicrap; Jonathan J. Hunt; Alexander Pritzel; Nicolas Heess; Tom Erez; Yuval Tassa; David Silver; Daan Wierstra

Continuous control with deep reinforcement learning

Abstract

We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain. We present an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. Using the same learning algorithm, network architecture and hyper-parameters, our algorithm robustly solves more than 20 simulated physics tasks, including classic problems such as cartpole swing-up, dexterous manipulation, legged locomotion and car driving. Our algorithm is able to find policies whose performance is competitive with those found by a planning algorithm with full access to the dynamics of the domain and its derivatives. We further demonstrate that for many of the tasks the algorithm can learn policies end-to-end: directly from raw pixel inputs.

Code Repositories

IvanVigor/MADDPG-Unity
pytorch
Mentioned in GitHub
facebookresearch/Horizon
pytorch
Mentioned in GitHub
b06b01073/continuous-control
pytorch
Mentioned in GitHub
MathPhysSim/PER-NAF
tf
Mentioned in GitHub
majercakdavid/gym-virtual-quant-trading
pytorch
Mentioned in GitHub
s-sd/task-amenability
tf
Mentioned in GitHub
nav74neet/ddpg_biped
tf
Mentioned in GitHub
YangRui2015/Modular_HER
tf
Mentioned in GitHub
nav74neet/rl4biped
tf
Mentioned in GitHub
T3chy/DDPG
pytorch
Mentioned in GitHub
SaminYeasar/off_policy_ac
pytorch
Mentioned in GitHub
tegg89/magnn
tf
Mentioned in GitHub
siyuofzhou/DDPG
tf
Mentioned in GitHub
abbadka/quadcopter
Mentioned in GitHub
gwthomas/gtml
tf
Mentioned in GitHub
krasing/DRLearningCollaboration
pytorch
Mentioned in GitHub
Sheldonmao/Learning-To-walk
Mentioned in GitHub
Dekki-Aero/DDPG
tf
Mentioned in GitHub
VasaKiDD/TD3-deep-rl-research
pytorch
Mentioned in GitHub
AgrawalAmey/safe-explorer
pytorch
Mentioned in GitHub
dchetelat/acer
pytorch
Mentioned in GitHub
MrSyee/pg-is-all-you-need
Mentioned in GitHub
JonasRSV/PGTensorflow
tf
Mentioned in GitHub
floodsung/DDPG
tf
Mentioned in GitHub
parilo/rl-server
tf
Mentioned in GitHub
thainv0212/re-ddpg
tf
Mentioned in GitHub
Brook1711/RIS_components
tf
Mentioned in GitHub
feruxhi/thoughts
tf
Mentioned in GitHub
JL321/mujo-2DWalker
tf
Mentioned in GitHub
yukezhu/tensorflow-reinforce
tf
Mentioned in GitHub
songrotek/DDPG
tf
Mentioned in GitHub
wwydmanski/rl_tennis
pytorch
Mentioned in GitHub
dpoulopoulos/drl_continuous_control
pytorch
Mentioned in GitHub
fiberleif/nc_ddpg
tf
Mentioned in GitHub
chainer/chainerrl
pytorch
Mentioned in GitHub
dpoulopoulos/drl_collaborate_compete
pytorch
Mentioned in GitHub
rikluost/RL_DQN_Pong
tf
Mentioned in GitHub
shahin-01/vqa-ad
pytorch
Mentioned in GitHub
tegg89/magnet
tf
Mentioned in GitHub
xuyuandong/simple-ddpg
tf
Mentioned in GitHub
nav74neet/ddpg4biped
tf
Mentioned in GitHub
massquantity/DBRL
pytorch
Mentioned in GitHub
MLCS-Yonsei/ddpg-control
tf
Mentioned in GitHub
fhbzc/FishAgentSimulation
tf
Mentioned in GitHub
bitterbloom/Reinforcement-Learning
pytorch
Mentioned in GitHub
stevenpjg/ddpg-aigym
tf
Mentioned in GitHub
FlyienSHaDOw/continuous_control
pytorch
Mentioned in GitHub
Souphis/mobile_robot_rl
tf
Mentioned in GitHub
tegg89/DLCamp_Jeju2018
tf
Mentioned in GitHub
samuelmat19/DDPG-tf2
tf
Mentioned in GitHub
fdcl-gwu/gym-rotor
pytorch
Mentioned in GitHub
krasing/DRLearningContinuousControl
pytorch
Mentioned in GitHub
fshamshirdar/pytorch-rdpg
pytorch
Mentioned in GitHub
ailab-pku/rl-framework
pytorch
Mentioned in GitHub
TheInfamousWayne/ddpg
pytorch
Mentioned in GitHub
Gouet/DDPG_PendulumV1
tf
Mentioned in GitHub
liuyuezhang/pyrl
pytorch
Mentioned in GitHub
Medabid1/RL_Project
pytorch
Mentioned in GitHub
liampetti/DDPG
tf
Mentioned in GitHub
tensorlayer/RLzoo
tf
Mentioned in GitHub
rk1998/robot-sac
tf
Mentioned in GitHub
bacdavid/HER
tf
Mentioned in GitHub
KelvinYang0320/deepbots-panda
pytorch
Mentioned in GitHub
Pechckin/MountainCar
Mentioned in GitHub
hamishs/JAX-RL
jax
Mentioned in GitHub
xyshadow/baseline_ddpg
tf
Mentioned in GitHub
saoudh/Reinforcement-Learning
tf
Mentioned in GitHub
Crevass/Hybrid-Agent
tf
Mentioned in GitHub
flowersteam/curious
Mentioned in GitHub
tegg89/mann
tf
Mentioned in GitHub
ZainRaza14/deepRL
pytorch
Mentioned in GitHub
wpiszlogin/driver_critic
tf
Mentioned in GitHub
shehrum/RL_Continous-Control
pytorch
Mentioned in GitHub
PeterJochem/Deep_RL
tf
Mentioned in GitHub
yusme/DDPG
tf
Mentioned in GitHub
LM095/DDPG-implementation
pytorch
Mentioned in GitHub
dyth/doublegum
jax
Mentioned in GitHub
alhabk/SGEE--pytorch
pytorch
Mentioned in GitHub
madvn/DDPG
tf
Mentioned in GitHub
InSpaceAI/RL-Zoo
tf
Mentioned in GitHub
DanielLSM/safe-rl-tutorial
tf
Mentioned in GitHub
anita-hu/TF2-RL
tf
Mentioned in GitHub
darkrush/meta_learn
tf
Mentioned in GitHub
guillaumeboniface/reacher
pytorch
Mentioned in GitHub
samiranrl/ODSC_2019_RL
pytorch
Mentioned in GitHub
SergiPonsa/Reinforcement-Learning-Sergi
pytorch
Mentioned in GitHub
J93T/TP4-DDPG
pytorch
Mentioned in GitHub
ghliu/pytorch-ddpg
pytorch
Mentioned in GitHub
shakedzy/warehouse
Mentioned in GitHub
andreidi/AC_DDPG_walker
tf
Mentioned in GitHub
SimonRamstedt/ddpg
tf
Mentioned in GitHub
h-aboutalebi/SparceReward
pytorch
Mentioned in GitHub
soumik12345/DDPG
pytorch
Mentioned in GitHub
xtma/simple-pytorch-rl
pytorch
Mentioned in GitHub
NervanaSystems/coach
tf
Mentioned in GitHub
taku-y/20181125-pybullet
tf
Mentioned in GitHub
kushagra06/DDPG
tf
Mentioned in GitHub
ZiyangY/IndProject-RL-in-Supply-chain
pytorch
Mentioned in GitHub
georgkruse/cleanqrl
pytorch
Mentioned in GitHub
Philori22/DDPG-aigym
pytorch
Mentioned in GitHub
JonasRSV/DDPG
tf
Mentioned in GitHub
JL321/mujo-2DHalf-Cheetah
tf
Mentioned in GitHub
baturaysaglam/dase
pytorch
Mentioned in GitHub
schatty/D4PG-pytorch
pytorch
Mentioned in GitHub
baturaysaglam/ac-off-poc
pytorch
Mentioned in GitHub
marload/DeepRL-TensorFlow2
tf
Mentioned in GitHub
claudeHifly/BipedalWalker-v3
pytorch
Mentioned in GitHub
tilkb/thermoai
tf
Mentioned in GitHub
YoUNG824/DDPG
tf
Mentioned in GitHub
bidimensional/Tennis
Mentioned in GitHub
HJDQN/HJQ
pytorch
Mentioned in GitHub
madhur-tandon/RL-Project
pytorch
Mentioned in GitHub
biemann/Continuous-Control
pytorch
Mentioned in GitHub
denizmguen/IANNWTF2019-Project
tf
Mentioned in GitHub
susan-amin/SparseBaseline1
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
continuous-control-on-lunar-lander-openai-gymDDPG
Score: 256.98±14.38
openai-gym-on-ant-v4DDPG
Average Return: 1712.12
openai-gym-on-halfcheetah-v4DDPG
Average Return: 14934.86
openai-gym-on-hopper-v4DDPG
Average Return: 1290.24
openai-gym-on-humanoid-v4DDPG
Average Return: 139.14
openai-gym-on-walker2d-v4DDPG
Average Return: 2994.54

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Continuous control with deep reinforcement learning | Papers | HyperAI