4 months ago

Continuous control with deep reinforcement learning

Timothy P. Lillicrap; Jonathan J. Hunt; Alexander Pritzel; Nicolas Heess; Tom Erez; Yuval Tassa; David Silver; Daan Wierstra

Abstract

We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain. We present an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. Using the same learning algorithm, network architecture and hyper-parameters, our algorithm robustly solves more than 20 simulated physics tasks, including classic problems such as cartpole swing-up, dexterous manipulation, legged locomotion and car driving. Our algorithm is able to find policies whose performance is competitive with those found by a planning algorithm with full access to the dynamics of the domain and its derivatives. We further demonstrate that for many of the tasks the algorithm can learn policies end-to-end: directly from raw pixel inputs.

Code Repositories

prajwalgatti/DRL-Continuous-Control

Mentioned in GitHub

IvanVigor/MADDPG-Unity

pytorch

Mentioned in GitHub

facebookresearch/Horizon

pytorch

Mentioned in GitHub

b06b01073/continuous-control

pytorch

Mentioned in GitHub

MathPhysSim/PER-NAF

Mentioned in GitHub

majercakdavid/gym-virtual-quant-trading

pytorch

Mentioned in GitHub

s-sd/task-amenability

Mentioned in GitHub

nav74neet/ddpg_biped

Mentioned in GitHub

KAIST-AILab/deeprl_practice_colab

Mentioned in GitHub

YangRui2015/Modular_HER

Mentioned in GitHub

flavioschneider/ml_papers_presentations

Mentioned in GitHub

nav74neet/rl4biped

Mentioned in GitHub

T3chy/DDPG

pytorch

Mentioned in GitHub

SaminYeasar/off_policy_ac

pytorch

Mentioned in GitHub

Mentioned in GitHub

Mentioned in GitHub

Mentioned in GitHub

Mentioned in GitHub

ray-project/ray/tree/master/rllib

jakegrigsby/deep_control/blob/master/deep_control/ddpg.py

pytorch

krasing/DRLearningCollaboration

pytorch

Mentioned in GitHub

Sheldonmao/Learning-To-walk

Mentioned in GitHub

toni-sm/skrl

jax

prajwalgatti/DRL-Collaboration-and-Competition

Mentioned in GitHub

Dekki-Aero/DDPG

Mentioned in GitHub

VasaKiDD/TD3-deep-rl-research

pytorch

Mentioned in GitHub

AgrawalAmey/safe-explorer

pytorch

Mentioned in GitHub

montaserFath/Reinforcement-Learning-for-Prosthetics

Mentioned in GitHub

dchetelat/acer

pytorch

Mentioned in GitHub

MrSyee/pg-is-all-you-need

Mentioned in GitHub

bmeyers/VirtualMicrogridSegmentation

Mentioned in GitHub

JonasRSV/PGTensorflow

Mentioned in GitHub

floodsung/DDPG

Mentioned in GitHub

wassname/rl-portfolio-management

Mentioned in GitHub

parilo/rl-server

Mentioned in GitHub

sayantanauddy/hierarchical_bipedal_controller

Mentioned in GitHub

thainv0212/re-ddpg

Mentioned in GitHub

Brook1711/RIS_components

Mentioned in GitHub

sliao-mi-luku/DeepRL-multiple-agents-tennis-udacity-drlnd-p3

pytorch

Mentioned in GitHub

PeterJochem/Double_Deep_QLearning

Mentioned in GitHub

lukebhan/TwitterSentimentAnalysisTool

Mentioned in GitHub

seungjaeryanlee/osim-rl-helper

Mentioned in GitHub

feruxhi/thoughts

Mentioned in GitHub

IvanVigor/Deep-Deterministic-Policy-Gradient-Unity-Env

pytorch

Mentioned in GitHub

luke-bhan/TwitterSentimentAnalysisTool

Mentioned in GitHub

mindspore-courses/Deep-Reinforcement-Learning-Algorithms-with-MindSpore

mindspore

Mentioned in GitHub

JL321/mujo-2DWalker

Mentioned in GitHub

yukezhu/tensorflow-reinforce

Mentioned in GitHub

hill-a/stable-baselines

songrotek/DDPG

Mentioned in GitHub

wwydmanski/rl_tennis

pytorch

Mentioned in GitHub

dpoulopoulos/drl_continuous_control

pytorch

Mentioned in GitHub

fiberleif/nc_ddpg

Mentioned in GitHub

chainer/chainerrl

pytorch

Mentioned in GitHub

petsol/ContinuousControl_UnityAgent_DDPG_Udacity

pytorch

Mentioned in GitHub

dpoulopoulos/drl_collaborate_compete

pytorch

Mentioned in GitHub

rikluost/RL_DQN_Pong

Mentioned in GitHub

AlexandraFilimokhina/MountainCarConrinious_DDPG

Mentioned in GitHub

shahin-01/vqa-ad

pytorch

Mentioned in GitHub

tegg89/magnet

Mentioned in GitHub

xuyuandong/simple-ddpg

Mentioned in GitHub

nav74neet/ddpg4biped

Mentioned in GitHub

matthewsparr/Reinforcement-Learning-Lesson

Mentioned in GitHub

iDataist/Tennis-With-Multi-Agent-Reinforcement

pytorch

Mentioned in GitHub

Kikumu/Reinforcement-Learning-repo

Mentioned in GitHub

Remtasya/DDPG-Actor-Critic-Reinforcement-Learning-Reacher-Environment

pytorch

Mentioned in GitHub

massquantity/DBRL

pytorch

Mentioned in GitHub

MLCS-Yonsei/ddpg-control

Mentioned in GitHub

fhbzc/FishAgentSimulation

Mentioned in GitHub

bitterbloom/Reinforcement-Learning

pytorch

Mentioned in GitHub

stevenpjg/ddpg-aigym

Mentioned in GitHub

FlyienSHaDOw/continuous_control

pytorch

Mentioned in GitHub

dfki-ric-underactuated-lab/torque_limited_simple_pendulum

Mentioned in GitHub

Souphis/mobile_robot_rl

Mentioned in GitHub

tegg89/DLCamp_Jeju2018

Mentioned in GitHub

alathiya/RL-Quadcoptor-Flying

Mentioned in GitHub

samuelmat19/DDPG-tf2

Mentioned in GitHub

fdcl-gwu/gym-rotor

pytorch

Mentioned in GitHub

krasing/DRLearningContinuousControl

pytorch

Mentioned in GitHub

fshamshirdar/pytorch-rdpg

pytorch

Mentioned in GitHub

ailab-pku/rl-framework

pytorch

Mentioned in GitHub

TheInfamousWayne/ddpg

pytorch

Mentioned in GitHub

Gouet/DDPG_PendulumV1

Mentioned in GitHub

liuyuezhang/pyrl

pytorch

Mentioned in GitHub

Medabid1/RL_Project

pytorch

Mentioned in GitHub

Mentioned in GitHub

Mentioned in GitHub

Mentioned in GitHub

Mentioned in GitHub

KelvinYang0320/deepbots-panda

pytorch

Mentioned in GitHub

Pechckin/MountainCar

Mentioned in GitHub

hamishs/JAX-RL

jax

Mentioned in GitHub

NOHYC/autonomous_driving_car_project

Mentioned in GitHub

xyshadow/baseline_ddpg

Mentioned in GitHub

saoudh/Reinforcement-Learning

Mentioned in GitHub

Crevass/Hybrid-Agent

Mentioned in GitHub

flowersteam/curious

Mentioned in GitHub

tegg89/mann

Mentioned in GitHub

2023-MindSpore-1/ms-code-210/tree/main/DDAG

mindspore

ZainRaza14/deepRL

pytorch

Mentioned in GitHub

wpiszlogin/driver_critic

Mentioned in GitHub

mathformoso/RL-playground---ddpg

Mentioned in GitHub

shehrum/RL_Continous-Control

pytorch

Mentioned in GitHub

PeterJochem/Deep_RL

Mentioned in GitHub

backgom2357/Recommender_system_via_deep_RL

Mentioned in GitHub

yusme/DDPG

Mentioned in GitHub

LM095/DDPG-implementation

pytorch

Mentioned in GitHub

dyth/doublegum

jax

Mentioned in GitHub

alhabk/SGEE--pytorch

pytorch

Mentioned in GitHub

dan-lennox/ml-udacity-quadcopter-rl

Mentioned in GitHub

madvn/DDPG

Mentioned in GitHub

EyaRhouma/collaboration-competition-MADDPG

pytorch

Mentioned in GitHub

InSpaceAI/RL-Zoo

Mentioned in GitHub

arnomoonens/yarll

DanielLSM/safe-rl-tutorial

Mentioned in GitHub

anita-hu/TF2-RL

Mentioned in GitHub

darkrush/meta_learn

Mentioned in GitHub

shahabi8/Deep-Reinforcement-Learning

Mentioned in GitHub

guillaumeboniface/reacher

pytorch

Mentioned in GitHub

samiranrl/ODSC_2019_RL

pytorch

Mentioned in GitHub

baturaysaglam/RIS-MISO-Deep-Reinforcement-Learning

pytorch

Mentioned in GitHub

SergiPonsa/Reinforcement-Learning-Sergi

pytorch

Mentioned in GitHub

hemilpanchiwala/Hindsight-Experience-Replay

pytorch

Mentioned in GitHub

J93T/TP4-DDPG

pytorch

Mentioned in GitHub

ghliu/pytorch-ddpg

pytorch

Mentioned in GitHub

shakedzy/warehouse

Mentioned in GitHub

andreidi/AC_DDPG_walker

Mentioned in GitHub

SimonRamstedt/ddpg

Mentioned in GitHub

h-aboutalebi/SparceReward

pytorch

Mentioned in GitHub

soumik12345/DDPG

pytorch

Mentioned in GitHub

xtma/simple-pytorch-rl

pytorch

Mentioned in GitHub

facebookresearch/rl/blob/main/examples/ddpg/ddpg.py

jax

NervanaSystems/coach

Mentioned in GitHub

taku-y/20181125-pybullet

Mentioned in GitHub

halajadallah/RL-Quadcopter_project

Mentioned in GitHub

kushagra06/DDPG

Mentioned in GitHub

ZiyangY/IndProject-RL-in-Supply-chain

pytorch

Mentioned in GitHub

georgkruse/cleanqrl

pytorch

Mentioned in GitHub

avivelor/UnityMachineLearningForProjectButterfly

Mentioned in GitHub

Philori22/DDPG-aigym

pytorch

Mentioned in GitHub

WittmannF/quadcopter-best-practices

Mentioned in GitHub

JonasRSV/DDPG

Mentioned in GitHub

JL321/mujo-2DHalf-Cheetah

Mentioned in GitHub

baturaysaglam/dase

pytorch

Mentioned in GitHub

schatty/D4PG-pytorch

pytorch

Mentioned in GitHub

ai-winter/python_motion_planning

Mentioned in GitHub

baturaysaglam/ac-off-poc

pytorch

Mentioned in GitHub

marload/DeepRL-TensorFlow2

Mentioned in GitHub

JorgeA-RD/Reinforcement-Learning-Agents/tree/main/DDPG

claudeHifly/BipedalWalker-v3

pytorch

Mentioned in GitHub

Mentioned in GitHub

Mentioned in GitHub

Mentioned in GitHub

pytorch

Mentioned in GitHub

madhur-tandon/RL-Project

pytorch

Mentioned in GitHub

biemann/Continuous-Control

pytorch

Mentioned in GitHub

denizmguen/IANNWTF2019-Project

Mentioned in GitHub

DLR-RM/stable-baselines3

pytorch

iDataist/Continuous-Control-with-Deep-Deterministic-Policy-Gradient

pytorch

Mentioned in GitHub

FlyienSHaDOw/project_2_continuous_control

pytorch

Mentioned in GitHub

susan-amin/SparseBaseline1

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
continuous-control-on-lunar-lander-openai-gym	DDPG	Score: 256.98±14.38
openai-gym-on-ant-v4	DDPG	Average Return: 1712.12
openai-gym-on-halfcheetah-v4	DDPG	Average Return: 14934.86
openai-gym-on-hopper-v4	DDPG	Average Return: 1290.24
openai-gym-on-humanoid-v4	DDPG	Average Return: 139.14
openai-gym-on-walker2d-v4	DDPG	Average Return: 2994.54

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Continuous control with deep reinforcement learning

Timothy P. Lillicrap; Jonathan J. Hunt; Alexander Pritzel; Nicolas Heess; Tom Erez; Yuval Tassa; David Silver; Daan Wierstra

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters