4 个月前

使用深度强化学习玩Atari游戏

Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Alex Graves; Ioannis Antonoglou; Daan Wierstra; Martin Riedmiller

摘要

我们提出了首个成功利用强化学习从高维感官输入中直接学习控制策略的深度学习模型。该模型是一个卷积神经网络，采用Q-learning的一种变体进行训练，其输入为原始像素，输出为估计未来奖励的价值函数。我们将该方法应用于Arcade Learning Environment中的七款Atari 2600游戏，且未对架构或学习算法进行任何调整。研究结果表明，该模型在六款游戏中超越了所有先前的方法，并在其中三款游戏中超过了人类专家的表现。

代码仓库

alfredvc/paac

GitHub 中提及

harvitronix/reinforcement-learning-car

GitHub 中提及

filippogiruzzi/deep_q_learning

GitHub 中提及

FauzaanQureshi/deep-Q-learning

GitHub 中提及

xiuyu0000/new_papers_codes/tree/main/dqn

mindspore

saha0073/Deep-Reinforcement-Learning-to-play-Cartpole

GitHub 中提及

proroklab/popgym

pytorch

GitHub 中提及

Rabrg/dqn

pytorch

GitHub 中提及

rishavb123/MineRL

GitHub 中提及

toni-sm/skrl

jax

han-won/PlayingAtariWithMindSpore

mindspore

Anshu1245/RL-CourseProject

GitHub 中提及

ray-project/ray/tree/master/rllib

marload/deep-rl-tf2

GitHub 中提及

avillemin/Minecraft-AI

pytorch

GitHub 中提及

daviddcho/supermario

pytorch

GitHub 中提及

JackFurby/Breakout

GitHub 中提及

JuliaPOMDP/DeepQLearning.jl

GitHub 中提及

spragunr/deep_q_rl

GitHub 中提及

parilo/rl-server

GitHub 中提及

MaximeVandegar/Papers-in-100-Lines-of-Code/tree/main/Playing_Atari_with_Deep_Reinforcement_Learning

pytorch

joshiatul/game_playing

GitHub 中提及

BH4/Deep-Reinforcement-Learning

GitHub 中提及

markusdutschke/yahtzee

GitHub 中提及

bjotho/Zelda1AI

GitHub 中提及

mindspore-courses/Deep-Reinforcement-Learning-Algorithms-with-MindSpore

mindspore

GitHub 中提及

facebookresearch/rl/blob/main/examples/dqn/dqn.py

jax

hill-a/stable-baselines

xValentim/Steering_Behaviors_with_pygame

GitHub 中提及

nandomp/AICollaboratory

GitHub 中提及

JonasRSV/DQN

GitHub 中提及

niklasschmitz/DeepQLearning

jax

GitHub 中提及

GitHub 中提及

GitHub 中提及

GitHub 中提及

mindspore

GitHub 中提及

2023-MindSpore-1/ms-code-52

mindspore

GitHub 中提及

MateuszJanda/netris-ai-robot

GitHub 中提及

borhanreo/Obstacle-Avoid-Car

GitHub 中提及

near32/regym

pytorch

GitHub 中提及

Sheepsody/Batched-Impala-PyTorch

pytorch

GitHub 中提及

K-tang-mkv/baseRLAlgorithm

pytorch

GitHub 中提及

vsquareg/RL_ERA

GitHub 中提及

epignatelli/human-level-control-through-deep-reinforcement-learning

jax

GitHub 中提及

Gary-Shi/Tank

GitHub 中提及

MehmetBarutcu/Streaming-Algorithm-for-Monotone-k-Submodular-Maximization-with-Cardinality-Constraints

GitHub 中提及

omkarv/pong-from-pixels

GitHub 中提及

tlohr/nfsu2-ai

GitHub 中提及

Wentworth1996/Summer_Intern_Progress

GitHub 中提及

eddynelson/dqn

GitHub 中提及

bay3s/dqn

pytorch

pavitrakumar78/Playing-custom-games-using-Deep-Learning

GitHub 中提及

RLeike/connect-four

jax

GitHub 中提及

paintception/Deep-Quality-Value-Family-

GitHub 中提及

kshitij-ingale/Reinforcement-Learning

GitHub 中提及

lvyufeng/DQN-MindSpore

mindspore

KavindaKottege/DeepQ-Pong

GitHub 中提及

behzaad/Deep_QLearning

GitHub 中提及

michaelnny/deep_rl_zoo

pytorch

sygi/deep_q_rl

GitHub 中提及

subhadip-maiti/tinydqn

GitHub 中提及

RandyDeng/gym_connect4

GitHub 中提及

sunjeet95/Deep-Q-Network-using-Tensorflow

GitHub 中提及

ShivamShrirao/deep_Q_learning_from_scratch

GitHub 中提及

KatyNTsachi/Hierarchical-RL

GitHub 中提及

invictos/InsacarDQN

GitHub 中提及

igoracmorais/inteligencia_artificial

GitHub 中提及

tensorpack/tensorpack/tree/master/examples/DeepQNetwork

kmdanielduan/DQN_Family_PyTorch

pytorch

GitHub 中提及

InSpaceAI/RL-Zoo

GitHub 中提及

komejisatori/ReinforcementCar

pytorch

GitHub 中提及

anita-hu/TF2-RL

ShivamShrirao/deep_Q_learning

GitHub 中提及

blakeMilner/DeepQLearning

pytorch

GitHub 中提及

LukasGardberg/cartpole

GitHub 中提及

pytorch/rl/tree/main/examples/dqn

jax

labmlai/annotated_deep_learning_paper_implementations

pytorch

ktkachuk/Atari-with-Q-Learning

GitHub 中提及

paintception/Deep-Quality-Value-Family

GitHub 中提及

dsgiitr/rl_2048

GitHub 中提及

Curt-Park/rainbow-is-all-you-need

gordicaleksa/pytorch-learn-reinforcement-learning

pytorch

jonaths/tf-dqn

GitHub 中提及

eublefar/dqn

GitHub 中提及

RobotMobile/rl-paper-review

GitHub 中提及

nathanin/pad

GitHub 中提及

JonasRSV/DQNTensorflow

GitHub 中提及

OscarHuangWind/Preference-Guided-DQN-Atari

pytorch

sourenaKhanzadeh/snakeAi

pytorch

GitHub 中提及

geeky-wizard/Atari-Deep-Reinforcement-Learning

GitHub 中提及

esmeralday/MARL

GitHub 中提及

CankayaUniversity/ceng-407-408-License-Plate-Recognition-Using-Deep-Learning

GitHub 中提及

Linging/Traffic-Signal-Control

GitHub 中提及

Ishan-Kumar2/Reinforcement-Learning-on-2048

GitHub 中提及

GitHub 中提及

GitHub 中提及

GitHub 中提及

marload/DeepRL-TensorFlow2

GitHub 中提及

qiankun214/DQN-FlappyBird-python3

pytorch

GitHub 中提及

drforester/Q-learning-Intersection-Crossing

GitHub 中提及

natsumeS/analysis

GitHub 中提及

TheFebrin/DeepRL-Pong

pytorch

GitHub 中提及

chandar-lab/RLHive

pytorch

vincentpalma/DQN-for-CaRL

pytorch

GitHub 中提及

mfregeau/DeepLearning

GitHub 中提及

SayhoKim/tetrisRL

GitHub 中提及

DLR-RM/stable-baselines3

pytorch

ugo-nama-kun/DQN-chainer

GitHub 中提及

AndrewJWashington/protodriver

GitHub 中提及

yaxinchen666/dce_pricingRL

GitHub 中提及

MOVzeroOne/DQN

pytorch

GitHub 中提及

基准测试

基准	方法	指标
atari-games-on-atari-2600-beam-rider	DQN Best	Score: 5184
atari-games-on-atari-2600-breakout	DQN Best	Score: 225
atari-games-on-atari-2600-enduro	DQN Best	Score: 661
atari-games-on-atari-2600-pong	DQN Best	Score: 21
atari-games-on-atari-2600-qbert	DQN Best	Score: 4500
atari-games-on-atari-2600-seaquest	DQN Best	Score: 1740
atari-games-on-atari-2600-space-invaders	DQN Best	Score: 1075

用 AI 构建 AI

从想法到上线——通过免费 AI 协同编程、开箱即用的环境和市场最优价格的 GPU 加速您的 AI 开发

AI 协同编程

即用型 GPU

最优价格

立即开始

Hyper Newsletters

订阅我们的最新资讯

我们会在北京时间 每周一的上午九点 向您的邮箱投递本周内的最新更新

邮件发送服务由 MailChimp 提供