HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

David Silver; Thomas Hubert; Julian Schrittwieser; Ioannis Antonoglou; Matthew Lai; Arthur Guez; Marc Lanctot; Laurent Sifre; Dharshan Kumaran; Thore Graepel; Timothy Lillicrap; Karen Simonyan; Demis Hassabis

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Abstract

The game of chess is the most widely-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. In contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go, by tabula rasa reinforcement learning from games of self-play. In this paper, we generalise this approach into a single AlphaZero algorithm that can achieve, tabula rasa, superhuman performance in many challenging domains. Starting from random play, and given no domain knowledge except the game rules, AlphaZero achieved within 24 hours a superhuman level of play in the games of chess and shogi (Japanese chess) as well as Go, and convincingly defeated a world-champion program in each case.

Code Repositories

Neo-The1/ThinkingTicTacToe
tf
Mentioned in GitHub
saikrishna-1996/deep_pepper_chess
pytorch
Mentioned in GitHub
fantianwen/leela13_training
tf
Mentioned in GitHub
intenseG/BSK
tf
Mentioned in GitHub
k-lombard/CS4641_Project
tf
Mentioned in GitHub
pathway/alphaxos
Mentioned in GitHub
MerceaOtniel/HybridAlpha
tf
Mentioned in GitHub
goodls-cs/leela-zero
tf
Mentioned in GitHub
QueensGambit/CrazyAra-Engine
Mentioned in GitHub
alreadydone/Leela_Phoenix
tf
Mentioned in GitHub
mengyangbai/CchessGo
Mentioned in GitHub
leela-zero/leela-zero
tf
Mentioned in GitHub
helpstonex/crazyara
mxnet
Mentioned in GitHub
leelachesszero/lczero
tf
Mentioned in GitHub
cjohnchen/mirror
tf
Mentioned in GitHub
Zeta36/chess-alpha-zero
tf
Mentioned in GitHub
MingzhenY/Projects
Mentioned in GitHub
Sidus-smg/ataxx-ai
pytorch
Mentioned in GitHub
BertilBraun/ChessBot
pytorch
Mentioned in GitHub
John-Yu/leelaApplication
Mentioned in GitHub
ryanp73/ChessAI
tf
Mentioned in GitHub
cjohnchen/sai
tf
Mentioned in GitHub
zhoujianxing123/minigo_v17
tf
Mentioned in GitHub
svikramank/chess-deepRL
tf
Mentioned in GitHub
jonahs99/baby-zero
pytorch
Mentioned in GitHub
fantianwen/laalaz13E
Mentioned in GitHub
sai-dev/sai
Mentioned in GitHub
gcp/leela-zero
tf
Mentioned in GitHub
wcarvalho/jaxneurorl
jax
Mentioned in GitHub
plkmo/AlphaZero_Connect4
pytorch
Mentioned in GitHub
qinwang/leela-zero
tf
Mentioned in GitHub
AIRLegend/ChessRL
tf
Mentioned in GitHub
samiamkhan/4641Project
Mentioned in GitHub
jasonrobwebster/alphazero-clone
pytorch
Mentioned in GitHub
Nkluge-correa/teeny-tiny_castle
tf
Mentioned in GitHub
sagpant/leela-zero
tf
Mentioned in GitHub
jonah-chen/alphazero-guerzhoy
tf
Mentioned in GitHub
cjohnchen/changebale
tf
Mentioned in GitHub
aduerig/burrus
tf
Mentioned in GitHub
samhippie/q-game-agent
tf
Mentioned in GitHub
simoll/apo
tf
Mentioned in GitHub
l1t1/lz19
tf
Mentioned in GitHub
langorou/vamperouge
pytorch
Mentioned in GitHub
tensorflow/minigo
tf
Mentioned in GitHub
glinscott/leela-chess
tf
Mentioned in GitHub
gtagency/beta-zero
Mentioned in GitHub
wangml999/chess_zero
tf
Mentioned in GitHub
cosmobobak/ttt-ml
tf
Mentioned in GitHub
QueensGambit/CrazyAra
mxnet
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
game-of-go-on-elo-ratingsAlphaGo Zero
ELO Rating: 5185
game-of-shogi-on-elo-ratingsAlphaZero
ELO Rating: 4650

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm | Papers | HyperAI