5 months ago

The StarCraft Multi-Agent Challenge

Mikayel Samvelyan; Tabish Rashid; Christian Schroeder de Witt; Gregory Farquhar; Nantas Nardelli; Tim G. J. Rudner; Chia-Man Hung; Philip H. S. Torr; Jakob Foerster; Shimon Whiteson

Abstract

In the last few years, deep multi-agent reinforcement learning (RL) has become a highly active area of research. A particularly challenging class of problems in this area is partially observable, cooperative, multi-agent learning, in which teams of agents must learn to coordinate their behaviour while conditioning only on their private observations. This is an attractive research area since such problems are relevant to a large number of real-world systems and are also more amenable to evaluation than general-sum problems. Standardised environments such as the ALE and MuJoCo have allowed single-agent RL to move beyond toy domains, such as grid worlds. However, there is no comparable benchmark for cooperative multi-agent RL. As a result, most papers in this field use one-off toy problems, making it difficult to measure real progress. In this paper, we propose the StarCraft Multi-Agent Challenge (SMAC) as a benchmark problem to fill this gap. SMAC is based on the popular real-time strategy game StarCraft II and focuses on micromanagement challenges where each unit is controlled by an independent agent that must act based on local observations. We offer a diverse set of challenge maps and recommendations for best practices in benchmarking and evaluations. We also open-source a deep multi-agent RL learning framework including state-of-the-art algorithms. We believe that SMAC can provide a standard benchmark environment for years to come. Videos of our best agents for several SMAC scenarios are available at: https://youtu.be/VZ7zmQ_obZ0.

Code Repositories

uoe-agents/epymarl

pytorch

Mentioned in GitHub

mahi97/XQMIX

pytorch

Mentioned in GitHub

starry-sky6688/marl-algorithms

pytorch

simsimiSION/pymarl-algorithm-extension-via-starcraft

pytorch

Mentioned in GitHub

oxwhirl/smac

Official

pytorch

Mentioned in GitHub

dtabas/epymarl

pytorch

Mentioned in GitHub

osilab-kaist/smac_exp

pytorch

Mentioned in GitHub

kinalmehta/epymarl

pytorch

Mentioned in GitHub

Denys88/rl_games

Mentioned in GitHub

hahayonghuming/VDACs

pytorch

Mentioned in GitHub

gingkg/smac

pytorch

Mentioned in GitHub

jk96491/SMAC

pytorch

Mentioned in GitHub

jk96491/C-COMA

pytorch

Mentioned in GitHub

ling-pan/res

pytorch

Mentioned in GitHub

kcorder/qmix_variants

pytorch

Mentioned in GitHub

jugg1er/air

pytorch

Mentioned in GitHub

wendelinboehmer/dcg

pytorch

Mentioned in GitHub

ailabdsunipi/pymarlzooplus

pytorch

Mentioned in GitHub

oxwhirl/facmac

pytorch

Mentioned in GitHub

oxwhirl/smacv2

Mentioned in GitHub

osilab-kaist/smac_plus

pytorch

Mentioned in GitHub

oxwhirl/pymarl

Official

pytorch

Mentioned in GitHub

Lamperougeyxy/GHQ

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
smac-on-smac-27m-vs-30m	Heuristic	Median Win Rate: 0
smac-on-smac-27m-vs-30m	QMIX	Median Win Rate: 49
smac-on-smac-3s5z-vs-3s6z-1	Heuristic	Median Win Rate: 0
smac-on-smac-3s5z-vs-3s6z-1	VDN	Median Win Rate: 2
smac-on-smac-3s5z-vs-3s6z-1	IQL	Median Win Rate: 0
smac-on-smac-6h-vs-8z-1	IQL	Median Win Rate: 0
smac-on-smac-6h-vs-8z-1	Heuristic	Median Win Rate: 0
smac-on-smac-6h-vs-8z-1	VDN	Median Win Rate: 0
smac-on-smac-6h-vs-8z-1	QMIX	Median Win Rate: 3
smac-on-smac-corridor	IQL	Median Win Rate: 0
smac-on-smac-corridor	Heuristic	Median Win Rate: 0
smac-on-smac-corridor	QMIX	Median Win Rate: 1
smac-on-smac-mmm2-1	QMIX	Median Win Rate: 69
smac-on-smac-mmm2-1	IQL	Median Win Rate: 0
smac-on-smac-mmm2-1	VDN	Median Win Rate: 1
smac-on-smac-mmm2-1	Heuristic	Median Win Rate: 0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

The StarCraft Multi-Agent Challenge

Mikayel Samvelyan; Tabish Rashid; Christian Schroeder de Witt; Gregory Farquhar; Nantas Nardelli; Tim G. J. Rudner; Chia-Man Hung; Philip H. S. Torr; Jakob Foerster; Shimon Whiteson

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters