2048 On 2048
评估指标
Average Score
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| AlphaZero (With Simulator) | 500000 | Planning in Stochastic Environments with a Learned Model | - |
| Stochastic Muzero | 500000 | Planning in Stochastic Environments with a Learned Model | - |
| MuZero | 300000 | Planning in Stochastic Environments with a Learned Model | - |
| Beam Search | 1024 | Playing 2048 With Reinforcement Learning | |
| DQN (1000 episodes) | 256 | Playing 2048 With Reinforcement Learning |
0 of 5 row(s) selected.