Continuous Control On Cartpole Swingup 2
评估指标
Return
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| SMuZero | 868.87 | Learning and Planning in Complex Action Spaces | |
| MuZero Unplugged | 594.3 | Online and Offline Reinforcement Learning by Planning with a Learned Model |
0 of 2 row(s) selected.