Question Answering On Stepgame

评估指标

1-of-100 Accuracy

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
TP-MANN52.99StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts
0 of 1 row(s) selected.
Question Answering On Stepgame | SOTA | HyperAI超神经