Math Word Problem Solving On Svamp 1 N

评估指标

Execution Accuracy

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
ATHENA (roberta-large)67.8ATHENA: Mathematical Reasoning with Thought Expansion
ATHENA (roberta-base)52.5ATHENA: Mathematical Reasoning with Thought Expansion
0 of 2 row(s) selected.
Math Word Problem Solving On Svamp 1 N | SOTA | HyperAI超神经