Command Palette
Search for a command to run...
Math Word Problem Solving On Math Minival
Metrics
Accuracy
Results
Performance results of various models on this benchmark
| Paper Title | Repository | ||
|---|---|---|---|
| Process Supervision (GPT-4) | 78.2 | Let's Verify Step by Step |
0 of 1 row(s) selected.