| ST-MoE-32B 269B (fine-tuned) | 96.1 | ST-MoE: Designing Stable and Transferable Sparse Expert Models | |
| Claude 3 Opus (5-shot) | 88.5 | The Claude 3 Model Family: Opus, Sonnet, Haiku | - |
| RoBERTa-Winogrande 355M (fine-tuned) | 79.1 | WinoGrande: An Adversarial Winograd Schema Challenge at Scale | |