Question Answering On Race
评估指标
RACE
RACE-h
RACE-m
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||||
|---|---|---|---|---|---|
| XLNet | 81.75 | - | 85.45 | XLNet: Generalized Autoregressive Pretraining for Language Understanding | |
| OCN_large | 71.7 | 69.6 | 76.7 | Option Comparison Network for Multiple-choice Reading Comprehension | - |
| DCMN_large | 69.7 | 68.1 | 73.4 | Dual Co-Matching Network for Multi-choice Reading Comprehension | - |
| Finetuned Transformer LM | 59.0 | 57.4 | 62.9 | Improving Language Understanding by Generative Pre-Training | - |
| BiAttention MRU | 53.3 | 50.3 | 60.2 | Multi-range Reasoning for Machine Comprehension | - |
| GPT-3 175B (few-shot, k=32) | - | - | 58.1 | Language Models are Few-Shot Learners | |
| GPT-3 175B (Few-Shot) | - | 46.8 | - | Language Models are Few-Shot Learners |
0 of 7 row(s) selected.