Coreference Resolution On Gap 1
评估指标
Bias (F/M)
Feminine F1 (F)
Masculine F1 (M)
Overall F1
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | |||||
|---|---|---|---|---|---|---|
| Coref-MTL | 0.99 | 92.45 | 92.65 | 92.72 | - | - |
| ProBERT | 0.97 | 91.1 | 94.0 | 92.5 | Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence Pooling | - |
| Maverick_incr | - | - | - | 91.2 | Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends | |
| Full Ensemble | 0.98 | 89.5 | 90.9 | 90.2 | Gendered Pronoun Resolution using BERT and an extractive question answering formulation | |
| PeTra | - | - | - | - | PeTra: A Sparsely Supervised Memory Model for People Tracking |
0 of 5 row(s) selected.