Text Summarization On Reddit Tifu
评估指标
ROUGE-1
ROUGE-2
ROUGE-L
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||||
|---|---|---|---|---|---|
| PEGASUS 2B + SLiC | 32.03 | 11.13 | 25.51 | Calibrating Sequence likelihood Improves Conditional Language Generation | - |
| BART+R3F | 30.31 | 10.98 | 24.74 | Better Fine-Tuning by Reducing Representational Collapse | |
| MUPPET BART Large | 30.3 | 11.25 | 24.92 | Muppet: Massive Multi-task Representations with Pre-Finetuning | |
| PEGASUS + SummaReranker | 29.83 | 9.5 | 23.47 | SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization | |
| MatchSum | 25.09 | 6.17 | 20.13 | Extractive Summarization as Text Matching |
0 of 5 row(s) selected.