Question Answering On Quality
评估指标
Accuracy
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| Claude 1.3 (5-shot) | 84.1 | Model Card and Evaluations for Claude Models | - |
| Claude 2 (5-shot) | 83.2 | Model Card and Evaluations for Claude Models | - |
| RAPTOR + GPT-4 (June 2023) | 82.6 | RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval | |
| Claude Instant 1.1 (5-shot) | 80.5 | Model Card and Evaluations for Claude Models | - |
0 of 4 row(s) selected.