Question Answering On Blurb
评估指标
Accuracy
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| BioLinkBERT (large) | 83.5 | LinkBERT: Pretraining Language Models with Document Links | |
| BioLinkBERT (base) | 80.81 | LinkBERT: Pretraining Language Models with Document Links | |
| GPT-4 | 80.56 | Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark | - |
| PubMedBERT (uncased; abstracts) | 71.7 | Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing |
0 of 4 row(s) selected.