Visual Question Answering Vqa On Ai2D
评估指标
EM
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| SMoLA-PaLI-X Specialist Model | 82.5 | Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts | - |
| SMoLA-PaLI-X Generalist Model | 81.4 | Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts | - |
| Gemini Ultra | 79.5 | Gemini: A Family of Highly Capable Multimodal Models | |
| DUBLIN | 51.11 | DUBLIN -- Document Understanding By Language-Image Network | - |
0 of 4 row(s) selected.