Visual Question Answering Vqa On 5
评估指标
Overall Accuracy
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| GPT-4V | 66.0 | AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models | |
| Gemini Pro Vision | 51.4 | - | - |
| miniGPT4 | 51.0 | MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models | |
| LLaVA-1.5 | 44.5 | Improved Baselines with Visual Instruction Tuning | |
| Claude 3 | 37.1 | - | - |
0 of 5 row(s) selected.