Visual Question Answering Vqa On Activitynet 1
评估指标
ClipMatch@1
ClipMatch@5
Contains
ExactMatch
Follow-up ClipMatch@1
Follow-up ClipMatch@5
Follow-up Contains
Follow-up ExactMatch
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| BLIP-2 T5 | 53.39 | 74.71 | 15.70 | 7.07 | 62.02 | 75.13 | 18.09 | 8.84 | Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy | 
0 of 1 row(s) selected.