Command Palette
Search for a command to run...
Visual Question Answering Vqa On Activitynet 1
Metrics
ClipMatch@1
ClipMatch@5
Contains
ExactMatch
Follow-up ClipMatch@1
Follow-up ClipMatch@5
Follow-up Contains
Follow-up ExactMatch
Results
Performance results of various models on this benchmark
| Paper Title | Repository | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| BLIP-2 T5 | 53.39 | 74.71 | 15.70 | 7.07 | 62.02 | 75.13 | 18.09 | 8.84 | Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy |
0 of 1 row(s) selected.