Visual Question Answering Vqa On Activitynet 1

ClipMatch@1

ClipMatch@5

Contains

ExactMatch

Follow-up ClipMatch@1

Follow-up ClipMatch@5

Follow-up Contains

Follow-up ExactMatch

评测结果

各个模型在此基准测试上的表现结果

									Paper Title	Repository
BLIP-2 T5	53.39	74.71	15.70	7.07	62.02	75.13	18.09	8.84	Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy

0 of 1 row(s) selected.