Visual Question Answering On Msrvtt Qa 2
评估指标
Accuracy
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| FrozenBiLM | 0.470 | Zero-Shot Video Question Answering via Frozen Bidirectional Language Models | |
| Just Ask | 0.415 | Just Ask: Learning to Answer Questions from Millions of Narrated Videos | |
| SSML | 0.35 | Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning | |
| Aurora (ours, r=64) Aurora (ours, r=64) | - | - | - |
0 of 4 row(s) selected.