Temporal Casual Qa On Next Qa
评估指标
WUPS
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| PaLI-X | 38.3 | PaLI-X: On Scaling up a Multilingual Vision and Language Model | |
| PaLI-3 | 37.7 | PaLI-3 Vision Language Models: Smaller, Faster, Stronger | |
| R2A | 34.7 | Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models | - |
| Flamingo(32-shot) | 33.5 | Flamingo: a Visual Language Model for Few-Shot Learning | |
| Gemini Ultra (zero-shot) | 29.9 | Gemini: A Family of Highly Capable Multimodal Models | |
| Gemini Pro (zero-shot) | 28.0 | Gemini: A Family of Highly Capable Multimodal Models | |
| Flamingo(0-shot) | 26.7 | Flamingo: a Visual Language Model for Few-Shot Learning | |
| Emu(0-shot) | 23.4 | Emu: Generative Pretraining in Multimodality |
0 of 8 row(s) selected.