Object Counting On Tallyqa Simple
评估指标
Accuracy
RMSE
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | |||
|---|---|---|---|---|
| SMoLA-PaLI-X Specialist | 86.3 | - | Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts | - |
| PaLI-X-VPD | 86.2 | - | Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models | - |
| SMoLA-PaLI-X Generalist (0 shot) | 83.3 | - | Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts | - |
| MoVie-ResNeXt | 74.9 | 1 | MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond | |
| RCN | 71.8 | 1.13 | TallyQA: Answering Complex Counting Questions | |
| MoVie | 70.8 | 1.09 | MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond |
0 of 6 row(s) selected.