Image Captioning On Whoops
评估指标
BLEU-4
CIDEr
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | |||
|---|---|---|---|---|
| BLIP2 FlanT5-XXL (Fine-tuned) | 42 | 177 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
| BLIP2 FlanT5-XL (Fine-tuned) | 41 | 174 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
| BLIP2 FlanT5-XXL (Zero-Shot) | 31 | 120 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
| CoCa ViT-L-14 MSCOCO | 25 | 102 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
| BLIP Large | 13 | 65 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
| OFA Large | 0 | 0 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
0 of 6 row(s) selected.