Visual Reasoning On Winogavil
评估指标
Jaccard Index
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| Humans | 90 | WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models | |
| ViLT (Zero-Shot) | 52 | WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models | |
| X-VLM (Zero-Shot) | 46 | WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models | |
| CLIP-ViT-B/32 (Zero-Shot) | 41 | WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models | |
| CLIP-ViT-L/14 (Zero-Shot) | 40 | WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models | |
| CLIP-RN50x64/14 (Zero-Shot) | 38 | WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models | |
| CLIP-RN50 (Zero-Shot) | 35 | WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models | |
| CLIP-ViL (Zero-Shot) | 15 | WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models |
0 of 8 row(s) selected.