Phrase Grounding On Referit
评估指标
Pointing Game Accuracy
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| VG_BiLSTM_VGG | 62.76 | Multi-level Multimodal Common Semantic Space for Image-Phrase Grounding | |
| GbS Ensemble MS-COCO | 58.21 | Detector-Free Weakly Supervised Grounding by Separation | |
| MCB | - | Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding |
0 of 3 row(s) selected.