Visual Question Answering On Vqa V1 Test Std
评估指标
Accuracy
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| SAAA (ResNet) | 64.6 | Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering | |
| RAU (ResNet) | 63.2 | Training Recurrent Answering Units with Joint Loss Minimization for VQA | - |
| HieCoAtt (ResNet) | 62.1 | Hierarchical Question-Image Co-Attention for Visual Question Answering | |
| DMN+ | 60.4 | Dynamic Memory Networks for Visual and Textual Question Answering | |
| SAN (VGG) | 58.9 | Stacked Attention Networks for Image Question Answering | |
| NMN+LSTM+FT | 58.7 | Neural Module Networks |
0 of 6 row(s) selected.