HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering

Vahid Kazemi; Ali Elqursh

Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering

Abstract

This paper presents a new baseline for visual question answering task. Given an image and a question in natural language, our model produces accurate answers according to the content of the image. Our model, while being architecturally simple and relatively small in terms of trainable parameters, sets a new state of the art on both unbalanced and balanced VQA benchmark. On VQA 1.0 open ended challenge, our model achieves 64.6% accuracy on the test-standard set without using additional data, an improvement of 0.4% over state of the art, and on newly released VQA 2.0, our model scores 59.7% on validation set outperforming best previously reported results by 0.5%. The results presented in this paper are especially interesting because very similar models have been tried before but significantly lower performance were reported. In light of the new results we hope to see more meaningful research on visual question answering in the future.

Code Repositories

mkhalil1998/EC601_Group_Project
pytorch
Mentioned in GitHub
dukelin95/vqa_pytorch
pytorch
Mentioned in GitHub
guoyang9/vqa-prior
pytorch
Mentioned in GitHub
snagiri/ECE285_Jarvis_ProjectA
pytorch
Mentioned in GitHub
deshanadesai/VQA-DataAugmentation
pytorch
Mentioned in GitHub
abhigoyal1997/CS-763-Project
pytorch
Mentioned in GitHub
Gunnika/Visual-Question-Answering
pytorch
Mentioned in GitHub
Cyanogenoid/pytorch-vqa
pytorch
Mentioned in GitHub
pramodkaushik/visual_qa_analysis
pytorch
Mentioned in GitHub
myaoo18/EC601-Visual-Question-Answering
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
visual-question-answering-on-vqa-v1-test-devSAAA (ResNet)
Accuracy: 64.5
visual-question-answering-on-vqa-v1-test-stdSAAA (ResNet)
Accuracy: 64.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering | Papers | HyperAI