HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Zero-shot Visual Question Answering using Knowledge Graph

Zhuo Chen Jiaoyan Chen Yuxia Geng Jeff Z. Pan Zonggang Yuan Huajun Chen

Zero-shot Visual Question Answering using Knowledge Graph

Abstract

Incorporating external knowledge to Visual Question Answering (VQA) has become a vital practical need. Existing methods mostly adopt pipeline approaches with different components for knowledge matching and extraction, feature learning, etc.However, such pipeline approaches suffer when some component does not perform well, which leads to error propagation and poor overall performance. Furthermore, the majority of existing approaches ignore the answer bias issue -- many answers may have never appeared during training (i.e., unseen answers) in real-word application. To bridge these gaps, in this paper, we propose a Zero-shot VQA algorithm using knowledge graphs and a mask-based learning mechanism for better incorporating external knowledge, and present new answer-based Zero-shot VQA splits for the F-VQA dataset. Experiments show that our method can achieve state-of-the-art performance in Zero-shot VQA with unseen answers, meanwhile dramatically augment existing end-to-end models on the normal F-VQA task.

Code Repositories

Fangyin1994/KCL
pytorch
Mentioned in GitHub
China-UK-ZSL/ZS-F-VQA
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
visual-question-answering-on-f-vqaZS-F-VQA
Accuracy: 88.49
MR: 9.17
MRR: 0.685
Top-1 Accuracy: 58.27
Top-3 Accuracy: 76.51
visual-question-answering-on-zs-f-vqaSAN † - hard mask
Top-1 Accuracy: 29.39

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Zero-shot Visual Question Answering using Knowledge Graph | Papers | HyperAI