HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Neural Self Talk: Image Understanding via Continuous Questioning and Answering

Yezhou Yang; Yi Li; Cornelia Fermuller; Yiannis Aloimonos

Neural Self Talk: Image Understanding via Continuous Questioning and Answering

Abstract

In this paper we consider the problem of continuously discovering image contents by actively asking image based questions and subsequently answering the questions being asked. The key components include a Visual Question Generation (VQG) module and a Visual Question Answering module, in which Recurrent Neural Networks (RNN) and Convolutional Neural Network (CNN) are used. Given a dataset that contains images, questions and their answers, both modules are trained at the same time, with the difference being VQG uses the images as input and the corresponding questions as output, while VQA uses images and questions as input and the corresponding answers as output. We evaluate the self talk process subjectively using Amazon Mechanical Turk, which show effectiveness of the proposed method.

Benchmarks

BenchmarkMethodologyMetrics
question-generation-on-coco-visual-questionSample(Yang,2015)
BLEU-1: 38.8
question-generation-on-coco-visual-questionMax(Yang,2015)
BLEU-1: 59.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Neural Self Talk: Image Understanding via Continuous Questioning and Answering | Papers | HyperAI