HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

A Read-Write Memory Network for Movie Story Understanding

Seil Na; Sangho Lee; Jisung Kim; Gunhee Kim

A Read-Write Memory Network for Movie Story Understanding

Abstract

We propose a novel memory network model named Read-Write Memory Network (RWMN) to perform question and answering tasks for large-scale, multimodal movie story understanding. The key focus of our RWMN model is to design the read network and the write network that consist of multiple convolutional layers, which enable memory read and write operations to have high capacity and flexibility. While existing memory-augmented network models treat each memory slot as an independent block, our use of multi-layered CNNs allows the model to read and write sequential memory cells as chunks, which is more reasonable to represent a sequential story because adjacent memory blocks often have strong correlations. For evaluation, we apply our model to all the six tasks of the MovieQA benchmark, and achieve the best accuracies on several tasks, especially on the visual QA task. Our model shows a potential to better understand not only the content in the story, but also more abstract information, such as relationships between characters and the reasons for their actions.

Code Repositories

seilna/RWMN
Official
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-story-qa-on-movieqaRWMN
Accuracy: 36.25

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Read-Write Memory Network for Movie Story Understanding | Papers | HyperAI