HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes

Caglar Gulcehre; Sarath Chandar; Kyunghyun Cho; Yoshua Bengio

Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes

Abstract

We extend neural Turing machine (NTM) model into a dynamic neural Turing machine (D-NTM) by introducing a trainable memory addressing scheme. This addressing scheme maintains for each memory cell two separate vectors, content and address vectors. This allows the D-NTM to learn a wide variety of location-based addressing strategies including both linear and nonlinear ones. We implement the D-NTM with both continuous, differentiable and discrete, non-differentiable read/write mechanisms. We investigate the mechanisms and effects of learning to read and write into a memory through experiments on Facebook bAbI tasks using both a feedforward and GRUcontroller. The D-NTM is evaluated on a set of Facebook bAbI tasks and shown to outperform NTM and LSTM baselines. We have done extensive analysis of our model and different variations of NTM on bAbI task. We also provide further experimental results on sequential pMNIST, Stanford Natural Language Inference, associative recall and copy tasks.

Benchmarks

BenchmarkMethodologyMetrics
question-answering-on-babiDMN+
Accuracy (trained on 10k): 97.2%
Accuracy (trained on 1k): 66.8%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes | Papers | HyperAI