HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Learning to Remember Rare Events

Łukasz Kaiser; Ofir Nachum; Aurko Roy; Samy Bengio

Learning to Remember Rare Events

Abstract

Despite recent advances, memory-augmented deep neural networks are still limited when it comes to life-long and one-shot learning, especially in remembering rare events. We present a large-scale life-long memory module for use in deep learning. The module exploits fast nearest-neighbor algorithms for efficiency and thus scales to large memory sizes. Except for the nearest-neighbor query, the module is fully differentiable and trained end-to-end with no extra supervision. It operates in a life-long manner, i.e., without the need to reset it during training. Our memory module can be easily added to any part of a supervised neural network. To show its versatility we add it to a number of networks, from simple convolutional ones tested on image classification to deep sequence-to-sequence and recurrent-convolutional models. In all cases, the enhanced network gains the ability to remember and do life-long one-shot learning. Our module remembers training examples shown many thousands of steps in the past and it can successfully generalize from them. We set new state-of-the-art for one-shot learning on the Omniglot dataset and demonstrate, for the first time, life-long one-shot learning in recurrent neural networks on a large-scale machine translation task.

Code Repositories

rdspring1/lsh_deeplearning
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
few-shot-image-classification-on-omniglot-1-1ConvNet with Memory Module
Accuracy: 95%
few-shot-image-classification-on-omniglot-1-2ConvNet with Memory Module
Accuracy: 98.4
few-shot-image-classification-on-omniglot-5-1ConvNet with Memory Module
Accuracy: 98.6%
few-shot-image-classification-on-omniglot-5-2ConvNet with Memory Module
Accuracy: 99.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Learning to Remember Rare Events | Papers | HyperAI