HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Atlas: Few-shot Learning with Retrieval Augmented Language Models

Gautier Izacard; Patrick Lewis; Maria Lomeli; Lucas Hosseini; Fabio Petroni; Timo Schick; Jane Dwivedi-Yu; Armand Joulin; Sebastian Riedel; Edouard Grave

Atlas: Few-shot Learning with Retrieval Augmented Language Models

Abstract

Large language models have shown impressive few-shot results on a wide range of tasks. However, when knowledge is key for such results, as is the case for tasks such as question answering and fact checking, massive parameter counts to store knowledge seem to be needed. Retrieval augmented models are known to excel at knowledge intensive tasks without the need for as many parameters, but it is unclear whether they work in few-shot settings. In this work we present Atlas, a carefully designed and pre-trained retrieval augmented language model able to learn knowledge intensive tasks with very few training examples. We perform evaluations on a wide range of tasks, including MMLU, KILT and NaturalQuestions, and study the impact of the content of the document index, showing that it can easily be updated. Notably, Atlas reaches over 42% accuracy on Natural Questions using only 64 examples, outperforming a 540B parameters model by 3% despite having 50x fewer parameters.

Code Repositories

thunlp/clueanchor
pytorch
Mentioned in GitHub
facebookresearch/atlas
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
multi-task-language-understanding-on-mmluAtlas (5-shot)
Average (%): 47.9
question-answering-on-natural-questionsAtlas (few-shot, k=64, Wiki-dec-2021+CC index)
EM: 42.4
question-answering-on-natural-questionsAtlas (full, Wiki-dec-2018 index)
EM: 64.0
question-answering-on-natural-questionsAtlas (full, Wiki-dec-2021+CC index)
EM: 60.4
question-answering-on-natural-questionsAtlas (few-shot, k=64, Wiki-Dec-2018 index)
EM: 45.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Atlas: Few-shot Learning with Retrieval Augmented Language Models | Papers | HyperAI