HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Akari Asai Zeqiu Wu Yizhong Wang Avirup Sil Hannaneh Hajishirzi

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Abstract

Despite their remarkable capabilities, large language models (LLMs) often produce responses containing factual inaccuracies due to their sole reliance on the parametric knowledge they encapsulate. Retrieval-Augmented Generation (RAG), an ad hoc approach that augments LMs with retrieval of relevant knowledge, decreases such issues. However, indiscriminately retrieving and incorporating a fixed number of retrieved passages, regardless of whether retrieval is necessary, or passages are relevant, diminishes LM versatility or can lead to unhelpful response generation. We introduce a new framework called Self-Reflective Retrieval-Augmented Generation (Self-RAG) that enhances an LM's quality and factuality through retrieval and self-reflection. Our framework trains a single arbitrary LM that adaptively retrieves passages on-demand, and generates and reflects on retrieved passages and its own generations using special tokens, called reflection tokens. Generating reflection tokens makes the LM controllable during the inference phase, enabling it to tailor its behavior to diverse task requirements. Experiments show that Self-RAG (7B and 13B parameters) significantly outperforms state-of-the-art LLMs and retrieval-augmented models on a diverse set of tasks. Specifically, Self-RAG outperforms ChatGPT and retrieval-augmented Llama2-chat on Open-domain QA, reasoning and fact verification tasks, and it shows significant gains in improving factuality and citation accuracy for long-form generations relative to these models.

Code Repositories

fate-ubw/raglab
pytorch
Mentioned in GitHub
ruc-nlpir/flashrag
pytorch
Mentioned in GitHub
llm-lab-org/CLASP
Mentioned in GitHub
AkariAsai/self-rag
Official
pytorch
Mentioned in GitHub
ShayekhBinIslam/openrag
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
question-answering-on-popqaSelfRAG-13b
Accuracy: 55.8
question-answering-on-popqaSelfRAG-7b
Accuracy: 54.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection | Papers | HyperAI