HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation

Guanting Dong Yutao Zhu Chenghao Zhang Zechen Wang Zhicheng Dou Ji-Rong Wen

Understand What LLM Needs: Dual Preference Alignment for
  Retrieval-Augmented Generation

Abstract

Retrieval-augmented generation (RAG) has demonstrated effectiveness inmitigating the hallucination problem of large language models (LLMs). However,the difficulty of aligning the retriever with the diverse LLMs' knowledgepreferences inevitably poses an inevitable challenge in developing a reliableRAG system. To address this issue, we propose DPA-RAG, a universal frameworkdesigned to align diverse knowledge preferences within RAG systems.Specifically, we initially introduce a preference knowledge constructionpipline and incorporate five novel query augmentation strategies to alleviatepreference data scarcity. Based on preference data, DPA-RAG accomplishes bothexternal and internal preference alignment: 1) It jointly integrate pair-wise,point-wise, and contrastive preference alignment abilities into the reranker,achieving external preference alignment among RAG components. 2) It furtherintroduces a pre-aligned stage before vanilla Supervised Fine-tuning (SFT),enabling LLMs to implicitly capture knowledge aligned with their reasoningpreferences, achieving LLMs' internal alignment. Experimental results acrossfour knowledge-intensive QA datasets demonstrate that DPA-RAG outperforms allbaselines and seamlessly integrates both black-box and open-sourced LLMreaders. Further qualitative analysis and discussions also provide empiricalguidance for achieving reliable RAG systems. Our code is publicly available athttps://github.com/dongguanting/DPA-RAG.

Code Repositories

dongguanting/dpa-rag
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
knowledge-base-question-answering-on-1DPA-RAG
F1: 81.74
Hits@1: 80.28
question-answering-on-natural-questionsDPA-RAG
EM: 59.19
question-answering-on-triviaqaDPA-RAG
F1: 80.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation | Papers | HyperAI