3 months ago

Dense Passage Retrieval for Open-Domain Question Answering

Vladimir Karpukhin Barlas Oğuz Sewon Min Patrick Lewis Ledell Wu Sergey Edunov Danqi Chen Wen-tau Yih

Abstract

Open-domain question answering relies on efficient passage retrieval to select candidate contexts, where traditional sparse vector space models, such as TF-IDF or BM25, are the de facto method. In this work, we show that retrieval can be practically implemented using dense representations alone, where embeddings are learned from a small number of questions and passages by a simple dual-encoder framework. When evaluated on a wide range of open-domain QA datasets, our dense retriever outperforms a strong Lucene-BM25 system largely by 9%-19% absolute in terms of top-20 passage retrieval accuracy, and helps our end-to-end QA system establish new state-of-the-art on multiple open-domain QA benchmarks.

Code Repositories

alexlimh/DPR_MUF

pytorch

Mentioned in GitHub

oriram/spider

pytorch

Mentioned in GitHub

DevSinghSachan/unsupervised-passage-reranking

pytorch

Mentioned in GitHub

efficientqa/retrieval-based-baselines

Mentioned in GitHub

openmatch/ance-tele

jax

Mentioned in GitHub

Ankur3107/dpr-tf

Mentioned in GitHub

hongyuntw/DPR

pytorch

Mentioned in GitHub

luyug/GC-DPR

pytorch

Mentioned in GitHub

texttron/tevatron

jax

huggingface/transformers

pytorch

Mentioned in GitHub

deepset-ai/haystack

pytorch

Mentioned in GitHub

facebookresearch/DPR

Official

pytorch

Mentioned in GitHub

AhmedHussKhalifa/Dense_Passage_Retrieval_in_Conversational_Search

pytorch

Mentioned in GitHub

hongyuntw/DPR_BESS

pytorch

Mentioned in GitHub

junnyu/dpr_paddle

paddle

Mentioned in GitHub

Hannibal046/nanoDPR

pytorch

Mentioned in GitHub

AkariAsai/XORQA

pytorch

Mentioned in GitHub

amzn/refuel-open-domain-qa

pytorch

Mentioned in GitHub

nidhikamal-emb/DPR_repo

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
passage-retrieval-on-natural-questions	DPR	Precision@100: 86 Precision@20: 79.4
question-answering-on-natural-questions	DPR	EM: 41.5
question-answering-on-naturalqa	DPR	EM: 41.5
question-answering-on-triviaqa	DPR	EM: 56.8
question-answering-on-webquestions	DPR	EM: 42.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Dense Passage Retrieval for Open-Domain Question Answering

Vladimir Karpukhin Barlas Oğuz Sewon Min Patrick Lewis Ledell Wu Sergey Edunov Danqi Chen Wen-tau Yih

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters