HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering

Devendra Singh Sachan Siva Reddy William Hamilton Chris Dyer Dani Yogatama

End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering

Abstract

We present an end-to-end differentiable training method for retrieval-augmented open-domain question answering systems that combine information from multiple retrieved documents when generating answers. We model retrieval decisions as latent variables over sets of relevant documents. Since marginalizing over sets of retrieved documents is computationally hard, we approximate this using an expectation-maximization algorithm. We iteratively estimate the value of our latent variable (the set of relevant documents for a given question) and then use this estimate to update the retriever and reader parameters. We hypothesize that such end-to-end training allows training signals to flow to the reader and then to the retriever better than staged-wise training. This results in a retriever that is able to select more relevant documents for a question and a reader that is trained on more accurate documents to generate an answer. Experiments on three benchmark datasets demonstrate that our proposed method outperforms all existing approaches of comparable size by 2-3% absolute exact match points, achieving new state-of-the-art results. Our results also demonstrate the feasibility of learning to retrieve to improve answer generation without explicit supervision of retrieval decisions.

Code Repositories

DevSinghSachan/art
pytorch
Mentioned in GitHub
DevSinghSachan/emdr2
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
open-domain-question-answering-onEMDR2
Exact Match: 48.7
open-domain-question-answering-on-natural-1EMDR2
Exact Match: 52.5
question-answering-on-natural-questionsEMDR^2
EM: 52.5
question-answering-on-triviaqaEMDR2
EM: 71.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering | Papers | HyperAI