Command Palette
Search for a command to run...
Michael Glass; Gaetano Rossiello; Md Faisal Mahbub Chowdhury; Ankita Rajaram Naik; Pengshan Cai; Alfio Gliozzo

Abstract
As demonstrated by GPT-3 and T5, transformers grow in capability as parameter spaces become larger and larger. However, for tasks that require a large amount of knowledge, non-parametric memory allows models to grow dramatically with a sub-linear increase in computational cost and GPU memory requirements. Recent models such as RAG and REALM have introduced retrieval into conditional generation. These models incorporate neural initial retrieval from a corpus of passages. We build on this line of research, proposing Re2G, which combines both neural initial retrieval and reranking into a BART-based sequence-to-sequence generation. Our reranking approach also permits merging retrieval results from sources with incomparable scores, enabling an ensemble of BM25 and neural initial retrieval. To train our system end-to-end, we introduce a novel variation of knowledge distillation to train the initial retrieval, reranker, and generation using only ground truth on the target sequence output. We find large gains in four diverse tasks: zero-shot slot filling, question answering, fact-checking, and dialog, with relative gains of 9% to 34% over the previous state-of-the-art on the KILT leaderboard. We make our code available as open source at https://github.com/IBM/kgi-slot-filling/tree/re2g.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| fact-verification-on-kilt-fever | Re2G | Accuracy: 89.55 KILT-AC: 78.53 R-Prec: 88.92 Recall@5: 92.52 |
| open-domain-dialog-on-kilt-wizard-of | Re2G | F1: 18.9 KILT-F1: 12.98 KILT-RL: 11.39 R-Prec: 60.1 ROUGE-L: 16.76 Recall@5: 79.98 |
| open-domain-question-answering-on-kilt | Re2G | EM: 51.73 F1: 60.97 KILT-EM: 43.56 KILT-F1: 49.8 R-Prec: 70.78 Recall@5: 76.63 |
| open-domain-question-answering-on-kilt-2 | Re2G | EM: 76.27 F1: 81.4 KILT-EM: 57.91 KILT-F1: 61.78 R-Prec: 72.68 Recall@5: 74.23 |
| slot-filling-on-kilt-t-rex | Re2G | Accuracy: 87.68 F1: 89.93 KILT-AC: 75.84 KILT-F1: 77.05 R-Prec: 80.7 Recall@5: 89.0 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.