HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Document Expansion by Query Prediction

Rodrigo Nogueira; Wei Yang; Jimmy Lin; Kyunghyun Cho

Document Expansion by Query Prediction

Abstract

One technique to improve the retrieval effectiveness of a search engine is to expand documents with terms that are related or representative of the documents' content.From the perspective of a question answering system, this might comprise questions the document can potentially answer. Following this observation, we propose a simple method that predicts which queries will be issued for a given document and then expands it with those predictions with a vanilla sequence-to-sequence model, trained using datasets consisting of pairs of query and relevant documents. By combining our method with a highly-effective re-ranking component, we achieve the state of the art in two retrieval tasks. In a latency-critical regime, retrieval results alone (without re-ranking) approach the effectiveness of more computationally expensive neural re-rankers but are much faster.

Code Repositories

castorini/Anserini
Mentioned in GitHub
castorini/docTTTTTquery
pytorch
Mentioned in GitHub
kasys-lab/anserini-kasys
Mentioned in GitHub
nyu-dl/dl4ir-doc2query
Official
tf
Mentioned in GitHub
irgroup/clef2023-longeval-irc
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
passage-re-ranking-on-ms-marcoBERT + Doc2query
MRR: 0.368
passage-re-ranking-on-trec-pmBERT + Doc2query
mAP: 36.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Document Expansion by Query Prediction | Papers | HyperAI