HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

CEDR: Contextualized Embeddings for Document Ranking

Sean MacAvaney; Andrew Yates; Arman Cohan; Nazli Goharian

CEDR: Contextualized Embeddings for Document Ranking

Abstract

Although considerable attention has been given to neural ranking architectures recently, far less attention has been paid to the term representations that are used as input to these models. In this work, we investigate how two pretrained contextualized language models (ELMo and BERT) can be utilized for ad-hoc document ranking. Through experiments on TREC benchmarks, we find that several existing neural ranking architectures can benefit from the additional context provided by contextualized language models. Furthermore, we propose a joint approach that incorporates BERT's classification vector into existing neural models and show that it outperforms state-of-the-art ad-hoc ranking baselines. We call this joint approach CEDR (Contextualized Embeddings for Document Ranking). We also address practical challenges in using these models for ranking, including the maximum input length imposed by BERT and runtime performance impacts of contextualized language models.

Code Repositories

ssun32/clirmatrix_cedr
pytorch
Mentioned in GitHub
Georgetown-IR-Lab/contextualized-reps-for-ranking
Official
pytorch
Mentioned in GitHub
crystina-z/CEDR_tpu
pytorch
Mentioned in GitHub
Georgetown-IR-Lab/cedr
Official
pytorch
Mentioned in GitHub
TikaToka/CapstoneSpring
pytorch
Mentioned in GitHub
Crysitna/CEDR_tpu
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
ad-hoc-information-retrieval-on-trec-robust04CEDR-KNRM
P@20: 0.4667
nDCG@20: 0.5381
ad-hoc-information-retrieval-on-trec-robust04Vanilla BERT
P@20: 0.4042
nDCG@20: 0.4541

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
CEDR: Contextualized Embeddings for Document Ranking | Papers | HyperAI