HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval

Wei Zhong; Jheng-Hong Yang; Yuqing Xie; Jimmy Lin

Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval

Abstract

With the recent success of dense retrieval methods based on bi-encoders, studies have applied this approach to various interesting downstream retrieval tasks with good efficiency and in-domain effectiveness. Recently, we have also seen the presence of dense retrieval models in Math Information Retrieval (MIR) tasks, but the most effective systems remain classic retrieval methods that consider hand-crafted structure features. In this work, we try to combine the best of both worlds:\ a well-defined structure search method for effective formula search and efficient bi-encoder dense retrieval models to capture contextual similarities. Specifically, we have evaluated two representative bi-encoder models for token-level and passage-level dense retrieval on recent MIR tasks. Our results show that bi-encoder models are highly complementary to existing structure search methods, and we are able to advance the state-of-the-art on MIR datasets.

Code Repositories

approach0/math-dense-retrievers
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
math-information-retrieval-on-arqmath2Approach0+ColBERT (reranking)
P@10: 0.276
math-information-retrieval-on-arqmath2Approach0+ColBERT (fusion)
MAP: 0.215
NDCG: 0.447
P@10: 0.252
bpref: 0.202

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval | Papers | HyperAI