HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Denoising Table-Text Retrieval for Open-Domain Question Answering

Deokhyung Kang Baikjin Jung Yunsu Kim Gary Geunbae Lee

Denoising Table-Text Retrieval for Open-Domain Question Answering

Abstract

In table-text open-domain question answering, a retriever system retrieves relevant evidence from tables and text to answer questions. Previous studies in table-text open-domain question answering have two common challenges: firstly, their retrievers can be affected by false-positive labels in training datasets; secondly, they may struggle to provide appropriate evidence for questions that require reasoning across the table. To address these issues, we propose Denoised Table-Text Retriever (DoTTeR). Our approach involves utilizing a denoised training dataset with fewer false positive labels by discarding instances with lower question-relevance scores measured through a false positive detection model. Subsequently, we integrate table-level ranking information into the retriever to assist in finding evidence for questions that demand reasoning across the table. To encode this ranking information, we fine-tune a rank-aware column encoder to identify minimum and maximum values within a column. Experimental results demonstrate that DoTTeR significantly outperforms strong baselines on both retrieval recall and downstream QA tasks. Our code is available at https://github.com/deokhk/DoTTeR.

Code Repositories

deokhk/dotter
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
question-answering-on-ott-qaDoTTeR
ANS-EM: 35.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Denoising Table-Text Retrieval for Open-Domain Question Answering | Papers | HyperAI