HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Denoising Distantly Supervised Open-Domain Question Answering

{Yankai Lin Maosong Sun Zhiyuan Liu Haozhe Ji}

Denoising Distantly Supervised Open-Domain Question Answering

Abstract

Distantly supervised open-domain question answering (DS-QA) aims to find answers in collections of unlabeled text. Existing DS-QA models usually retrieve related paragraphs from a large-scale corpus and apply reading comprehension technique to extract answers from the most relevant paragraph. They ignore the rich information contained in other paragraphs. Moreover, distant supervision data inevitably accompanies with the wrong labeling problem, and these noisy data will substantially degrade the performance of DS-QA. To address these issues, we propose a novel DS-QA model which employs a paragraph selector to filter out those noisy paragraphs and a paragraph reader to extract the correct answer from those denoised paragraphs. Experimental results on real-world datasets show that our model can capture useful information from noisy data and achieve significant improvements on DS-QA as compared to all baselines.

Benchmarks

BenchmarkMethodologyMetrics
open-domain-question-answering-on-quasarDenoising QA
EM (Quasar-T): 42.2
F1 (Quasar-T): 49.3
open-domain-question-answering-on-searchqaDenoising QA
EM: 58.8
F1: 64.5
N-gram F1: -
Unigram Acc: -
question-answering-on-quasart-tDenoising QA
EM: 42.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Denoising Distantly Supervised Open-Domain Question Answering | Papers | HyperAI