HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

From Bag of Sentences to Document: Distantly Supervised Relation Extraction via Machine Reading Comprehension

Lingyong Yan; Xianpei Han; Le Sun; Fangchao Liu; Ning Bian

From Bag of Sentences to Document: Distantly Supervised Relation Extraction via Machine Reading Comprehension

Abstract

Distant supervision (DS) is a promising approach for relation extraction but often suffers from the noisy label problem. Traditional DS methods usually represent an entity pair as a bag of sentences and denoise labels using multi-instance learning techniques. The bag-based paradigm, however, fails to leverage the inter-sentence-level and the entity-level evidence for relation extraction, and their denoising algorithms are often specialized and complicated. In this paper, we propose a new DS paradigm--document-based distant supervision, which models relation extraction as a document-based machine reading comprehension (MRC) task. By re-organizing all sentences about an entity as a document and extracting relations via querying the document with relation-specific questions, the document-based DS paradigm can simultaneously encode and exploit all sentence-level, inter-sentence-level, and entity-level evidence. Furthermore, we design a new loss function--DSLoss (distant supervision loss), which can effectively train MRC models using only $\langle$document, question, answer$\rangle$ tuples, therefore noisy label problem can be inherently resolved. Experiments show that our method achieves new state-of-the-art DS performance.

Code Repositories

lingyongyan/docds
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
relationship-extraction-distant-supervised-on-2DocDS
P@100: 0.939
P@200: 0.889
P@300: 0.873
PR AUC: 0.595

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
From Bag of Sentences to Document: Distantly Supervised Relation Extraction via Machine Reading Comprehension | Papers | HyperAI