HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

DocOIE: A Document-level Context-Aware Dataset for OpenIE

Kuicai Dong Yilin Zhao Aixin Sun Jung-Jae Kim Xiaoli Li

DocOIE: A Document-level Context-Aware Dataset for OpenIE

Abstract

Open Information Extraction (OpenIE) aims to extract structured relational tuples (subject, relation, object) from sentences and plays critical roles for many downstream NLP applications. Existing solutions perform extraction at sentence level, without referring to any additional contextual information. In reality, however, a sentence typically exists as part of a document rather than standalone; we often need to access relevant contextual information around the sentence before we can accurately interpret it. As there is no document-level context-aware OpenIE dataset available, we manually annotate 800 sentences from 80 documents in two domains (Healthcare and Transportation) to form a DocOIE dataset for evaluation. In addition, we propose DocIE, a novel document-level context-aware OpenIE model. Our experimental results based on DocIE demonstrate that incorporating document-level context is helpful in improving OpenIE performance. Both DocOIE dataset and DocIE model are released for public.

Code Repositories

daviddongkc/DocOIE
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
open-information-extraction-on-docoieReverb
F1: 55.8
open-information-extraction-on-docoieDocIE w transformer
F1: 60.8
open-information-extraction-on-docoie-1DocIE w transformer
F1: 56.9
open-information-extraction-on-docoie-1Reverb
F1: 49.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
DocOIE: A Document-level Context-Aware Dataset for OpenIE | Papers | HyperAI