HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Document-level Entity-based Extraction as Template Generation

Kung-Hsiang Huang; Sam Tang; Nanyun Peng

Document-level Entity-based Extraction as Template Generation

Abstract

Document-level entity-based extraction (EE), aiming at extracting entity-centric information such as entity roles and entity relations, is key to automatic knowledge acquisition from text corpora for various domains. Most document-level EE systems build extractive models, which struggle to model long-term dependencies among entities at the document level. To address this issue, we propose a generative framework for two document-level EE tasks: role-filler entity extraction (REE) and relation extraction (RE). We first formulate them as a template generation problem, allowing models to efficiently capture cross-entity dependencies, exploit label semantics, and avoid the exponential computation complexity of identifying N-ary relations. A novel cross-attention guided copy mechanism, TopK Copy, is incorporated into a pre-trained sequence-to-sequence model to enhance the capabilities of identifying key information in the input document. Experiments done on the MUC-4 and SciREX dataset show new state-of-the-art results on REE (+3.26%), binary RE (+4.8%), and 4-ary RE (+2.7%) in F1 score.

Code Repositories

PlusLabNLP/TempGen
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
4-ary-relation-extraction-on-scirexTempGen
Avg. F1: 3.55
binary-relation-extraction-on-scirexTempGen
Avg. F1: 14.47
role-filler-entity-extraction-on-muc-4TempGen
Avg. F1: 57.76

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Document-level Entity-based Extraction as Template Generation | Papers | HyperAI