HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents

Tobias Deußer; Syed Musharraf Ali; Lars Hillebrand; Desiana Nurchalifah; Basil Jacob; Christian Bauckhage; Rafet Sifa

KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents

Abstract

We introduce KPI-EDGAR, a novel dataset for Joint Named Entity Recognition and Relation Extraction building on financial reports uploaded to the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system, where the main objective is to extract Key Performance Indicators (KPIs) from financial documents and link them to their numerical values and other attributes. We further provide four accompanying baselines for benchmarking potential future research. Additionally, we propose a new way of measuring the success of said extraction process by incorporating a word-level weighting scheme into the conventional F1 score to better model the inherently fuzzy borders of the entity pairs of a relation in this domain.

Code Repositories

tobideusser/kpi-edgar
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
joint-entity-and-relation-extraction-on-kpiKPI-BERT
Relation F1: 43.76

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents | Papers | HyperAI