HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

WikiGUM: Exhaustive Entity Linking for Wikification in 12 Genres

Jessica Lin Amir Zeldes

WikiGUM: Exhaustive Entity Linking for Wikification in 12 Genres

Abstract

Previous work on Entity Linking has focused on resources targeting non-nested proper named entity mentions, often in data from Wikipedia, i.e. Wikification. In this paper, we present and evaluate WikiGUM, a fully wikified dataset, covering all mentions of named entities, including their non-named and pronominal mentions, as well as mentions nested within other mentions. The dataset covers a broad range of 12 written and spoken genres, most of which have not been included in Entity Linking efforts to date, leading to poor performance by a pretrained SOTA system in our evaluation. The availability of a variety of other annotations for the same data also enables further research on entities in context.

Benchmarks

BenchmarkMethodologyMetrics
entity-linking-on-gumbaseline
F1: 26.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
WikiGUM: Exhaustive Entity Linking for Wikification in 12 Genres | Papers | HyperAI