HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Highly Parallel Autoregressive Entity Linking with Discriminative Correction

Nicola De Cao; Wilker Aziz; Ivan Titov

Highly Parallel Autoregressive Entity Linking with Discriminative Correction

Abstract

Generative approaches have been recently shown to be effective for both Entity Disambiguation and Entity Linking (i.e., joint mention detection and disambiguation). However, the previously proposed autoregressive formulation for EL suffers from i) high computational cost due to a complex (deep) decoder, ii) non-parallelizable decoding that scales with the source sequence length, and iii) the need for training on a large amount of data. In this work, we propose a very efficient approach that parallelizes autoregressive linking across all potential mentions and relies on a shallow and efficient decoder. Moreover, we augment the generative objective with an extra discriminative component, i.e., a correction term which lets us directly optimize the generator's ranking. When taken together, these techniques tackle all the above issues: our model is >70 times faster and more accurate than the previous generative method, outperforming state-of-the-art approaches on the standard English dataset AIDA-CoNLL. Source code available at https://github.com/nicola-decao/efficient-autoregressive-EL

Code Repositories

nicola-decao/efficient-autoregressive-EL
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
entity-linking-on-aida-conllDe Cao et al. (2021b)
Micro-F1 strong: 85.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Highly Parallel Autoregressive Entity Linking with Discriminative Correction | Papers | HyperAI