4 months ago

Global Entity Disambiguation with BERT

Ikuya Yamada; Koki Washio; Hiroyuki Shindo; Yuji Matsumoto

Abstract

We propose a global entity disambiguation (ED) model based on BERT. To capture global contextual information for ED, our model treats not only words but also entities as input tokens, and solves the task by sequentially resolving mentions to their referent entities and using resolved entities as inputs at each step. We train the model using a large entity-annotated corpus obtained from Wikipedia. We achieve new state-of-the-art results on five standard ED datasets: AIDA-CoNLL, MSNBC, AQUAINT, ACE2004, and WNED-WIKI. The source code and model checkpoint are available at https://github.com/studio-ousia/luke.

Code Repositories

studio-ousia/luke

Official

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
entity-disambiguation-on-ace2004	confidence-order	Micro-F1: 91.9
entity-disambiguation-on-aida-conll	confidence-order	In-KB Accuracy: 95.0
entity-disambiguation-on-aquaint	confidence-order	Micro-F1: 93.5
entity-disambiguation-on-msnbc	confidence-order	Micro-F1: 96.3
entity-disambiguation-on-wned-cweb	MEP	Micro-F1: 76.2
entity-disambiguation-on-wned-cweb	confidence-order	Micro-F1: 78.9
entity-disambiguation-on-wned-wiki	MEP	Micro-F1: 86.2
entity-disambiguation-on-wned-wiki	confidence-order	Micro-F1: 89.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette