Command Palette
Search for a command to run...
Robust Disambiguation of Named Entities in Text
Robust Disambiguation of Named Entities in Text
Gerhard Weikum Stefan Thater Bilyana Taneva Marc Spaniol Manfred Pinkal Hagen Fürstenau Ilaria Bordino Mohamed Amir Yosef Johannes Hoffart
Abstract
Disambiguating named entities in naturallanguage text maps mentions of ambiguous names onto canonical entities like people or places, registered in a knowledge base such as DBpedia or YAGO. This paper presents a robust method for collective disambiguation, by harnessing context from knowledge bases and using a new form of coherence graph. It unifies prior approaches into a comprehensive framework that combines three measures: the prior probability of an entity being mentioned, the similarity between the contexts of a mention and a candidate entity, as well as the coherence among candidate entities for all mentions together. The method builds a weighted graph of mentions and candidate entities, and computes a dense subgraph that approximates the best joint mention-entity mapping. Experiments show that the new method significantly outperforms prior methods in terms of accuracy, with robust behavior across a variety of inputs.