HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Representation Learning of Entities and Documents from Knowledge Base Descriptions

Ikuya Yamada; Hiroyuki Shindo; Yoshiyasu Takefuji

Representation Learning of Entities and Documents from Knowledge Base Descriptions

Abstract

In this paper, we describe TextEnt, a neural network model that learns distributed representations of entities and documents directly from a knowledge base (KB). Given a document in a KB consisting of words and entity annotations, we train our model to predict the entity that the document describes and map the document and its target entity close to each other in a continuous vector space. Our model is trained using a large number of documents extracted from Wikipedia. The performance of the proposed model is evaluated using two tasks, namely fine-grained entity typing and multiclass text classification. The results demonstrate that our model achieves state-of-the-art performance on both tasks. The code and the trained representations are made available online for further academic research.

Code Repositories

wikipedia2vec/wikipedia2vec
Official
Mentioned in GitHub
studio-ousia/textent
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
entity-typing-on-freebase-figerTextEnt-full
Accuracy: 37.4
BEP: 94.8
Macro F1: 84.2
Micro F1: 85.7
P@1: 93.2
text-classification-on-20newsTextEnt-full
Accuracy: 84.5
F-measure: 83.9
text-classification-on-r8TextEnt-full
Accuracy: 96.7
F-measure: 91

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Representation Learning of Entities and Documents from Knowledge Base Descriptions | Papers | HyperAI