HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Phraseformer: Multimodal Key-phrase Extraction using Transformer and Graph Embedding

Narjes Nikzad-Khasmakhi Mohammad-Reza Feizi-Derakhshi Meysam Asgari-Chenaghlu Mohammad-Ali Balafar Ali-Reza Feizi-Derakhshi Taymaz Rahkar-Farshi Majid Ramezani Zoleikha Jahanbakhsh-Nagadeh Elnaz Zafarani-Moattar Mehrdad Ranjbar-Khadivi

Phraseformer: Multimodal Key-phrase Extraction using Transformer and Graph Embedding

Abstract

Background: Keyword extraction is a popular research topic in the field of natural language processing. Keywords are terms that describe the most relevant information in a document. The main problem that researchers are facing is how to efficiently and accurately extract the core keywords from a document. However, previous keyword extraction approaches have utilized the text and graph features, there is the lack of models that can properly learn and combine these features in a best way. Methods: In this paper, we develop a multimodal Key-phrase extraction approach, namely Phraseformer, using transformer and graph embedding techniques. In Phraseformer, each keyword candidate is presented by a vector which is the concatenation of the text and structure learning representations. Phraseformer takes the advantages of recent researches such as BERT and ExEm to preserve both representations. Also, the Phraseformer treats the key-phrase extraction task as a sequence labeling problem solved using classification task. Results: We analyze the performance of Phraseformer on three datasets including Inspec, SemEval2010 and SemEval 2017 by F1-score. Also, we investigate the performance of different classifiers on Phraseformer method over Inspec dataset. Experimental results demonstrate the effectiveness of Phraseformer method over the three datasets used. Additionally, the Random Forest classifier gain the highest F1-score among all classifiers. Conclusions: Due to the fact that the combination of BERT and ExEm is more meaningful and can better represent the semantic of words. Hence, Phraseformer significantly outperforms single-modality methods.

Benchmarks

BenchmarkMethodologyMetrics
keyword-extraction-on-inspecPhraseformer(BERT, DeepWalk)
F1 score: 68.44
keyword-extraction-on-inspecPhraseformer(BERT, Node2vec)
F1 score: 68.68
keyword-extraction-on-inspecPhraseformer(BERT, ExEm(w2v))
F1 score: 69.70
keyword-extraction-on-inspecPhraseformer(BERT, ExEm(ft))
F1 score: 69.87
keyword-extraction-on-semeval-2010-task-8Phraseformer(BERT, ExEm(ft))
F1 score: 48.65
keyword-extraction-on-semeval-2010-task-8Phraseformer(BERT, ExEm(w2v))
F1 score: 48.48
keyword-extraction-on-semeval-2010-task-8Phraseformer(BERT, Node2vec)
F1 score: 47.46
keyword-extraction-on-semeval-2010-task-8Phraseformer(BERT, DeepWalk)
F1 score: 47.22
keyword-extraction-on-semeval2017Phraseformer(BERT, ExEm(ft))
F1 score: 67.13
keyword-extraction-on-semeval2017Phraseformer(BERT, ExEm(w2v))
F1 score: 66.96
keyword-extraction-on-semeval2017Phraseformer(BERT, Node2vec)
F1 score: 65.94
keyword-extraction-on-semeval2017Phraseformer(BERT, DeepWalk)
F1 score: 65.70

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Phraseformer: Multimodal Key-phrase Extraction using Transformer and Graph Embedding | Papers | HyperAI