HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

vONTSS: vMF based semi-supervised neural topic modeling with optimal transport

Weijie Xu Xiaoyu Jiang Srinivasan H. Sengamedu Francis Iannacci Jinjin Zhao

vONTSS: vMF based semi-supervised neural topic modeling with optimal transport

Abstract

Recently, Neural Topic Models (NTM), inspired by variational autoencoders, have attracted a lot of research interest; however, these methods have limited applications in the real world due to the challenge of incorporating human knowledge. This work presents a semi-supervised neural topic modeling method, vONTSS, which uses von Mises-Fisher (vMF) based variational autoencoders and optimal transport. When a few keywords per topic are provided, vONTSS in the semi-supervised setting generates potential topics and optimizes topic-keyword quality and topic classification. Experiments show that vONTSS outperforms existing semi-supervised topic modeling methods in classification accuracy and diversity. vONTSS also supports unsupervised topic modeling. Quantitative and qualitative experiments show that vONTSS in the unsupervised setting outperforms recent NTMs on multiple aspects: vONTSS discovers highly clustered and coherent topics on benchmark datasets. It is also much faster than the state-of-the-art weakly supervised text classification method while achieving similar classification performance. We further prove the equivalence of optimal transport loss and cross-entropy loss at the global minimum.

Benchmarks

BenchmarkMethodologyMetrics
topic-models-on-20newsgroupsvONTSS
C_v: 0.69
topic-models-on-ag-newsvONTSS
C_v: 0.49
NPMI: 0.054
topic-models-on-agnewsvONTSS
C_v: 0.49

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
vONTSS: vMF based semi-supervised neural topic modeling with optimal transport | Papers | HyperAI