3 months ago

BertGCN: Transductive Text Classification by Combining GCN and BERT

Yuxiao Lin Yuxian Meng Xiaofei Sun Qinghong Han Kun Kuang Jiwei Li Fei Wu

Abstract

In this work, we propose BertGCN, a model that combines large scale pretraining and transductive learning for text classification. BertGCN constructs a heterogeneous graph over the dataset and represents documents as nodes using BERT representations. By jointly training the BERT and GCN modules within BertGCN, the proposed model is able to leverage the advantages of both worlds: large-scale pretraining which takes the advantage of the massive amount of raw data and transductive learning which jointly learns representations for both training data and unlabeled test data by propagating label influence through graph convolution. Experiments show that BertGCN achieves SOTA performances on a wide range of text classification datasets. Code is available at https://github.com/ZeroRin/BertGCN.

Code Repositories

ZeroRin/BertGCN

Official

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
text-classification-on-20-newsgroups	RoBERTaGCN	Accuracy: 89.5
text-classification-on-20news	RoBERTaGCN	Accuracy: 89.5
text-classification-on-mr	RoBERTaGCN	Accuracy: 89.7
text-classification-on-ohsumed	RoBERTaGCN	Accuracy: 72.8
text-classification-on-r52	1-6 BertGCN	Accuracy: 96.6
text-classification-on-r8	RoBERTaGCN	Accuracy: 98.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette