Command Palette
Search for a command to run...
Ruiying Geng; Binhua Li; Yongbin Li; Xiaodan Zhu; Ping Jian; Jian Sun

Abstract
Text classification tends to struggle when data is deficient or when it needs to adapt to unseen classes. In such challenging scenarios, recent studies have used meta-learning to simulate the few-shot task, in which new queries are compared to a small support set at the sample-wise level. However, this sample-wise comparison may be severely disturbed by the various expressions in the same class. Therefore, we should be able to learn a general representation of each class in the support set and then compare it to new queries. In this paper, we propose a novel Induction Network to learn such a generalized class-wise representation, by innovatively leveraging the dynamic routing algorithm in meta-learning. In this way, we find the model is able to induce and generalize better. We evaluate the proposed model on a well-studied sentiment classification dataset (English) and a real-world dialogue intent classification dataset (Chinese). Experiment results show that on both datasets, the proposed model significantly outperforms the existing state-of-the-art approaches, proving the effectiveness of class-wise generalization in few-shot text classification.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| few-shot-text-classification-on-odic-10-way | Induction Networks | Accuracy: 81.64 |
| few-shot-text-classification-on-odic-10-way-5 | Induction Networks | Accuracy: 78.27 |
| few-shot-text-classification-on-odic-5-way-10 | Induction Networks | Accuracy: 88.49 |
| few-shot-text-classification-on-odic-5-way-5 | Induction Networks | Accuracy: 87.16 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.