Command Palette
Search for a command to run...
Bowen Zhao Xi Xiao Guojun Gan Bin Zhang Shutao Xia

Abstract
Deep neural networks (DNNs) have been applied in class incremental learning, which aims to solve common real-world problems of learning new classes continually. One drawback of standard DNNs is that they are prone to catastrophic forgetting. Knowledge distillation (KD) is a commonly used technique to alleviate this problem. In this paper, we demonstrate it can indeed help the model to output more discriminative results within old classes. However, it cannot alleviate the problem that the model tends to classify objects into new classes, causing the positive effect of KD to be hidden and limited. We observed that an important factor causing catastrophic forgetting is that the weights in the last fully connected (FC) layer are highly biased in class incremental learning. In this paper, we propose a simple and effective solution motivated by the aforementioned observations to address catastrophic forgetting. Firstly, we utilize KD to maintain the discrimination within old classes. Then, to further maintain the fairness between old classes and new classes, we propose Weight Aligning (WA) that corrects the biased weights in the FC layer after normal training process. Unlike previous work, WA does not require any extra parameters or a validation set in advance, as it utilizes the information provided by the biased weights themselves. The proposed method is evaluated on ImageNet-1000, ImageNet-100, and CIFAR-100 under various settings. Experimental results show that the proposed method can effectively alleviate catastrophic forgetting and significantly outperform state-of-the-art methods.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| incremental-learning-on-cifar-100-b0-5steps | WA | Average Incremental Accuracy: 72.81 |
| incremental-learning-on-imagenet-10-steps | WA | # M Params: 11.68 Average Incremental Accuracy: 65.67 Average Incremental Accuracy Top-5: 86.60 Final Accuracy: 55.60 Final Accuracy Top-5: 81.10 |
| incremental-learning-on-imagenet100-10-steps | WA | # M Params: 11.22 Average Incremental Accuracy Top-5: 91.00 Final Accuracy Top-5: 84.10 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.