Command Palette
Search for a command to run...
Xudong Wang Zhirong Wu Long Lian Stella X. Yu

Abstract
Pseudo-labels are confident predictions made on unlabeled target data by a classifier trained on labeled source data. They are widely used for adapting a model to unlabeled data, e.g., in a semi-supervised learning setting. Our key insight is that pseudo-labels are naturally imbalanced due to intrinsic data similarity, even when a model is trained on balanced source data and evaluated on balanced target data. If we address this previously unknown imbalanced classification problem arising from pseudo-labels instead of ground-truth training labels, we could remove model biases towards false majorities created by pseudo-labels. We propose a novel and effective debiased learning method with pseudo-labels, based on counterfactual reasoning and adaptive margins: The former removes the classifier response bias, whereas the latter adjusts the margin of each class according to the imbalance of pseudo-labels. Validated by extensive experimentation, our simple debiased learning delivers significant accuracy gains over the state-of-the-art on ImageNet-1K: 26% for semi-supervised learning with 0.2% annotations and 9% for zero-shot learning. Our code is available at: https://github.com/frank-xwang/debiased-pseudo-labeling.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| few-shot-image-classification-on-imagenet-0 | DebiasPL (ResNet50) | Accuracy: 68.3% |
| semi-supervised-image-classification-on-1 | DebiasPL (ResNet-50) | Top 1 Accuracy: 71.3% |
| semi-supervised-image-classification-on-16 | DebiasPL (ResNet-50) | ImageNet Top-1 Accuracy: 69.6% |
| semi-supervised-image-classification-on-cifar-6 | DebiasPL (w/ FixMatch) | Percentage error: 4.6 |
| semi-supervised-image-classification-on-cifar-7 | DebiasPL (w/ FixMatch) | Percentage error: 5.4 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.