3 months ago

Debiased Learning from Naturally Imbalanced Pseudo-Labels

Xudong Wang Zhirong Wu Long Lian Stella X. Yu

Abstract

Pseudo-labels are confident predictions made on unlabeled target data by a classifier trained on labeled source data. They are widely used for adapting a model to unlabeled data, e.g., in a semi-supervised learning setting. Our key insight is that pseudo-labels are naturally imbalanced due to intrinsic data similarity, even when a model is trained on balanced source data and evaluated on balanced target data. If we address this previously unknown imbalanced classification problem arising from pseudo-labels instead of ground-truth training labels, we could remove model biases towards false majorities created by pseudo-labels. We propose a novel and effective debiased learning method with pseudo-labels, based on counterfactual reasoning and adaptive margins: The former removes the classifier response bias, whereas the latter adjusts the margin of each class according to the imbalance of pseudo-labels. Validated by extensive experimentation, our simple debiased learning delivers significant accuracy gains over the state-of-the-art on ImageNet-1K: 26% for semi-supervised learning with 0.2% annotations and 9% for zero-shot learning. Our code is available at: https://github.com/frank-xwang/debiased-pseudo-labeling.

Code Repositories

frank-xwang/debiased-pseudo-labeling

Official

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
few-shot-image-classification-on-imagenet-0	DebiasPL (ResNet50)	Accuracy: 68.3%
semi-supervised-image-classification-on-1	DebiasPL (ResNet-50)	Top 1 Accuracy: 71.3%
semi-supervised-image-classification-on-16	DebiasPL (ResNet-50)	ImageNet Top-1 Accuracy: 69.6%
semi-supervised-image-classification-on-cifar-6	DebiasPL (w/ FixMatch)	Percentage error: 4.6
semi-supervised-image-classification-on-cifar-7	DebiasPL (w/ FixMatch)	Percentage error: 5.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette