Command Palette
Search for a command to run...
Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization
Kaidi Cao Yining Chen Junwei Lu Nikos Arechiga Adrien Gaidon Tengyu Ma

Abstract
Real-world large-scale datasets are heteroskedastic and imbalanced -- labels have varying levels of uncertainty and label distributions are long-tailed. Heteroskedasticity and imbalance challenge deep learning algorithms due to the difficulty of distinguishing among mislabeled, ambiguous, and rare examples. Addressing heteroskedasticity and imbalance simultaneously is under-explored. We propose a data-dependent regularization technique for heteroskedastic datasets that regularizes different regions of the input space differently. Inspired by the theoretical derivation of the optimal regularization strength in a one-dimensional nonparametric classification setting, our approach adaptively regularizes the data points in higher-uncertainty, lower-density regions more heavily. We test our method on several benchmark tasks, including a real-world heteroskedastic and imbalanced dataset, WebVision. Our experiments corroborate our theory and demonstrate a significant improvement over other methods in noise-robust deep learning.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-classification-on-webvision-1000 | HAR (InceptionResNet-v2) | ImageNet Top-1 Accuracy: 67.1% ImageNet Top-5 Accuracy: 86.7% Top-1 Accuracy: 75.0% Top-5 Accuracy: 90.6% |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.