Command Palette
Search for a command to run...
Junnan Li Richard Socher Steven C.H. Hoi

Abstract
Deep neural networks are known to be annotation-hungry. Numerous efforts have been devoted to reducing the annotation cost when learning with deep networks. Two prominent directions include learning with noisy labels and semi-supervised learning by exploiting unlabeled data. In this work, we propose DivideMix, a novel framework for learning with noisy labels by leveraging semi-supervised learning techniques. In particular, DivideMix models the per-sample loss distribution with a mixture model to dynamically divide the training data into a labeled set with clean samples and an unlabeled set with noisy samples, and trains the model on both the labeled and unlabeled data in a semi-supervised manner. To avoid confirmation bias, we simultaneously train two diverged networks where each network uses the dataset division from the other network. During the semi-supervised training phase, we improve the MixMatch strategy by performing label co-refinement and label co-guessing on labeled and unlabeled samples, respectively. Experiments on multiple benchmark datasets demonstrate substantial improvements over state-of-the-art methods. Code is available at https://github.com/LiJunnan1992/DivideMix .
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-classification-on-clothing1m | DivideMix | Accuracy: 74.76% |
| image-classification-on-mini-webvision-1-0 | DivideMix (ResNet-50) | ImageNet Top-1 Accuracy: 74.42 ±0.29 ImageNet Top-5 Accuracy: 91.21 ±0.12 Top-1 Accuracy: 76.32 ±0.36 Top-5 Accuracy: 90.65 ±0.16 |
| image-classification-on-mini-webvision-1-0 | DivideMix (ResNet-18) | Top-1 Accuracy: 76.08 |
| image-classification-on-mini-webvision-1-0 | DivideMix (Inception-ResNet-v2) | ImageNet Top-1 Accuracy: 75.20 ImageNet Top-5 Accuracy: 91.64 Top-1 Accuracy: 77.32 Top-5 Accuracy: 91.64 |
| learning-with-noisy-labels-on-cifar-100n | Divide-Mix | Accuracy (mean): 71.13 |
| learning-with-noisy-labels-on-cifar-10n | Divide-Mix | Accuracy (mean): 95.01 |
| learning-with-noisy-labels-on-cifar-10n-1 | Divide-Mix | Accuracy (mean): 90.18 |
| learning-with-noisy-labels-on-cifar-10n-2 | Divide-Mix | Accuracy (mean): 90.90 |
| learning-with-noisy-labels-on-cifar-10n-3 | Divide-Mix | Accuracy (mean): 89.97 |
| learning-with-noisy-labels-on-cifar-10n-worst | Divide-Mix | Accuracy (mean): 92.56 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.