Command Palette
Search for a command to run...
Expeditious Saliency-guided Mix-up through Random Gradient Thresholding
Luu Minh-Long ; Huang Zeyi ; Xing Eric P. ; Lee Yong Jae ; Wang Haohan

Abstract
Mix-up training approaches have proven to be effective in improving thegeneralization ability of Deep Neural Networks. Over the years, the researchcommunity expands mix-up methods into two directions, with extensive efforts toimprove saliency-guided procedures but minimal focus on the arbitrary path,leaving the randomization domain unexplored. In this paper, inspired by thesuperior qualities of each direction over one another, we introduce a novelmethod that lies at the junction of the two routes. By combining the bestelements of randomness and saliency utilization, our method balances speed,simplicity, and accuracy. We name our method R-Mix following the concept of"Random Mix-up". We demonstrate its effectiveness in generalization, weaklysupervised object localization, calibration, and robustness to adversarialattacks. Finally, in order to address the question of whether there exists abetter decision protocol, we train a Reinforcement Learning agent that decidesthe mix-up policies based on the classifier's performance, reducing dependencyon human-designed objectives and hyperparameter tuning. Extensive experimentsfurther show that the agent is capable of performing at the cutting-edge level,laying the foundation for a fully automatic mix-up. Our code is released at[https://github.com/minhlong94/Random-Mixup].
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| classifier-calibration-on-cifar-100 | R-Mix (PreActResNet-18) | Expected Calibration Error: 3.73 |
| image-classification-on-cifar-100 | WideResNet 28-10 + CutMix (OneCycleLR scheduler) | Percentage correct: 83.97 |
| image-classification-on-cifar-100 | RL-Mix (PreActResNet-18) | Percentage correct: 80.75 |
| image-classification-on-cifar-100 | R-Mix (WideResNet 28-10) | Percentage correct: 85 |
| image-classification-on-cifar-100 | RL-Mix (WideResNet 16-8) | Percentage correct: 82.16 |
| image-classification-on-cifar-100 | RL-Mix (ResNeXt 29-4-24) | Percentage correct: 82.43 |
| image-classification-on-cifar-100 | WideResNet 16-8 + CutMix (OneCycleLR scheduler) | Percentage correct: 81.79 |
| image-classification-on-cifar-100 | ResNeXt 29-4-24 + CutMix (OneCycleLR scheduler) | Percentage correct: 82.3 |
| image-classification-on-cifar-100 | RL-Mix (WideResNet 28-10) | Percentage correct: 84.9 |
| image-classification-on-cifar-100 | R-Mix (ResNeXt 29-4-24) | Percentage correct: 83.02 |
| image-classification-on-cifar-100 | R-Mix (WideResNet 16-8) | Percentage correct: 82.32 |
| image-classification-on-cifar-100 | R-Mix (PreActResNet-18) | Percentage correct: 81.49 |
| image-classification-on-cifar-100 | PreActResNet-18 + CutMix (OneCycleLR scheduler) | Percentage correct: 80.6 |
| image-classification-on-imagenet | R-Mix (ResNet-50) | Top 1 Accuracy: 77.39% |
| weakly-supervised-object-localization-on-2 | R-Mix (ResNet-50) | Top-1 Localization Accuracy: 55.58 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.