5 months ago

Expeditious Saliency-guided Mix-up through Random Gradient Thresholding

Luu Minh-Long ; Huang Zeyi ; Xing Eric P. ; Lee Yong Jae ; Wang Haohan

Abstract

Mix-up training approaches have proven to be effective in improving thegeneralization ability of Deep Neural Networks. Over the years, the researchcommunity expands mix-up methods into two directions, with extensive efforts toimprove saliency-guided procedures but minimal focus on the arbitrary path,leaving the randomization domain unexplored. In this paper, inspired by thesuperior qualities of each direction over one another, we introduce a novelmethod that lies at the junction of the two routes. By combining the bestelements of randomness and saliency utilization, our method balances speed,simplicity, and accuracy. We name our method R-Mix following the concept of"Random Mix-up". We demonstrate its effectiveness in generalization, weaklysupervised object localization, calibration, and robustness to adversarialattacks. Finally, in order to address the question of whether there exists abetter decision protocol, we train a Reinforcement Learning agent that decidesthe mix-up policies based on the classifier's performance, reducing dependencyon human-designed objectives and hyperparameter tuning. Extensive experimentsfurther show that the agent is capable of performing at the cutting-edge level,laying the foundation for a fully automatic mix-up. Our code is released at[https://github.com/minhlong94/Random-Mixup].

Code Repositories

minhlong94/random-mixup

Official

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
classifier-calibration-on-cifar-100	R-Mix (PreActResNet-18)	Expected Calibration Error: 3.73
image-classification-on-cifar-100	WideResNet 28-10 + CutMix (OneCycleLR scheduler)	Percentage correct: 83.97
image-classification-on-cifar-100	RL-Mix (PreActResNet-18)	Percentage correct: 80.75
image-classification-on-cifar-100	R-Mix (WideResNet 28-10)	Percentage correct: 85
image-classification-on-cifar-100	RL-Mix (WideResNet 16-8)	Percentage correct: 82.16
image-classification-on-cifar-100	RL-Mix (ResNeXt 29-4-24)	Percentage correct: 82.43
image-classification-on-cifar-100	WideResNet 16-8 + CutMix (OneCycleLR scheduler)	Percentage correct: 81.79
image-classification-on-cifar-100	ResNeXt 29-4-24 + CutMix (OneCycleLR scheduler)	Percentage correct: 82.3
image-classification-on-cifar-100	RL-Mix (WideResNet 28-10)	Percentage correct: 84.9
image-classification-on-cifar-100	R-Mix (ResNeXt 29-4-24)	Percentage correct: 83.02
image-classification-on-cifar-100	R-Mix (WideResNet 16-8)	Percentage correct: 82.32
image-classification-on-cifar-100	R-Mix (PreActResNet-18)	Percentage correct: 81.49
image-classification-on-cifar-100	PreActResNet-18 + CutMix (OneCycleLR scheduler)	Percentage correct: 80.6
image-classification-on-imagenet	R-Mix (ResNet-50)	Top 1 Accuracy: 77.39%
weakly-supervised-object-localization-on-2	R-Mix (ResNet-50)	Top-1 Localization Accuracy: 55.58

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette