HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

Shuang Li Kaixiong Gong Chi Harold Liu Yulin Wang Feng Qiao Xinjing Cheng

MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

Abstract

Real-world training data usually exhibits long-tailed distribution, where several majority classes have a significantly larger number of samples than the remaining minority classes. This imbalance degrades the performance of typical supervised learning algorithms designed for balanced training sets. In this paper, we address this issue by augmenting minority classes with a recently proposed implicit semantic data augmentation (ISDA) algorithm, which produces diversified augmented samples by translating deep features along many semantically meaningful directions. Importantly, given that ISDA estimates the class-conditional statistics to obtain semantic directions, we find it ineffective to do this on minority classes due to the insufficient training data. To this end, we propose a novel approach to learn transformed semantic directions with meta-learning automatically. In specific, the augmentation strategy during training is dynamically optimized, aiming to minimize the loss on a small balanced validation set, which is approximated via a meta update step. Extensive empirical results on CIFAR-LT-10/100, ImageNet-LT, and iNaturalist 2017/2018 validate the effectiveness of our method.

Code Repositories

BIT-DA/MetaSAug
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-classification-on-inaturalistMetaSAug
Top 1 Accuracy: 63.28%
image-classification-on-inaturalist-2018MetaSAug
Top-1 Accuracy: 68.75%
long-tail-learning-on-cifar-10-lt-r-10MetaSAug-LDAM
Error Rate: 10.32
long-tail-learning-on-cifar-10-lt-r-100MetaSAug-LDAM
Error Rate: 19.34
long-tail-learning-on-cifar-10-lt-r-200MetaSAug-LDAM
Error Rate: 22.65
long-tail-learning-on-cifar-10-lt-r-50MetaSAug-LDAM
Error Rate: 15.66
long-tail-learning-on-cifar-100-lt-r-10MetaSAug-LDAM
Error Rate: 38.72
long-tail-learning-on-cifar-100-lt-r-100MetaSAug-LDAM
Error Rate: 51.99
long-tail-learning-on-cifar-100-lt-r-200MetaSAug-LDAM
Error Rate: 56.91
long-tail-learning-on-cifar-100-lt-r-50MetaSAug-LDAM
Error Rate: 47.73
long-tail-learning-on-imagenet-ltMetaSAug with CE loss
Top-1 Accuracy: 47.39
long-tail-learning-on-imagenet-ltMetaSAug (ResNet-152)
Top-1 Accuracy: 50.03
long-tail-learning-on-inaturalist-2018MetaSAug
Top-1 Accuracy: 68.75%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition | Papers | HyperAI