HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark

A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark

Abstract

Representation learning promises to unlock deep learning for the long tail of vision tasks without expensive labelled datasets. Yet, the absence of a unified evaluation for general visual representations hinders progress. Popular protocols are often too constrained (linear classification), limited in diversity (ImageNet, CIFAR, Pascal-VOC), or only weakly related to representation quality (ELBO, reconstruction error). We present the Visual Task Adaptation Benchmark (VTAB), which defines good representations as those that adapt to diverse, unseen tasks with few examples. With VTAB, we conduct a large-scale study of many popular publicly-available representation learning algorithms. We carefully control confounders such as architecture and tuning budget. We address questions like: How effective are ImageNet representations beyond standard natural datasets? How do representations trained via generative and discriminative models compare? To what extent can self-supervision replace labels? And, how close are we to general visual representations?

Code Repositories

google-research/task_adaptation
Official
tf
Mentioned in GitHub
facebookresearch/vissl
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-classification-on-vtab-1k-1SelfSup-RelativePatchLoc-ResNet50
Top-1 Accuracy: 50.8
image-classification-on-vtab-1k-1BigBiGAN-ResNet50
Top-1 Accuracy: 59.1
image-classification-on-vtab-1k-1ResNet50-LargeHyperSweep
Top-1 Accuracy: 59.2
image-classification-on-vtab-1k-1SelfSup-Rotation-ResNet50
Top-1 Accuracy: 59.5
image-classification-on-vtab-1k-1Conditional-BigGAN
Top-1 Accuracy: 35.3
image-classification-on-vtab-1k-1SelfSup-Jigsaw-ResNet50
Top-1 Accuracy: 51.1
image-classification-on-vtab-1k-1ImageNet-ResNet50-LargeHyperSweep
Top-1 Accuracy: 71.2
image-classification-on-vtab-1k-1ResNet50
Top-1 Accuracy: 42.1
image-classification-on-vtab-1k-1S4L-10%-Exemplar-ResNet50
Top-1 Accuracy: 63.9
image-classification-on-vtab-1k-1SelfSup-Exemplar-ResNet50
Top-1 Accuracy: 57.5
image-classification-on-vtab-1k-1VAE
Top-1 Accuracy: 37.5
image-classification-on-vtab-1k-1ImageNet-10%-ResNet50
Top-1 Accuracy: 61.6
image-classification-on-vtab-1k-1S4L-Rotation-ResNet50-LargeHyperSweep
Top-1 Accuracy: 71.5
image-classification-on-vtab-1k-1WAE-UKL
Top-1 Accuracy: 31.0
image-classification-on-vtab-1k-1WAE-GAN
Top-1 Accuracy: 32.0
image-classification-on-vtab-1k-1ImageNet-ResNet50
Top-1 Accuracy: 65.6
image-classification-on-vtab-1k-1S4L-Exemplar-ResNet50
Top-1 Accuracy: 67.0
image-classification-on-vtab-1k-1WAE-MMD
Top-1 Accuracy: 37.3
image-classification-on-vtab-1k-1S4L-Exemplar-ResNet50-LargeHyperSweep
Top-1 Accuracy: 72.7
image-classification-on-vtab-1k-1Unconditional-BigGAN-ResNet50
Top-1 Accuracy: 44.0
image-classification-on-vtab-1k-1S4L-10%-Rotation-ResNet50
Top-1 Accuracy: 64.8
image-classification-on-vtab-1k-1S4L-Rotation-ResNet50
Top-1 Accuracy: 67.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark | Papers | HyperAI