Command Palette
Search for a command to run...
Olivier J. Hénaff; Aravind Srinivas; Jeffrey De Fauw; Ali Razavi; Carl Doersch; S. M. Ali Eslami; Aaron van den Oord

Abstract
Human observers can learn to recognize new categories of images from a handful of examples, yet doing so with artificial ones remains an open challenge. We hypothesize that data-efficient recognition is enabled by representations which make the variability in natural signals more predictable. We therefore revisit and improve Contrastive Predictive Coding, an unsupervised objective for learning such representations. This new implementation produces features which support state-of-the-art linear classification accuracy on the ImageNet dataset. When used as input for non-linear classification with deep neural networks, this representation allows us to use 2-5x less labels than classifiers trained directly on image pixels. Finally, this unsupervised representation substantially improves transfer learning to object detection on the PASCAL VOC dataset, surpassing fully supervised pre-trained ImageNet classifiers.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| contrastive-learning-on-imagenet-1k | ResNet v2 101 | ImageNet Top-1 Accuracy: 48.7 |
| contrastive-learning-on-imagenet-1k | ResNet50 (v2) | ImageNet Top-1 Accuracy: 63.8 |
| self-supervised-image-classification-on | CPC v2 (ResNet-50) (arxiv v2) | Number of Params: 24M Top 1 Accuracy: 63.8% Top 5 Accuracy: 85.3% |
| self-supervised-image-classification-on | CPC v2 (ResNet-161) (arxiv v2) | Number of Params: 305M Top 1 Accuracy: 71.5% Top 5 Accuracy: 90.1% |
| self-supervised-image-classification-on | CPC v2 (ResNet-161) (arxiv v1) | Number of Params: 305M Top 1 Accuracy: 61.0% Top 5 Accuracy: 83.0% |
| semi-supervised-image-classification-on-2 | CPC v2 (ResNet-161) | Top 1 Accuracy: 73.1% Top 5 Accuracy: 91.2% |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.