Command Palette
Search for a command to run...
Maxim Berman; Hervé Jégou; Andrea Vedaldi; Iasonas Kokkinos; Matthijs Douze

Abstract
MultiGrain is a network architecture producing compact vector representations that are suited both for image classification and particular object retrieval. It builds on a standard classification trunk. The top of the network produces an embedding containing coarse and fine-grained information, so that images can be recognized based on the object class, particular object, or if they are distorted copies. Our joint training is simple: we minimize a cross-entropy loss for classification and a ranking loss that determines if two images are identical up to data augmentation, with no need for additional labels. A key component of MultiGrain is a pooling layer that takes advantage of high-resolution images with a network trained at a lower resolution. When fed to a linear classifier, the learned embeddings provide state-of-the-art classification accuracy. For instance, we obtain 79.4% top-1 accuracy with a ResNet-50 learned on Imagenet, which is a +1.8% absolute improvement over the AutoAugment method. When compared with the cosine similarity, the same embeddings perform on par with the state-of-the-art for image retrieval at moderate resolutions.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-classification-on-imagenet | MultiGrain PNASNet (450px) | Top 1 Accuracy: 83.2% |
| image-classification-on-imagenet | MultiGrain PNASNet (500px) | Top 1 Accuracy: 83.6% |
| image-classification-on-imagenet | MultiGrain SENet154 (400px) | Top 1 Accuracy: 83.0% |
| image-classification-on-imagenet | MultiGrain R50-AA-500 | Top 1 Accuracy: 79.4% |
| image-classification-on-imagenet | MultiGrain SENet154 (450px) | Top 1 Accuracy: 83.1% |
| image-classification-on-imagenet | MultiGrain PNASNet (400px) | Top 1 Accuracy: 82.6% |
| image-classification-on-imagenet | MultiGrain NASNet-A-Mobile (350px) | Top 1 Accuracy: 75.1% |
| image-classification-on-imagenet | MultiGrain R50-AA-224 | Top 1 Accuracy: 78.2% |
| image-classification-on-imagenet | MultiGrain PNASNet (300px) | Top 1 Accuracy: 81.3% |
| image-classification-on-imagenet | MultiGrain SENet154 (500px) | Top 1 Accuracy: 82.7% |
| image-retrieval-on-inria-holidays | MultiGrain R50 @ 800 | Mean mAP: 92.5% |
| image-retrieval-on-inria-holidays | MultiGrain R50 @ 500 | Mean mAP: 91.8% |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.