HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

On the Unreasonable Effectiveness of Centroids in Image Retrieval

Mikolaj Wieczorek Barbara Rychalska Jacek Dabrowski

On the Unreasonable Effectiveness of Centroids in Image Retrieval

Abstract

Image retrieval task consists of finding similar images to a query image from a set of gallery (database) images. Such systems are used in various applications e.g. person re-identification (ReID) or visual product search. Despite active development of retrieval models it still remains a challenging task mainly due to large intra-class variance caused by changes in view angle, lighting, background clutter or occlusion, while inter-class variance may be relatively low. A large portion of current research focuses on creating more robust features and modifying objective functions, usually based on Triplet Loss. Some works experiment with using centroid/proxy representation of a class to alleviate problems with computing speed and hard samples mining used with Triplet Loss. However, these approaches are used for training alone and discarded during the retrieval stage. In this paper we propose to use the mean centroid representation both during training and retrieval. Such an aggregated representation is more robust to outliers and assures more stable features. As each class is represented by a single embedding - the class centroid - both retrieval time and storage requirements are reduced significantly. Aggregating multiple embeddings results in a significant reduction of the search space due to lowering the number of candidate target vectors, which makes the method especially suitable for production deployments. Comprehensive experiments conducted on two ReID and Fashion Retrieval datasets demonstrate effectiveness of our method, which outperforms the current state-of-the-art. We propose centroid training and retrieval as a viable method for both Fashion Retrieval and ReID applications.

Code Repositories

lannguyen0910/deep-efficient-reid
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-retrieval-on-deepfashion-consumer-toCTL Model (ResNet50, 256x128)
Rank-1: 29.4
Rank-10: 61.3
Rank-20: 68.9
Rank-50: 77.4
mAP: 40.4
image-retrieval-on-deepfashion-consumer-toCTL Model (ResNet50-IBN-A, 320x320)
Rank-1: 37.3
Rank-10: 71.2
Rank-20: 77.7
Rank-50: 85.0
mAP: 49.2
image-retrieval-on-exact-street2shopCTL Model (ResNet50, 256x128)
Rank-1: 43.2
Rank-10: 61.9
Rank-20: 66.0
Rank-50: 72.1
mAP: 49.8
image-retrieval-on-exact-street2shopCTL Model (ResNet50-IBN-A, 320x320)
Rank-1: 53.7
Rank-10: 70.9
Rank-20: 75.0
Rank-50: 79.2
mAP: 59.8
person-re-identification-on-dukemtmc-reidCTL Model (ResNet50, 256x128)
Rank-1: 95.6
Rank-10: 97.9
Rank-5: 96.2
mAP: 96.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
On the Unreasonable Effectiveness of Centroids in Image Retrieval | Papers | HyperAI