HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Generalized Contrastive Optimization of Siamese Networks for Place Recognition

María Leyva-Vallina Nicola Strisciuglio Nicolai Petkov

Generalized Contrastive Optimization of Siamese Networks for Place Recognition

Abstract

Visual place recognition is a challenging task in computer vision and a key component of camera-based localization and navigation systems. Recently, Convolutional Neural Networks (CNNs) achieved high results and good generalization capabilities. They are usually trained using pairs or triplets of images labeled as either similar or dissimilar, in a binary fashion. In practice, the similarity between two images is not binary, but continuous. Furthermore, training these CNNs is computationally complex and involves costly pair and triplet mining strategies. We propose a Generalized Contrastive loss (GCL) function that relies on image similarity as a continuous measure, and use it to train a siamese CNN. Furthermore, we present three techniques for automatic annotation of image pairs with labels indicating their degree of similarity, and deploy them to re-annotate the MSLS, TB-Places, and 7Scenes datasets. We demonstrate that siamese CNNs trained using the GCL function and the improved annotations consistently outperform their binary counterparts. Our models trained on MSLS outperform the state-of-the-art methods, including NetVLAD, NetVLAD-SARE, AP-GeM and Patch-NetVLAD, and generalize well on the Pittsburgh30k, Tokyo 24/7, RobotCar Seasons v2 and Extended CMU Seasons datasets. Furthermore, training a siamese network using the GCL function does not require complex pair mining. We release the source code at https://github.com/marialeyvallina/generalized_contrastive_loss.

Code Repositories

marialeyvallina/generalized_contrastive_loss
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
visual-place-recognition-on-mapillary-testRexNeXt-GCL
Recall@1: 56.0
Recall@10: 75.1
Recall@5: 70.8
visual-place-recognition-on-mapillary-testResNeXt-GCL-PCA
Recall@1: 62.3
Recall@10: 81.1
Recall@5: 76.2
visual-place-recognition-on-mapillary-valResNeXt GCL + PCA
Recall@1: 80.9
Recall@10: 92.6
Recall@5: 90.7
visual-place-recognition-on-mapillary-valResNeXt GCL
Recall@1: 75.5
Recall@10: 88.5
Recall@5: 86.1
visual-place-recognition-on-pittsburgh-30kGCL [trained only on MSLS]
Recall@1: 81.94
Recall@5: 94.56
visual-place-recognition-on-tokyo247GCL [trained only on MSLS]
Recall@1: 69.84
Recall@10: 80.63
Recall@5: 84.76

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Generalized Contrastive Optimization of Siamese Networks for Place Recognition | Papers | HyperAI