HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

BoQ: A Place is Worth a Bag of Learnable Queries

Ali-Bey Amar ; Chaib-draa Brahim ; Giguère Philippe

BoQ: A Place is Worth a Bag of Learnable Queries

Abstract

In visual place recognition, accurately identifying and matching images oflocations under varying environmental conditions and viewpoints remains asignificant challenge. In this paper, we introduce a new technique, calledBag-of-Queries (BoQ), which learns a set of global queries designed to captureuniversal place-specific attributes. Unlike existing methods that employself-attention and generate the queries directly from the input features, BoQemploys distinct learnable global queries, which probe the input features viacross-attention, ensuring consistent information aggregation. In addition, ourtechnique provides an interpretable attention mechanism and integrates withboth CNN and Vision Transformer backbones. The performance of BoQ isdemonstrated through extensive experiments on 14 large-scale benchmarks. Itconsistently outperforms current state-of-the-art techniques including NetVLAD,MixVPR and EigenPlaces. Moreover, as a global retrieval technique (one-stage),BoQ surpasses two-stage retrieval methods, such as Patch-NetVLAD, TransVPR andR2Former, all while being orders of magnitude faster and more efficient. Thecode and model weights are publicly available athttps://github.com/amaralibey/Bag-of-Queries.

Code Repositories

amaralibey/bag-of-queries
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
visual-place-recognition-on-amstertimeBoQ (ResNet-50)
Recall@1: 52.2
visual-place-recognition-on-amstertimeBoQ
Recall@1: 63.0
Recall@10: 85.1
Recall@5: 81.6
visual-place-recognition-on-eynshamBoQ
Recall@1: 92.2
Recall@10: 96.4
Recall@5: 95.6
visual-place-recognition-on-eynshamBoQ (ResNet-50)
Recall@1: 91.3
visual-place-recognition-on-mapillary-testBoQ
Recall@1: 79
Recall@10: 92
Recall@5: 90.3
visual-place-recognition-on-mapillary-valBoQ
Recall@1: 93.8
Recall@10: 97
Recall@5: 96.8
visual-place-recognition-on-mapillary-valBoQ (ResNet-50)
Recall@1: 91.2
Recall@10: 96.1
Recall@5: 95.3
visual-place-recognition-on-nordlandBoQ
Recall@1: 90.6
Recall@10: 97.5
Recall@5: 96.0
visual-place-recognition-on-nordlandBoQ (ResNet-50)
Recall@1: 83.1
visual-place-recognition-on-pittsburgh-250kBoQ
Recall@1: 96.6
Recall@10: 99.5
Recall@5: 99.1
visual-place-recognition-on-pittsburgh-250kBoQ (ResNet-50)
Recall@1: 95
Recall@10: 99.1
Recall@5: 98.5
visual-place-recognition-on-pittsburgh-30kBoQ
Recall@1: 93.7
Recall@10: 97.9
Recall@5: 97.1
visual-place-recognition-on-pittsburgh-30kBoQ (ResNet-50)
Recall@1: 92.4
visual-place-recognition-on-san-franciscoBoQ
Recall@1: 93.6
Recall@10: 96.5
Recall@5: 95.8
visual-place-recognition-on-spedBoQ
Recall@1: 92.5
Recall@10: 96.7
Recall@5: 95.9
visual-place-recognition-on-spedBoQ (ResNet-50)
Recall@1: 86.5
Recall@10: 95.7
Recall@5: 93.4
visual-place-recognition-on-st-luciaBoQ
Recall@10: 100
Recall@5: 100
visual-place-recognition-on-st-luciaBoQ (DINOv2)
Recall@1: 100.0
Recall@5: 100
visual-place-recognition-on-svox-nightBoQ (ResNet-50)
Recall@1: 87.1
visual-place-recognition-on-svox-overcastBoQ (ResNet-50)
Recall@1: 97.8
visual-place-recognition-on-svox-rainBoQ (ResNet-50)
Recall@1: 96.2
visual-place-recognition-on-svox-snowBoQ (ResNet-50)
Recall@1: 98.7
visual-place-recognition-on-svox-sunBoQ (ResNet-50)
Recall@1: 95.9
visual-place-recognition-on-tokyo247BoQ
Recall@1: 98.1
Recall@10: 98.7
Recall@5: 98.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
BoQ: A Place is Worth a Bag of Learnable Queries | Papers | HyperAI