HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

AnyLoc: Towards Universal Visual Place Recognition

Nikhil Keetha Avneesh Mishra Jay Karhade Krishna Murthy Jatavallabhula Sebastian Scherer Madhava Krishna Sourav Garg

AnyLoc: Towards Universal Visual Place Recognition

Abstract

Visual Place Recognition (VPR) is vital for robot localization. To date, the most performant VPR approaches are environment- and task-specific: while they exhibit strong performance in structured environments (predominantly urban driving), their performance degrades severely in unstructured environments, rendering most approaches brittle to robust real-world deployment. In this work, we develop a universal solution to VPR -- a technique that works across a broad range of structured and unstructured environments (urban, outdoors, indoors, aerial, underwater, and subterranean environments) without any re-training or fine-tuning. We demonstrate that general-purpose feature representations derived from off-the-shelf self-supervised models with no VPR-specific training are the right substrate upon which to build such a universal VPR solution. Combining these derived features with unsupervised feature aggregation enables our suite of methods, AnyLoc, to achieve up to 4X significantly higher performance than existing approaches. We further obtain a 6% improvement in performance by characterizing the semantic properties of these features, uncovering unique domains which encapsulate datasets from similar environments. Our detailed experiments and analysis lay a foundation for building VPR solutions that may be deployed anywhere, anytime, and across anyview. We encourage the readers to explore our project page and interactive demos: https://anyloc.github.io/.

Code Repositories

AnyLoc/AnyLoc
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
visual-place-recognition-on-17-placesAnyLoc-VLAD-DINOv2
Recall@1: 65.02
visual-place-recognition-on-17-placesCLIP
Recall@1: 59.36
visual-place-recognition-on-baidu-mallCLIP
Recall@1: 56.02
visual-place-recognition-on-baidu-mallAnyLoc-VLAD-DINOv2
Recall@1: 75.22
visual-place-recognition-on-gardens-pointAnyLoc-VLAD-DINOv2
Recall@1: 95.5
visual-place-recognition-on-gardens-pointCLIP
Recall@1: 42.5
visual-place-recognition-on-hawkinsAnyLoc-VLAD-DINOv2
Recall@1: 65.25
visual-place-recognition-on-hawkinsCLIP
Recall@1: 33.05
visual-place-recognition-on-laurel-cavernsCLIP
Recall@1: 36.61
visual-place-recognition-on-laurel-cavernsAnyLoc-VLAD-DINOv2
Recall@1: 61.61
visual-place-recognition-on-mid-atlanticCLIP
Recall@1: 25.74
visual-place-recognition-on-mid-atlanticAnyLoc-VLAD-DINOv2
Recall@1: 34.65
visual-place-recognition-on-nardo-airAnyLoc-VLAD-DINOv2
Recall@1: 76.06
visual-place-recognition-on-nardo-airCLIP
Recall@1: 42.25
visual-place-recognition-on-nardo-air-rAnyLoc-VLAD-DINO
Recall@1: 94.37
visual-place-recognition-on-nardo-air-rAnyLoc-VLAD-DINOv2
Recall@1: 85.92
visual-place-recognition-on-nardo-air-rCLIP
Recall@1: 61.97
visual-place-recognition-on-oxford-robotcar-4AnyLoc-VLAD-DINOv2
Recall@1: 98.95
visual-place-recognition-on-oxford-robotcar-4CLIP
Recall@1: 34.55
visual-place-recognition-on-pittsburgh-30kAnyLoc-VLAD-DINOv2
Recall@1: 87.66
visual-place-recognition-on-pittsburgh-30kCLIP
Recall@1: 54.97
visual-place-recognition-on-st-luciaAnyLoc-VLAD-DINOv2
Recall@1: 96.17
visual-place-recognition-on-st-luciaCLIP
Recall@1: 62.7
visual-place-recognition-on-vp-airCLIP
Recall@1: 36.59
visual-place-recognition-on-vp-airAnyLoc-VLAD-DINOv2
Recall@1: 66.74

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
AnyLoc: Towards Universal Visual Place Recognition | Papers | HyperAI