Command Palette
Search for a command to run...
Huang Gaoshuang ; Zhou Yang ; Zhao Luying ; Gan Wenjian

Abstract
Cross-view geo-localization (CVGL), which involves matching and retrievingsatellite images to determine the geographic location of a ground image, iscrucial in GNSS-constrained scenarios. However, this task faces significantchallenges due to substantial viewpoint discrepancies, the complexity oflocalization scenarios, and the need for global localization. To address theseissues, we propose a novel CVGL framework that integrates the visionfoundational model DINOv2 with an advanced feature mixer. Our frameworkintroduces the symmetric InfoNCE loss and incorporates near-neighbor samplingand dynamic similarity sampling strategies, significantly enhancinglocalization accuracy. Experimental results show that our framework surpassesexisting methods across multiple public and self-built datasets. To furtherimprove globalscale performance, we have developed CV-Cities, a novel datasetfor global CVGL. CV-Cities includes 223,736 ground-satellite image pairs withgeolocation data, spanning sixteen cities across six continents and covering awide range of complex scenarios, providing a challenging benchmark for CVGL.The framework trained with CV-Cities demonstrates high localization accuracy invarious test cities, highlighting its strong globalization and generalizationcapabilities. Our datasets and codes are available athttps://github.com/GaoShuang98/CVCities.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| drone-view-target-localization-on-university-1 | CV-Cities | AP: 95.01 Recall@1: 97.43 |
| image-based-localization-on-cvact | CV-Cities | Recall@1: 92.59 Recall@1 (%): 98.72 Recall@10: 97.82 Recall@5: 97.16 |
| image-based-localization-on-cvusa-1 | CV-Cities | Recall@1: 99.19 Recall@10: 99.85 Recall@5: 99.80 Recall@top1%: 99.92 |
| image-based-localization-on-vigor-cross-area | CV-Cities | Hit Rate: 75.97 Recall@1: 64.61 Recall@1%: 98.63 Recall@10: 91.20 Recall@5: 87.48 |
| image-based-localization-on-vigor-same-area | CV-Cities | Hit Rate: 90.76 Recall@1: 78.27 Recall@1%: 99.67 Recall@10: 97.52 Recall@5: 96.10 |
| visual-place-recognition-on-cv-cities | CV-Cities | Recall@1: 82.91 Recall@5: 90.14 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.