HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

OpenStreetView-5M: The Many Roads to Global Visual Geolocation

Guillaume Astruc; Nicolas Dufour; Ioannis Siglidis; Constantin Aronssohn; Nacim Bouia; Stephanie Fu; Romain Loiseau; Van Nguyen Nguyen; Charles Raude; Elliot Vincent; Lintao XU; Hongyu Zhou; Loic Landrieu

OpenStreetView-5M: The Many Roads to Global Visual Geolocation

Abstract

Determining the location of an image anywhere on Earth is a complex visual task, which makes it particularly relevant for evaluating computer vision algorithms. Yet, the absence of standard, large-scale, open-access datasets with reliably localizable images has limited its potential. To address this issue, we introduce OpenStreetView-5M, a large-scale, open-access dataset comprising over 5.1 million geo-referenced street view images, covering 225 countries and territories. In contrast to existing benchmarks, we enforce a strict train/test separation, allowing us to evaluate the relevance of learned geographical features beyond mere memorization. To demonstrate the utility of our dataset, we conduct an extensive benchmark of various state-of-the-art image encoders, spatial representations, and training strategies. All associated codes and models can be found at https://github.com/gastruc/osv5m.

Code Repositories

gastruc/osv5m
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
photo-geolocation-estimation-onOSV-5M
Geoscore: 3361
photo-geolocation-estimation-onPlonk
Average Distance: 1814
Geoscore: 3361

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
OpenStreetView-5M: The Many Roads to Global Visual Geolocation | Papers | HyperAI