HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Nicolas Dufour David Picard Vicky Kalogeiton Loic Landrieu

Around the World in 80 Timesteps: A Generative Approach to Global Visual
  Geolocation

Abstract

Global visual geolocation predicts where an image was captured on Earth.Since images vary in how precisely they can be localized, this task inherentlyinvolves a significant degree of ambiguity. However, existing approaches aredeterministic and overlook this aspect. In this paper, we aim to close the gapbetween traditional geolocalization and modern generative methods. We proposethe first generative geolocation approach based on diffusion and Riemannianflow matching, where the denoising process operates directly on the Earth'ssurface. Our model achieves state-of-the-art performance on three visualgeolocation benchmarks: OpenStreetView-5M, YFCC-100M, and iNat21. In addition,we introduce the task of probabilistic visual geolocation, where the modelpredicts a probability distribution over all possible locations instead of asingle point. We introduce new metrics and baselines for this task,demonstrating the advantages of our diffusion-based approach. Codes and modelswill be made available.

Code Repositories

nicolas-dufour/plonk
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
photo-geolocation-estimation-onPlonk
Geoscore: 3767

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation | Papers | HyperAI