HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification

{Kader Pustu-Iren Eric Muller-Budack Ralph Ewerth}

Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification

Abstract

While the successful estimation of a photo's geolocation enables a number of interesting applications, it is also a very challenging task. Due to the complexity of the problem, most existing approaches are restricted to specific areas, imagery, or worldwide landmarks. Only a few proposals predict GPS coordinates without any limitations. In this paper, we introduce several deep learning methods, which pursue the latter approach and treat geolocalization as a classification problem where the earth is subdivided into geographical cells. We propose to exploit hierarchical knowledge of multiple partitionings and additionally extract and take the photo's scene content into account, i.e., indoor, natural, or urban setting etc. As a result, contextual information at different spatial resolutions as well as more specific features for various environmental settings are incorporated in the learning process of the convolutional neural network. Experimental results on two benchmarks demonstrate the effectiveness of our approach outperforming the state of the art while using a significant lower number of training images and without relying on retrieval methods that require an appropriate reference dataset.

Benchmarks

BenchmarkMethodologyMetrics
photo-geolocation-estimation-on-gws15kISNs (M, f*, S3)
City level (25 km): 0.6
Continent level (2500 km): 38.5
Country level (750 km): 15.5
Region level (200 km): 4.2
Street level (1 km): 0.05
photo-geolocation-estimation-on-im2gpsbase (L, m)
City level (25 km): 35.0
Continent level (2500 km): 79.7
Country level (750 km): 64.1
Reference images: 0
Region level (200 km): 49.8
Street level (1 km): 13.5
Training images: 4.7M
photo-geolocation-estimation-on-im2gpsISNs (M, f*, S3)
City level (25 km): 43.0
Continent level (2500 km): 80.2
Country level (750 km): 66.7
Reference images: 0
Region level (200 km): 51.9
Street level (1 km): 16.9
Training images: 4.7M
photo-geolocation-estimation-on-im2gpsbase (M, f*)
City level (25 km): 40.9
Continent level (2500 km): 78.5
Country level (750 km): 65.4
Reference images: 0
Region level (200 km): 51.5
Street level (1 km): 15.2
Training images: 4.7M
photo-geolocation-estimation-on-im2gps3kISNs (M, f*, S3)
City level (25 km): 28.0
Continent level (2500 km): 66.0
Country level (750 km): 49.7
Region level (200 km): 36.6
Street level (1 km): 10.5
Training Images: 4.7M
photo-geolocation-estimation-on-im2gps3kbase (M, f*)
City level (25 km): 27.0
Continent level (2500 km): 66.0
Country level (750 km): 49.2
Region level (200 km): 35.6
Street level (1 km): 9.7
Training Images: 4.7M
photo-geolocation-estimation-on-im2gps3kbase (L, m)
City level (25 km): 24.9
Continent level (2500 km): 65.8
Country level (750 km): 48.8
Region level (200 km): 34.0
Street level (1 km): 8.3
Training Images: 4.7M
photo-geolocation-estimation-on-yfcc26kISNs (M, f*, S3)
City level (25 km): 12.3
Continent level (2500 km): 50.7
Country level (750 km): 31.9
Region level (200 km): 19.0
Street level (1 km): 5.3
Training Images: 4.7M

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification | Papers | HyperAI