8 months ago

Abstract

Predicting the geographic location (geo-localization) from a single ground-level RGB image taken anywhere in the world is a very challenging problem. The challenges include huge diversity of images due to different environmental scenarios, drastic variation in the appearance of the same location depending on the time of the day, weather, season, and more importantly, the prediction is made from a single image possibly having only a few geo-locating cues. For these reasons, most existing works are restricted to specific cities, imagery, or worldwide landmarks. In this work, we focus on developing an efficient solution to planet-scale single-image geo-localization. To this end, we propose TransLocator, a unified dual-branch transformer network that attends to tiny details over the entire image and produces robust feature representation under extreme appearance variations. TransLocator takes an RGB image and its semantic segmentation map as inputs, interacts between its two parallel branches after each transformer layer, and simultaneously performs geo-localization and scene recognition in a multi-task fashion. We evaluate TransLocator on four benchmark datasets - Im2GPS, Im2GPS3k, YFCC4k, YFCC26k and obtain 5.5%, 14.1%, 4.9%, 9.9% continent-level accuracy improvement over the state-of-the-art. TransLocator is also validated on real-world test images and found to be more effective than previous methods.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

8 months ago

Computer Vision

Geographic Information

Shraman Pramanick Ewa M. Nowara Joshua Gleason Carlos D. Castillo Rama Chellappa

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

8 months ago

Computer Vision

Geographic Information

Shraman Pramanick Ewa M. Nowara Joshua Gleason Carlos D. Castillo Rama Chellappa

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Where in the World is this Image? Transformer-based Geo-localization in the Wild

Shraman Pramanick Ewa M. Nowara Joshua Gleason Carlos D. Castillo Rama Chellappa

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Where in the World is this Image? Transformer-based Geo-localization in the Wild

Shraman Pramanick Ewa M. Nowara Joshua Gleason Carlos D. Castillo Rama Chellappa

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Where in the World is this Image? Transformer-based Geo-localization in the Wild

Shraman Pramanick Ewa M. Nowara Joshua Gleason Carlos D. Castillo Rama Chellappa

Abstract

Build AI with AI

HyperAI Newsletters