HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Text2Loc: 3D Point Cloud Localization from Natural Language

Yan Xia Letian Shi Zifeng Ding João F. Henriques Daniel Cremers

Text2Loc: 3D Point Cloud Localization from Natural Language

Abstract

We tackle the problem of 3D point cloud localization based on a few natural linguistic descriptions and introduce a novel neural network, Text2Loc, that fully interprets the semantic relationship between points and text. Text2Loc follows a coarse-to-fine localization pipeline: text-submap global place recognition, followed by fine localization. In global place recognition, relational dynamics among each textual hint are captured in a hierarchical transformer with max-pooling (HTM), whereas a balance between positive and negative pairs is maintained using text-submap contrastive learning. Moreover, we propose a novel matching-free fine localization method to further refine the location predictions, which completely removes the need for complicated text-instance matching and is lighter, faster, and more accurate than previous methods. Extensive experiments show that Text2Loc improves the localization accuracy by up to $2\times$ over the state-of-the-art on the KITTI360Pose dataset. Our project page is publicly available at \url{https://yan-xia.github.io/projects/text2loc/}.

Code Repositories

kevin301342/cmmloc
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
visual-place-recognition-on-kitti360poseText2Loc
Localization Recall@1 : 0.37

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Text2Loc: 3D Point Cloud Localization from Natural Language | Papers | HyperAI