Command Palette
Search for a command to run...
Sarkar Sayan Deb ; Miksik Ondrej ; Pollefeys Marc ; Barath Daniel ; Armeni Iro

Abstract
Building 3D scene graphs has recently emerged as a topic in scenerepresentation for several embodied AI applications to represent the world in astructured and rich manner. With their increased use in solving downstreamtasks (eg, navigation and room rearrangement), can we leverage and recycle themfor creating 3D maps of environments, a pivotal step in agent operation? Wefocus on the fundamental problem of aligning pairs of 3D scene graphs whoseoverlap can range from zero to partial and can contain arbitrary changes. Wepropose SGAligner, the first method for aligning pairs of 3D scene graphs thatis robust to in-the-wild scenarios (ie, unknown overlap -- if any -- andchanges in the environment). We get inspired by multi-modality knowledge graphsand use contrastive learning to learn a joint, multi-modal embedding space. Weevaluate on the 3RScan dataset and further showcase that our method can be usedfor estimating the transformation between pairs of 3D scenes. Since benchmarksfor these tasks are missing, we create them on this dataset. The code,benchmark, and trained models are available on the project website.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-scene-graph-alignment-on-3dssg | SGAligner | F1: 89.3 Hits@1: 94.3 MRR: 96.3 |
| point-cloud-registration-on-3rscan | SGAligner | CD: 0.0111 RRE: 1.012 RTE: 1.67 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.