HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

3D-LFM: Lifting Foundation Model

Dabhi Mosam ; Jeni Laszlo A. ; Lucey Simon

3D-LFM: Lifting Foundation Model

Abstract

The lifting of 3D structure and camera from 2D landmarks is at thecornerstone of the entire discipline of computer vision. Traditional methodshave been confined to specific rigid objects, such as those inPerspective-n-Point (PnP) problems, but deep learning has expanded ourcapability to reconstruct a wide range of object classes (e.g. C3DPO and PAUL)with resilience to noise, occlusions, and perspective distortions. All thesetechniques, however, have been limited by the fundamental need to establishcorrespondences across the 3D training data -- significantly limiting theirutility to applications where one has an abundance of "in-correspondence" 3Ddata. Our approach harnesses the inherent permutation equivariance oftransformers to manage varying number of points per 3D data instance,withstands occlusions, and generalizes to unseen categories. We demonstratestate of the art performance across 2D-3D lifting task benchmarks. Since ourapproach can be trained across such a broad class of structures we refer to itsimply as a 3D Lifting Foundation Model (3D-LFM) -- the first of its kind.

Code Repositories

mosamdabhi/3dlfm
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-facial-landmark-localization-on-h3wb3D-LFM
Average MPJPE (mm): 10.44
3d-hand-pose-estimation-on-h3wb3D-LFM
Average MPJPE (mm): 28.22
3d-human-pose-estimation-on-h3wb3D-LFM
MPJPE: 60.83
3d-human-pose-estimation-on-human36m3D-LFM
Average MPJPE (mm): 31.89
Multi-View or Monocular: Monocular
Using 2D ground-truth joints: Yes

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
3D-LFM: Lifting Foundation Model | Papers | HyperAI