HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

End-to-End Human Pose and Mesh Reconstruction with Transformers

Lin Kevin ; Wang Lijuan ; Liu Zicheng

End-to-End Human Pose and Mesh Reconstruction with Transformers

Abstract

We present a new method, called MEsh TRansfOrmer (METRO), to reconstruct 3Dhuman pose and mesh vertices from a single image. Our method uses a transformerencoder to jointly model vertex-vertex and vertex-joint interactions, andoutputs 3D joint coordinates and mesh vertices simultaneously. Compared toexisting techniques that regress pose and shape parameters, METRO does not relyon any parametric mesh models like SMPL, thus it can be easily extended toother objects such as hands. We further relax the mesh topology and allow thetransformer self-attention mechanism to freely attend between any two vertices,making it possible to learn non-local relationships among mesh vertices andjoints. With the proposed masked vertex modeling, our method is more robust andeffective in handling challenging situations like partial occlusions. METROgenerates new state-of-the-art results for human mesh reconstruction on thepublic Human3.6M and 3DPW datasets. Moreover, we demonstrate thegeneralizability of METRO to 3D hand reconstruction in the wild, outperformingexisting state-of-the-art methods on FreiHAND dataset. Code and pre-trainedmodels are available at https://github.com/microsoft/MeshTransformer.

Code Repositories

microsoft/MeshTransformer
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-hand-pose-estimation-on-dexycbMETRO
Average MPJPE (mm): 15.2
MPVPE: -
PA-MPVPE: -
PA-VAUC: -
Procrustes-Aligned MPJPE: 6.99
VAUC: -
3d-hand-pose-estimation-on-freihandMETRO
PA-F@15mm: 0.984
PA-F@5mm: 0.731
PA-MPJPE: 6.5
PA-MPVPE: 6.3
3d-hand-pose-estimation-on-hint-handMETRO
PCK@0.05 (Ego4D) All: 13.2
PCK@0.05 (Ego4D) Occ: 8.1
PCK@0.05 (Ego4D) Visible: 15.8
PCK@0.05 (New Days) All: 14.7
PCK@0.05 (NewDays) Occ: 7.0
PCK@0.05 (NewDays) Visible: 19.2
PCK@0.05 (VISOR) All: 16.8
PCK@0.05 (VISOR) Occ: 10.2
PCK@0.05 (VISOR) Visible: 19.7
3d-hand-pose-estimation-on-ho-3dMETRO
AUC_J: 0.792
AUC_V: 0.779
F@15mm: 0.946
F@5mm: 0.484
PA-MPJPE (mm): 10.4
PA-MPVPE: 11.1
3d-human-pose-estimation-on-3dpwMETRO
MPJPE: 77.1
MPVPE: 88.2
PA-MPJPE: 47.9
3d-human-pose-estimation-on-human36mMETRO
Average MPJPE (mm): 54
Multi-View or Monocular: Monocular
PA-MPJPE: 36.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
End-to-End Human Pose and Mesh Reconstruction with Transformers | Papers | HyperAI