HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

MUG: Multi-human Graph Network for 3D Mesh Reconstruction from 2D Pose

Chenyan Wu Yandong Li Xianfeng Tang James Wang

MUG: Multi-human Graph Network for 3D Mesh Reconstruction from 2D Pose

Abstract

Reconstructing multi-human body mesh from a single monocular image is an important but challenging computer vision problem. In addition to the individual body mesh models, we need to estimate relative 3D positions among subjects to generate a coherent representation. In this work, through a single graph neural network, named MUG (Multi-hUman Graph network), we construct coherent multi-human meshes using only multi-human 2D pose as input. Compared with existing methods, which adopt a detection-style pipeline (i.e., extracting image features and then locating human instances and recovering body meshes from that) and suffer from the significant domain gap between lab-collected training datasets and in-the-wild testing datasets, our method benefits from the 2D pose which has a relatively consistent geometric property across datasets. Our method works like the following: First, to model the multi-human environment, it processes multi-human 2D poses and builds a novel heterogeneous graph, where nodes from different people and within one person are connected to capture inter-human interactions and draw the body geometry (i.e., skeleton and mesh structure). Second, it employs a dual-branch graph neural network structure -- one for predicting inter-human depth relation and the other one for predicting root-joint-relative mesh coordinates. Finally, the entire multi-human 3D meshes are constructed by combining the output from both branches. Extensive experiments demonstrate that MUG outperforms previous multi-human mesh estimation methods on standard 3D human benchmarks -- Panoptic, MuPoTS-3D and 3DPW.

Benchmarks

BenchmarkMethodologyMetrics
3d-human-pose-estimation-on-3dpwMUG
MPJPE: 87
MPVPE: 106.2
PA-MPJPE: 60.5
3d-human-pose-estimation-on-cmu-panopticMUG
Average MPJPE (mm): 127.8
3d-human-pose-estimation-on-human36mMUG
Average MPJPE (mm): 61.9
PA-MPJPE: 48.5
3d-human-pose-estimation-on-human36mMUG (GTi)
Average MPJPE (mm): 50.3
PA-MPJPE: 38.5
3d-multi-person-human-pose-estimation-onMUG
3DPCK: 76.27

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MUG: Multi-human Graph Network for 3D Mesh Reconstruction from 2D Pose | Papers | HyperAI