Command Palette
Search for a command to run...
MUG: Multi-human Graph Network for 3D Mesh Reconstruction from 2D Pose
Chenyan Wu Yandong Li Xianfeng Tang James Wang

Abstract
Reconstructing multi-human body mesh from a single monocular image is an important but challenging computer vision problem. In addition to the individual body mesh models, we need to estimate relative 3D positions among subjects to generate a coherent representation. In this work, through a single graph neural network, named MUG (Multi-hUman Graph network), we construct coherent multi-human meshes using only multi-human 2D pose as input. Compared with existing methods, which adopt a detection-style pipeline (i.e., extracting image features and then locating human instances and recovering body meshes from that) and suffer from the significant domain gap between lab-collected training datasets and in-the-wild testing datasets, our method benefits from the 2D pose which has a relatively consistent geometric property across datasets. Our method works like the following: First, to model the multi-human environment, it processes multi-human 2D poses and builds a novel heterogeneous graph, where nodes from different people and within one person are connected to capture inter-human interactions and draw the body geometry (i.e., skeleton and mesh structure). Second, it employs a dual-branch graph neural network structure -- one for predicting inter-human depth relation and the other one for predicting root-joint-relative mesh coordinates. Finally, the entire multi-human 3D meshes are constructed by combining the output from both branches. Extensive experiments demonstrate that MUG outperforms previous multi-human mesh estimation methods on standard 3D human benchmarks -- Panoptic, MuPoTS-3D and 3DPW.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-human-pose-estimation-on-3dpw | MUG | MPJPE: 87 MPVPE: 106.2 PA-MPJPE: 60.5 |
| 3d-human-pose-estimation-on-cmu-panoptic | MUG | Average MPJPE (mm): 127.8 |
| 3d-human-pose-estimation-on-human36m | MUG | Average MPJPE (mm): 61.9 PA-MPJPE: 48.5 |
| 3d-human-pose-estimation-on-human36m | MUG (GTi) | Average MPJPE (mm): 50.3 PA-MPJPE: 38.5 |
| 3d-multi-person-human-pose-estimation-on | MUG | 3DPCK: 76.27 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.