Command Palette
Search for a command to run...
Sun Yu ; Bao Qian ; Liu Wu ; Fu Yili ; Black Michael J. ; Mei Tao

Abstract
This paper focuses on the regression of multiple 3D people from a single RGBimage. Existing approaches predominantly follow a multi-stage pipeline thatfirst detects people in bounding boxes and then independently regresses their3D body meshes. In contrast, we propose to Regress all meshes in a One-stagefashion for Multiple 3D People (termed ROMP). The approach is conceptuallysimple, bounding box-free, and able to learn a per-pixel representation in anend-to-end manner. Our method simultaneously predicts a Body Center heatmap anda Mesh Parameter map, which can jointly describe the 3D body mesh on the pixellevel. Through a body-center-guided sampling process, the body mesh parametersof all people in the image are easily extracted from the Mesh Parameter map.Equipped with such a fine-grained representation, our one-stage framework isfree of the complex multi-stage process and more robust to occlusion. Comparedwith state-of-the-art methods, ROMP achieves superior performance on thechallenging multi-person benchmarks, including 3DPW and CMU Panoptic.Experiments on crowded/occluded datasets demonstrate the robustness undervarious types of occlusion. The released code is the first real-timeimplementation of monocular multi-person 3D mesh regression.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-depth-estimation-on-relative-human | ROMP | PCDR: 54.84 PCDR-Adult: 55.34 PCDR-Baby: 30.08 PCDR-Kid: 48.41 PCDR-Teen: 51.12 mPCDK: 0.866 |
| 3d-human-pose-estimation-on-3d-poses-in-the | ROMP | MPJPE: 81.76 |
| 3d-human-pose-estimation-on-3dpw | ROMP | MPJPE: 76.7 MPVPE: 93.4 PA-MPJPE: 47.3 |
| 3d-human-pose-estimation-on-cmu-panoptic | ROMP (ResNet-50) | Average MPJPE (mm): 127.6 |
| 3d-human-pose-estimation-on-emdb | ROMP | Average MPJAE (deg): 26.5975 Average MPJAE-PA (deg): 23.9901 Average MPJPE (mm): 112.652 Average MPJPE-PA (mm): 75.1869 Average MVE (mm): 134.863 Average MVE-PA (mm): 90.648 Jitter (10m/s^3): 71.2556 |
| 3d-multi-person-mesh-recovery-on-relative | ROMP | PCDR: 68.27 |
| multi-person-pose-estimation-on-crowdpose | ROMP+CAR | mAP @0.5:0.95: 58.6 |
| multi-person-pose-estimation-on-crowdpose | ROMP | mAP @0.5:0.95: 55.6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.