HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Monocular, One-stage, Regression of Multiple 3D People

Sun Yu ; Bao Qian ; Liu Wu ; Fu Yili ; Black Michael J. ; Mei Tao

Monocular, One-stage, Regression of Multiple 3D People

Abstract

This paper focuses on the regression of multiple 3D people from a single RGBimage. Existing approaches predominantly follow a multi-stage pipeline thatfirst detects people in bounding boxes and then independently regresses their3D body meshes. In contrast, we propose to Regress all meshes in a One-stagefashion for Multiple 3D People (termed ROMP). The approach is conceptuallysimple, bounding box-free, and able to learn a per-pixel representation in anend-to-end manner. Our method simultaneously predicts a Body Center heatmap anda Mesh Parameter map, which can jointly describe the 3D body mesh on the pixellevel. Through a body-center-guided sampling process, the body mesh parametersof all people in the image are easily extracted from the Mesh Parameter map.Equipped with such a fine-grained representation, our one-stage framework isfree of the complex multi-stage process and more robust to occlusion. Comparedwith state-of-the-art methods, ROMP achieves superior performance on thechallenging multi-person benchmarks, including 3DPW and CMU Panoptic.Experiments on crowded/occluded datasets demonstrate the robustness undervarious types of occlusion. The released code is the first real-timeimplementation of monocular multi-person 3D mesh regression.

Code Repositories

Arthur151/ROMP
Official
pytorch
Mentioned in GitHub
cai-jianfeng/ROMP_mindspore
mindspore
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-depth-estimation-on-relative-humanROMP
PCDR: 54.84
PCDR-Adult: 55.34
PCDR-Baby: 30.08
PCDR-Kid: 48.41
PCDR-Teen: 51.12
mPCDK: 0.866
3d-human-pose-estimation-on-3d-poses-in-theROMP
MPJPE: 81.76
3d-human-pose-estimation-on-3dpwROMP
MPJPE: 76.7
MPVPE: 93.4
PA-MPJPE: 47.3
3d-human-pose-estimation-on-cmu-panopticROMP (ResNet-50)
Average MPJPE (mm): 127.6
3d-human-pose-estimation-on-emdbROMP
Average MPJAE (deg): 26.5975
Average MPJAE-PA (deg): 23.9901
Average MPJPE (mm): 112.652
Average MPJPE-PA (mm): 75.1869
Average MVE (mm): 134.863
Average MVE-PA (mm): 90.648
Jitter (10m/s^3): 71.2556
3d-multi-person-mesh-recovery-on-relativeROMP
PCDR: 68.27
multi-person-pose-estimation-on-crowdposeROMP+CAR
mAP @0.5:0.95: 58.6
multi-person-pose-estimation-on-crowdposeROMP
mAP @0.5:0.95: 55.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Monocular, One-stage, Regression of Multiple 3D People | Papers | HyperAI