4 months ago

A simple yet effective baseline for 3d human pose estimation

Julieta Martinez; Rayat Hossain; Javier Romero; James J. Little

Abstract

Following the success of deep convolutional networks, state-of-the-art methods for 3d human pose estimation have focused on deep end-to-end systems that predict 3d joint locations given raw image pixels. Despite their excellent performance, it is often not easy to understand whether their remaining error stems from a limited 2d pose (visual) understanding, or from a failure to map 2d poses into 3-dimensional positions. With the goal of understanding these sources of error, we set out to build a system that given 2d joint locations predicts 3d positions. Much to our surprise, we have found that, with current technology, "lifting" ground truth 2d joint locations to 3d space is a task that can be solved with a remarkably low error rate: a relatively simple deep feed-forward network outperforms the best reported result by about 30\% on Human3.6M, the largest publicly available 3d pose estimation benchmark. Furthermore, training our system on the output of an off-the-shelf state-of-the-art 2d detector (\ie, using images as input) yields state of the art results -- this includes an array of systems that have been trained end-to-end specifically for this task. Our results indicate that a large portion of the error of modern deep 3d pose estimation systems stems from their visual analysis, and suggests directions to further advance the state of the art in 3d human pose estimation.

Code Repositories

open-mmlab/mmpose

pytorch

sidify/3d-pose-estimation

Mentioned in GitHub

sidsunny/human-pose-estimation-mlsp

Mentioned in GitHub

una-dinosauria/3d-pose-baseline

Official

Mentioned in GitHub

garyzhao/SemGCN

pytorch

Mentioned in GitHub

jaroslaw1007/Pose_Baseline_3D_PyTorch

pytorch

Mentioned in GitHub

zhimingzo/modulated-gcn

pytorch

Mentioned in GitHub

ZHONGCHUYUN/3d_pose_baseline_tf

Mentioned in GitHub

happyvictor008/High-order-GNN-LF-iter

pytorch

Mentioned in GitHub

llSourcell/3D_Pose_Estimation

Mentioned in GitHub

SJTU-DL-lab/3d-pose-baseline

Mentioned in GitHub

ailingzengzzz/Split-and-Recombine-Net

pytorch

Mentioned in GitHub

serviceberry3/3d-pose-baseline_android

Mentioned in GitHub

denilson020898/baseline_3d_pose

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
3d-human-pose-estimation-on-3dpw	Simple-baseline	PA-MPJPE: 157.0
3d-human-pose-estimation-on-human36m	SIM (GT detections) (MA)	Average MPJPE (mm): 45.5 Using 2D ground-truth joints: Yes
3d-human-pose-estimation-on-human36m	SIM (SH detections FT) (MA)	Average MPJPE (mm): 62.9 Multi-View or Monocular: Monocular Using 2D ground-truth joints: No
3d-human-pose-estimation-on-human36m	SIM (SH detections) (MA)	Average MPJPE (mm): 67.5
3d-human-pose-estimation-on-humaneva-i	SIM (SH detections)	Mean Reconstruction Error (mm): 24.6
monocular-3d-human-pose-estimation-on-human3	SIM (SH detections FT) (MA)	Average MPJPE (mm): 62.9 Frames Needed: 1 Need Ground Truth 2D Pose: No Use Video Sequence: No

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

A simple yet effective baseline for 3d human pose estimation

Julieta Martinez; Rayat Hossain; Javier Romero; James J. Little

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters