HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video

Xiaowei Zhou; Menglong Zhu; Spyridon Leonardos; Kosta Derpanis; Kostas Daniilidis

Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video

Abstract

This paper addresses the challenge of 3D full-body human pose estimation from a monocular image sequence. Here, two cases are considered: (i) the image locations of the human joints are provided and (ii) the image locations of joints are unknown. In the former case, a novel approach is introduced that integrates a sparsity-driven 3D geometric prior and temporal smoothness. In the latter case, the former case is extended by treating the image locations of the joints as latent variables. A deep fully convolutional network is trained to predict the uncertainty maps of the 2D joint locations. The 3D pose estimates are realized via an Expectation-Maximization algorithm over the entire sequence, where it is shown that the 2D joint location uncertainties can be conveniently marginalized out during inference. Empirical evaluation on the Human3.6M dataset shows that the proposed approaches achieve greater 3D pose estimation accuracy over state-of-the-art baselines. Further, the proposed approach outperforms a publicly available 2D pose estimation baseline on the challenging PennAction dataset.

Benchmarks

BenchmarkMethodologyMetrics
3d-human-pose-estimation-on-human36mSparseness Meets Deepness
Average MPJPE (mm): 113.01
3d-human-pose-estimation-on-human36mSparseness Meets Deepness
PA-MPJPE: 106.7
monocular-3d-human-pose-estimation-on-human3Sparseness Meets Deepness
Average MPJPE (mm): 113.01
Frames Needed: 300
Need Ground Truth 2D Pose: No
Use Video Sequence: Yes

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video | Papers | HyperAI