HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates

Diogo C Luvizon Hedi Tabia David Picard

Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates

Abstract

3D human pose estimation is frequently seen as the task of estimating 3D poses relative to the root body joint. Alternatively, we propose a 3D human pose estimation method in camera coordinates, which allows effective combination of 2D annotated data and 3D poses and a straightforward multi-view generalization. To that end, we cast the problem as a view frustum space pose estimation, where absolute depth prediction and joint relative depth estimations are disentangled. Final 3D predictions are obtained in camera coordinates by the inverse camera projection. Based on this, we also present a consensus-based optimization algorithm for multi-view predictions from uncalibrated images, which requires a single monocular training procedure. Although our method is indirectly tied to the training camera intrinsics, it still converges for cameras with different intrinsic parameters, resulting in coherent estimations up to a scale factor. Our method improves the state of the art on well known 3D human pose datasets, reducing the prediction error by 32% in the most common benchmark. We also reported our results in absolute pose position error, achieving 80~mm for monocular estimations and 51~mm for multi-view, on average.

Code Repositories

dluvizon/3d-pose-consensus
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-human-pose-estimation-on-human36mPose Consensus (monocular)
Average MPJPE (mm): 52
Multi-View or Monocular: Monocular
Using 2D ground-truth joints: No
3d-human-pose-estimation-on-human36mPose Consensus (multi-view, est. calib.)
Average MPJPE (mm): 45
Multi-View or Monocular: Multi-View
Using 2D ground-truth joints: No
3d-human-pose-estimation-on-human36mPose Consensus (multi-view, GT calib.)
Average MPJPE (mm): 39
Multi-View or Monocular: Multi-View
Using 2D ground-truth joints: No
3d-human-pose-estimation-on-mpi-inf-3dhpPose Consensus (monocular)
AUC: 42.1
MPJPE: 112.1
PCK: 80.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates | Papers | HyperAI