HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose

Georgios Pavlakos; Xiaowei Zhou; Konstantinos G. Derpanis; Kostas Daniilidis

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose

Abstract

This paper addresses the challenge of 3D human pose estimation from a single color image. Despite the general success of the end-to-end learning paradigm, top performing approaches employ a two-step solution consisting of a Convolutional Network (ConvNet) for 2D joint localization and a subsequent optimization step to recover 3D pose. In this paper, we identify the representation of 3D pose as a critical issue with current ConvNet approaches and make two important contributions towards validating the value of end-to-end learning for this task. First, we propose a fine discretization of the 3D space around the subject and train a ConvNet to predict per voxel likelihoods for each joint. This creates a natural representation for 3D pose and greatly improves performance over the direct regression of joint coordinates. Second, to further improve upon initial estimates, we employ a coarse-to-fine prediction scheme. This step addresses the large dimensionality increase and enables iterative refinement and repeated processing of the image features. The proposed approach outperforms all state-of-the-art methods on standard benchmarks achieving a relative error reduction greater than 30% on average. Additionally, we investigate using our volumetric representation in a related architecture which is suboptimal compared to our end-to-end approach, but is of practical interest, since it enables training when no image with corresponding 3D groundtruth is available, and allows us to present compelling results for in-the-wild images.

Code Repositories

thuml/ContextWM
pytorch
Mentioned in GitHub
strawberryfg/c2f-3dhm-human-caffe
pytorch
Mentioned in GitHub
geopavlakos/c2f-vol-train
pytorch
Mentioned in GitHub
geopavlakos/c2f-vol-demo
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-human-pose-estimation-on-human36mc2f-vol
Average MPJPE (mm): 71.9
PA-MPJPE: 51.9
3d-human-pose-estimation-on-humaneva-ic2f-vol
Mean Reconstruction Error (mm): 24.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose | Papers | HyperAI