HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation

Gong Kehong ; Zhang Jianfeng ; Feng Jiashi

PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose
  Estimation

Abstract

Existing 3D human pose estimators suffer poor generalization performance tonew datasets, largely due to the limited diversity of 2D-3D pose pairs in thetraining data. To address this problem, we present PoseAug, a newauto-augmentation framework that learns to augment the available training posestowards a greater diversity and thus improve generalization of the trained2D-to-3D pose estimator. Specifically, PoseAug introduces a novel poseaugmentor that learns to adjust various geometry factors (e.g., posture, bodysize, view point and position) of a pose through differentiable operations.With such differentiable capacity, the augmentor can be jointly optimized withthe 3D pose estimator and take the estimation error as feedback to generatemore diverse and harder poses in an online manner. Moreover, PoseAug introducesa novel part-aware Kinematic Chain Space for evaluating local joint-angleplausibility and develops a discriminative module accordingly to ensure theplausibility of the augmented poses. These elaborate designs enable PoseAug togenerate more diverse yet plausible poses than existing offline augmentationmethods, and thus yield better generalization of the pose estimator. PoseAug isgeneric and easy to be applied to various 3D pose estimators. Extensiveexperiments demonstrate that PoseAug brings clear improvements on bothintra-scenario and cross-scenario datasets. Notably, it achieves 88.6% 3D PCKon MPI-INF-3DHP under cross-dataset evaluation setup, improving upon theprevious best data augmentation based method by 9.1%. Code can be found at:https://github.com/jfzhang95/PoseAug.

Code Repositories

jfzhang95/PoseAug
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-human-pose-estimation-on-3dpwHR-Net+ST-GCN+PoseAug
PA-MPJPE: 73.2
3d-human-pose-estimation-on-human36mHR-Net+ST-GCN+PoseAug
Average MPJPE (mm): 50.8
Multi-View or Monocular: Monocular
Using 2D ground-truth joints: No
3d-human-pose-estimation-on-human36m2DGT+VPose+PoseAug (GTi)
Average MPJPE (mm): 38.2
Multi-View or Monocular: Monocular
Using 2D ground-truth joints: Yes
3d-human-pose-estimation-on-human36m2DGT+ST-GCN+PoseAug (GTi)
Average MPJPE (mm): 36.9
Multi-View or Monocular: Monocular
Using 2D ground-truth joints: Yes
3d-human-pose-estimation-on-human36mHR-Net+VPose+PoseAug
Average MPJPE (mm): 50.2
Multi-View or Monocular: Monocular
Using 2D ground-truth joints: No
3d-human-pose-estimation-on-mpi-inf-3dhpPoseAug (+Extra2D)
AUC: 57.9
MPJPE: 71.1
PCK: 89.2
3d-human-pose-estimation-on-mpi-inf-3dhpHR-Net+ST-GCN+PoseAug
MPJPE: 76.6
3d-human-pose-estimation-on-mpi-inf-3dhpVPose+PoseAug
AUC: 57.3
MPJPE: 73
PCK: 88.6
3d-human-pose-estimation-on-mpi-inf-3dhpHR-Net+VPose+PoseAug
MPJPE: 73.2
monocular-3d-human-pose-estimation-on-human3HR-Net+VPose+PoseAug
Average MPJPE (mm): 50.2
PA-MPJPE: 39.1
monocular-3d-human-pose-estimation-on-human3PoseAug
Frames Needed: 1
Need Ground Truth 2D Pose: No
Use Video Sequence: No
weakly-supervised-3d-human-pose-estimation-onPoseAug
3D Annotations: S1
Average MPJPE (mm): 56.7
Number of Frames Per View: 1
Number of Views: 1
weakly-supervised-3d-human-pose-estimation-onPoseAug
3D Annotations: S1
Number of Frames Per View: 1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation | Papers | HyperAI