Command Palette
Search for a command to run...
PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation
Gong Kehong ; Zhang Jianfeng ; Feng Jiashi

Abstract
Existing 3D human pose estimators suffer poor generalization performance tonew datasets, largely due to the limited diversity of 2D-3D pose pairs in thetraining data. To address this problem, we present PoseAug, a newauto-augmentation framework that learns to augment the available training posestowards a greater diversity and thus improve generalization of the trained2D-to-3D pose estimator. Specifically, PoseAug introduces a novel poseaugmentor that learns to adjust various geometry factors (e.g., posture, bodysize, view point and position) of a pose through differentiable operations.With such differentiable capacity, the augmentor can be jointly optimized withthe 3D pose estimator and take the estimation error as feedback to generatemore diverse and harder poses in an online manner. Moreover, PoseAug introducesa novel part-aware Kinematic Chain Space for evaluating local joint-angleplausibility and develops a discriminative module accordingly to ensure theplausibility of the augmented poses. These elaborate designs enable PoseAug togenerate more diverse yet plausible poses than existing offline augmentationmethods, and thus yield better generalization of the pose estimator. PoseAug isgeneric and easy to be applied to various 3D pose estimators. Extensiveexperiments demonstrate that PoseAug brings clear improvements on bothintra-scenario and cross-scenario datasets. Notably, it achieves 88.6% 3D PCKon MPI-INF-3DHP under cross-dataset evaluation setup, improving upon theprevious best data augmentation based method by 9.1%. Code can be found at:https://github.com/jfzhang95/PoseAug.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-human-pose-estimation-on-3dpw | HR-Net+ST-GCN+PoseAug | PA-MPJPE: 73.2 |
| 3d-human-pose-estimation-on-human36m | HR-Net+ST-GCN+PoseAug | Average MPJPE (mm): 50.8 Multi-View or Monocular: Monocular Using 2D ground-truth joints: No |
| 3d-human-pose-estimation-on-human36m | 2DGT+VPose+PoseAug (GTi) | Average MPJPE (mm): 38.2 Multi-View or Monocular: Monocular Using 2D ground-truth joints: Yes |
| 3d-human-pose-estimation-on-human36m | 2DGT+ST-GCN+PoseAug (GTi) | Average MPJPE (mm): 36.9 Multi-View or Monocular: Monocular Using 2D ground-truth joints: Yes |
| 3d-human-pose-estimation-on-human36m | HR-Net+VPose+PoseAug | Average MPJPE (mm): 50.2 Multi-View or Monocular: Monocular Using 2D ground-truth joints: No |
| 3d-human-pose-estimation-on-mpi-inf-3dhp | PoseAug (+Extra2D) | AUC: 57.9 MPJPE: 71.1 PCK: 89.2 |
| 3d-human-pose-estimation-on-mpi-inf-3dhp | HR-Net+ST-GCN+PoseAug | MPJPE: 76.6 |
| 3d-human-pose-estimation-on-mpi-inf-3dhp | VPose+PoseAug | AUC: 57.3 MPJPE: 73 PCK: 88.6 |
| 3d-human-pose-estimation-on-mpi-inf-3dhp | HR-Net+VPose+PoseAug | MPJPE: 73.2 |
| monocular-3d-human-pose-estimation-on-human3 | HR-Net+VPose+PoseAug | Average MPJPE (mm): 50.2 PA-MPJPE: 39.1 |
| monocular-3d-human-pose-estimation-on-human3 | PoseAug | Frames Needed: 1 Need Ground Truth 2D Pose: No Use Video Sequence: No |
| weakly-supervised-3d-human-pose-estimation-on | PoseAug | 3D Annotations: S1 Average MPJPE (mm): 56.7 Number of Frames Per View: 1 Number of Views: 1 |
| weakly-supervised-3d-human-pose-estimation-on | PoseAug | 3D Annotations: S1 Number of Frames Per View: 1 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.