HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation

Zhe Wang Daeyun Shin Charless C. Fowlkes

Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation

Abstract

Monocular estimation of 3d human pose has attracted increased attention with the availability of large ground-truth motion capture datasets. However, the diversity of training data available is limited and it is not clear to what extent methods generalize outside the specific datasets they are trained on. In this work we carry out a systematic study of the diversity and biases present in specific datasets and its effect on cross-dataset generalization across a compendium of 5 pose datasets. We specifically focus on systematic differences in the distribution of camera viewpoints relative to a body-centered coordinate frame. Based on this observation, we propose an auxiliary task of predicting the camera viewpoint in addition to pose. We find that models trained to jointly predict viewpoint and pose systematically show significantly improved cross-dataset generalization.

Benchmarks

BenchmarkMethodologyMetrics
3d-human-pose-estimation-on-3dpwCross Dataset Generalization
MPJPE: 89.7
PA-MPJPE: 65.2
3d-human-pose-estimation-on-geometric-pose-1Cross Dataset Generalization
MPJPE: 53.3
3d-human-pose-estimation-on-human36mCross Dataset Generalization
Average MPJPE (mm): 52
Multi-View or Monocular: Monocular
PA-MPJPE: 42.5
Using 2D ground-truth joints: Yes
3d-human-pose-estimation-on-mpi-inf-3dhpCross Dataset Generalization
MPJPE: 90.3
PCK: 84.3
3d-human-pose-estimation-on-surreal-1Cross Dataset Generalization
MPJPE: 37.1
PCK: 97.3
monocular-3d-human-pose-estimation-on-human3cross-dataset-evaluation
Average MPJPE (mm): 52.0
Frames Needed: 1
Need Ground Truth 2D Pose: No
Use Video Sequence: No

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation | Papers | HyperAI