HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Learning Pose Grammar for Monocular 3D Pose Estimation

{Song-Chun Zhu Yuanlu Xu Wenguan Wang Jianwen Xie Xiaobai Liu}

Abstract

In this paper, we propose a pose grammar to tackle the problem of 3D human pose estimation from a monocular RGB image. Our model takes estimated 2D pose as the input and learns a generalized 2D-3D mapping function to leverage into 3D pose. The proposed model consists of a base network which efficiently captures pose-aligned features and a hierarchy of Bi-directional RNNs (BRNNs) on the top to explicitly incorporate a set of knowledge regarding human body configuration (i.e., kinematics, symmetry,motor coordination). The proposed model thus enforces high-level constraints over human poses. In learning, we develop a data augmentation algorithm to further improve model robustness against appearance variations and cross-view generalization ability. We validate our method on public 3D human pose benchmarks and propose a new evaluation protocol working on cross-view setting to verify the generalization capability of different methods. We empirically observe that most state-of-the-art methods encounter difficulty under such setting while our method can well handle such challenges.

Benchmarks

BenchmarkMethodologyMetrics
3d-human-pose-estimation-on-humaneva-i3D Pose Grammar Network
Mean Reconstruction Error (mm): 22.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Learning Pose Grammar for Monocular 3D Pose Estimation | Papers | HyperAI