Command Palette
Search for a command to run...
GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras
Yuan Ye ; Iqbal Umar ; Molchanov Pavlo ; Kitani Kris ; Kautz Jan

Abstract
We present an approach for 3D global human mesh recovery from monocularvideos recorded with dynamic cameras. Our approach is robust to severe andlong-term occlusions and tracks human bodies even when they go outside thecamera's field of view. To achieve this, we first propose a deep generativemotion infiller, which autoregressively infills the body motions of occludedhumans based on visible motions. Additionally, in contrast to prior work, ourapproach reconstructs human meshes in consistent global coordinates even withdynamic cameras. Since the joint reconstruction of human motions and cameraposes is underconstrained, we propose a global trajectory predictor thatgenerates global human trajectories based on local body movements. Using thepredicted trajectories as anchors, we present a global optimization frameworkthat refines the predicted trajectories and optimizes the camera poses to matchthe video evidence such as 2D keypoints. Experiments on challenging indoor andin-the-wild datasets with dynamic cameras demonstrate that the proposedapproach outperforms prior methods significantly in terms of motion infillingand global mesh recovery.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-human-pose-estimation-on-emdb | GLAMR | Average MPJAE (deg): 25.5058 Average MPJAE-PA (deg): 23.5484 Average MPJPE (mm): 107.797 Average MPJPE-PA (mm): 71.0091 Average MVE (mm): 128.237 Average MVE-PA (mm): 85.4586 Jitter (10m/s^3): 67.3522 |
| global-3d-human-pose-estimation-on-emdb | GLAMR | Average G-MPJPE (mm): 3193 Average G-MVE (mm): 3203 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.