Command Palette
Search for a command to run...
I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image
Gyeongsik Moon Kyoung Mu Lee

Abstract
Most of the previous image-based 3D human pose and mesh estimation methods estimate parameters of the human mesh model from an input image. However, directly regressing the parameters from the input image is a highly non-linear mapping because it breaks the spatial relationship between pixels in the input image. In addition, it cannot model the prediction uncertainty, which can make training harder. To resolve the above issues, we propose I2L-MeshNet, an image-to-lixel (line+pixel) prediction network. The proposed I2L-MeshNet predicts the per-lixel likelihood on 1D heatmaps for each mesh vertex coordinate instead of directly regressing the parameters. Our lixel-based 1D heatmap preserves the spatial relationship in the input image and models the prediction uncertainty. We demonstrate the benefit of the image-to-lixel prediction and show that the proposed I2L-MeshNet outperforms previous methods. The code is publicly available https://github.com/mks0601/I2L-MeshNet_RELEASE.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-hand-pose-estimation-on-freihand | I2L-MeshNet | PA-F@15mm: 0.973 PA-F@5mm: 0.681 PA-MPJPE: 7.4 PA-MPVPE: 7.6 |
| 3d-hand-pose-estimation-on-ho-3d | I2L-MeshNet | AUC_J: 0.775 AUC_V: 0.722 F@15mm: 0.932 F@5mm: 0.409 PA-MPJPE (mm): 11.2 PA-MPVPE: 13.9 |
| 3d-human-pose-estimation-on-3dpw | I2L-MeshNet | Acceleration Error: 30.9 MPJPE: 93.2 MPVPE: 110.1 PA-MPJPE: 58.6 |
| 3d-human-pose-estimation-on-human36m | I2L-MeshNet | Average MPJPE (mm): 55.7 PA-MPJPE: 41.7 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.