Command Palette
Search for a command to run...
Deep High-Resolution Representation Learning for Human Pose Estimation
Ke Sun; Bin Xiao; Dong Liu; Jingdong Wang

Abstract
This is an official pytorch implementation of Deep High-Resolution Representation Learning for Human Pose Estimation. In this work, we are interested in the human pose estimation problem with a focus on learning reliable high-resolution representations. Most existing methods recover high-resolution representations from low-resolution representations produced by a high-to-low resolution network. Instead, our proposed network maintains high-resolution representations through the whole process. We start from a high-resolution subnetwork as the first stage, gradually add high-to-low resolution subnetworks one by one to form more stages, and connect the mutli-resolution subnetworks in parallel. We conduct repeated multi-scale fusions such that each of the high-to-low resolution representations receives information from other parallel representations over and over, leading to rich high-resolution representations. As a result, the predicted keypoint heatmap is potentially more accurate and spatially more precise. We empirically demonstrate the effectiveness of our network through the superior pose estimation results over two benchmark datasets: the COCO keypoint detection dataset and the MPII Human Pose dataset. The code and models have been publicly available at \url{https://github.com/leoxiaobin/deep-high-resolution-net.pytorch}.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 2d-human-pose-estimation-on-coco-wholebody-1 | HRNet | WB: 43.2 body: 65.9 face: 52.3 foot: 31.4 hand: 30.0 |
| 2d-human-pose-estimation-on-human-art | HRNet-w48 | AP: 0.417 AP (gt bbox): 0.769 |
| 2d-human-pose-estimation-on-human-art | HRNet-w32 | AP: 0.399 AP (gt bbox): 0.754 |
| 3d-pose-estimation-on-harper | HRNet + Depth | Average MPJPE (mm): 151 |
| instance-segmentation-on-coco-minival | HTC (HRNetV2p-W48) | mask AP: 41.0 |
| keypoint-detection-on-coco | HRNet-48(384x288) | Test AP: 75.5 Validation AP: 76.3 |
| keypoint-detection-on-coco | HRNet-32 | Validation AP: 75.8 |
| keypoint-detection-on-coco-test-dev | HRNet | AP50: 92.5 AP75: 83.3 APL: 81.5 APM: 71.9 AR: 80.5 |
| keypoint-detection-on-coco-test-dev | HRNet* | AP50: 92.7 AP75: 84.5 APL: 83.1 APM: 73.4 AR: 82.0 |
| pose-estimation-on-aic | HRNet (HRNet-w32) | AP: 32.3 AP50: 76.2 AP75: 21.9 AR: 36.6 AR50: 78.9 |
| pose-estimation-on-aic | HRNet (HRNet-w48 ) | AP: 33.5 AP50: 78.0 AP75: 23.6 AR: 37.9 AR50: 80.0 |
| pose-estimation-on-brace | HRNet pre-trained on COCO | Average Precision: 0.158 Average Recall: 0.202 |
| pose-estimation-on-brace | HRNet fine-tuned on BRACE | Average Precision: 0.357 Average Recall: 0.445 |
| pose-estimation-on-coco-test-dev | HRNet-W48 + extra data | AP: 77 AP50: 92.7 AP75: 84.5 APL: 83.1 APM: 73.4 AR: 82 |
| pose-estimation-on-coco-val2017 | HRNet (256x192) | AP: 75.3 AP50: - AP75: - AR: - |
| pose-estimation-on-mpii-human-pose | HRNet-W32 | PCKh-0.5: 92.3 |
| pose-tracking-on-posetrack2017 | HRNet-W48 COCO | MOTA: 57.93 mAP: 74.95 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.