HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation

William McNally Kanav Vats Alexander Wong John McPhee

Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation

Abstract

In keypoint estimation tasks such as human pose estimation, heatmap-based regression is the dominant approach despite possessing notable drawbacks: heatmaps intrinsically suffer from quantization error and require excessive computation to generate and post-process. Motivated to find a more efficient solution, we propose to model individual keypoints and sets of spatially related keypoints (i.e., poses) as objects within a dense single-stage anchor-based detection framework. Hence, we call our method KAPAO (pronounced "Ka-Pow"), for Keypoints And Poses As Objects. KAPAO is applied to the problem of single-stage multi-person human pose estimation by simultaneously detecting human pose and keypoint objects and fusing the detections to exploit the strengths of both object representations. In experiments, we observe that KAPAO is faster and more accurate than previous methods, which suffer greatly from heatmap post-processing. The accuracy-speed trade-off is especially favourable in the practical setting when not using test-time augmentation. Source code: https://github.com/wmcnally/kapao.

Code Repositories

wmcnally/kapao
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
pose-estimation-on-coco-test-devKAPAO-L
AP: 70.3
AP50: 91.2
AP75: 77.8
APL: 76.8
APM: 66.3
AR: 77.7
pose-estimation-on-coco-test-devKAPAO-S
AP: 63.8
AP50: 88.4
AP75: 70.4
APL: 71.7
APM: 58.6
AR: 71.2
pose-estimation-on-coco-test-devKAPAO-M
AP: 68.8
AP50: 90.5
AP75: 76.5
APL: 76
APM: 64.3
AR: 76.3
pose-estimation-on-crowdposeKAPAO-S
AP: 63.8
AP50: 87.7
AP75: 69.4
APM: 64.8
Test: 72.1
pose-estimation-on-crowdposeKAPAO-M
AP: 67.1
AP50: 88.8
AP75: 73.4
APM: 68.1
Test: 75.2
pose-estimation-on-crowdposeKAPAO-L
AP: 68.9
AP50: 89.4
AP75: 75.6
APM: 69.9
Test: 76.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation | Papers | HyperAI