HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Poseur: Direct Human Pose Regression with Transformers

Weian Mao Yongtao Ge Chunhua Shen Zhi Tian Xinlong Wang Zhibin Wang Anton van den Hengel

Poseur: Direct Human Pose Regression with Transformers

Abstract

We propose a direct, regression-based approach to 2D human pose estimation from single images. We formulate the problem as a sequence prediction task, which we solve using a Transformer network. This network directly learns a regression mapping from images to the keypoint coordinates, without resorting to intermediate representations such as heatmaps. This approach avoids much of the complexity associated with heatmap-based approaches. To overcome the feature misalignment issues of previous regression-based methods, we propose an attention mechanism that adaptively attends to the features that are most relevant to the target keypoints, considerably improving the accuracy. Importantly, our framework is end-to-end differentiable, and naturally learns to exploit the dependencies between keypoints. Experiments on MS-COCO and MPII, two predominant pose-estimation datasets, demonstrate that our method significantly improves upon the state-of-the-art in regression-based pose estimation. More notably, ours is the first regression-based approach to perform favorably compared to the best heatmap-based pose estimation methods.

Code Repositories

aim-uofa/poseur
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
keypoint-detection-on-cocoPoseur(384x288)
Test AP: 78.3
Validation AP: 79.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Poseur: Direct Human Pose Regression with Transformers | Papers | HyperAI