HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

DirectPose: Direct End-to-End Multi-Person Pose Estimation

Zhi Tian Hao Chen Chunhua Shen

DirectPose: Direct End-to-End Multi-Person Pose Estimation

Abstract

We propose the first direct end-to-end multi-person pose estimation framework, termed DirectPose. Inspired by recent anchor-free object detectors, which directly regress the two corners of target bounding-boxes, the proposed framework directly predicts instance-aware keypoints for all the instances from a raw input image, eliminating the need for heuristic grouping in bottom-up methods or bounding-box detection and RoI operations in top-down ones. We also propose a novel Keypoint Alignment (KPAlign) mechanism, which overcomes the main difficulty: lack of the alignment between the convolutional features and predictions in this end-to-end framework. KPAlign improves the framework's performance by a large margin while still keeping the framework end-to-end trainable. With the only postprocessing non-maximum suppression (NMS), our proposed framework can detect multi-person keypoints with or without bounding-boxes in a single shot. Experiments demonstrate that the end-to-end paradigm can achieve competitive or better performance than previous strong baselines, in both bottom-up and top-down methods. We hope that our end-to-end approach can provide a new perspective for the human pose estimation task.

Code Repositories

aim-uofa/adet
pytorch
Mentioned in GitHub
zhubinQAQ/Ins
pytorch
Mentioned in GitHub
blueardour/AdelaiDet
pytorch
Mentioned in GitHub
Pxtri2156/AdelaiDet_v2
pytorch
Mentioned in GitHub
aim-uofa/AdelaiDet
pytorch
Mentioned in GitHub
quangvy2703/ABCNet-ESRGAN-SRTEXT
pytorch
Mentioned in GitHub
idea-research/x-pose
pytorch
Mentioned in GitHub
IDEA-Research/UniPose
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
keypoint-detection-on-coco-test-devDirectPose (ResNet-101)
AP: 64.8
AP50: 87.8
AP75: 71.1
APL: 71.5
APM: 60.4
pose-estimation-on-coco-test-devDirectPose (ResNet-101)
AP: 63.3
AP50: 86.7
AP75: 69.4
APL: 71.2
APM: 57.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
DirectPose: Direct End-to-End Multi-Person Pose Estimation | Papers | HyperAI