HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Cascaded Pyramid Network for Multi-Person Pose Estimation

Yilun Chen; Zhicheng Wang; Yuxiang Peng; Zhiqiang Zhang; Gang Yu; Jian Sun

Cascaded Pyramid Network for Multi-Person Pose Estimation

Abstract

The topic of multi-person pose estimation has been largely improved recently, especially with the development of convolutional neural network. However, there still exist a lot of challenging cases, such as occluded keypoints, invisible keypoints and complex background, which cannot be well addressed. In this paper, we present a novel network structure called Cascaded Pyramid Network (CPN) which targets to relieve the problem from these "hard" keypoints. More specifically, our algorithm includes two stages: GlobalNet and RefineNet. GlobalNet is a feature pyramid network which can successfully localize the "simple" keypoints like eyes and hands but may fail to precisely recognize the occluded or invisible keypoints. Our RefineNet tries explicitly handling the "hard" keypoints by integrating all levels of feature representations from the GlobalNet together with an online hard keypoint mining loss. In general, to address the multi-person pose estimation problem, a top-down pipeline is adopted to first generate a set of human bounding boxes based on a detector, followed by our CPN for keypoint localization in each human bounding box. Based on the proposed algorithm, we achieve state-of-art results on the COCO keypoint benchmark, with average precision at 73.0 on the COCO test-dev dataset and 72.1 on the COCO test-challenge dataset, which is a 19% relative improvement compared with 60.5 from the COCO 2016 keypoint challenge.Code (https://github.com/chenyilun95/tf-cpn.git) and the detection results are publicly available for further research.

Code Repositories

chenyilun95/tf-cpn
Official
tf
Mentioned in GitHub
tuvovan/CPN_KR
tf
Mentioned in GitHub
Cli98/pytorch-cpn
pytorch
Mentioned in GitHub
caiyuanhao1998/RSN
pytorch
Mentioned in GitHub
megvii-detection/MSPN
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
keypoint-detection-on-cocoCPN+
Test AP: 73.0
keypoint-detection-on-coco-test-challengeCPN+
AP: 72.1
AP50: 90.5
AP75: 78.9
APL: 84.7
AR: 78.7
AR50: 94.7
AR75: 84.8
ARL: 78.1
ARM: 74.3
keypoint-detection-on-coco-test-devCPN
AP50: 91.4
AP75: 80.0
APL: 77.2
APM: 68.7
AR: 78.5
AR50: 95.1
AR75: 85.3
ARL: 84.3
ARM: 74.2
keypoint-detection-on-coco-test-devCPN+
AP50: 91.7
AP75: 80.9
APL: 78.1
APM: 69.5
AR: 79.0
AR50: 95.1
AR75: 85.9
ARL: 84.6
ARM: 74.8
multi-person-pose-estimation-on-cocoCPN+
AP: 0.730
pose-estimation-on-coco-test-devCPN
AP: 72.1
AP50: 91.4
AP75: 80.0
APL: 77.2
AR: 78.5
pose-estimation-on-coco-test-devCPN+ [6, 9]
AP: 73.0
AP50: 91.7
AP75: 80.9
APL: 78.1
AR: 79.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Cascaded Pyramid Network for Multi-Person Pose Estimation | Papers | HyperAI