Command Palette
Search for a command to run...
Zhang Song-Hai ; Li Ruilong ; Dong Xin ; Rosin Paul L. ; Cai Zixi ; Han Xi ; Yang Dingcheng ; Huang Hao-Zhi ; Hu Shi-Min

Abstract
The standard approach to image instance segmentation is to perform the objectdetection first, and then segment the object from the detection bounding-box.More recently, deep learning methods like Mask R-CNN perform them jointly.However, little research takes into account the uniqueness of the "human"category, which can be well defined by the pose skeleton. Moreover, the humanpose skeleton can be used to better distinguish instances with heavy occlusionthan using bounding-boxes. In this paper, we present a brand new pose-basedinstance segmentation framework for humans which separates instances based onhuman pose, rather than proposal region detection. We demonstrate that ourpose-based framework can achieve better accuracy than the state-of-artdetection-based approach on the human instance segmentation problem, and canmoreover better handle occlusion. Furthermore, there are few public datasetscontaining many heavily occluded humans along with comprehensive annotations,which makes this a challenging problem seldom noticed by researchers.Therefore, in this paper we introduce a new benchmark "Occluded Human(OCHuman)", which focuses on occluded humans with comprehensive annotationsincluding bounding-box, human pose and instance masks. This dataset contains8110 detailed annotated human instances within 4731 images. With an average0.67 MaxIoU for each person, OCHuman is the most complex and challengingdataset related to human instance segmentation. Through this dataset, we wantto emphasize occlusion as a challenging problem for researchers to study.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 2d-human-pose-estimation-on-ochuman | Pose2Seg | Test AP: 23.8 |
| human-instance-segmentation-on-ochuman | Pose2Seg | AP: 23.8 |
| keypoint-detection-on-ochuman | Pose2Seg | Test AP: 23.8 |
| pose-based-human-instance-segmentation-on | Pose2Seg (plus ground-truth keypoints) | AP: 55.2 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.