Command Palette
Search for a command to run...
Tianheng Cheng; Xinggang Wang; Shaoyu Chen; Wenqiang Zhang; Qian Zhang; Chang Huang; Zhaoxiang Zhang; Wenyu Liu

Abstract
In this paper, we propose a conceptually novel, efficient, and fully convolutional framework for real-time instance segmentation. Previously, most instance segmentation methods heavily rely on object detection and perform mask prediction based on bounding boxes or dense centers. In contrast, we propose a sparse set of instance activation maps, as a new object representation, to highlight informative regions for each foreground object. Then instance-level features are obtained by aggregating features according to the highlighted regions for recognition and segmentation. Moreover, based on bipartite matching, the instance activation maps can predict objects in a one-to-one style, thus avoiding non-maximum suppression (NMS) in post-processing. Owing to the simple yet effective designs with instance activation maps, SparseInst has extremely fast inference speed and achieves 40 FPS and 37.9 AP on the COCO benchmark, which significantly outperforms the counterparts in terms of speed and accuracy. Code and models are available at https://github.com/hustvl/SparseInst.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| real-time-instance-segmentation-on-mscoco | SparseInst-608 (ResNet-50-vd) | AP50: 59.2 AP75: 40.2 APL: 56.9 APM: 39.4 APS: 15.7 Frame (fps): 40 (2080 Ti) mask AP: 37.9 |
| real-time-instance-segmentation-on-mscoco | SparseInst-448 (ResNet-50-vd) | AP50: 56.5 AP75: 37.7 APL: 57.0 APM: 37.1 APS: 12.3 Frame (fps): 58.5 (2080 Ti) mask AP: 35.9 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.