Command Palette
Search for a command to run...

Abstract
We present Sparse R-CNN, a purely sparse method for object detection in images. Existing works on object detection heavily rely on dense object candidates, such as $k$ anchor boxes pre-defined on all grids of image feature map of size $H\times W$. In our method, however, a fixed sparse set of learned object proposals, total length of $N$, are provided to object recognition head to perform classification and location. By eliminating $HWk$ (up to hundreds of thousands) hand-designed object candidates to $N$ (e.g. 100) learnable proposals, Sparse R-CNN completely avoids all efforts related to object candidates design and many-to-one label assignment. More importantly, final predictions are directly output without non-maximum suppression post-procedure. Sparse R-CNN demonstrates accuracy, run-time and training convergence performance on par with the well-established detector baselines on the challenging COCO dataset, e.g., achieving 45.0 AP in standard $3\times$ training schedule and running at 22 fps using ResNet-50 FPN model. We hope our work could inspire re-thinking the convention of dense prior in object detectors. The code is available at: https://github.com/PeizeSun/SparseR-CNN.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 2d-object-detection-on-ceymo | Sparse R-CNN | mAP: 47.3 |
| 2d-object-detection-on-sardet-100k | Sparse R-CNN | box mAP: 38.1 |
| object-detection-on-coco-minival | Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN) | AP50: 64.6 AP75: 49.5 APL: 61.6 APM: 48.3 APS: 28.3 box AP: 45.6 |
| object-detection-on-coco-minival | Sparse R-CNN (ResNet-101, FPN) | AP50: 62.1 AP75: 47.2 APL: 59.7 APM: 46.3 APS: 26.1 box AP: 43.5 |
| object-detection-on-coco-minival | Sparse R-CNN (ResNet-50, FPN) | AP50: 61.2 AP75: 45.7 APL: 57.6 APM: 44.6 APS: 26.7 box AP: 42.3 |
| object-detection-on-coco-minival | Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN) | AP50: 63.4 AP75: 48.2 APL: 59.5 APM: 47.2 APS: 26.9 box AP: 44.5 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.