Command Palette
Search for a command to run...
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren; Kaiming He; Ross Girshick; Jian Sun

Abstract
State-of-the-art object detection networks depend on region proposal algorithms to hypothesize object locations. Advances like SPPnet and Fast R-CNN have reduced the running time of these detection networks, exposing region proposal computation as a bottleneck. In this work, we introduce a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals. An RPN is a fully convolutional network that simultaneously predicts object bounds and objectness scores at each position. The RPN is trained end-to-end to generate high-quality region proposals, which are used by Fast R-CNN for detection. We further merge RPN and Fast R-CNN into a single network by sharing their convolutional features---using the recently popular terminology of neural networks with 'attention' mechanisms, the RPN component tells the unified network where to look. For the very deep VGG-16 model, our detection system has a frame rate of 5fps (including all steps) on a GPU, while achieving state-of-the-art object detection accuracy on PASCAL VOC 2007, 2012, and MS COCO datasets with only 300 proposals per image. In ILSVRC and COCO 2015 competitions, Faster R-CNN and RPN are the foundations of the 1st-place winning entries in several tracks. Code has been made publicly available.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 2d-object-detection-on-sardet-100k | F-RCNN | box mAP: 49.0 |
| object-counting-on-carpk | Faster R-CNN (2015) | MAE: 39.88 RMSE: 47.67 |
| object-detection-on-coco-o | Faster R-CNN (ResNet-50-FPN) | Average mAP: 16.4 Effective Robustness: -0.41 |
| object-detection-on-pascal-voc-2007-15-5 | Faster R-CNN | MAP: 73.2% |
| object-detection-on-pku-ddd17-car-1 | Faster-RCNN | mAP50: 80.2 |
| object-detection-on-ua-detrac | Faster R-CNN | mAP: 58.45 |
| real-time-object-detection-on-pascal-voc-2007-1 | Faster R-CNN | FPS: 7.0 MAP: 73.2 |
| robust-object-detection-on-cityscapes-1 | Baseline | mPC [AP]: 15.4 |
| vessel-detection-on-vessel-detection-dateset | Faster RCNN | AP: 64.3% |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.