HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Cascade R-CNN: Delving into High Quality Object Detection

Zhaowei Cai; Nuno Vasconcelos

Cascade R-CNN: Delving into High Quality Object Detection

Abstract

In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An object detector, trained with low IoU threshold, e.g. 0.5, usually produces noisy detections. However, detection performance tends to degrade with increasing the IoU thresholds. Two main factors are responsible for this: 1) overfitting during training, due to exponentially vanishing positive samples, and 2) inference-time mismatch between the IoUs for which the detector is optimal and those of the input hypotheses. A multi-stage object detection architecture, the Cascade R-CNN, is proposed to address these problems. It consists of a sequence of detectors trained with increasing IoU thresholds, to be sequentially more selective against close false positives. The detectors are trained stage by stage, leveraging the observation that the output of a detector is a good distribution for training the next higher quality detector. The resampling of progressively improved hypotheses guarantees that all detectors have a positive set of examples of equivalent size, reducing the overfitting problem. The same cascade procedure is applied at inference, enabling a closer match between the hypotheses and the detector quality of each stage. A simple implementation of the Cascade R-CNN is shown to surpass all single-model object detectors on the challenging COCO dataset. Experiments also show that the Cascade R-CNN is widely applicable across detector architectures, achieving consistent gains independently of the baseline detector strength. The code will be made available at https://github.com/zhaoweicai/cascade-rcnn.

Benchmarks

BenchmarkMethodologyMetrics
2d-object-detection-on-sardet-100kCascade R-CNN
box mAP: 51.1
object-detection-on-ai-todCascade R-CNN (ResNet-50-FPN)
AP: 13.8
AP50: 30.8
AP75: 10.5
APm: 26.6
APs: 25.5
APt: 10.6
APvt: 0.0
object-detection-on-cocoCascade R-CNN (ResNet-101-FPN+)
AP50: 61.1
AP75: 41.9
APL: 49.8
APM: 41.8
APS: 21.3
Hardware Burden: 3G
Operations per network pass:
box mAP: 38.8
object-detection-on-cocoCascade R-CNN (ResNet-50-FPN+)
AP50: 59
AP75: 39.2
APL: 46.4
APM: 38.8
APS: 20.3
Hardware Burden: 3G
Operations per network pass:
box mAP: 36.5
object-detection-on-cocoCascade R-CNN (ResNet-50-FPN+, cascade)
AP50: 59.9
AP75: 44
APL: 52.1
APM: 42.7
APS: 22.6
Hardware Burden: 12G
Operations per network pass:
box mAP: 40.6
object-detection-on-cocoCascade R-CNN (ResNet-101-FPN+, cascade)
AP50: 62.1
AP75: 46.3
APL: 55.2
APM: 45.5
APS: 23.7
Hardware Burden:
Operations per network pass:
box mAP: 42.8
object-detection-on-coco-minivalCascade R-CNN (ResNet-101-FPN+, cascade)
AP50: 61.6
AP75: 46.6
APL: 57.4
APM: 46.2
APS: 23.8
box AP: 42.7
object-detection-on-coco-minivalCascade R-CNN (ResNet-50-FPN+)
AP50: 59.4
AP75: 43.7
APL: 54.1
APM: 43.7
APS: 22.9
box AP: 40.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Cascade R-CNN: Delving into High Quality Object Detection | Papers | HyperAI