HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Mask-aware IoU for Anchor Assignment in Real-time Instance Segmentation

Kemal Oksuz; Baris Can Cam; Fehmi Kahraman; Zeynep Sonat Baltaci; Sinan Kalkan; Emre Akbas

Mask-aware IoU for Anchor Assignment in Real-time Instance Segmentation

Abstract

This paper presents Mask-aware Intersection-over-Union (maIoU) for assigning anchor boxes as positives and negatives during training of instance segmentation methods. Unlike conventional IoU or its variants, which only considers the proximity of two boxes; maIoU consistently measures the proximity of an anchor box with not only a ground truth box but also its associated ground truth mask. Thus, additionally considering the mask, which, in fact, represents the shape of the object, maIoU enables a more accurate supervision during training. We present the effectiveness of maIoU on a state-of-the-art (SOTA) assigner, ATSS, by replacing IoU operation by our maIoU and training YOLACT, a SOTA real-time instance segmentation method. Using ATSS with maIoU consistently outperforms (i) ATSS with IoU by $\sim 1$ mask AP, (ii) baseline YOLACT with fixed IoU threshold assigner by $\sim 2$ mask AP over different image sizes and (iii) decreases the inference time by $25 \%$ owing to using less anchors. Then, exploiting this efficiency, we devise maYOLACT, a faster and $+6$ AP more accurate detector than YOLACT. Our best model achieves $37.7$ mask AP at $25$ fps on COCO test-dev establishing a new state-of-the-art for real-time instance segmentation. Code is available at https://github.com/kemaloksuz/Mask-aware-IoU

Code Repositories

kemaloksuz/mask-aware-iou
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
real-time-instance-segmentation-on-mscocomaYOLACT-700 (ResNet-50)
AP50: 59.4
AP75: 39.9
APL: 52.5
APM: 40.8
APS: 18.1
Frame (fps): 25 (Tesla V100)
mask AP: 37.7
real-time-instance-segmentation-on-mscocomaYOLACT-550 (ResNet-50)
AP50: 56.2
AP75: 37.1
APL: 51.4
APM: 38.0
APS: 14.7
Frame (fps): 30 (Tesla V100)
mask AP: 35.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Mask-aware IoU for Anchor Assignment in Real-time Instance Segmentation | Papers | HyperAI