HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

A Tri-Layer Plugin to Improve Occluded Detection

Guanqi Zhan Weidi Xie Andrew Zisserman

A Tri-Layer Plugin to Improve Occluded Detection

Abstract

Detecting occluded objects still remains a challenge for state-of-the-art object detectors. The objective of this work is to improve the detection for such objects, and thereby improve the overall performance of a modern object detector. To this end we make the following four contributions: (1) We propose a simple 'plugin' module for the detection head of two-stage object detectors to improve the recall of partially occluded objects. The module predicts a tri-layer of segmentation masks for the target object, the occluder and the occludee, and by doing so is able to better predict the mask of the target object. (2) We propose a scalable pipeline for generating training data for the module by using amodal completion of existing object detection and instance segmentation training datasets to establish occlusion relationships. (3) We also establish a COCO evaluation dataset to measure the recall performance of partially occluded and separated objects. (4) We show that the plugin module inserted into a two-stage detector can boost the performance significantly, by only fine-tuning the detection head, and with additional improvements if the entire architecture is fine-tuned. COCO results are reported for Mask R-CNN with Swin-T or Swin-S backbones, and Cascade Mask R-CNN with a Swin-B backbone.

Code Repositories

Championchess/Tri-Layer_Plugin_Occluded_Detection
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
instance-segmentation-on-cocoSwin-B + Cascade Mask R-CNN (tri-layer modelling)
mask AP: 45.9
instance-segmentation-on-occluded-cocoSwin-T + Mask R-CNN (tri-layer plugin)
Mean Recall: 62.00
instance-segmentation-on-occluded-cocoSwin-S + Mask R-CNN (tri-layer plugin)
Mean Recall: 62.58
instance-segmentation-on-occluded-cocoSwin-B + Cascade Mask R-CNN (tri-layer modelling)
Mean Recall: 63.64
instance-segmentation-on-separated-cocoSwin-B + Cascade Mask R-CNN (tri-layer modelling)
Mean Recall: 36.88
instance-segmentation-on-separated-cocoSwin-S + Mask R-CNN (tri-layer plugin)
Mean Recall: 35.80
instance-segmentation-on-separated-cocoSwin-T + Mask R-CNN (tri-layer plugin)
Mean Recall: 34.72

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Tri-Layer Plugin to Improve Occluded Detection | Papers | HyperAI