Command Palette
Search for a command to run...
Semi-Supervised Object Detection with Adaptive Class-Rebalancing Self-Training
Fangyuan Zhang Tianxiang Pan Bin Wang

Abstract
This study delves into semi-supervised object detection (SSOD) to improve detector performance with additional unlabeled data. State-of-the-art SSOD performance has been achieved recently by self-training, in which training supervision consists of ground truths and pseudo-labels. In current studies, we observe that class imbalance in SSOD severely impedes the effectiveness of self-training. To address the class imbalance, we propose adaptive class-rebalancing self-training (ACRST) with a novel memory module called CropBank. ACRST adaptively rebalances the training data with foreground instances extracted from the CropBank, thereby alleviating the class imbalance. Owing to the high complexity of detection tasks, we observe that both self-training and data-rebalancing suffer from noisy pseudo-labels in SSOD. Therefore, we propose a novel two-stage filtering algorithm to generate accurate pseudo-labels. Our method achieves satisfactory improvements on MS-COCO and VOC benchmarks. When using only 1\% labeled data in MS-COCO, our method achieves 17.02 mAP improvement over supervised baselines, and 5.32 mAP improvement compared with state-of-the-art methods.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| semi-supervised-object-detection-on-coco-0-5 | Adaptive Rebalancing | mAP: 19.62±0.37 |
| semi-supervised-object-detection-on-coco-1 | Adaptive Class-Rebalancing | mAP: 26.07±0.46 |
| semi-supervised-object-detection-on-coco-10 | Adaptive Class-Rebalancing | mAP: 34.92±0.22 |
| semi-supervised-object-detection-on-coco-100 | Adaptive Class-Rebalancing | mAP: 42.79 |
| semi-supervised-object-detection-on-coco-2 | Adaptive Class-Rebalancing | mAP: 28.69±0.17 |
| semi-supervised-object-detection-on-coco-5 | Adaptive Class-Rebalancing | mAP: 31.35±0.13 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.