Command Palette
Search for a command to run...
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
Hao Zhang Feng Li Shilong Liu Lei Zhang Hang Su Jun Zhu Lionel M. Ni Heung-Yeung Shum

Abstract
We present DINO (\textbf{D}ETR with \textbf{I}mproved de\textbf{N}oising anch\textbf{O}r boxes), a state-of-the-art end-to-end object detector. % in this paper. DINO improves over previous DETR-like models in performance and efficiency by using a contrastive way for denoising training, a mixed query selection method for anchor initialization, and a look forward twice scheme for box prediction. DINO achieves $49.4$AP in $12$ epochs and $51.3$AP in $24$ epochs on COCO with a ResNet-50 backbone and multi-scale features, yielding a significant improvement of $\textbf{+6.0}$\textbf{AP} and $\textbf{+2.7}$\textbf{AP}, respectively, compared to DN-DETR, the previous best DETR-like model. DINO scales well in both model size and data size. Without bells and whistles, after pre-training on the Objects365 dataset with a SwinL backbone, DINO obtains the best results on both COCO \texttt{val2017} ($\textbf{63.2}$\textbf{AP}) and \texttt{test-dev} (\textbf{$\textbf{63.3}$AP}). Compared to other models on the leaderboard, DINO significantly reduces its model size and pre-training data size while achieving better results. Our code will be available at \url{https://github.com/IDEACVR/DINO}.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| object-detection-on-coco | DINO (Swin-L,multi-scale, TTA) | box mAP: 63.3 |
| object-detection-on-coco-minival | DINO-5scale (24 epoch) | AP50: 69.1 AP75: 56 APL: 65.8 APM: 54.2 APS: 34.5 box AP: 51.3 |
| object-detection-on-coco-minival | DINO-5scale (36 epoch) | AP50: 69 AP75: 55.8 APL: 65.3 APM: 54.3 APS: 35 box AP: 51.2 |
| object-detection-on-coco-minival | DINO (Swin-L) | box AP: 63.2 |
| object-detection-on-coco-o | DINO (Swin-L) | Average mAP: 42.1 Effective Robustness: 15.76 |
| object-detection-on-sa-det-100k | DINO (ResNet50 1x VFL) | AP: 43.7 AP50: 52.0 AP75: 47.7 APL: 61.5 APM: 43.0 APS: 5.8 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.