Command Palette
Search for a command to run...
Tianhe Ren Jianwei Yang Shilong Liu Ailing Zeng Feng Li Hao Zhang Hongyang Li Zhaoyang Zeng Lei Zhang

Abstract
This work presents Focal-Stable-DINO, a strong and reproducible object detection model which achieves 64.6 AP on COCO val2017 and 64.8 AP on COCO test-dev using only 700M parameters without any test time augmentation. It explores the combination of the powerful FocalNet-Huge backbone with the effective Stable-DINO detector. Different from existing SOTA models that utilize an extensive number of parameters and complex training techniques on large-scale private data or merged data, our model is exclusively trained on the publicly available dataset Objects365, which ensures the reproducibility of our approach.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| object-detection-on-coco | Focal-Stable-DINO (Focal-Huge, no TTA) | AP50: 81.7 AP75: 71.5 APL: 78 APM: 67.6 APS: 48.6 Params (M): 689 box mAP: 64.8 |
| object-detection-on-coco-minival | Focal-Stable-DINO (Focal-Huge, no TTA) | AP50: 81.5 AP75: 71.4 APL: 78.5 APM: 68.5 APS: 50.4 Params (M): 689 box AP: 64.6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.