HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

A Strong and Reproducible Object Detector with Only Public Datasets

Tianhe Ren Jianwei Yang Shilong Liu Ailing Zeng Feng Li Hao Zhang Hongyang Li Zhaoyang Zeng Lei Zhang

A Strong and Reproducible Object Detector with Only Public Datasets

Abstract

This work presents Focal-Stable-DINO, a strong and reproducible object detection model which achieves 64.6 AP on COCO val2017 and 64.8 AP on COCO test-dev using only 700M parameters without any test time augmentation. It explores the combination of the powerful FocalNet-Huge backbone with the effective Stable-DINO detector. Different from existing SOTA models that utilize an extensive number of parameters and complex training techniques on large-scale private data or merged data, our model is exclusively trained on the publicly available dataset Objects365, which ensures the reproducibility of our approach.

Code Repositories

idea-research/stabledino
pytorch
Mentioned in GitHub
microsoft/FocalNet
Official
pytorch
Mentioned in GitHub
idea-research/stable-dino
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
object-detection-on-cocoFocal-Stable-DINO (Focal-Huge, no TTA)
AP50: 81.7
AP75: 71.5
APL: 78
APM: 67.6
APS: 48.6
Params (M): 689
box mAP: 64.8
object-detection-on-coco-minivalFocal-Stable-DINO (Focal-Huge, no TTA)
AP50: 81.5
AP75: 71.4
APL: 78.5
APM: 68.5
APS: 50.4
Params (M): 689
box AP: 64.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Strong and Reproducible Object Detector with Only Public Datasets | Papers | HyperAI