HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism

Nan Zhixiong Li Xianghong Dai Jifeng Xiang Tao

MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism

Abstract

Based on analyzing the character of cascaded decoder architecture commonlyadopted in existing DETR-like models, this paper proposes a new decoderarchitecture. The cascaded decoder architecture constrains object queries toupdate in the cascaded direction, only enabling object queries to learnrelatively-limited information from image features. However, the challenges forobject detection in natural scenes (e.g., extremely-small, heavily-occluded,and confusingly mixed with the background) require an object detection model tofully utilize image features, which motivates us to propose a new decoderarchitecture with the parallel Multi-time Inquiries (MI) mechanism. MI enablesobject queries to learn more comprehensive information, and our MI based model,MI-DETR, outperforms all existing DETR-like models on COCO benchmark underdifferent backbones and training epochs, achieving +2.3 AP and +0.6 APimprovements compared to the most representative model DINO and SOTA modelRelation-DETR under ResNet-50 backbone. In addition, a series of diagnostic andvisualization experiments demonstrate the effectiveness, rationality, andinterpretability of MI.

Code Repositories

CQU-ADHRI-Lab/MI-DETR
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
object-detection-on-coco-2017-valMI-DETR (Swin-L 1x)
AP: 58.2
AP50: 76.5
AP75: 63.4
APL: 74.6
APM: 62.8
APS: 42.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism | Papers | HyperAI