Zero Shot Object Detection On Lvis V1 0
评估指标
AP
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| CP-DETR-Pro(without LVIS data) | 58.2 | CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection | - |
| Grounding DINO 1.6 Pro (without LVIS data) | 57.7 | Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection | |
| Grounding DINO 1.5 Pro (without LVIS data) | 55.7 | Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection | |
| OWLv2 (OWL-ST+FT) | 51.3 | Scaling Open-Vocabulary Object Detection | |
| MQ-GLIP-L | 43.4 | Multi-modal Queried Object Detection in the Wild | |
| OV-DINO-T (without LVIS data, swin tiny) | 40.1 | OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion | |
| GLIP-L | 37.3 | Grounded Language-Image Pre-training | |
| YOLO-World-L | 35.4 | YOLO-World: Real-Time Open-Vocabulary Object Detection | |
| GroundingDINO-L | 33.9 | Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection | |
| MQ-GLIP-T | 30.4 | Multi-modal Queried Object Detection in the Wild | |
| MQ-GroundingDINO-T | 30.2 | Multi-modal Queried Object Detection in the Wild |
0 of 11 row(s) selected.