HyperAI
HyperAI超神经
首页
算力平台
文档
资讯
论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
全站搜索…
⌘
K
首页
SOTA
零样本组合图像检索 (ZS-CIR)
Zero Shot Composed Image Retrieval Zs Cir On
Zero Shot Composed Image Retrieval Zs Cir On
评估指标
mAP@10
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
mAP@10
Paper Title
Repository
MMRet-MLLM
43.4
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
MMRet-Large (CLIP L/14)
40.2
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
SEIZE (CLIP G/14 & GPT-4o)
37.23
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
-
MagicLens (CoCa L)
35.4
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
MMRet-Base (CLIP B/16)
35.0
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
IP-CIR + LDRE (CLIP G/14)
34.26
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
-
SEIZE (CLIP G/14)
33.77
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
-
LDRE (CLIP G/14)
32.24
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval
-
MagicLens (CoCa B)
32.0
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
OSrCIR (CLIP G/14)
31.14
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval
MagicLens (CLIP L)
30.8
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
CoVR-BLIP-2
29.55
CoVR-2: Automatic Data Construction for Composed Video Retrieval
ImageScope (CLIP-ViT-L/14)
28.36
ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning
CIReVL (CLIP G/14)
27.59
Vision-by-Language for Training-Free Compositional Image Retrieval
IP-CIR + LDRE (CLIP L/14)
27.41
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
-
SEIZE (CLIP L/14)
25.82
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
-
OSrCIR (CLIP L/14)
25.33
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval
LDRE (CLIP L/14)
24.03
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval
-
MagicLens (CLIP B)
23.8
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
RTD + LinCIR (CLIP G/14)
22.29
An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval
0 of 42 row(s) selected.
Previous
Next
Zero Shot Composed Image Retrieval Zs Cir On | SOTA | HyperAI超神经