HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

NAC: Mitigating Noisy Correspondence in Cross-Modal Matching Via Neighbor Auxiliary Corrector

{Shao-Lun Huang Jian Xu Haoming Huang Yuqing Li}

Abstract

The presence of noisy correspondence within cross-modal matching has significantly undermined the performance of existing matching methods. In this paper, we introduce a robust framework named Neighbor Auxiliary Corrector (NAC) for alleviating noise by utilizing the neighbors, which are indicative of similar textual targets. NAC is inspired by an observation that similar texts tend to correspond to similar images. Leveraging the zero-shot capabilities of Pre-trained Language Models (PLMs), we identify the top-k nearest neighbors for each positive image-text pair. Subsequently, the side information provided by these neighbors is harnessed for both sample verification and sample rectification. Extensive experiments on benchmark datasets demonstrate that our framework can significantly boost the performance and is more robust to various levels of noisy correspondence.

Benchmarks

BenchmarkMethodologyMetrics
cross-modal-retrieval-with-noisy-1NAC
Image-to-text R@1: 41.8
Image-to-text R@10: 77.3
Image-to-text R@5: 68.6
R-Sum: 373.5
Text-to-image R@1: 40.5
Text-to-image R@10: 77.0
Text-to-image R@5: 68.3
cross-modal-retrieval-with-noisy-2NAC
Image-to-text R@1: 79.3
Image-to-text R@10: 97.8
Image-to-text R@5: 94.6
R-Sum: 507.1
Text-to-image R@1: 60.8
Text-to-image R@10: 90.1
Text-to-image R@5: 84.5
cross-modal-retrieval-with-noisy-3NAC
Image-to-text R@1: 80.3
Image-to-text R@10: 98.5
Image-to-text R@5: 96.2
R-Sum: 524.5
Text-to-image R@1: 63.2
Text-to-image R@10: 96.0
Text-to-image R@5: 90.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
NAC: Mitigating Noisy Correspondence in Cross-Modal Matching Via Neighbor Auxiliary Corrector | Papers | HyperAI