HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Semantically Self-Aligned Network for Text-to-Image Part-aware Person Re-identification

Ding Zefeng ; Ding Changxing ; Shao Zhiyin ; Tao Dacheng

Semantically Self-Aligned Network for Text-to-Image Part-aware Person
  Re-identification

Abstract

Text-to-image person re-identification (ReID) aims to search for imagescontaining a person of interest using textual descriptions. However, due to thesignificant modality gap and the large intra-class variance in textualdescriptions, text-to-image ReID remains a challenging problem. Accordingly, inthis paper, we propose a Semantically Self-Aligned Network (SSAN) to handle theabove problems. First, we propose a novel method that automatically extractssemantically aligned part-level features from the two modalities. Second, wedesign a multi-view non-local network that captures the relationships betweenbody parts, thereby establishing better correspondences between body parts andnoun phrases. Third, we introduce a Compound Ranking (CR) loss that makes useof textual descriptions for other images of the same identity to provide extrasupervision, thereby effectively reducing the intra-class variance in textualfeatures. Finally, to expedite future research in text-to-image ReID, we builda new database named ICFG-PEDES. Extensive experiments demonstrate that SSANoutperforms state-of-the-art approaches by significant margins. Both the newICFG-PEDES database and the SSAN code are available athttps://github.com/zifyloo/SSAN.

Code Repositories

zifyloo/SSAN
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-retrieval-on-icfg-pedesSSAN
rank-1: 54.23
nlp-based-person-retrival-on-cuhk-pedesSSAN
R@1: 61.37
R@10: 86.73
R@5: 80.15
text-based-person-retrieval-on-icfg-pedesSSAN
R@1: 54.23
text-based-person-retrieval-with-noisySSAN
Rank 10: 77.42
Rank-1: 46.52
Rank-5: 68.36
mAP: 42.49
mINP: 28.13
text-based-person-retrieval-with-noisy-1SSAN
Rank 1: 40.57
Rank-10: 71.53
Rank-5: 62.58
mAP: 20.93
mINP: 2.22
text-based-person-retrieval-with-noisy-2SSAN
Rank 1: 35.10
Rank 10: 71.45
Rank 5: 60.00
mAP: 28.90
mINP: 12.08

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Semantically Self-Aligned Network for Text-to-Image Part-aware Person Re-identification | Papers | HyperAI