HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

TransMatcher: Deep Image Matching Through Transformers for Generalizable Person Re-identification

Liao Shengcai ; Shao Ling

TransMatcher: Deep Image Matching Through Transformers for Generalizable
  Person Re-identification

Abstract

Transformers have recently gained increasing attention in computer vision.However, existing studies mostly use Transformers for feature representationlearning, e.g. for image classification and dense predictions, and thegeneralizability of Transformers is unknown. In this work, we furtherinvestigate the possibility of applying Transformers for image matching andmetric learning given pairs of images. We find that the Vision Transformer(ViT) and the vanilla Transformer with decoders are not adequate for imagematching due to their lack of image-to-image attention. Thus, we further designtwo naive solutions, i.e. query-gallery concatenation in ViT, and query-gallerycross-attention in the vanilla Transformer. The latter improves theperformance, but it is still limited. This implies that the attention mechanismin Transformers is primarily designed for global feature aggregation, which isnot naturally suitable for image matching. Accordingly, we propose a newsimplified decoder, which drops the full attention implementation with thesoftmax weighting, keeping only the query-key similarity computation.Additionally, global max pooling and a multilayer perceptron (MLP) head areapplied to decode the matching result. This way, the simplified decoder iscomputationally more efficient, while at the same time more effective for imagematching. The proposed method, called TransMatcher, achieves state-of-the-artperformance in generalizable person re-identification, with up to 6.1% and 5.7%performance gains in Rank-1 and mAP, respectively, on several popular datasets.Code is available at https://github.com/ShengcaiLiao/QAConv.

Code Repositories

ShengcaiLiao/TransMatcher
Official
pytorch
Mentioned in GitHub
shengcailiao/QAConv
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
generalizable-person-re-identification-on-20TransMatcher
ClonedPerson-u003eRank-1: 51.6
ClonedPerson-u003emAP: 20.8
Market-1501-u003eRank1: 47.3
Market-1501-u003emAP: 18.4
RandPerson-u003eRank-1: 48.3
RandPerson-u003emAP: 17.7
generalizable-person-re-identification-on-21TransMatcher
ClonedPerson-u003eRank-1: 84.8
ClonedPerson-u003emAP: 62.3
MSMT17-u003eRank-1: 80.1
MSMT17-u003emAP: 52.0
MSMT17-All-u003eRank-1: 82.6
MSMT17-All-u003emAP: 58.4
RandPerson-u003eRank-1: 77.3
RandPerson-u003emAP: 49.1
generalizable-person-re-identification-on-22TransMatcher
ClonedPerson-u003eRank-1: 25.4
ClonedPerson-u003emAP: 24.4
MSMT17-u003eRank-1: 23.7
MSMT17-u003emAP: 22.5
MSMT17-All-u003eRank-1: 31.9
MSMT17-All-u003emAP: 30.7
Market-1501-u003eRank-1: 22.2
Market-1501-u003emAP: 21.4
RandPerson-u003eRank-1: 17.1
RandPerson-u003emAP: 16.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
TransMatcher: Deep Image Matching Through Transformers for Generalizable Person Re-identification | Papers | HyperAI