HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Attribute De-biased Vision Transformer (AD-ViT) for Long-Term Person Re-identification

{and Venu Govindaraju Srirangaraj Setlur Deen Mohan Bhavin Jawade Kyung Won Lee}

Abstract

Person re-identification (re-ID) aims to retrieve images of the same identity from a gallery of person images across cameras and viewpoints. However, most works in person re-ID assume a short-term setting characterized by invariance in appearance. In contrast, a high visual variance can be frequently seen in a long-term setting due to changes in apparel and accessories, which makes the task more challenging. Therefore, learning identity-specific features agnostic of temporally variant features is crucial for robust long-term person Re-ID. To this end, we propose an Attribute De-biased Vision Transformer (AD-ViT) to provide direct supervision to learn identity-specific features. Specifically, we produce attribute labels for person instances and utilize them to guide our model to focus on identity features through gradient reversal. Our experiments on two longterm re-ID datasets - LTCC and NKUP show that the proposed work consistently outperforms current state-of-theart methods.

Benchmarks

BenchmarkMethodologyMetrics
person-re-identification-on-ltccAD-ViT
Rank-1: 72
mAP: 34.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Attribute De-biased Vision Transformer (AD-ViT) for Long-Term Person Re-identification | Papers | HyperAI