HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Dense Interaction Learning for Video-based Person Re-identification

Tianyu He Xin Jin Xu Shen Jianqiang Huang Zhibo Chen Xian-Sheng Hua

Dense Interaction Learning for Video-based Person Re-identification

Abstract

Video-based person re-identification (re-ID) aims at matching the same person across video clips. Efficiently exploiting multi-scale fine-grained features while building the structural interaction among them is pivotal for its success. In this paper, we propose a hybrid framework, Dense Interaction Learning (DenseIL), that takes the principal advantages of both CNN-based and Attention-based architectures to tackle video-based person re-ID difficulties. DenseIL contains a CNN encoder and a Dense Interaction (DI) decoder. The CNN encoder is responsible for efficiently extracting discriminative spatial features while the DI decoder is designed to densely model spatial-temporal inherent interaction across frames. Different from previous works, we additionally let the DI decoder densely attends to intermediate fine-grained CNN features and that naturally yields multi-grained spatial-temporal representation for each video clip. Moreover, we introduce Spatio-TEmporal Positional Embedding (STEP-Emb) into the DI decoder to investigate the positional relation among the spatial-temporal inputs. Our experiments consistently and significantly outperform all the state-of-the-art methods on multiple standard video-based person re-ID datasets.

Benchmarks

BenchmarkMethodologyMetrics
person-re-identification-on-dukemtmc-reidDenseIL
mAP: 97.1
person-re-identification-on-marsDenseIL
Rank-1: 90.8
Rank-20: 98.8
Rank-5: 97.1
mAP: 87

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Dense Interaction Learning for Video-based Person Re-identification | Papers | HyperAI