HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection

Medeiros Heitor R. ; Latortue David ; Granger Eric ; Pedersoli Marco

MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection

Abstract

In real-world scenarios, using multiple modalities like visible (RGB) andinfrared (IR) can greatly improve the performance of a predictive task such asobject detection (OD). Multimodal learning is a common way to leverage thesemodalities, where multiple modality-specific encoders and a fusion module areused to improve performance. In this paper, we tackle a different way to employRGB and IR modalities, where only one modality or the other is observed by asingle shared vision encoder. This realistic setting requires a lower memoryfootprint and is more suitable for applications such as autonomous driving andsurveillance, which commonly rely on RGB and IR data. However, when learning asingle encoder on multiple modalities, one modality can dominate the other,producing uneven recognition results. This work investigates how to efficientlyleverage RGB and IR modalities to train a common transformer-based OD visionencoder, while countering the effects of modality imbalance. For this, weintroduce a novel training technique to Mix Patches (MiPa) from the twomodalities, in conjunction with a patch-wise modality agnostic module, forlearning a common representation of both modalities. Our experiments show thatMiPa can learn a representation to reach competitive results on traditionalRGB/IR benchmarks while only requiring a single modality during inference. Ourcode is available at: https://github.com/heitorrapela/MiPa.

Code Repositories

heitorrapela/mipa
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
multispectral-object-detection-on-flir-1MiPa
mAP: 44.8%
mAP50: 81.3%
object-detection-on-flirMiPa
AP 0.5: 0.813
object-detection-on-llvipMiPa
AP: 0.665
pedestrian-detection-on-llvipMiPa
AP: 0.665

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection | Papers | HyperAI