HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Delving into Out-of-Distribution Detection with Vision-Language Representations

Yifei Ming Ziyang Cai Jiuxiang Gu Yiyou Sun Wei Li Yixuan Li

Delving into Out-of-Distribution Detection with Vision-Language Representations

Abstract

Recognizing out-of-distribution (OOD) samples is critical for machine learning systems deployed in the open world. The vast majority of OOD detection methods are driven by a single modality (e.g., either vision or language), leaving the rich information in multi-modal representations untapped. Inspired by the recent success of vision-language pre-training, this paper enriches the landscape of OOD detection from a single-modal to a multi-modal regime. Particularly, we propose Maximum Concept Matching (MCM), a simple yet effective zero-shot OOD detection method based on aligning visual features with textual concepts. We contribute in-depth analysis and theoretical insights to understand the effectiveness of MCM. Extensive experiments demonstrate that MCM achieves superior performance on a wide variety of real-world tasks. MCM with vision-language features outperforms a common baseline with pure visual features on a hard OOD task with semantically similar classes by 13.1% (AUROC). Code is available at https://github.com/deeplearning-wisc/MCM.

Code Repositories

HHU-MMBS/plp-official-tmlr2024
pytorch
Mentioned in GitHub
deeplearning-wisc/mcm
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
out-of-distribution-detection-on-imagenet-1k-10MCM (CLIP-B)
AUROC: 86.11
FPR95: 57.77
out-of-distribution-detection-on-imagenet-1k-10MCM (CLIP-L)
AUROC: 84.88
FPR95: 59.88
out-of-distribution-detection-on-imagenet-1k-12MCM (CLIP-L)
AUROC: 91.49
FPR95: 38.17
out-of-distribution-detection-on-imagenet-1k-3MCM (CLIP-L)
AUROC: 94.95
FPR95: 28.38
out-of-distribution-detection-on-imagenet-1k-3MCM (CLIP-B)
AUROC: 94.61
FPR95: 30.91
out-of-distribution-detection-on-imagenet-1k-8MCM (CLIP-L)
AUROC: 94.14
FPR95: 29.00
out-of-distribution-detection-on-imagenet-1k-8MCM (CLIP-B)
AUROC: 92.57
FPR95: 37.59
out-of-distribution-detection-on-imagenet-1k-9MCM (CLIP-L)
AUROC: 92.00
FPR95: 35.42
out-of-distribution-detection-on-imagenet-1k-9MCM (CLIP-B)
AUROC: 89.77
FPR95: 44.69

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Delving into Out-of-Distribution Detection with Vision-Language Representations | Papers | HyperAI