Command Palette
Search for a command to run...
Delving into Out-of-Distribution Detection with Vision-Language Representations
Yifei Ming Ziyang Cai Jiuxiang Gu Yiyou Sun Wei Li Yixuan Li

Abstract
Recognizing out-of-distribution (OOD) samples is critical for machine learning systems deployed in the open world. The vast majority of OOD detection methods are driven by a single modality (e.g., either vision or language), leaving the rich information in multi-modal representations untapped. Inspired by the recent success of vision-language pre-training, this paper enriches the landscape of OOD detection from a single-modal to a multi-modal regime. Particularly, we propose Maximum Concept Matching (MCM), a simple yet effective zero-shot OOD detection method based on aligning visual features with textual concepts. We contribute in-depth analysis and theoretical insights to understand the effectiveness of MCM. Extensive experiments demonstrate that MCM achieves superior performance on a wide variety of real-world tasks. MCM with vision-language features outperforms a common baseline with pure visual features on a hard OOD task with semantically similar classes by 13.1% (AUROC). Code is available at https://github.com/deeplearning-wisc/MCM.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| out-of-distribution-detection-on-imagenet-1k-10 | MCM (CLIP-B) | AUROC: 86.11 FPR95: 57.77 |
| out-of-distribution-detection-on-imagenet-1k-10 | MCM (CLIP-L) | AUROC: 84.88 FPR95: 59.88 |
| out-of-distribution-detection-on-imagenet-1k-12 | MCM (CLIP-L) | AUROC: 91.49 FPR95: 38.17 |
| out-of-distribution-detection-on-imagenet-1k-3 | MCM (CLIP-L) | AUROC: 94.95 FPR95: 28.38 |
| out-of-distribution-detection-on-imagenet-1k-3 | MCM (CLIP-B) | AUROC: 94.61 FPR95: 30.91 |
| out-of-distribution-detection-on-imagenet-1k-8 | MCM (CLIP-L) | AUROC: 94.14 FPR95: 29.00 |
| out-of-distribution-detection-on-imagenet-1k-8 | MCM (CLIP-B) | AUROC: 92.57 FPR95: 37.59 |
| out-of-distribution-detection-on-imagenet-1k-9 | MCM (CLIP-L) | AUROC: 92.00 FPR95: 35.42 |
| out-of-distribution-detection-on-imagenet-1k-9 | MCM (CLIP-B) | AUROC: 89.77 FPR95: 44.69 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.