HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

Dan Hendrycks; Kevin Gimpel

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

Abstract

We consider the two related problems of detecting if an example is misclassified or out-of-distribution. We present a simple baseline that utilizes probabilities from softmax distributions. Correctly classified examples tend to have greater maximum softmax probabilities than erroneously classified and out-of-distribution examples, allowing for their detection. We assess performance by defining several tasks in computer vision, natural language processing, and automatic speech recognition, showing the effectiveness of this baseline across all. We then show the baseline can sometimes be surpassed, demonstrating the room for future research on these underexplored detection tasks.

Code Repositories

sooonwoo/RotNet-OOD
pytorch
Mentioned in GitHub
oliverzhang42/ood_medical_images
pytorch
Mentioned in GitHub
2sang/OOD-baseline
tf
Mentioned in GitHub
thuiar/textoir-demo
pytorch
Mentioned in GitHub
kingjamessong/rankfeat
pytorch
Mentioned in GitHub
guyAmit/GLOD
pytorch
Mentioned in GitHub
thuiar/textoir
pytorch
Mentioned in GitHub
lancopku/avg-avg
pytorch
Mentioned in GitHub
drumpt/RotNet-OOD
pytorch
Mentioned in GitHub
hendrycks/error-detection
Official
tf
Mentioned in GitHub
zjysteven/mixoe
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
out-of-distribution-detection-on-cifar-10-vsWRN 40-2 (MSP Baseline)
AUPR: 55.8
AUROC: 87.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks | Papers | HyperAI