Command Palette
Search for a command to run...
Li Yuting ; Chen Yingyi ; Yu Xuanlong ; Chen Dexiong ; Shen Xi

Abstract
In this paper, we revisit techniques for uncertainty estimation within deepneural networks and consolidate a suite of techniques to enhance theirreliability. Our investigation reveals that an integrated application ofdiverse techniques--spanning model regularization, classifier andoptimization--substantially improves the accuracy of uncertainty predictions inimage classification tasks. The synergistic effect of these techniquesculminates in our novel SURE approach. We rigorously evaluate SURE against thebenchmark of failure prediction, a critical testbed for uncertainty estimationefficacy. Our results showcase a consistently better performance than modelsthat individually deploy each technique, across various datasets and modelarchitectures. When applied to real-world challenges, such as data corruption,label noise, and long-tailed class distribution, SURE exhibits remarkablerobustness, delivering results that are superior or on par with currentstate-of-the-art specialized methods. Particularly on Animal-10N and Food-101Nfor learning with noisy labels, SURE achieves state-of-the-art performancewithout any task-specific adjustments. This work not only sets a new benchmarkfor robust uncertainty estimation but also paves the way for its application indiverse, real-world scenarios where reliability is paramount. Our code isavailable at \url{https://yutingli0606.github.io/SURE/}.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-classification-on-food-101n-1 | SURE(ResNet-50) | Accuracy: 88.0 |
| learning-with-noisy-labels-on-animal | SURE | Accuracy: 89.0 ImageNet Pretrained: NO Network: Vgg19-BN |
| long-tail-learning-on-cifar-10-lt-r-10 | SURE(ResNet-32) | Error Rate: 5.04 |
| long-tail-learning-on-cifar-10-lt-r-100 | SURE(ResNet-32) | Error Rate: 13.07 |
| long-tail-learning-on-cifar-10-lt-r-50 | SURE(ResNet-32) | Error Rate: 9.78 |
| long-tail-learning-on-cifar-100-lt-r-10 | SURE(ResNet-32) | Error Rate: 26.76 |
| long-tail-learning-on-cifar-100-lt-r-100 | SURE(ResNet-32) | Error Rate: 43.66 |
| long-tail-learning-on-cifar-100-lt-r-50 | SURE(ResNet-32) | Error Rate: 36.87 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.