HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Pankaj Mishra Riccardo Verk Daniele Fornasier Claudio Piciarelli Gian Luca Foresti

VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Abstract

We present a transformer-based image anomaly detection and localization network. Our proposed model is a combination of a reconstruction-based approach and patch embedding. The use of transformer networks helps to preserve the spatial information of the embedded patches, which are later processed by a Gaussian mixture density network to localize the anomalous areas. In addition, we also publish BTAD, a real-world industrial anomaly dataset. Our results are compared with other state-of-the-art algorithms using publicly available datasets like MNIST and MVTec.

Code Repositories

pankajmishra000/VT-ADL
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
anomaly-detection-on-btadVT-ADL
Segmentation AUROC: 81.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization | Papers | HyperAI