HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Abstract

While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in conjunction with convolutional networks, or used to replace certain components of convolutional networks while keeping their overall structure in place. We show that this reliance on CNNs is not necessary and a pure transformer applied directly to sequences of image patches can perform very well on image classification tasks. When pre-trained on large amounts of data and transferred to multiple mid-sized or small image recognition benchmarks (ImageNet, CIFAR-100, VTAB, etc.), Vision Transformer (ViT) attains excellent results compared to state-of-the-art convolutional networks while requiring substantially fewer computational resources to train.

Code Repositories

rayanramoul/Visual-Transformer-PyTorch
pytorch
Mentioned in GitHub
Thanusan19/Vision_Transformer
jax
Mentioned in GitHub
YanYan0716/vision_transform
tf
Mentioned in GitHub
ludics/ViT-Retri
pytorch
Mentioned in GitHub
SupreethRao99/VisionTransformer
pytorch
Mentioned in GitHub
quanmario0311/ViT_PyTorch
pytorch
Mentioned in GitHub
haiyang-w/git
pytorch
Mentioned in GitHub
mtancak1/PyTorch-ViT-Visual-Transformer
pytorch
Mentioned in GitHub
ruiqirichard/eegeyenet-vit
pytorch
Mentioned in GitHub
james77777778/keras-image-models
pytorch
Mentioned in GitHub
KiUngSong/Vision
pytorch
Mentioned in GitHub
nima1999nikkhah/ViT-Hybrid
pytorch
Mentioned in GitHub
timH6502/VisionTransformer-PyTorch
pytorch
Mentioned in GitHub
liuxingwt/CLS
pytorch
Mentioned in GitHub
qiaopTDUN/mae-repo
pytorch
Mentioned in GitHub
SHI-Labs/Compact-Transformers
pytorch
Mentioned in GitHub
faustomorales/vit-keras
tf
Mentioned in GitHub
asarigun/TransGAN
pytorch
Mentioned in GitHub
shahrukhx01/ocr-test
pytorch
Mentioned in GitHub
charchit7/Using_Transoformers
pytorch
Mentioned in GitHub
rwightman/pytorch-image-models
pytorch
Mentioned in GitHub
BaiqiangGit/15minCode
pytorch
Mentioned in GitHub
wangguanan/light-reid
pytorch
Mentioned in GitHub
jiangtaoxie/So-ViT
pytorch
Mentioned in GitHub
naver-ai/pflayer
pytorch
Mentioned in GitHub
SrinjaySarkar/ViT
pytorch
Mentioned in GitHub
Westlake-AI/openmixup
pytorch
Mentioned in GitHub
Ugenteraan/Masked-AutoEncoder-PyTorch
pytorch
Mentioned in GitHub
bshantam97/Attention_Based_Networks
pytorch
Mentioned in GitHub
smu-ivpl/DeepfakeDetection
pytorch
Mentioned in GitHub
jaketae/mlp-mixer
pytorch
Mentioned in GitHub
conceptofmind/ViT-haiku
jax
Mentioned in GitHub
Julien-pour/music_classifcation
pytorch
Mentioned in GitHub
gnoses/ViT_examples
pytorch
Mentioned in GitHub
TACJu/TransFG
pytorch
Mentioned in GitHub
kingcong/vit
mindspore
Mentioned in GitHub
skchen1993/TrangFG
pytorch
Mentioned in GitHub
BrianPulfer/PapersReimplementations
pytorch
Mentioned in GitHub
BebDong/MXNetSeg
mxnet
Mentioned in GitHub
KatherLab/HIA
pytorch
Mentioned in GitHub
facebookresearch/vissl
pytorch
Mentioned in GitHub
drumpt/ViT
pytorch
Mentioned in GitHub
google-research/vision_transformer
Official
jax
Mentioned in GitHub
dispink/xpt
pytorch
Mentioned in GitHub
kakaobrain/coyo-dataset
pytorch
Mentioned in GitHub
IMvision12/keras-vision-models
pytorch
Mentioned in GitHub
alililia/vit_base_GPU
mindspore
Mentioned in GitHub
sangHa0411/VIT
pytorch
Mentioned in GitHub
sneakatyou/ViT-Tensorflow-2.0
tf
Mentioned in GitHub
stevenwalton/scs-cct
pytorch
Mentioned in GitHub
huggingface/transformers
pytorch
Mentioned in GitHub
04RR/SOTA-Vision
pytorch
Mentioned in GitHub
YousefGamal220/Vision-Transformers
pytorch
Mentioned in GitHub
junyongyou/triq
pytorch
Mentioned in GitHub
nachiket273/Vision_transformer_pytorch
pytorch
Mentioned in GitHub
alililia/vit_base_Ascend
mindspore
Mentioned in GitHub
facebookresearch/hiera
pytorch
Mentioned in GitHub
Mind23-2/MindCode-89
mindspore
Mentioned in GitHub
mtancak/PyTorch-ViT-Visual-Transformer
pytorch
Mentioned in GitHub
jacobgil/vit-explain
pytorch
Mentioned in GitHub
ttt496/VisionTransformer
jax
Mentioned in GitHub
HyeonhoonLee/MAIC2021_Sleep
pytorch
Mentioned in GitHub
sliao-mi-luku/Galaxy-Zoo-Classification
pytorch
Mentioned in GitHub
davisking/dlib-models
Mentioned in GitHub
gmum/dl-mo-2021
Mentioned in GitHub
Mayurji/Image-Classification-PyTorch
pytorch
Mentioned in GitHub
holdfire/CLS
pytorch
Mentioned in GitHub
Kevinz-code/CSRA
pytorch
Mentioned in GitHub
Aedelon/ViT-PyTorch-Replication
pytorch
Mentioned in GitHub
staghado/vit.cpp
pytorch
Mentioned in GitHub
mahmoodlab/hipt
pytorch
Mentioned in GitHub
Ugenteraan/Vanilla-ViT
pytorch
Mentioned in GitHub
DominikBatic/EndoViT
pytorch
Mentioned in GitHub
tahmid0007/VisionTransformer
pytorch
Mentioned in GitHub
SforAiDl/vformer
pytorch
Mentioned in GitHub
explainingai-code/VIT-Pytorch
pytorch
Mentioned in GitHub
meowbutlerdev/ViT
pytorch
Mentioned in GitHub
nasa-impact/hls-foundation-os
pytorch
Mentioned in GitHub
Mind23-2/MindCode-1
paddle
Mentioned in GitHub
nachiket273/VisTrans
pytorch
Mentioned in GitHub
zpc-666/Paddle-R-Drop
paddle
Mentioned in GitHub
modeeric/eegvit-tcnet
pytorch
Mentioned in GitHub
nateraw/lightning-vision-transformer
pytorch
Mentioned in GitHub
protonx-engineering/vit
tf
Mentioned in GitHub
jeonsworld/ViT-pytorch
pytorch
Mentioned in GitHub
holdfire/FAS
pytorch
Mentioned in GitHub
asyml/vision-transformer-pytorch
jax
Mentioned in GitHub
jo1jun/Vision_Transformer
pytorch
Mentioned in GitHub
lukas-blecher/LaTeX-OCR
pytorch
Mentioned in GitHub
woctezuma/steam-CLIP
Mentioned in GitHub
tintn/vision-transformer-from-scratch
pytorch
Mentioned in GitHub
smitheric95/MoCoViT-PyTorch
pytorch
Mentioned in GitHub
uygarkurt/ViT-PyTorch
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
domain-generalization-on-vizwizViT-8/B-224
Accuracy - Clean Images: 450
domain-generalization-on-vizwizViT-16/L-224
Accuracy - All Images: 49
fine-grained-image-classification-on-oxford-2ViT-B/16
Top-1 Error Rate: 6.2%
image-classification-on-cifar-10ViT-H/14
Percentage correct: 99.5
image-classification-on-cifar-10ViT-L/16
Percentage correct: 99.42
image-classification-on-flowers-102-
Accuracy: 99.68
image-classification-on-imagenetViT-L/16
Top 1 Accuracy: 87.76%
image-classification-on-imagenetViT-Large
Top 1 Accuracy: 24%
image-classification-on-imagenet-
Top 5 Accuracy: 23.72
image-classification-on-imagenetViT-H/14
Top 1 Accuracy: 88.55%
image-classification-on-objectnetViT-H/14
Top-5 Accuracy: 82.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale | Papers | HyperAI