| ViT-B-16 (ImageNet-21K-P pretrain) | 94.2 | ImageNet-21K Pretraining for the Masses | |
| Heinsen Routing + BEiT-large 16 224 | 93.8 | An Algorithm for Routing Vectors in Sequences | |
| VIT-L/16 (Spinal FC, Background) | 93.31 | Reduction of Class Activation Uncertainty with Background Information | |
| CeiT-S (384 finetune resolution) | 91.8 | Incorporating Convolution Designs into Visual Transformers | |