| DINOv2 (ViT-g/14, frozen model, linear eval) | 99.5 | DINOv2: Learning Robust Visual Features without Supervision | |
| RDNet-L (224 res, IN-1K pretrained) | 99.31 | DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | |
| RDNet-B (224 res, IN-1K pretrained) | 99.31 | DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | |
| Heinsen Routing + BEiT-large 16 224 | 99.2 | An Algorithm for Routing Vectors in Sequences | |
| CeiT-S (384 finetune resolution) | 99.1 | Incorporating Convolution Designs into Visual Transformers | |
| VIT-L/16 (Spinal FC, Background) | 99.05 | Reduction of Class Activation Uncertainty with Background Information | |