Image Classification On Cifar 10

评估指标

Percentage correct

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
DINOv2 (ViT-g/14, frozen model, linear eval)99.5DINOv2: Learning Robust Visual Features without Supervision
ViT-H/1499.5An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
µ2Net (ViT-L/16)99.49An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems
ViT-L/1699.42An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
CaiT-M-36 U 22499.4--
CvT-W2499.39CvT: Introducing Convolutions to Vision Transformers
BiT-L (ResNet)99.37Big Transfer (BiT): General Visual Representation Learning
RDNet-L (224 res, IN-1K pretrained)99.31DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
RDNet-B (224 res, IN-1K pretrained)99.31DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
ViT-B (attn fine-tune)99.3Three things everyone should know about Vision Transformers
Heinsen Routing + BEiT-large 16 22499.2An Algorithm for Routing Vectors in Sequences
ViT-B/16 (PUGD)99.13Perturbated Gradients Updating within Unit Space for Deep Learning
Astroformer99.12Astroformer: More Data Might not be all you need for Classification
CeiT-S (384 finetune resolution)99.1Incorporating Convolution Designs into Visual Transformers
TNT-B99.1Transformer in Transformer
DeiT-B99.1Training data-efficient image transformers & distillation through attention
EfficientNetV2-L99.1EfficientNetV2: Smaller Models and Faster Training
AutoFormer-S | 38499.1AutoFormer: Searching Transformers for Visual Recognition
VIT-L/16 (Spinal FC, Background)99.05Reduction of Class Activation Uncertainty with Background Information
LaNet99.03Sample-Efficient Neural Architecture Search by Learning Action Space for Monte Carlo Tree Search-
0 of 264 row(s) selected.
Image Classification On Cifar 10 | SOTA | HyperAI超神经