Image Classification On Inaturalist 2019

评估指标

Top-1 Accuracy

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
Hiera-H (448px)88.5Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
MAE (ViT-H, 448)88.3Masked Autoencoders Are Scalable Vision Learners
Grafit (RegnetY 8GF)84.1Grafit: Learning fine-grained image representations with coarse labels-
MixMIM-L83.9MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers
RDNet-L (224 res, IN-1K pretrained)83.7DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
RDNet-B (224 res, IN-1K pretrained)83.5DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
RDNet-S (224 res, IN-1K pretrained)82.9DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
Conviformer-B82.85Conviformers: Convolutionally guided Vision Transformer
CeiT-S (384 finetune resolution)82.7Incorporating Convolution Designs into Visual Transformers
CaiT-M-36 U 22481.8--
RDNet-T (224 res, IN-1K pretrained)81.2DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
CeiT-S78.9Incorporating Convolution Designs into Visual Transformers
CeiT-T (384 finetune resolution)77.9Incorporating Convolution Designs into Visual Transformers
ResNet50 (A2)75.0ResNet strikes back: An improved training procedure in timm
LeViT-38474.3LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
CeiT-T72.8Incorporating Convolution Designs into Visual Transformers
ResMLP-2472.5ResMLP: Feedforward networks for image classification with data-efficient training
LeViT-25672.3LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
ResMLP-1271.0ResMLP: Feedforward networks for image classification with data-efficient training
LeViT-19270.8LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
0 of 22 row(s) selected.
Image Classification On Inaturalist 2019 | SOTA | HyperAI超神经