Self Supervised Image Classification On

评估指标

Number of Params
Top 1 Accuracy

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
DINOv2+reg (ViT-g/14)1100M87.1Vision Transformers Need Registers
DINOv2 (ViT-g/14 @448)1100M86.7%DINOv2: Learning Robust Visual Features without Supervision
DINOv2 (ViT-g/14)1100M86.5%DINOv2: Learning Robust Visual Features without Supervision
DINOv2 distilled (ViT-L/14)307M86.3%DINOv2: Learning Robust Visual Features without Supervision
MIM-Refiner (D2V2-ViT-H/14)632M84.7%MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
DINOv2 distilled (ViT-B/14)85M84.5%DINOv2: Learning Robust Visual Features without Supervision
MIM-Refiner (MAE-ViT-2B/14)1890M84.5%MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
MIM-Refiner (MAE-ViT-H/14632M83.7%MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
MIM-Refiner (D2V2-ViT-L/16)307M83.5%MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
MIM-Refiner (MAE-ViT-L/16)307M82.8%MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
iBOT (ViT-L/16) (IN22k)307M82.3%iBOT: Image BERT Pre-Training with Online Tokenizer
MAE-CT (ViT-H/16)632M82.2%Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget
Mugs (VIT-L/16)307M82.1%Mugs: A Multi-Granular Self-Supervised Learning Framework
MAE-CT (ViT-L/16307M81.5%Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget
EsViT (Swin-B)87M81.3Efficient Self-supervised Vision Transformers for Representation Learning
iBOT (ViT-L/16)307M81.3%iBOT: Image BERT Pre-Training with Online Tokenizer
DINOv2 distilled (ViT-S/14)21M81.1%DINOv2: Learning Robust Visual Features without Supervision
MoCo v3 (ViT-BN-L/7)304M81.0%An Empirical Study of Training Self-Supervised Vision Transformers
EsViT(Swin-S)49M80.8Efficient Self-supervised Vision Transformers for Representation Learning
MSN (ViT-L/7)306M80.7%Masked Siamese Networks for Label-Efficient Learning
0 of 142 row(s) selected.
Self Supervised Image Classification On | SOTA | HyperAI超神经