Scene Text Recognition On Wost
评估指标
1:1 Accuracy
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| CLIP4STR-H (DFN-5B) | 90.9 | CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | |
| CLIP4STR-L (DataComp-1B) | 90.6 | CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | |
| CLIP4STR-L | 88.8 | CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | |
| CLIP4STR-B | 87.0 | CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | |
| CCD-ViT-Base | 86.0 | Self-supervised Character-to-Character Distillation for Text Recognition |
0 of 5 row(s) selected.