HyperAI
HyperAI超神经
首页
算力平台
文档
资讯
论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
全站搜索…
⌘
K
首页
SOTA
场景文本识别
Scene Text Recognition On Icdar2013
Scene Text Recognition On Icdar2013
评估指标
Accuracy
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
Accuracy
Paper Title
Repository
CLIP4STR-L*
99.42
An Empirical Study of Scaling Law for OCR
DTrOCR 105M
99.4
DTrOCR: Decoder-only Transformer for Optical Character Recognition
CLIP4STR-L (DataComp-1B)
99.0
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
CLIP4STR-L
98.5
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
MGP-STR
98.5
Multi-Granularity Prediction for Scene Text Recognition
PARSeq
98.4±0.2
Scene Text Recognition with Permuted Autoregressive Sequence Models
CLIP4STR-B
98.3
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
CCD-ViT-Base(ARD_2.8M)
98.3
Self-supervised Character-to-Character Distillation for Text Recognition
CCD-ViT-Small(ARD_2.8M)
98.3
Self-supervised Character-to-Character Distillation for Text Recognition
MATRN
97.9
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
SIGA_T
97.8
Self-supervised Implicit Glyph Attention for Text Recognition
S-GTR
97.8
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
DPAN
97.7
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition
-
CDistNet (Ours)
97.67
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition
CCD-ViT-Tiny(ARD_2.8M)
97.5
Self-supervised Character-to-Character Distillation for Text Recognition
SVTR-L (Large)
97.2
SVTR: Scene Text Recognition with a Single Visual Model
SVTR-B (Base)
97.1
SVTR: Scene Text Recognition with a Single Visual Model
DiffusionSTR
97.1
DiffusionSTR: Diffusion Model for Scene Text Recognition
-
Yet Another Text Recognizer
96.8
Why You Should Try the Real Data for the Scene Text Recognition
SVTR-T (Tiny)
96.3
SVTR: Scene Text Recognition with a Single Visual Model
0 of 38 row(s) selected.
Previous
Next
Scene Text Recognition On Icdar2013 | SOTA | HyperAI超神经