HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Baoguang Shi; Xiang Bai; Cong Yao

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Abstract

Image-based sequence recognition has been a long-standing research topic in computer vision. In this paper, we investigate the problem of scene text recognition, which is among the most important and challenging tasks in image-based sequence recognition. A novel neural network architecture, which integrates feature extraction, sequence modeling and transcription into a unified framework, is proposed. Compared with previous systems for scene text recognition, the proposed architecture possesses four distinctive properties: (1) It is end-to-end trainable, in contrast to most of the existing algorithms whose components are separately trained and tuned. (2) It naturally handles sequences in arbitrary lengths, involving no character segmentation or horizontal scale normalization. (3) It is not confined to any predefined lexicon and achieves remarkable performances in both lexicon-free and lexicon-based scene text recognition tasks. (4) It generates an effective yet much smaller model, which is more practical for real-world application scenarios. The experiments on standard benchmarks, including the IIIT-5K, Street View Text and ICDAR datasets, demonstrate the superiority of the proposed algorithm over the prior arts. Moreover, the proposed algorithm performs well in the task of image-based music score recognition, which evidently verifies the generality of it.

Code Repositories

Yuting-Gao/CRNN_Mxnet
tf
Mentioned in GitHub
courao/ocr.pytorch
pytorch
Mentioned in GitHub
Liumihan/CRNN_pytorch
pytorch
Mentioned in GitHub
9ruddls3/CRNN_Pytorch
pytorch
Mentioned in GitHub
cjxxx0/license
tf
Mentioned in GitHub
chauthehan/CRNN_OCR_CMND
Mentioned in GitHub
Crespo-dong/caffe_ocr
Mentioned in GitHub
zwenwang/CTPN_Pytorch
pytorch
Mentioned in GitHub
lostsword/character_recognition
mindspore
Mentioned in GitHub
bai-shang/crnn_ctc_ocr_tf
tf
Mentioned in GitHub
tranbahien/CTC-OCR
tf
Mentioned in GitHub
bai-shang/CRNN_CTC_Tensorflow
tf
Mentioned in GitHub
githubharald/simplehtr
tf
Mentioned in GitHub
zhiqwang/image-captioning
pytorch
Mentioned in GitHub
DnanaDev/CRNN_for_OCR
tf
Mentioned in GitHub
HassamChundrigar/Urdu-Ocr
tf
Mentioned in GitHub
nithyadurai87/pottan-ocr-tamil
pytorch
Mentioned in GitHub
qjadud1994/CRNN-Keras
tf
Mentioned in GitHub
zhiqwang/crnn.pytorch
pytorch
Mentioned in GitHub
solivr/tf-crnn
tf
Mentioned in GitHub
CodeAchieveDream/crnn_model
pytorch
Mentioned in GitHub
mindee/doctr
pytorch
Mentioned in GitHub
shivaverma/Score-Time-Detection
pytorch
Mentioned in GitHub
sbillburg/CRNN-with-STN
tf
Mentioned in GitHub
WenmuZhou/PytorchOCR
pytorch
Mentioned in GitHub
mineshmathew/pytorch_rnn_examples
pytorch
Mentioned in GitHub
jackknife007/crnn
tf
Mentioned in GitHub
zyasjtu/CNN-RNN-CTC
tf
Mentioned in GitHub
MaybeShewill-CV/CRNN_Tensorflow
tf
Mentioned in GitHub
GitYCC/crnn-pytorch
pytorch
Mentioned in GitHub
DCSong/CRNN-DenseNet
pytorch
Mentioned in GitHub
FLming/CRNN.tf2
tf
Mentioned in GitHub
xmy0916/pytorch_crnn
pytorch
Mentioned in GitHub
WenmuZhou/Segmentation-Free_OCR
tf
Mentioned in GitHub
JaidedAI/EasyOCR
pytorch
Mentioned in GitHub
sonamghosh/local_hack_day_2018
pytorch
Mentioned in GitHub
bgshih/crnn
pytorch
Mentioned in GitHub
shreshtashetty/OCR
tf
Mentioned in GitHub
anuragcp/iocl-deepocr
tf
Mentioned in GitHub
topdu/openocr
pytorch
Mentioned in GitHub
cipri-tom/type-aware-crnn
tf
Mentioned in GitHub
lidongliang666/cv_deep_learning
pytorch
Mentioned in GitHub
xusongpei/crnn-ctc
tf
Mentioned in GitHub
sgenza/tf_crnn
tf
Mentioned in GitHub
bharatsush/TextSpotting
tf
Mentioned in GitHub
PaddlePaddle/PaddleOCR
paddle
Mentioned in GitHub
foamliu/CRNN
pytorch
Mentioned in GitHub
senlinuc/caffe_ocr
Mentioned in GitHub
abhiraman/Capstone_Project
pytorch
Mentioned in GitHub
harish2704/pottan-ocr
pytorch
Mentioned in GitHub
juanluisrosaramos/CRNN_OCR
tf
Mentioned in GitHub
carnotaur/crnn-tutorial
pytorch
Mentioned in GitHub
bai-shang/OCR_TF_CRNN_CTC
tf
Mentioned in GitHub
Liumihan/CRNN_kreas
tf
Mentioned in GitHub
weinman/cnn_lstm_ctc_ocr
tf
Mentioned in GitHub
SYR-Aegis/BrailleOCR
pytorch
Mentioned in GitHub
Media-Smart/vedastr
pytorch
Mentioned in GitHub
L706077/OCR-CRNN
pytorch
Mentioned in GitHub
wacr2008/tensorflow_crnn
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
scene-text-recognition-on-icdar-2003CRNN
Accuracy: 89.4
scene-text-recognition-on-icdar2013CRNN
Accuracy: 86.7
scene-text-recognition-on-svtCRNN
Accuracy: 80.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp