4 months ago

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Baoguang Shi; Xiang Bai; Cong Yao

Abstract

Image-based sequence recognition has been a long-standing research topic in computer vision. In this paper, we investigate the problem of scene text recognition, which is among the most important and challenging tasks in image-based sequence recognition. A novel neural network architecture, which integrates feature extraction, sequence modeling and transcription into a unified framework, is proposed. Compared with previous systems for scene text recognition, the proposed architecture possesses four distinctive properties: (1) It is end-to-end trainable, in contrast to most of the existing algorithms whose components are separately trained and tuned. (2) It naturally handles sequences in arbitrary lengths, involving no character segmentation or horizontal scale normalization. (3) It is not confined to any predefined lexicon and achieves remarkable performances in both lexicon-free and lexicon-based scene text recognition tasks. (4) It generates an effective yet much smaller model, which is more practical for real-world application scenarios. The experiments on standard benchmarks, including the IIIT-5K, Street View Text and ICDAR datasets, demonstrate the superiority of the proposed algorithm over the prior arts. Moreover, the proposed algorithm performs well in the task of image-based music score recognition, which evidently verifies the generality of it.

Code Repositories

bai-shang/crnn_ctc_ocr.Tensorflow

Mentioned in GitHub

Yuting-Gao/CRNN_Mxnet

Mentioned in GitHub

courao/ocr.pytorch

pytorch

Mentioned in GitHub

Liumihan/CRNN_pytorch

pytorch

Mentioned in GitHub

9ruddls3/CRNN_Pytorch

pytorch

Mentioned in GitHub

cjxxx0/license

Mentioned in GitHub

sartaj0/TextRecognition-Pytorch

pytorch

PratirupG/Handwriting-Recognition

Mentioned in GitHub

chauthehan/CRNN_OCR_CMND

Mentioned in GitHub

Crespo-dong/caffe_ocr

Mentioned in GitHub

oyxhust/CNN-LSTM-CTC-text-recognition

Mentioned in GitHub

zwenwang/CTPN_Pytorch

pytorch

Mentioned in GitHub

lostsword/character_recognition

mindspore

Mentioned in GitHub

bai-shang/crnn_ctc_ocr_tf

Mentioned in GitHub

tranbahien/CTC-OCR

Mentioned in GitHub

samueltin/tf-crnn_backup20180808

Mentioned in GitHub

bai-shang/CRNN_CTC_Tensorflow

Mentioned in GitHub

WenmuZhou/crnn.pytorch

pytorch

githubharald/simplehtr

Mentioned in GitHub

naveen-kumar-123/Handwritten-text-recognition---CNN-and-LSTM

Mentioned in GitHub

FaceOnLive/ID-Card-Passport-Recognition-SDK-Android

zhiqwang/image-captioning

pytorch

Mentioned in GitHub

DnanaDev/CRNN_for_OCR

Mentioned in GitHub

HassamChundrigar/Urdu-Ocr

Mentioned in GitHub

Holmeyoung/crnn-pytorch

pytorch

nithyadurai87/pottan-ocr-tamil

pytorch

Mentioned in GitHub

qjadud1994/CRNN-Keras

Mentioned in GitHub

zhiqwang/crnn.pytorch

pytorch

Mentioned in GitHub

junstar92/hangul-syllable-recognition

Mentioned in GitHub

tanmaysheoran/CRNN-Sequence-Text-Recognition

Mentioned in GitHub

solivr/tf-crnn

Mentioned in GitHub

CodeAchieveDream/crnn_model

pytorch

Mentioned in GitHub

wangrui1996/crnnLicensePlateRecognition

Mentioned in GitHub

rajmuchhala/Raj-Muchhala---Sr.-ML-Engineer--ML-Assignmen

Mentioned in GitHub

mindee/doctr

pytorch

Mentioned in GitHub

shivaverma/Score-Time-Detection

pytorch

Mentioned in GitHub

sbillburg/CRNN-with-STN

Mentioned in GitHub

WenmuZhou/PytorchOCR

pytorch

Mentioned in GitHub

mineshmathew/pytorch_rnn_examples

pytorch

Mentioned in GitHub

jackknife007/crnn

Mentioned in GitHub

anuragbaurai/Portable-camera-based-assistive-text-reader-for-blind-persons

Mentioned in GitHub

ztoString/CRNN_CTC_OCR_TensorFlow

Mentioned in GitHub

mrzaizai2k/Information-recognition-on-the-university-test-paper

Mentioned in GitHub

zyasjtu/CNN-RNN-CTC

Mentioned in GitHub

meijieru/crnn.pytorch

pytorch

MaybeShewill-CV/CRNN_Tensorflow

Mentioned in GitHub

GitYCC/crnn-pytorch

pytorch

Mentioned in GitHub

DCSong/CRNN-DenseNet

pytorch

Mentioned in GitHub

FLming/CRNN.tf2

Mentioned in GitHub

xmy0916/pytorch_crnn

pytorch

Mentioned in GitHub

2023-MindSpore-1/ms-code-210/tree/main/cnnctc

mindspore

WenmuZhou/Segmentation-Free_OCR

Mentioned in GitHub

JaidedAI/EasyOCR

pytorch

Mentioned in GitHub

sonamghosh/local_hack_day_2018

pytorch

Mentioned in GitHub

bgshih/crnn

pytorch

Mentioned in GitHub

mindspore-lab/mindocr

mindspore

shreshtashetty/OCR

Mentioned in GitHub

anuragcp/iocl-deepocr

Mentioned in GitHub

topdu/openocr

pytorch

Mentioned in GitHub

moto8xpk/DataExtractionJejuMLCamp

Mentioned in GitHub

cipri-tom/type-aware-crnn

Mentioned in GitHub

lidongliang666/cv_deep_learning

pytorch

Mentioned in GitHub

xusongpei/crnn-ctc

Mentioned in GitHub

kurapan/CRNN

Mingtzge/2019-CCF-BDCI-OCR-MCZJ-OCR-IdentificationIDElement

pytorch

Mentioned in GitHub

sgenza/tf_crnn

Mentioned in GitHub

bharatsush/TextSpotting

Mentioned in GitHub

PaddlePaddle/PaddleOCR

paddle

Mentioned in GitHub

foamliu/CRNN

pytorch

Mentioned in GitHub

senlinuc/caffe_ocr

Mentioned in GitHub

abhiraman/Capstone_Project

pytorch

Mentioned in GitHub

harish2704/pottan-ocr

pytorch

Mentioned in GitHub

juanluisrosaramos/CRNN_OCR

Mentioned in GitHub

carnotaur/crnn-tutorial

pytorch

Mentioned in GitHub

bai-shang/OCR_TF_CRNN_CTC

Mentioned in GitHub

Liumihan/CRNN_kreas

Mentioned in GitHub

weinman/cnn_lstm_ctc_ocr

Mentioned in GitHub

rickyHong/CRNN-Tensorflow-Text-repl

Mentioned in GitHub

abdulwaheedsoudagar/ImageTextTranslation

Mentioned in GitHub

bhavitvyamalik/OCR-using-CRNN

Mentioned in GitHub

SYR-Aegis/BrailleOCR

pytorch

Mentioned in GitHub

Media-Smart/vedastr

pytorch

Mentioned in GitHub

L706077/OCR-CRNN

pytorch

Mentioned in GitHub

chandan5362/Indian-Number-Plate-Recognition

Mentioned in GitHub

wacr2008/tensorflow_crnn

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
scene-text-recognition-on-icdar-2003	CRNN	Accuracy: 89.4
scene-text-recognition-on-icdar2013	CRNN	Accuracy: 86.7
scene-text-recognition-on-svt	CRNN	Accuracy: 80.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Baoguang Shi; Xiang Bai; Cong Yao

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters