HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition

Denis Coquenet Clément Chatelain Thierry Paquet

SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition

Abstract

Unconstrained handwriting recognition is an essential task in document analysis. It is usually carried out in two steps. First, the document is segmented into text lines. Second, an Optical Character Recognition model is applied on these line images. We propose the Simple Predict & Align Network: an end-to-end recurrence-free Fully Convolutional Network performing OCR at paragraph level without any prior segmentation stage. The framework is as simple as the one used for the recognition of isolated lines and we achieve competitive results on three popular datasets: RIMES, IAM and READ 2016. The proposed model does not require any dataset adaptation, it can be trained from scratch, without segmentation labels, and it does not require line breaks in the transcription labels. Our code and trained model weights are available at https://github.com/FactoDeepLearning/SPAN.

Code Repositories

FactoDeepLearning/SPAN
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
handwritten-text-recognition-on-read2016-lineSpan
Test CER: 4.6
Test WER: 21.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition | Papers | HyperAI