HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network

Denis Coquenet Clément Chatelain Thierry Paquet

End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network

Abstract

Unconstrained handwritten text recognition remains challenging for computer vision systems. Paragraph text recognition is traditionally achieved by two models: the first one for line segmentation and the second one for text line recognition. We propose a unified end-to-end model using hybrid attention to tackle this task. This model is designed to iteratively process a paragraph image line by line. It can be split into three modules. An encoder generates feature maps from the whole paragraph image. Then, an attention module recurrently generates a vertical weighted mask enabling to focus on the current text line features. This way, it performs a kind of implicit line segmentation. For each text line features, a decoder module recognizes the character sequence associated, leading to the recognition of a whole paragraph. We achieve state-of-the-art character error rate at paragraph level on three popular datasets: 1.91% for RIMES, 4.45% for IAM and 3.59% for READ 2016. Our code and trained model weights are available at https://github.com/FactoDeepLearning/VerticalAttentionOCR.

Code Repositories

FactoDeepLearning/VerticalAttentionOCR
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
handwritten-text-recognition-on-iamVAN
CER: 4.32
WER: 16.24
handwritten-text-recognition-on-iam-lineVAN
Test CER: 5.0
Test WER: 16.3
handwritten-text-recognition-on-read2016-lineVAN
Test CER: 4.1
Test WER: 16.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network | Papers | HyperAI