HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

A Multi-Object Rectified Attention Network for Scene Text Recognition

Canjie Luo; Lianwen Jin; Zenghui Sun

A Multi-Object Rectified Attention Network for Scene Text Recognition

Abstract

Irregular text is widely used. However, it is considerably difficult to recognize because of its various shapes and distorted patterns. In this paper, we thus propose a multi-object rectified attention network (MORAN) for general scene text recognition. The MORAN consists of a multi-object rectification network and an attention-based sequence recognition network. The multi-object rectification network is designed for rectifying images that contain irregular text. It decreases the difficulty of recognition and enables the attention-based sequence recognition network to more easily read irregular text. It is trained in a weak supervision way, thus requiring only images and corresponding text labels. The attention-based sequence recognition network focuses on target characters and sequentially outputs the predictions. Moreover, to improve the sensitivity of the attention-based sequence recognition network, a fractional pickup method is proposed for an attention-based decoder in the training phase. With the rectification mechanism, the MORAN can read both regular and irregular scene text. Extensive experiments on various benchmarks are conducted, which show that the MORAN achieves state-of-the-art performance. The source code is available.

Code Repositories

jeasung-pf/MORAN_v2
pytorch
Mentioned in GitHub
ModelBunker/MORAN-PyTorch
pytorch
Mentioned in GitHub
lzmisscc/emoran
pytorch
Mentioned in GitHub
dipu-bd/craft-moran-ocr
pytorch
Mentioned in GitHub
Canjie-Luo/MORAN_v2
Official
pytorch
Mentioned in GitHub
agiletechvn/moran_v2_text_recognition
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
optical-character-recognition-on-benchmarkingMORAN
Accuracy (%): 64.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Multi-Object Rectified Attention Network for Scene Text Recognition | Papers | HyperAI