HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

DiffusionSTR: Diffusion Model for Scene Text Recognition

Masato Fujitake

DiffusionSTR: Diffusion Model for Scene Text Recognition

Abstract

This paper presents Diffusion Model for Scene Text Recognition (DiffusionSTR), an end-to-end text recognition framework using diffusion models for recognizing text in the wild. While existing studies have viewed the scene text recognition task as an image-to-text transformation, we rethought it as a text-text one under images in a diffusion model. We show for the first time that the diffusion model can be applied to text recognition. Furthermore, experimental results on publicly available datasets show that the proposed method achieves competitive accuracy compared to state-of-the-art methods.

Benchmarks

BenchmarkMethodologyMetrics
scene-text-recognition-on-cute80DiffusionSTR
Accuracy: 92.5
scene-text-recognition-on-icdar2013DiffusionSTR
Accuracy: 97.1
scene-text-recognition-on-icdar2015DiffusionSTR
Accuracy: 86
scene-text-recognition-on-iiit5kDiffusionSTR
Accuracy: 97.3
scene-text-recognition-on-svtDiffusionSTR
Accuracy: 93.6
scene-text-recognition-on-svtpDiffusionSTR
Accuracy: 89.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
DiffusionSTR: Diffusion Model for Scene Text Recognition | Papers | HyperAI