HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

Long Shangbang ; Ruan Jiaqiang ; Zhang Wenjie ; He Xin ; Wu Wenhao ; Yao Cong

TextSnake: A Flexible Representation for Detecting Text of Arbitrary
  Shapes

Abstract

Driven by deep neural networks and large scale datasets, scene text detectionmethods have progressed substantially over the past years, continuouslyrefreshing the performance records on various standard benchmarks. However,limited by the representations (axis-aligned rectangles, rotated rectangles orquadrangles) adopted to describe text, existing methods may fall short whendealing with much more free-form text instances, such as curved text, which areactually very common in real-world scenarios. To tackle this problem, wepropose a more flexible representation for scene text, termed as TextSnake,which is able to effectively represent text instances in horizontal, orientedand curved forms. In TextSnake, a text instance is described as a sequence ofordered, overlapping disks centered at symmetric axes, each of which isassociated with potentially variable radius and orientation. Such geometryattributes are estimated via a Fully Convolutional Network (FCN) model. Inexperiments, the text detector based on TextSnake achieves state-of-the-art orcomparable performance on Total-Text and SCUT-CTW1500, the two newly publishedbenchmarks with special emphasis on curved text in natural images, as well asthe widely-used datasets ICDAR 2015 and MSRA-TD500. Specifically, TextSnakeoutperforms the baseline on Total-Text by more than 40% in F-measure.

Code Repositories

speakstone/TextSnake
pytorch
Mentioned in GitHub
princewang1994/TextSnake.pytorch
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
curved-text-detection-on-scut-ctw1500TextSnake [[Long et al.(2018)Long, Ruan, Zhang, He, Wu, and Yao]]
F-Measure: 75.6%
scene-text-detection-on-icdar-2015PixelLink+VGG16 2s
F-Measure: 83.7
Precision: 85.5
Recall: 82
scene-text-detection-on-msra-td500TextSnake
F-Measure: 78.3
Precision: 83.2
Recall: 73.9
scene-text-detection-on-scut-ctw1500TextSnake
F-Measure: 75.6
Precision: 67.9
Recall: 85.3
scene-text-detection-on-total-textTextSnake
F-Measure: 78.4%
Precision: 82.7
Recall: 74.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes | Papers | HyperAI