HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

BN-DRISHTI: Bangla Document Recognition through Instance-level Segmentation of Handwritten Text Images

Jubaer Sheikh Mohammad ; Tabassum Nazifa ; Rahman Md. Ataur ; Islam Mohammad Khairul

BN-DRISHTI: Bangla Document Recognition through Instance-level
  Segmentation of Handwritten Text Images

Abstract

Handwriting recognition remains challenging for some of the most spokenlanguages, like Bangla, due to the complexity of line and word segmentationbrought by the curvilinear nature of writing and lack of quality datasets. Thispaper solves the segmentation problem by introducing a state-of-the-art method(BN-DRISHTI) that combines a deep learning-based object detection framework(YOLO) with Hough and Affine transformation for skew correction. However,training deep learning models requires a massive amount of data. Thus, we alsopresent an extended version of the BN-HTRd dataset comprising 786 full-pagehandwritten Bangla document images, line and word-level annotation forsegmentation, and corresponding ground truths for word recognition. Evaluationon the test portion of our dataset resulted in an F-score of 99.97% for lineand 98% for word segmentation. For comparative analysis, we used three externalBangla handwritten datasets, namely BanglaWriting, WBSUBNdb_text, and ICDAR2013, where our system outperformed by a significant margin, further justifyingthe performance of our approach on completely unseen samples.

Code Repositories

crusnic-corp/BN-DRISHTI
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
handwritten-line-segmentation-on-bn-htrdBN-DRISHTI Line Segmentation
F-Score: 0.9997
handwritten-word-segmentation-onBN-DRISHTI Word Segmentation
F-Score: 0.97
handwritten-word-segmentation-on-bn-htrdBN-DRISHTI Word Segmentation
F-Score: 0.98

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
BN-DRISHTI: Bangla Document Recognition through Instance-level Segmentation of Handwritten Text Images | Papers | HyperAI