HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition

{LiRong Dai Si Wei Jinshui Hu Yulong Hu Dan Liu Shiliang Zhang Jun Du Jianshu Zhang}

Abstract

Machine recognition of a handwritten mathematical expression (HME) is challenging due to the ambiguities of handwritten symbols and the two-dimensional structure of mathematical expressions. Inspired by recent work in deep learning, we present Watch, Attend and Parse (WAP), a novel end-to-end approach based on neural network that learns to recognize HMEs in a two-dimensional layout and outputs them as one-dimensional character sequences in LaTeX format. Inherently unlike traditional methods, our proposed model avoids problems that stem from symbol segmentation, and it does not require a predefined expression grammar. Meanwhile, the problems of symbol recognition and structural analysis are handled, respectively, using a watcher and a parser. We employ a convolutional neural network encoder that takes HME images as input as the watcher and employ a recurrent neural network decoder equipped with an attention mechanism as the parser to generate LaTeX sequences. Moreover, the correspondence between the input expressions and the output LaTeX sequences is learned automatically by the attention mechanism. We validate the proposed approach on a benchmark published by the CROHME international competition. Using the official training dataset, WAP significantly outperformed the state-of-the-art method with an expression recognition accuracy of 46.55% on CROHME 2014 and 44.55% on CROHME 2016.

Benchmarks

BenchmarkMethodologyMetrics
handwritten-mathmatical-expressionWAP
ExpRate: 46.55
handwritten-mathmatical-expression-1WAP
ExpRate: 44.55

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition | Papers | HyperAI