HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Teaching Machines to Code: Neural Markup Generation with Visual Attention

Sumeet S. Singh

Teaching Machines to Code: Neural Markup Generation with Visual Attention

Abstract

We present a neural transducer model with visual attention that learns to generate LaTeX markup of a real-world math formula given its image. Applying sequence modeling and transduction techniques that have been very successful across modalities such as natural language, image, handwriting, speech and audio; we construct an image-to-markup model that learns to produce syntactically and semantically correct LaTeX markup code over 150 words long and achieves a BLEU score of 89%; improving upon the previous state-of-art for the Im2Latex problem. We also demonstrate with heat-map visualization how attention helps in interpreting the model and can pinpoint (detect and localize) symbols on the image accurately despite having been trained without any bounding box data.

Code Repositories

untrix/im2latex
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
optical-character-recognition-on-i2l-140kI2L-NOPOOL
BLEU: 89.09%
optical-character-recognition-on-i2l-140kI2L-STRIPS
BLEU: 89%
optical-character-recognition-on-im2latex-1I2L-STRIPS
BLEU: 88.86%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Teaching Machines to Code: Neural Markup Generation with Visual Attention | Papers | HyperAI