HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

Wang Jiapeng ; Jin Lianwen ; Ding Kai

LiLT: A Simple yet Effective Language-Independent Layout Transformer for
  Structured Document Understanding

Abstract

Structured document understanding has attracted considerable attention andmade significant progress recently, owing to its crucial role in intelligentdocument processing. However, most existing related models can only deal withthe document data of specific language(s) (typically English) included in thepre-training collection, which is extremely limited. To address this issue, wepropose a simple yet effective Language-independent Layout Transformer (LiLT)for structured document understanding. LiLT can be pre-trained on thestructured documents of a single language and then directly fine-tuned on otherlanguages with the corresponding off-the-shelf monolingual/multilingualpre-trained textual models. Experimental results on eight languages have shownthat LiLT can achieve competitive or even superior performance on diversewidely-used downstream benchmarks, which enables language-independent benefitfrom the pre-training of document layout structure. Code and model are publiclyavailable at https://github.com/jpWang/LiLT.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
document-image-classification-on-rvl-cdipLiLT[EN-R]BASE
Accuracy: 95.68%
key-information-extraction-on-cordLILT
F1: 96.07
key-value-pair-extraction-on-rfund-enLiLT ([InfoXLM]_base)
key-value pair F1: 52.18
key-value-pair-extraction-on-rfund-enLiLT ([EN-R]_base)
key-value pair F1: 54.33
key-value-pair-extraction-on-sibrLiLT ([InfoXLM]_base)
key-value pair F1: 72.76
semantic-entity-labeling-on-funsdLILT
F1: 88.41

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding | Papers | HyperAI