HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

Heng-Da Xu; Zhongli Li; Qingyu Zhou; Chao Li; Zizhen Wang; Yunbo Cao; Heyan Huang; Xian-Ling Mao

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

Abstract

Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in the Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts noticed this phenomenon and try to use the similarity for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called ReaLiSe, by directly leveraging the multimodal information of the Chinese characters. The ReaLiSe model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a large margin.

Code Repositories

DaDaMrX/ReaLiSe
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
chinese-spell-checking-on-sighan-2015ReaLiSe
Correction F1: 77.8
Detection F1: 79.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking | Papers | HyperAI