HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Bilingual Rhetorical Structure Parsing with Large Parallel Annotations

Elena Chistova

Bilingual Rhetorical Structure Parsing with Large Parallel Annotations

Abstract

Discourse parsing is a crucial task in natural language processing that aims to reveal the higher-level relations in a text. Despite growing interest in cross-lingual discourse parsing, challenges persist due to limited parallel data and inconsistencies in the Rhetorical Structure Theory (RST) application across languages and corpora. To address this, we introduce a parallel Russian annotation for the large and diverse English GUM RST corpus. Leveraging recent advances, our end-to-end RST parser achieves state-of-the-art results on both English and Russian corpora. It demonstrates effectiveness in both monolingual and bilingual settings, successfully transferring even with limited second-language annotation. To the best of our knowledge, this work is the first to evaluate the potential of cross-lingual end-to-end RST parsing on a manually annotated parallel corpus.

Code Repositories

tchewik/bilingualrsp
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
discourse-parsing-on-rst-dtDMRST
Standard Parseval (Full): 55.7 ± 0.3
Standard Parseval (Nuclearity): 68.0 ± 0.6
Standard Parseval (Relation): 57.3 ± 0.2
Standard Parseval (Span): 78.7 ± 0.4
end-to-end-rst-parsing-on-rst-dt-1DMRST + ToNy + E-BiLSTM
Standard Parseval (Full): 53.0 ± 0.7
Standard Parseval (Nuclearity): 64.5 ± 0.8
Standard Parseval (Relation): 54.5 ± 0.7
Standard Parseval (Span): 74.8 ± 0.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Bilingual Rhetorical Structure Parsing with Large Parallel Annotations | Papers | HyperAI