HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Enhancing Structure-aware Encoder with Extremely Limited Data for Graph-based Dependency Parsing

{Fei Xia Yan Song Yuanhe Tian}

Enhancing Structure-aware Encoder with Extremely Limited Data for Graph-based Dependency Parsing

Abstract

Dependency parsing is an important fundamental natural language processing task which analyzes the syntactic structure of an input sentence by illustrating the syntactic relations between words. To improve dependency parsing, leveraging existing dependency parsers and extra data (e.g., through semi-supervised learning) has been demonstrated to be effective, even though the final parsers are trained on inaccurate (but massive) data. In this paper, we propose a frustratingly easy approach to improve graph-based dependency parsing, where a structure-aware encoder is pre-trained on auto-parsed data by predicting the word dependencies and then fine-tuned on gold dependency trees, which differs from the usual pre-training process that aims to predict the context words along dependency paths. Experimental results and analyses demonstrate the effectiveness and robustness of our approach to benefit from the data (even with noise) processed by different parsers, where our approach outperforms strong baselines under different settings with different dependency standards and model architectures used in pre-training and fine-tuning. More importantly, further analyses find that only 2K auto-parsed sentences are required to obtain improvement when pre-training vanilla BERT-large based parser without requiring extra parameters.

Benchmarks

BenchmarkMethodologyMetrics
dependency-parsing-on-penn-treebankDMPar + XLNet
LAS: 95.92
UAS: 97.30

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Enhancing Structure-aware Encoder with Extremely Limited Data for Graph-based Dependency Parsing | Papers | HyperAI