HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Decomposed Meta-Learning for Few-Shot Sequence Labeling

{Chin-Yew Lin Tiejun Zhao Börje F. Karlsson Jieru Lin Huiqiang Jiang Qianhui Wu Tingting Ma}

Decomposed Meta-Learning for Few-Shot Sequence Labeling

Abstract

Few-shot sequence labeling is a general problem formulation for many natural language understanding tasks in data-scarcity scenarios, which require models to generalize to new types via only a few labeled examples. Recent advances mostly adopt metric-based meta--earning and thus face the challenges of modeling the miscellaneous Other prototype and the inability to generalize to classes with large domain gaps. To overcome these challenges, we propose a decomposed meta-learning framework for few-shot sequence labeling that breaks down the task into few-shot mention detection and few-shot type classification, and sequentially tackles them via meta-learning. Specifically, we employ model-agnostic meta-learning (MAML) to prompt the mention detection model to learn boundary knowledge shared across types. With the detected mention spans, we further leverage the MAML-enhanced span-level prototypical network for few-shot type classification. In this way, the decomposition framework bypasses the requirement of modeling the miscellaneous Other prototype. Meanwhile, the adoption of the MAML algorithm enables us to explore the knowledge contained insupport examples more efficiently, so that our model can quickly adapt to new types using only a few labeled examples. Under our framework, we explore a basic implementation that uses two separate models for the two subtasks. We further propose a joint model to reduce model size and inference time, making our framework more applicable for scenarios with limited resources. Extensive experiments on nine benchmark datasets, including named entity recognition, slot tagging, event detection, and part-of-speech tagging, show that the proposed approach achieves start-of-the-art performance across various few-shot sequence labeling tasks.

Benchmarks

BenchmarkMethodologyMetrics
few-shot-ner-on-few-nerd-interDecomposedMetaSL
10 way 1~2 shot: 55.61±0.32
10 way 5~10 shot: 67.85±0.18
5 way 1~2 shot: 62.09±0.93
5 way 5~10 shot: 71.26±0.15
Average: 63.99
few-shot-ner-on-few-nerd-intraDecomposedMetaSL
10 way 1~2 shot: 43.03±0.29
10 way 5~10 shot: 57.58±0.26
5 way 1~2 shot: 49.90±0.33
5 way 5~10 shot: 64.36±0.20
Average: 53.72
pos-tagging-on-twitter-posDecomposedMetaSL
Accuracy: 81.01±0.15
pos-tagging-on-wsj-posDecomposedMetaSL
Accuracy: 91.78±0.21
slot-filling-on-snipsDecomposedMetaSL
F1 (1-shot) avg: 74.89
F1 (5-shot) avg: 84.54

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Decomposed Meta-Learning for Few-Shot Sequence Labeling | Papers | HyperAI