HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Decomposed Meta-Learning for Few-Shot Named Entity Recognition

Tingting Ma Huiqiang Jiang Qianhui Wu Tiejun Zhao Chin-Yew Lin

Decomposed Meta-Learning for Few-Shot Named Entity Recognition

Abstract

Few-shot named entity recognition (NER) systems aim at recognizing novel-class named entities based on only a few labeled examples. In this paper, we present a decomposed meta-learning approach which addresses the problem of few-shot NER by sequentially tackling few-shot span detection and few-shot entity typing using meta-learning. In particular, we take the few-shot span detection as a sequence labeling problem and train the span detector by introducing the model-agnostic meta-learning (MAML) algorithm to find a good model parameter initialization that could fast adapt to new entity classes. For few-shot entity typing, we propose MAML-ProtoNet, i.e., MAML-enhanced prototypical networks to find a good embedding space that can better distinguish text span representations from different entity classes. Extensive experiments on various benchmarks show that our approach achieves superior performance over prior methods.

Code Repositories

microsoft/vert-papers
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
few-shot-ner-on-few-nerd-interDecomposedMetaNER
10 way 1~2 shot: 58.65±0.43
10 way 5~10 shot: 68.11±0.05
5 way 1~2 shot: 64.75±0.35
5 way 5~10 shot: 71.49±0.47
few-shot-ner-on-few-nerd-intraDecomposedMetaNER
10 way 1~2 shot: 42.84±0.46
10 way 5~10 shot: 57.31±0.25
5 way 1~2 shot: 49.48±0.85
5 way 5~10 shot: 62.92±0.57

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Decomposed Meta-Learning for Few-Shot Named Entity Recognition | Papers | HyperAI