HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Type-Aware Decomposed Framework for Few-Shot Named Entity Recognition

Yongqi Li Yu Yu Tieyun Qian

Type-Aware Decomposed Framework for Few-Shot Named Entity Recognition

Abstract

Despite the recent success achieved by several two-stage prototypical networks in few-shot named entity recognition (NER) task, the overdetected false spans at the span detection stage and the inaccurate and unstable prototypes at the type classification stage remain to be challenging problems. In this paper, we propose a novel Type-Aware Decomposed framework, namely TadNER, to solve these problems. We first present a type-aware span filtering strategy to filter out false spans by removing those semantically far away from type names. We then present a type-aware contrastive learning strategy to construct more accurate and stable prototypes by jointly exploiting support samples and type names as references. Extensive experiments on various benchmarks prove that our proposed TadNER framework yields a new state-of-the-art performance. Our code and data will be available at https://github.com/NLPWM-WHU/TadNER.

Code Repositories

nlpwm-whu/tadner
Official
pytorch
Mentioned in GitHub
liyongqi2002/TadNER
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
few-shot-ner-on-few-nerd-interTadNER
10 way 1~2 shot: 64.06±0.19
10 way 5~10 shot: 69.94±0.15
5 way 1~2 shot: 64.83±0.14
5 way 5~10 shot: 72.12±0.12
few-shot-ner-on-few-nerd-intraTadNER
10 way 1~2 shot: 55.44±0.08
10 way 5~10 shot: 60.87±0.22
5 way 1~2 shot: 60.78±0.32
5 way 5~10 shot: 67.94±0.17

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Type-Aware Decomposed Framework for Few-Shot Named Entity Recognition | Papers | HyperAI