HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Few-NERD: A Few-Shot Named Entity Recognition Dataset

Ning Ding Guangwei Xu Yulin Chen Xiaobin Wang Xu Han Pengjun Xie Hai-Tao Zheng Zhiyuan Liu

Few-NERD: A Few-Shot Named Entity Recognition Dataset

Abstract

Recently, considerable literature has grown up around the theme of few-shot named entity recognition (NER), but little published benchmark data specifically focused on the practical and challenging task. Current approaches collect existing supervised NER datasets and re-organize them to the few-shot setting for empirical study. These strategies conventionally aim to recognize coarse-grained entity types with few examples, while in practice, most unseen entity types are fine-grained. In this paper, we present Few-NERD, a large-scale human-annotated few-shot NER dataset with a hierarchy of 8 coarse-grained and 66 fine-grained entity types. Few-NERD consists of 188,238 sentences from Wikipedia, 4,601,160 words are included and each is annotated as context or a part of a two-level entity type. To the best of our knowledge, this is the first few-shot NER dataset and the largest human-crafted NER dataset. We construct benchmark tasks with different emphases to comprehensively assess the generalization capability of models. Extensive empirical results and analysis show that Few-NERD is challenging and the problem requires further research. We make Few-NERD public at https://ningding97.github.io/fewnerd/.

Code Repositories

katzurik/neretrieve
Mentioned in GitHub
wangpeiyi9979/esd
pytorch
Mentioned in GitHub
thunlp/Few-NERD
Official
pytorch
Mentioned in GitHub
psunlpgroup/container
pytorch
Mentioned in GitHub
renll/sparselt
pytorch
Mentioned in GitHub
zifengcheng/cdap
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
few-shot-ner-on-few-nerd-interNNShot
10 way 1~2 shot: 38.87±0.21
10 way 5~10 shot: 49.57±2.73
5 way 1~2 shot: 47.24±1.00
5 way 5~10 shot: 55.64±0.63
few-shot-ner-on-few-nerd-interProtoBERT
10 way 1~2 shot: 32.45±0.79
10 way 5~10 shot: 52.92±0.37
5 way 1~2 shot: 38.83±1.49
5 way 5~10 shot: 58.79±0.44
few-shot-ner-on-few-nerd-interStructShot
10 way 1~2 shot: 43.34±0.10
10 way 5~10 shot: 49.57±3.08
5 way 1~2 shot: 51.88±0.69
5 way 5~10 shot: 57.32±0.63
few-shot-ner-on-few-nerd-intraProtoBERT
10 way 1~2 shot: 15.05±0.44
10 way 5~10 shot: 35.40±0.13
5 way 1~2 shot: 20.76±0.84
5 way 5~10 shot: 42.54±0.94
few-shot-ner-on-few-nerd-intraNNShot
10 way 1~2 shot: 18.27±0.41
10 way 5~10 shot: 27.38±0.53
5 way 1~2 shot: 25.78±0.91
5 way 5~10 shot: 36.18±0.79
few-shot-ner-on-few-nerd-intraStructShot
10 way 1~2 shot: 21.03±1.13
10 way 5~10 shot: 26.42±0.60
5 way 1~2 shot: 30.21±0.90
5 way 5~10 shot: 38.00±1.29
named-entity-recognition-on-few-nerd-supBERT-Tagger
F1-Measure: 67.13
Precision: 65.56
Recall: 68.78

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Few-NERD: A Few-Shot Named Entity Recognition Dataset | Papers | HyperAI