HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition

Junhao Zheng; Zhanxian Liang; Haibin Chen; Qianli Ma

Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition

Abstract

Continual Learning for Named Entity Recognition (CL-NER) aims to learn a growing number of entity types over time from a stream of data. However, simply learning Other-Class in the same way as new entity types amplifies the catastrophic forgetting and leads to a substantial performance drop. The main cause behind this is that Other-Class samples usually contain old entity types, and the old knowledge in these Other-Class samples is not preserved properly. Thanks to the causal inference, we identify that the forgetting is caused by the missing causal effect from the old data. To this end, we propose a unified causal framework to retrieve the causality from both new entity types and Other-Class. Furthermore, we apply curriculum learning to mitigate the impact of label noise and introduce a self-adaptive weight for balancing the causal effects between new entity types and Other-Class. Experimental results on three benchmark datasets show that our method outperforms the state-of-the-art method by a large margin. Moreover, our method can be combined with the existing state-of-the-art methods to improve the performance in CL-NER

Code Repositories

zzz47zzz/CFNER
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
fg-1-pg-1-on-2010-i2b2-vaCFNER
F1 (macro): 0.3626
F1 (micro): 0.6273
fg-1-pg-1-on-conll2003CFNER
F1 (macro): 0.7911
F1 (micro): 0.8091
fg-1-pg-1-on-ontonotes-5-0CFNER
F1 (macro): 0.4222
F1 (micro): 0.5894

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition | Papers | HyperAI