| DeepStruct multi-task w/ finetune | 80.8 | DeepStruct: Pretraining of Language Models for Structure Prediction |  | 
| Second-best learning and decoding + BERT + Flair | 77.36 | Nested Named Entity Recognition via Second-best Sequence Learning and Decoding |  | 
| Neural transition-based model | 73.9 | A Neural Transition-based Model for Nested Mention Recognition |  |