HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Mimic-IV-ICD: A new benchmark for eXtreme MultiLabel Classification

Thanh-Tung Nguyen Viktor Schlegel Abhinav Kashyap Stefan Winkler Shao-Syuan Huang Jie-Jyun Liu Chih-Jen Lin

Mimic-IV-ICD: A new benchmark for eXtreme MultiLabel Classification

Abstract

Clinical notes are assigned ICD codes - sets of codes for diagnoses and procedures. In the recent years, predictive machine learning models have been built for automatic ICD coding. However, there is a lack of widely accepted benchmarks for automated ICD coding models based on large-scale public EHR data. This paper proposes a public benchmark suite for ICD-10 coding using a large EHR dataset derived from MIMIC-IV, the most recent public EHR dataset. We implement and compare several popular methods for ICD coding prediction tasks to standardize data preprocessing and establish a comprehensive ICD coding benchmark dataset. This approach fosters reproducibility and model comparison, accelerating progress toward employing automated ICD coding in future studies. Furthermore, we create a new ICD-9 benchmark using MIMIC-IV data, providing more data points and a higher number of ICD codes than MIMIC-III. Our open-source code offers easy access to data processing steps, benchmark creation, and experiment replication for those with MIMIC-IV access, providing insights, guidance, and protocols to efficiently develop ICD coding models.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
medical-code-prediction-on-mimic-iv-icd-10-1MSMN
Macro-AUC: 97.07
Macro-F1: 5.42
Micro-AUC: 99.61
Micro-F1: 55.91
Precision@8: 67.66
medical-code-prediction-on-mimic-iv-icd-10-1CAML
Macro-AUC: 89.91
Macro-F1: 4.07
Micro-AUC: 98.79
Micro-F1: 52.67
Precision@8: 64.43
medical-code-prediction-on-mimic-iv-icd-10-1LAAT
Macro-AUC: 92.96
Macro-F1: 4.47
Micro-AUC: 99.14
Micro-F1: 55.40
Precision@8: 66,97
medical-code-prediction-on-mimic-iv-icd-10-1PLM
Macro-AUC: 91.85
Macro-F1: 4.90
Micro-AUC: 99.02
Micro-F1: 56.95
Precision@8: 69.47
medical-code-prediction-on-mimic-iv-icd-10-1Joint LAAT
Macro-AUC: 93.64
Macro-F1: 5.71
Micro-AUC: 99.27
Micro-F1: 55.89
Precision@8: 66.89
medical-code-prediction-on-mimic-iv-icd10Joint LAAT
AUC (Macro): 93.39
AUC (Micro): 95.57
F1 (macro): 68.41
F1 (micro): 72.85
Precision@5: 64.49
medical-code-prediction-on-mimic-iv-icd10CAML
AUC (Macro): 91.05
AUC (Micro): 93.18
F1 (macro): 64.30
F1 (micro): 67.56
Precision@5: 59.58
medical-code-prediction-on-mimic-iv-icd10LAAT
AUC (Macro): 93.21
AUC (Micro): 95.49
F1 (macro): 68.15
F1 (micro): 72.56
Precision@5: 64.39
medical-code-prediction-on-mimic-iv-icd10MSMN
AUC (Macro): 93.60
AUC (Micro): 95.61
F1 (macro): 69.01
F1 (micro): 74.15
Precision@5: 65.16
medical-code-prediction-on-mimic-iv-icd10PLM-ICD
AUC (Macro): 93.37
AUC (Micro): 95.69
F1 (macro): 70.31
F1 (micro): 73.27
Precision@5: 64.57
medical-code-prediction-on-mimic-iv-icd9CAML
AUC Macro: 93.07
AUC Micro: 94.05
F1 Macro: 65.33
F1 Micro: 69.23
Precision @5: 58.64
medical-code-prediction-on-mimic-iv-icd9PLM-ICD
AUC Macro: 94.97
AUC Micro: 96.41
F1 Macro: 71.35
F1 Micro: 75.46
Precision @5: 62.44
medical-code-prediction-on-mimic-iv-icd9LAAT
AUC Macro: 94.88
AUC Micro: 96.29
F1 Macro: 69.99
F1 Micro: 74.46
Precision @5: 62.01
medical-code-prediction-on-mimic-iv-icd9Joint LAAT
AUC Macro: 94.92
AUC Micro: 96.31
F1 Macro: 69.93
F1 Micro: 74.33
Precision @5: 61.95
medical-code-prediction-on-mimic-iv-icd9MSMN
AUC Macro: 95.13
AUC Micro: 96.46
F1 Macro: 71.85
F1 Micro: 75.78
Precision @5: 62.60
medical-code-prediction-on-mimic-iv-icd9-fullCAML
F1 Macro: 11.06
F1 Micro: 57.28
Macro AUC: 93.45
Micro AUC: 99.29
Precision@8: 64.91
medical-code-prediction-on-mimic-iv-icd9-fullPLM-ICD
F1 Macro: 14.40
F1 Micro: 62.45
Macro AUC: 96.61
Micro AUC: 99.53
Precision@8: 70.34
medical-code-prediction-on-mimic-iv-icd9-fullLAAT
F1 Macro: 13.12
F1 Micro: 60.31
Macro AUC: 95.18
Micro AUC: 99.47
Precision@8: 67.47
medical-code-prediction-on-mimic-iv-icd9-fullMSMN
F1 Macro: 13.94
F1 Micro: 61.15
Macro AUC: 96.79
Micro AUC: 99.56
Precision@8: 68.89
medical-code-prediction-on-mimic-iv-icd9-fullJoint LAAT
F1 Macro: 14.17
F1 Micro: 60.37
Macro AUC: 95.57
Micro AUC: 99.49
Precision@8: 67.46

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Mimic-IV-ICD: A new benchmark for eXtreme MultiLabel Classification | Papers | HyperAI