HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study

Joakim Edin; Alexander Junge; Jakob D. Havtorn; Lasse Borgholt; Maria Maistro; Tuukka Ruotsalo; Lars Maaløe

Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study

Abstract

Medical coding is the task of assigning medical codes to clinical free-text documentation. Healthcare professionals manually assign such codes to track patient diagnoses and treatments. Automated medical coding can considerably alleviate this administrative burden. In this paper, we reproduce, compare, and analyze state-of-the-art automated medical coding machine learning models. We show that several models underperform due to weak configurations, poorly sampled train-test splits, and insufficient evaluation. In previous work, the macro F1 score has been calculated sub-optimally, and our correction doubles it. We contribute a revised model comparison using stratified sampling and identical experimental setups, including hyperparameters and decision boundary tuning. We analyze prediction errors to validate and falsify assumptions of previous works. The analysis confirms that all models struggle with rare codes, while long documents only have a negligible impact. Finally, we present the first comprehensive results on the newly released MIMIC-IV dataset using the reproduced models. We release our code, model parameters, and new MIMIC-III and MIMIC-IV training and evaluation pipelines to accommodate fair future comparisons.

Code Repositories

joakimedin/medical-coding-reproducibility
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
medical-code-prediction-on-mimic-iv-icd-10CAML
AUC Macro: 91.1
AUC Micro: 98.5
Exact Match Ratio: 0.3
F1 Macro: 16.0
F1 Micro: 55.4
Precision@15: 52.2
Precision@8: 66.8
R-Prec: 54.5
mAP: 57.4
medical-code-prediction-on-mimic-iv-icd-10PLM-ICD
AUC Macro: 96.6
AUC Micro: 99.2
Exact Match Ratio: 0.4
F1 Macro: 21.1
F1 Micro: 58.5
Precision@15: 55.0
Precision@8: 69.9
R-Prec: 57.9
mAP: 61.9
medical-code-prediction-on-mimic-iv-icd-10CNN
AUC Macro: 87.9
AUC Micro: 97.5
Exact Match Ratio: 0.3
F1 Macro: 8.0
F1 Micro: 47.2
Precision@15: 45.7
Precision@8: 60.3
R-Prec: 47.3
mAP: 48.2
medical-code-prediction-on-mimic-iv-icd-10Bi-GRU
AUC Macro: 92.4
AUC Micro: 98.3
Exact Match Ratio: 0.3
F1 Macro: 10.6
F1 Micro: 50.1
Precision@15: 47.7
Precision@8: 62.6
R-Prec: 49.6
mAP: 51.1
medical-code-prediction-on-mimic-iv-icd-10LAAT
AUC Macro: 95.4
AUC Micro: 99.0
Exact Match Ratio: 0.4
F1 Macro: 20.3
F1 Micro: 57.9
Precision@15: 54.3
Precision@8: 68.9
R-Prec: 57.2
mAP: 60.6
medical-code-prediction-on-mimic-iv-icd-10MultiResCNN
AUC Macro: 94.5
AUC Micro: 99.0
Exact Match Ratio: 0.4
F1 Macro: 21.1
F1 Micro: 56.9
Precision@15: 53.5
Precision@8: 67.8
R-Prec: 56.1
mAP: 59.3
medical-code-prediction-on-mimic-iv-icd-9MultiResCNN
AUC Macro: 95.1
AUC Micro: 99.2
Exact Match Ratio: 0.8
F1 Macro: 27.7
F1 Micro: 60.4
Precision@15: 51.8
Precision@8: 67.6
R-Prec: 60.4
mAP: 64.7
medical-code-prediction-on-mimic-iv-icd-9PLM-ICD
AUC Macro: 97.2
AUC Micro: 99.4
Exact Match Ratio: 1.0
F1 Macro: 29.8
F1 Micro: 62.6
Precision@15: 53.5
Precision@8: 70.0
R-Prec: 62.7
mAP: 68.0
medical-code-prediction-on-mimic-iv-icd-9Bi-GRU
AUC Macro: 93.8
AUC Micro: 98.8
Exact Match Ratio: 0.7
F1 Macro: 16.6
F1 Micro: 55.5
Precision@15: 47.8
Precision@8: 64.1
R-Prec: 55.8
mAP: 58.9
medical-code-prediction-on-mimic-iv-icd-9CNN
AUC Macro: 89.4
AUC Micro: 98.1
Exact Match Ratio: 0.6
F1 Macro: 12.6
F1 Micro: 52.4
Precision@15: 45.6
Precision@8: 61.3
R-Prec: 52.9
mAP: 55.2
medical-code-prediction-on-mimic-iv-icd-9LAAT
AUC Macro: 96.0
AUC Micro: 99.3
Exact Match Ratio: 0.9
F1 Macro: 26.4
F1 Micro: 61.7
Precision@15: 52.7
Precision@8: 68.9
R-Prec: 61.7
mAP: 66.3
medical-code-prediction-on-mimic-iv-icd-9CAML
AUC Macro: 90.7
AUC Micro: 98.8
Exact Match Ratio: 0.6
F1 Macro: 19.3
F1 Micro: 58.6
Precision@15: 50.3
Precision@8: 66.3
R-Prec: 58.5
mAP: 62.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study | Papers | HyperAI