HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

A New Public Corpus for Clinical Section Identification: MedSecId

{Cornelia Caragea Barbara Di Eugenio Adam Webb Sean S. Huang Kunal Patel Paul Landes}

A New Public Corpus for Clinical Section Identification: MedSecId

Abstract

The process by which sections in a document are demarcated and labeled is known as section identification. Such sections are helpful to the reader when searching for information and contextualizing specific topics. The goal of this work is to segment the sections of clinical medical domain documentation. The primary contribution of this work is MedSecId, a publicly available set of 2,002 fully annotated medical notes from the MIMIC-III. We include several baselines, source code, a pretrained model and analysis of the data showing a relationship between medical concepts across sections using principal component analysis.

Benchmarks

BenchmarkMethodologyMetrics
classification-on-medsecidBiLSTM-CRF
1 shot Micro-F1: 82.2
clinical-section-identification-on-medsecidMedSecId
1 shot Micro-F1: 95.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A New Public Corpus for Clinical Section Identification: MedSecId | Papers | HyperAI