Analysis of Clinical Text: Disorder Identification and Normalization
Identify disorder mentions and their attributes in clinical discharge summaries, based on SemEval-2015 Task 14 (Elhadad et al.). Annotators mark disorder spans, body locations, severity indicators, and classify the assertion status of each disorder.
Configuration Fileconfig.yaml
# Clinical Text Analysis - Disorder Identification
# Based on Elhadad et al., SemEval 2015
# Paper: https://aclanthology.org/S15-2070/
# Dataset: https://www.i2b2.org/NLP/DataSets/
#
# This task requires annotators to identify disorder mentions and their
# attributes in de-identified clinical discharge summaries and progress
# notes. Annotators mark disorder spans, body locations, severity
# indicators, disease course markers, and negation cues, then classify
# the assertion status of each disorder.
#
# Span Types:
# - Disorder Mention: A disease, condition, or clinical finding
# - Body Location: An anatomical site associated with a disorder
# - Severity: An indicator of disorder severity (e.g., severe, mild, moderate)
# - Course: A marker of disease progression (e.g., progressive, worsening, stable)
# - Negation Indicator: A cue that negates a disorder (e.g., denies, no, negative for)
#
# Assertion Types:
# - Present: The disorder is currently present in the patient
# - Absent: The disorder has been ruled out or is not present
# - Possible: The disorder is suspected but not confirmed
# - Conditional: The disorder occurs under certain conditions
# - Hypothetical: The disorder is discussed as a future possibility
# - Associated with Someone Else: The disorder is attributed to someone other than the patient
annotation_task_name: "Clinical Text Analysis - Disorder Identification"
task_dir: "."
data_files:
- sample-data.json
item_properties:
id_key: "id"
text_key: "text"
output_annotation_dir: "annotation_output/"
output_annotation_format: "json"
port: 8000
server_name: localhost
annotation_schemes:
- annotation_type: span
name: clinical_entity_spans
description: "Highlight disorder mentions and their associated attributes in the clinical text."
labels:
- "Disorder Mention"
- "Body Location"
- "Severity"
- "Course"
- "Negation Indicator"
tooltips:
"Disorder Mention": "A disease, condition, symptom, or clinical finding (e.g., congestive heart failure, pneumonia, diabetes mellitus)"
"Body Location": "An anatomical site or body part associated with a disorder (e.g., right knee, lower extremity, chest)"
"Severity": "An indicator of how severe the disorder is (e.g., severe, mild, moderate, significant)"
"Course": "A marker describing disease progression or trajectory (e.g., progressive, worsening, stable, acute)"
"Negation Indicator": "A word or phrase that negates the presence of a disorder (e.g., denies, no, negative for, ruled out)"
- annotation_type: radio
name: assertion_status
description: "What is the assertion status of the primary disorder mentioned in this text?"
labels:
- "Present"
- "Absent"
- "Possible"
- "Conditional"
- "Hypothetical"
- "Associated with Someone Else"
keyboard_shortcuts:
"Present": "1"
"Absent": "2"
"Possible": "3"
"Conditional": "4"
"Hypothetical": "5"
"Associated with Someone Else": "6"
tooltips:
"Present": "The disorder is currently present and confirmed in the patient"
"Absent": "The disorder has been ruled out or is explicitly stated as not present"
"Possible": "The disorder is suspected or being considered but not yet confirmed"
"Conditional": "The disorder is present only under specific conditions or circumstances"
"Hypothetical": "The disorder is discussed as a future possibility or preventive concern"
"Associated with Someone Else": "The disorder is mentioned in reference to someone other than the patient (e.g., family history)"
annotation_instructions: |
You will be shown de-identified clinical text from discharge summaries
or progress notes.
**Step 1: Entity Annotation**
Highlight and classify the following spans in the text:
- **Disorder Mention**: Any disease, condition, symptom, or finding
(e.g., "congestive heart failure", "pneumonia", "edema")
- **Body Location**: Anatomical sites associated with disorders
(e.g., "right knee", "lower lobe", "bilateral lower extremity")
- **Severity**: Severity indicators (e.g., "severe", "mild", "moderate")
- **Course**: Disease progression markers (e.g., "progressive", "worsening", "acute")
- **Negation Indicator**: Words negating a disorder (e.g., "denies", "no", "negative for")
**Step 2: Assertion Classification**
Classify the assertion status of the primary disorder:
- **Present**: Disorder is confirmed in the patient
- **Absent**: Disorder is ruled out or explicitly absent
- **Possible**: Disorder is suspected but unconfirmed
- **Conditional**: Disorder present only under certain conditions
- **Hypothetical**: Discussed as a future possibility
- **Associated with Someone Else**: Attributed to another person
html_layout: |
<div style="padding: 15px; max-width: 800px; margin: auto;">
<div style="background: #fef2f2; border: 1px solid #fca5a5; border-radius: 8px; padding: 12px; margin-bottom: 16px;">
<strong style="color: #991b1b;">Note Type:</strong>
<span style="font-size: 15px; margin-left: 8px; text-transform: capitalize;">{{note_type}}</span>
</div>
<div style="background: #f0f9ff; border: 1px solid #bae6fd; border-radius: 8px; padding: 16px; margin-bottom: 16px;">
<strong style="color: #0369a1;">Clinical Text:</strong>
<p style="font-size: 16px; line-height: 1.7; margin: 8px 0 0 0;">{{text}}</p>
</div>
</div>
allow_all_users: true
instances_per_annotator: 50
annotation_per_instance: 2
allow_skip: true
skip_reason_required: false
Sample Datasample-data.json
[
{
"id": "clinical_001",
"text": "DISCHARGE SUMMARY: The patient is a 67-year-old male admitted with acute exacerbation of congestive heart failure. He presented with progressive dyspnea on exertion and bilateral lower extremity edema over the past two weeks. Chest X-ray revealed bilateral pleural effusions. He was treated with IV furosemide with significant improvement in symptoms.",
"note_type": "discharge_summary"
},
{
"id": "clinical_002",
"text": "HOSPITAL COURSE: Patient is a 54-year-old female with a history of type 2 diabetes mellitus and chronic kidney disease stage III. She was admitted for evaluation of worsening renal function. Creatinine on admission was 3.2 mg/dL, up from baseline of 1.8 mg/dL. Renal ultrasound showed no hydronephrosis. Nephrology was consulted.",
"note_type": "discharge_summary"
}
]
// ... and 8 more itemsGet This Design
Clone or download from the repository
Quick start:
git clone https://github.com/davidjurgens/potato-showcase.git cd potato-showcase/semeval/2015/task14-clinical-text-analysis potato start config.yaml
Details
Annotation Types
Domain
Use Cases
Tags
Found an issue or want to improve this design?
Open an IssueRelated Designs
Drug-Drug Interaction Extraction from Biomedical Texts
Extract drug entities and classify drug-drug interactions from biomedical texts, based on SemEval-2013 Task 9 (Segura-Bedmar et al.). Annotators identify drug mentions by type and classify interaction relationships between drug pairs.
Causal Medical Claim Detection and PICO Extraction
Detection of causal claims in medical texts and extraction of PICO (Population, Intervention, Comparator, Outcome) elements. Based on SemEval-2023 Task 8 (Khetan et al.).
Clinical TempEval - Temporal Information Extraction from Clinical Notes
Extraction of temporal information from clinical text, identifying time expressions, event mentions, and their temporal relations. Based on SemEval-2016 Task 12 (Clinical TempEval).