TY - GEN
T1 - Reducing Diagnostic Uncertainty Using Large Language Models
AU - Finkelstein, Joseph
AU - Cui, Wanting
AU - Morgan, Keaton
AU - Kawamoto, Kensaku
N1 - Publisher Copyright:
©2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The study used ClinicalBERT to predict body system categories based on clinical notes from the first three days of admission, using the MIMIC-III dataset. After data preprocessing, including the extraction of admission details, clinical notes, and diagnoses, the dataset comprised 510,956 notes associated with 44,270 unique hospitalizations. Discharge diagnoses were categorized into body systems, and the ClinicalBERT model was fine-tuned to predict associations with these diagnoses, resulting in 19 classification models - one for each body system. Around 80% of the models achieved F1 scores exceeding 0.7. Models for diseases of the circulatory, infectious and parasitic, respiratory, nervous, digestive, and genitourinary systems had F1 scores surpassing 0.8. Conversely, models for congenital malformations, eye and adnexa diseases, and ear and mastoid process diseases showed notably lower F1 scores. To explore model robustness, a comparison between three days and one day of notes per patient was conducted. While F1 scores generally decreased, a significant finding was that most body system models maintained satisfactory performance due to the statistical distribution similarities in note types and lengths between one and three days. This suggests the potential for ClinicalBERT's adaptability to varied data availability scenarios. Future studies could delve into developing a multiple notes model, testing its flexibility and robustness across different prediction durations, thereby potentially reducing the time and effort associated with model implementation in diverse clinical settings.
AB - The study used ClinicalBERT to predict body system categories based on clinical notes from the first three days of admission, using the MIMIC-III dataset. After data preprocessing, including the extraction of admission details, clinical notes, and diagnoses, the dataset comprised 510,956 notes associated with 44,270 unique hospitalizations. Discharge diagnoses were categorized into body systems, and the ClinicalBERT model was fine-tuned to predict associations with these diagnoses, resulting in 19 classification models - one for each body system. Around 80% of the models achieved F1 scores exceeding 0.7. Models for diseases of the circulatory, infectious and parasitic, respiratory, nervous, digestive, and genitourinary systems had F1 scores surpassing 0.8. Conversely, models for congenital malformations, eye and adnexa diseases, and ear and mastoid process diseases showed notably lower F1 scores. To explore model robustness, a comparison between three days and one day of notes per patient was conducted. While F1 scores generally decreased, a significant finding was that most body system models maintained satisfactory performance due to the statistical distribution similarities in note types and lengths between one and three days. This suggests the potential for ClinicalBERT's adaptability to varied data availability scenarios. Future studies could delve into developing a multiple notes model, testing its flexibility and robustness across different prediction durations, thereby potentially reducing the time and effort associated with model implementation in diverse clinical settings.
KW - Large Language Models
KW - diagnostic uncertainty
KW - emergency department
UR - http://www.scopus.com/inward/record.url?scp=85192231909&partnerID=8YFLogxK
U2 - 10.1109/AIMHC59811.2024.00049
DO - 10.1109/AIMHC59811.2024.00049
M3 - Conference contribution
AN - SCOPUS:85192231909
T3 - Proceedings - 2024 IEEE 1st International Conference on Artificial Intelligence for Medicine, Health and Care, AIMHC 2024
SP - 236
EP - 242
BT - Proceedings - 2024 IEEE 1st International Conference on Artificial Intelligence for Medicine, Health and Care, AIMHC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 1st IEEE International Conference on Artificial Intelligence for Medicine, Health and Care, AIMHC 2024
Y2 - 5 February 2024 through 7 February 2024
ER -