TY - JOUR
T1 - Estimating Local Costs Associated with Clostridium difficile Infection Using Machine Learning and Electronic Medical Records
AU - Pak, Theodore R.
AU - Chacko, Kieran I.
AU - O'Donnell, Timothy
AU - Huprikar, Shirish S.
AU - Van Bakel, Harm
AU - Kasarskis, Andrew
AU - Scott, Erick R.
N1 - Funding Information:
Financial support: This study was supported by the Icahn Institute for Genomics and Multiscale Biology at Mount Sinai, in part by the National Institute of Allergy and Infectious Diseases (grant nos. F30AI122673 and R01AI119145), and through the resources and expertise of the Department of Scientific Computing at the Icahn School of Medicine at Mount Sinai.
Funding Information:
This study was supported by the Icahn Institute for Genomics and Multiscale Biology at Mount Sinai, in part by the National Institute of Allergy and Infectious Diseases (grant nos. F30AI122673 and R01AI119145)
Publisher Copyright:
© 2017 by The Society for Healthcare Epidemiology of America. All rights reserved This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence.
PY - 2017/12/1
Y1 - 2017/12/1
N2 - BACKGROUND Reported per-patient costs of Clostridium difficile infection (CDI) vary by 2 orders of magnitude among different hospitals, implying that infection control officers need precise, local analyses to guide rational decision making between interventions. OBJECTIVE We sought to comprehensively estimate changes in length of stay (LOS) attributable to CDI at a single urban tertiary-care facility using only data automatically extractable from the electronic medical record (EMR). METHODS We performed a retrospective cohort study of 171,938 visits spanning a 7-year period. In total, 23,968 variables were extracted from EMR data recorded within 24 hours of admission to train elastic-net regularized logistic regression models for propensity score matching. To address time-dependent bias (reverse causation), we separately stratified comparisons by time of infection, and we fit multistate models. RESULTS The estimated difference in median LOS for propensity-matched cohorts varied from 3.1 days (95% CI, 2.2-3.9) to 10.1 days (95% CI, 7.3-12.2) depending on the case definition; however, dependency of the estimate on time to infection was observed. Stratification by time to first positive toxin assay, excluding probable community-Acquired infections, showed a minimum excess LOS of 3.1 days (95% CI, 1.7-4.4). Under the same case definition, the multistate model averaged an excess LOS of 3.3 days (95% CI, 2.6-4.0). CONCLUSIONS In this study, 2 independent time-To-infection adjusted methods converged on similar excess LOS estimates. Changes in LOS can be extrapolated to marginal dollar costs by multiplying by average costs of an inpatient day. Infection control officers can leverage automatically extractable EMR data to estimate costs of CDI at their own institutions. Infect Control Hosp Epidemiol. 2017;38:1478-1486.
AB - BACKGROUND Reported per-patient costs of Clostridium difficile infection (CDI) vary by 2 orders of magnitude among different hospitals, implying that infection control officers need precise, local analyses to guide rational decision making between interventions. OBJECTIVE We sought to comprehensively estimate changes in length of stay (LOS) attributable to CDI at a single urban tertiary-care facility using only data automatically extractable from the electronic medical record (EMR). METHODS We performed a retrospective cohort study of 171,938 visits spanning a 7-year period. In total, 23,968 variables were extracted from EMR data recorded within 24 hours of admission to train elastic-net regularized logistic regression models for propensity score matching. To address time-dependent bias (reverse causation), we separately stratified comparisons by time of infection, and we fit multistate models. RESULTS The estimated difference in median LOS for propensity-matched cohorts varied from 3.1 days (95% CI, 2.2-3.9) to 10.1 days (95% CI, 7.3-12.2) depending on the case definition; however, dependency of the estimate on time to infection was observed. Stratification by time to first positive toxin assay, excluding probable community-Acquired infections, showed a minimum excess LOS of 3.1 days (95% CI, 1.7-4.4). Under the same case definition, the multistate model averaged an excess LOS of 3.3 days (95% CI, 2.6-4.0). CONCLUSIONS In this study, 2 independent time-To-infection adjusted methods converged on similar excess LOS estimates. Changes in LOS can be extrapolated to marginal dollar costs by multiplying by average costs of an inpatient day. Infection control officers can leverage automatically extractable EMR data to estimate costs of CDI at their own institutions. Infect Control Hosp Epidemiol. 2017;38:1478-1486.
UR - http://www.scopus.com/inward/record.url?scp=85033371762&partnerID=8YFLogxK
U2 - 10.1017/ice.2017.214
DO - 10.1017/ice.2017.214
M3 - Article
C2 - 29103378
AN - SCOPUS:85033371762
SN - 0899-823X
VL - 38
SP - 1478
EP - 1486
JO - Infection Control and Hospital Epidemiology
JF - Infection Control and Hospital Epidemiology
IS - 12
ER -