TY - JOUR
T1 - Development of a machine learning model using electrocardiogram signals to improve acute pulmonary embolism screening
AU - Somani, Sulaiman S.
AU - Honarvar, Hossein
AU - Narula, Sukrit
AU - Landi, Isotta
AU - Lee, Shawn
AU - Khachatoorian, Yeraz
AU - Rehmani, Arsalan
AU - Kim, Andrew
AU - De Freitas, Jessica K.
AU - Teng, Shelly
AU - Jaladanki, Suraj
AU - Kumar, Arvind
AU - Russak, Adam
AU - Zhao, Shan P.
AU - Freeman, Robert
AU - Levin, Matthew A.
AU - Nadkarni, Girish N.
AU - Kagen, Alexander C.
AU - Argulian, Edgar
AU - Glicksberg, Benjamin S.
N1 - Publisher Copyright:
© 2021 The Author(s). Published by Oxford University Press on behalf of the European Society of Cardiology.
PY - 2022/3/1
Y1 - 2022/3/1
N2 - Aims: Clinical scoring systems for pulmonary embolism (PE) screening have low specificity and contribute to computed tomography pulmonary angiogram (CTPA) overuse. We assessed whether deep learning models using an existing and routinely collected data modality, electrocardiogram (ECG) waveforms, can increase specificity for PE detection. Methods and results: We create a retrospective cohort of 21-183 patients at moderate-to high suspicion of PE and associate 23-793 CTPAs (10.0% PE-positive) with 320-746 ECGs and encounter-level clinical data (demographics, comorbidities, vital signs, and labs). We develop three machine learning models to predict PE likelihood: An ECG model using only ECG waveform data, an EHR model using tabular clinical data, and a Fusion model integrating clinical data and an embedded representation of the ECG waveform. We find that a Fusion model [area under the receiver-operating characteristic curve (AUROC) 0.81 ± 0.01] outperforms both the ECG model (AUROC 0.59 ± 0.01) and EHR model (AUROC 0.65 ± 0.01). On a sample of 100 patients from the test set, the Fusion model also achieves greater specificity (0.18) and performance (AUROC 0.84 ± 0.01) than four commonly evaluated clinical scores: Wells' Criteria, Revised Geneva Score, Pulmonary Embolism Rule-Out Criteria, and 4-Level Pulmonary Embolism Clinical Probability Score (AUROC 0.50-0.58, specificity 0.00-0.05). The model is superior to these scores on feature sensitivity analyses (AUROC 0.66-0.84) and achieves comparable performance across sex (AUROC 0.81) and racial/ethnic (AUROC 0.77-0.84) subgroups. Conclusion: Synergistic deep learning of ECG waveforms with traditional clinical variables can increase the specificity of PE detection in patients at least at moderate suspicion for PE.
AB - Aims: Clinical scoring systems for pulmonary embolism (PE) screening have low specificity and contribute to computed tomography pulmonary angiogram (CTPA) overuse. We assessed whether deep learning models using an existing and routinely collected data modality, electrocardiogram (ECG) waveforms, can increase specificity for PE detection. Methods and results: We create a retrospective cohort of 21-183 patients at moderate-to high suspicion of PE and associate 23-793 CTPAs (10.0% PE-positive) with 320-746 ECGs and encounter-level clinical data (demographics, comorbidities, vital signs, and labs). We develop three machine learning models to predict PE likelihood: An ECG model using only ECG waveform data, an EHR model using tabular clinical data, and a Fusion model integrating clinical data and an embedded representation of the ECG waveform. We find that a Fusion model [area under the receiver-operating characteristic curve (AUROC) 0.81 ± 0.01] outperforms both the ECG model (AUROC 0.59 ± 0.01) and EHR model (AUROC 0.65 ± 0.01). On a sample of 100 patients from the test set, the Fusion model also achieves greater specificity (0.18) and performance (AUROC 0.84 ± 0.01) than four commonly evaluated clinical scores: Wells' Criteria, Revised Geneva Score, Pulmonary Embolism Rule-Out Criteria, and 4-Level Pulmonary Embolism Clinical Probability Score (AUROC 0.50-0.58, specificity 0.00-0.05). The model is superior to these scores on feature sensitivity analyses (AUROC 0.66-0.84) and achieves comparable performance across sex (AUROC 0.81) and racial/ethnic (AUROC 0.77-0.84) subgroups. Conclusion: Synergistic deep learning of ECG waveforms with traditional clinical variables can increase the specificity of PE detection in patients at least at moderate suspicion for PE.
KW - Deep learning
KW - Electrocardiogram
KW - Machine learning
KW - Pulmonary embolism
UR - http://www.scopus.com/inward/record.url?scp=85148577699&partnerID=8YFLogxK
U2 - 10.1093/ehjdh/ztab101
DO - 10.1093/ehjdh/ztab101
M3 - Article
AN - SCOPUS:85148577699
SN - 2634-3916
VL - 3
SP - 56
EP - 66
JO - European Heart Journal - Digital Health
JF - European Heart Journal - Digital Health
IS - 1
ER -