TY - JOUR
T1 - Coronary Risk Estimation Based on Clinical Data in Electronic Health Records
AU - Petrazzini, Ben O.
AU - Chaudhary, Kumardeep
AU - Márquez-Luna, Carla
AU - Forrest, Iain S.
AU - Rocheleau, Ghislain
AU - Cho, Judy
AU - Narula, Jagat
AU - Nadkarni, Girish
AU - Do, Ron
N1 - Publisher Copyright:
© 2022 American College of Cardiology Foundation
PY - 2022/3/29
Y1 - 2022/3/29
N2 - Background: Clinical features from electronic health records (EHRs) can be used to build a complementary tool to predict coronary artery disease (CAD) susceptibility. Objectives: The purpose of this study was to determine whether an EHR score can improve CAD prediction and reclassification 1 year before diagnosis, beyond conventional clinical guidelines as determined by the pooled cohort equations (PCE) and a polygenic risk score for CAD. Methods: We applied a machine learning framework using clinical features from the EHR in a multiethnic, clinical care cohort (BioMe) comprising 555 CAD cases and 6,349 control subjects and in a population-based cohort (UK Biobank) comprising 3,130 CAD cases and 378,344 control subjects for external validation. Results: Compared with the PCE, the EHR score improved CAD prediction by 12% in the BioMe Biobank and by 9% in the UK Biobank. The EHR score reclassified 25.8% and 15.2% individuals in each cohort respectively, compared with the PCE score. We observed larger improvements in the EHR score over the PCE in a subgroup of individuals with low CAD risk, with 20% increased discrimination and 34.4% increased reclassification. In all models, the polygenic risk score for CAD did not improve CAD prediction, compared with the PCE or EHR score. Conclusions: The EHR score resulted in increased prediction and reclassification for CAD, demonstrating its potential use for population health monitoring of short-term CAD risk in large health systems.
AB - Background: Clinical features from electronic health records (EHRs) can be used to build a complementary tool to predict coronary artery disease (CAD) susceptibility. Objectives: The purpose of this study was to determine whether an EHR score can improve CAD prediction and reclassification 1 year before diagnosis, beyond conventional clinical guidelines as determined by the pooled cohort equations (PCE) and a polygenic risk score for CAD. Methods: We applied a machine learning framework using clinical features from the EHR in a multiethnic, clinical care cohort (BioMe) comprising 555 CAD cases and 6,349 control subjects and in a population-based cohort (UK Biobank) comprising 3,130 CAD cases and 378,344 control subjects for external validation. Results: Compared with the PCE, the EHR score improved CAD prediction by 12% in the BioMe Biobank and by 9% in the UK Biobank. The EHR score reclassified 25.8% and 15.2% individuals in each cohort respectively, compared with the PCE score. We observed larger improvements in the EHR score over the PCE in a subgroup of individuals with low CAD risk, with 20% increased discrimination and 34.4% increased reclassification. In all models, the polygenic risk score for CAD did not improve CAD prediction, compared with the PCE or EHR score. Conclusions: The EHR score resulted in increased prediction and reclassification for CAD, demonstrating its potential use for population health monitoring of short-term CAD risk in large health systems.
KW - biobank
KW - coronary artery disease
KW - electronic health record
KW - machine learning
KW - polygenic risk score
KW - pooled cohort equations
KW - prevention
UR - http://www.scopus.com/inward/record.url?scp=85126270071&partnerID=8YFLogxK
U2 - 10.1016/j.jacc.2022.01.021
DO - 10.1016/j.jacc.2022.01.021
M3 - Article
C2 - 35331410
AN - SCOPUS:85126270071
SN - 0735-1097
VL - 79
SP - 1155
EP - 1166
JO - Journal of the American College of Cardiology
JF - Journal of the American College of Cardiology
IS - 12
ER -