TY - JOUR
T1 - Evaluation of a machine learning-based metabolic marker for coronary artery disease in the UK Biobank
AU - Gibson, Kyle
AU - Forrest, Iain S.
AU - Petrazzini, Ben O.
AU - Duffy, Áine
AU - Park, Joshua K.
AU - Malick, Waqas
AU - Rosenson, Robert S.
AU - Rocheleau, Ghislain
AU - Jordan, Daniel M.
AU - Do, Ron
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2025/2
Y1 - 2025/2
N2 - Background and aims: An in silico quantitative score of coronary artery disease (ISCAD), built using machine learning and clinical data from electronic health records, has been shown to result in gradations of risk of subclinical atherosclerosis, coronary artery disease (CAD) sequelae, and mortality. Large-scale metabolite biomarker profiling provides increased portability and objectivity in machine learning for disease prediction and gradation. However, these models have not been fully leveraged. We evaluated a quantitative score of CAD derived from probabilities of a machine learning model trained on metabolomic data. Methods: We developed a CAD-predictive learning model using metabolic data from 93,642 individuals from the UK Biobank (median [IQR] age, 57 [14] years; 39,796 [42 %] male; 5640 [6 %] with diagnosed CAD), and assessed its probabilities as a quantitative metabolic risk score for CAD (M-CAD; range 0 [lowest probability] to 1 [highest probability]) in participants of the UK Biobank. The relationship of M-CAD with arterial stiffness index, ejection fraction, CAD sequelae, and mortality was assessed. Results: The model predicted CAD with an area under the receiver-operating-characteristic curve of 0.712. Arterial Stiffness Index increased by 0.19 and ejection fraction decreased by 0.2 % per 0.1 increase in M-CAD. Both incident and recurrent myocardial infarction increased stepwise over M-CAD quartiles (odds ratio (OR) 15.3 [4.2 %] and 12.5 [0.2 %]) in top quartiles as compared to the first quartile of incident and recurrent MI respectively). Likewise, the hazard ratio and prevalence of all-cause mortality, CVD-associated mortality, and CAD-associated mortality increased stepwise over M-CAD deciles (2.98 [14 %], 9.34 [4.3 %], 26.7 [2.7 %] in the top deciles as compared to the first decile of all-cause, CVD, and CAD mortality respectively). Conclusions: Metabolic-based machine learning can be used to build a quantitative risk score for CAD that is associated with atherosclerotic burden, CAD sequelae and mortality.
AB - Background and aims: An in silico quantitative score of coronary artery disease (ISCAD), built using machine learning and clinical data from electronic health records, has been shown to result in gradations of risk of subclinical atherosclerosis, coronary artery disease (CAD) sequelae, and mortality. Large-scale metabolite biomarker profiling provides increased portability and objectivity in machine learning for disease prediction and gradation. However, these models have not been fully leveraged. We evaluated a quantitative score of CAD derived from probabilities of a machine learning model trained on metabolomic data. Methods: We developed a CAD-predictive learning model using metabolic data from 93,642 individuals from the UK Biobank (median [IQR] age, 57 [14] years; 39,796 [42 %] male; 5640 [6 %] with diagnosed CAD), and assessed its probabilities as a quantitative metabolic risk score for CAD (M-CAD; range 0 [lowest probability] to 1 [highest probability]) in participants of the UK Biobank. The relationship of M-CAD with arterial stiffness index, ejection fraction, CAD sequelae, and mortality was assessed. Results: The model predicted CAD with an area under the receiver-operating-characteristic curve of 0.712. Arterial Stiffness Index increased by 0.19 and ejection fraction decreased by 0.2 % per 0.1 increase in M-CAD. Both incident and recurrent myocardial infarction increased stepwise over M-CAD quartiles (odds ratio (OR) 15.3 [4.2 %] and 12.5 [0.2 %]) in top quartiles as compared to the first quartile of incident and recurrent MI respectively). Likewise, the hazard ratio and prevalence of all-cause mortality, CVD-associated mortality, and CAD-associated mortality increased stepwise over M-CAD deciles (2.98 [14 %], 9.34 [4.3 %], 26.7 [2.7 %] in the top deciles as compared to the first decile of all-cause, CVD, and CAD mortality respectively). Conclusions: Metabolic-based machine learning can be used to build a quantitative risk score for CAD that is associated with atherosclerotic burden, CAD sequelae and mortality.
KW - Biobank
KW - Coronary artery disease
KW - Machine learning
KW - Metabolic
KW - Personalized
KW - Quantitative risk score
UR - http://www.scopus.com/inward/record.url?scp=85214510692&partnerID=8YFLogxK
U2 - 10.1016/j.atherosclerosis.2024.119103
DO - 10.1016/j.atherosclerosis.2024.119103
M3 - Article
AN - SCOPUS:85214510692
SN - 0021-9150
VL - 401
JO - Atherosclerosis
JF - Atherosclerosis
M1 - 119103
ER -