TY - JOUR
T1 - Deep learning-based polygenic risk analysis for Alzheimer’s disease prediction
AU - Alzheimer’s Disease Neuroimaging Initiative
AU - Zhou, Xiaopu
AU - Chen, Yu
AU - Ip, Fanny C.F.
AU - Jiang, Yuanbing
AU - Cao, Han
AU - Lv, Ge
AU - Zhong, Huan
AU - Chen, Jiahang
AU - Ye, Tao
AU - Chen, Yuewen
AU - Zhang, Yulin
AU - Ma, Shuangshuang
AU - Lo, Ronnie M.N.
AU - Tong, Estella P.S.
AU - Furst, Ansgar J.
AU - Taylor, Joy L.
AU - Yesavage, Jerome A.
AU - Li, Gail
AU - Petrie, Eric C.
AU - Peskind, Elaine R.
AU - Harding, Sandra
AU - Fruehling, J. Jay
AU - Massoglia, Dino
AU - James, Olga
AU - Arfanakis, Konstantinos
AU - Fleischman, Debra
AU - Friedl, Karl
AU - Finley, Shannon
AU - Hayes, Jacqueline
AU - Morrison, Rosemary
AU - Davis, Melissa
AU - Grafman, Jordan
AU - Neylan, Thomas
AU - Raj, Balebail Ashok
AU - Fargher, Kristin
AU - Smith, Amanda
AU - Raudin, Lisa
AU - Chaing, Gloria
AU - Relkin, Norman
AU - Smith, Karen Elizabeth
AU - Shim, Hyungsub
AU - Boles Ponto, Laura L.
AU - Schultz, Susan K.
AU - Sarrael, Antero
AU - Hernando, Raymundo
AU - Pomara, Nunzio
AU - Drost, Dick
AU - Rachinsky, Irina
AU - Grossman, Hillel
AU - Fillit, Howard
N1 - Publisher Copyright:
© The Author(s) 2023.
PY - 2023/12
Y1 - 2023/12
N2 - Background: The polygenic nature of Alzheimer’s disease (AD) suggests that multiple variants jointly contribute to disease susceptibility. As an individual’s genetic variants are constant throughout life, evaluating the combined effects of multiple disease-associated genetic risks enables reliable AD risk prediction. Because of the complexity of genomic data, current statistical analyses cannot comprehensively capture the polygenic risk of AD, resulting in unsatisfactory disease risk prediction. However, deep learning methods, which capture nonlinearity within high-dimensional genomic data, may enable more accurate disease risk prediction and improve our understanding of AD etiology. Accordingly, we developed deep learning neural network models for modeling AD polygenic risk. Methods: We constructed neural network models to model AD polygenic risk and compared them with the widely used weighted polygenic risk score and lasso models. We conducted robust linear regression analysis to investigate the relationship between the AD polygenic risk derived from deep learning methods and AD endophenotypes (i.e., plasma biomarkers and individual cognitive performance). We stratified individuals by applying unsupervised clustering to the outputs from the hidden layers of the neural network model. Results: The deep learning models outperform other statistical models for modeling AD risk. Moreover, the polygenic risk derived from the deep learning models enables the identification of disease-associated biological pathways and the stratification of individuals according to distinct pathological mechanisms. Conclusion: Our results suggest that deep learning methods are effective for modeling the genetic risks of AD and other diseases, classifying disease risks, and uncovering disease mechanisms.
AB - Background: The polygenic nature of Alzheimer’s disease (AD) suggests that multiple variants jointly contribute to disease susceptibility. As an individual’s genetic variants are constant throughout life, evaluating the combined effects of multiple disease-associated genetic risks enables reliable AD risk prediction. Because of the complexity of genomic data, current statistical analyses cannot comprehensively capture the polygenic risk of AD, resulting in unsatisfactory disease risk prediction. However, deep learning methods, which capture nonlinearity within high-dimensional genomic data, may enable more accurate disease risk prediction and improve our understanding of AD etiology. Accordingly, we developed deep learning neural network models for modeling AD polygenic risk. Methods: We constructed neural network models to model AD polygenic risk and compared them with the widely used weighted polygenic risk score and lasso models. We conducted robust linear regression analysis to investigate the relationship between the AD polygenic risk derived from deep learning methods and AD endophenotypes (i.e., plasma biomarkers and individual cognitive performance). We stratified individuals by applying unsupervised clustering to the outputs from the hidden layers of the neural network model. Results: The deep learning models outperform other statistical models for modeling AD risk. Moreover, the polygenic risk derived from the deep learning models enables the identification of disease-associated biological pathways and the stratification of individuals according to distinct pathological mechanisms. Conclusion: Our results suggest that deep learning methods are effective for modeling the genetic risks of AD and other diseases, classifying disease risks, and uncovering disease mechanisms.
UR - http://www.scopus.com/inward/record.url?scp=85203814889&partnerID=8YFLogxK
U2 - 10.1038/s43856-023-00269-x
DO - 10.1038/s43856-023-00269-x
M3 - Article
AN - SCOPUS:85203814889
SN - 2730-664X
VL - 3
JO - Communications Medicine
JF - Communications Medicine
IS - 1
M1 - 49
ER -