TY - JOUR
T1 - Machine learning methods applied to genotyping data capture interactions between single nucleotide variants in late onset Alzheimer's disease
AU - Arnal Segura, Magdalena
AU - Bini, Giorgio
AU - Fernandez Orth, Dietmar
AU - Samaras, Eleftherios
AU - Kassis, Maya
AU - Aisopos, Fotis
AU - Rambla De Argila, Jordi
AU - Paliouras, George
AU - Garrard, Peter
AU - Giambartolomei, Claudia
AU - Tartaglia, Gian Gaetano
N1 - Publisher Copyright:
© 2022 The Authors. Alzheimer's & Dementia: Diagnosis, Assessment & Disease Monitoring published by Wiley Periodicals, LLC on behalf of Alzheimer's Association.
PY - 2022
Y1 - 2022
N2 - Introduction: Genome-wide association studies (GWAS) in late onset Alzheimer's disease (LOAD) provide lists of individual genetic determinants. However, GWAS do not capture the synergistic effects among multiple genetic variants and lack good specificity. Methods: We applied tree-based machine learning algorithms (MLs) to discriminate LOAD (>700 individuals) and age-matched unaffected subjects in UK Biobank with single nucleotide variants (SNVs) from Alzheimer's disease (AD) studies, obtaining specific genomic profiles with the prioritized SNVs. Results: MLs prioritized a set of SNVs located in genes PVRL2, TOMM40, APOE, and APOC1, also influencing gene expression and splicing. The genomic profiles in this region showed interaction patterns involving rs405509 and rs1160985, also present in the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset. rs405509 located in APOE promoter interacts with rs429358 among others, seemingly neutralizing their predisposing effect. Discussion: Our approach efficiently discriminates LOAD from controls, capturing genomic profiles defined by interactions among SNVs in a hot-spot region.
AB - Introduction: Genome-wide association studies (GWAS) in late onset Alzheimer's disease (LOAD) provide lists of individual genetic determinants. However, GWAS do not capture the synergistic effects among multiple genetic variants and lack good specificity. Methods: We applied tree-based machine learning algorithms (MLs) to discriminate LOAD (>700 individuals) and age-matched unaffected subjects in UK Biobank with single nucleotide variants (SNVs) from Alzheimer's disease (AD) studies, obtaining specific genomic profiles with the prioritized SNVs. Results: MLs prioritized a set of SNVs located in genes PVRL2, TOMM40, APOE, and APOC1, also influencing gene expression and splicing. The genomic profiles in this region showed interaction patterns involving rs405509 and rs1160985, also present in the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset. rs405509 located in APOE promoter interacts with rs429358 among others, seemingly neutralizing their predisposing effect. Discussion: Our approach efficiently discriminates LOAD from controls, capturing genomic profiles defined by interactions among SNVs in a hot-spot region.
KW - Apolipoprotein E
KW - genetic determinants
KW - genomic interactions
KW - genomic profiles
KW - late onset Alzheimer's disease
KW - machine learning
KW - single nucleotide variants
KW - variant prioritization
UR - https://www.scopus.com/pages/publications/85145056776
U2 - 10.1002/dad2.12300
DO - 10.1002/dad2.12300
M3 - Article
AN - SCOPUS:85145056776
SN - 2352-8729
VL - 14
JO - Alzheimer's and Dementia: Diagnosis, Assessment and Disease Monitoring
JF - Alzheimer's and Dementia: Diagnosis, Assessment and Disease Monitoring
IS - 1
M1 - e12300
ER -