TY - JOUR
T1 - A Rare Variant Nonparametric Linkage Method for Nuclear and Extended Pedigrees with Application to Late-Onset Alzheimer Disease via WGS Data
AU - Zhao, Linhai
AU - He, Zongxiao
AU - Zhang, Di
AU - Wang, Gao T.
AU - Renton, Alan E.
AU - Vardarajan, Badri N.
AU - Nothnagel, Michael
AU - Goate, Alison M.
AU - Mayeux, Richard
AU - Leal, Suzanne M.
N1 - Funding Information:
We wish to thank the family members who participated in the Alzheimer Disease Sequencing Project and made this research possible. The datasets used for the analyses in this manuscript were obtained from the database of Genotypes and Phenotypes (dbGaP) at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000572.v7.p4 through dbGaP accession study number phs000572.v7.p4. We would like to thank dbGaP for distributing the data used in this study. The National Institute on Aging (NIA)-LOAD study supported the collection of samples used in this study through NIA grants U24AG026395 and R01AG041797. We thank contributors, including the Alzheimer Disease Centers who collected samples used in the NIA-LOAD study, as well as patients and their families, whose help and participation made this work possible. Data collection for this project was also supported by the Genetic Studies of Alzheimer Disease in Caribbean Hispanics (EFIGA) funded by the NIA grants 5R37AG015473, RF1AG015473, and R56AG051876. We acknowledge the EFIGA study participants and the EFIGA research and support staff for their contributions to this study. This work was also supported by grants from the National Human Genome Research Institute R01 HG008972 and NIA RF1 AG058131. Complete acknowledgments can be found in the Supplemental Acknowledgments.
Funding Information:
We wish to thank the family members who participated in the Alzheimer Disease Sequencing Project and made this research possible. The datasets used for the analyses in this manuscript were obtained from the database of Genotypes and Phenotypes (dbGaP) at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000572.v7.p4 through dbGaP accession study number phs000572.v7.p4. We would like to thank dbGaP for distributing the data used in this study. The National Institute on Aging (NIA)-LOAD study supported the collection of samples used in this study through NIA grants U24AG026395 and R01AG041797 . We thank contributors, including the Alzheimer Disease Centers who collected samples used in the NIA-LOAD study, as well as patients and their families, whose help and participation made this work possible. Data collection for this project was also supported by the Genetic Studies of Alzheimer Disease in Caribbean Hispanics (EFIGA) funded by the NIA grants 5R37AG015473 , RF1AG015473 , and R56AG051876 . We acknowledge the EFIGA study participants and the EFIGA research and support staff for their contributions to this study. This work was also supported by grants from the National Human Genome Research Institute R01 HG008972 and NIA RF1 AG058131 . Complete acknowledgments can be found in the Supplemental Acknowledgments .
Publisher Copyright:
© 2019 American Society of Human Genetics
PY - 2019/10/3
Y1 - 2019/10/3
N2 - To analyze family-based whole-genome sequence (WGS) data for complex traits, we developed a rare variant (RV) non-parametric linkage (NPL) analysis method, which has advantages over association methods. The RV-NPL differs from the NPL in that RVs are analyzed, and allele sharing among affected relative-pairs is estimated only for minor alleles. Analyzing families can increase power because causal variants with familial aggregation usually have larger effect sizes than those underlying sporadic diseases. Differing from association analysis, for NPL only affected individuals are analyzed, which can increase power, since unaffected family members can be susceptibility variant carriers. RV-NPL is robust to population substructure and admixture, inclusion of nonpathogenic variants, as well as allelic and locus heterogeneity and can readily be applied outside of coding regions. In contrast to analyzing common variants using NPL, where loci localize to large genomic regions (e.g., >50 Mb), mapped regions are well defined for RV-NPL. Using simulation studies, we demonstrate that RV-NPL is substantially more powerful than applying traditional NPL methods to analyze RVs. The RV-NPL was applied to analyze 107 late-onset Alzheimer disease (LOAD) pedigrees of Caribbean Hispanic and European ancestry with WGS data, and statistically significant linkage (LOD ≥ 3.8) was found with RVs in PSMF1 and PTPN21 which have been shown to be involved in LOAD etiology. Additionally, nominally significant linkage was observed with RVs in ABCA7, ACE, EPHA1, and SORL1, genes that were previously reported to be associated with LOAD. RV-NPL is an ideal method to elucidate the genetic etiology of complex familial diseases.
AB - To analyze family-based whole-genome sequence (WGS) data for complex traits, we developed a rare variant (RV) non-parametric linkage (NPL) analysis method, which has advantages over association methods. The RV-NPL differs from the NPL in that RVs are analyzed, and allele sharing among affected relative-pairs is estimated only for minor alleles. Analyzing families can increase power because causal variants with familial aggregation usually have larger effect sizes than those underlying sporadic diseases. Differing from association analysis, for NPL only affected individuals are analyzed, which can increase power, since unaffected family members can be susceptibility variant carriers. RV-NPL is robust to population substructure and admixture, inclusion of nonpathogenic variants, as well as allelic and locus heterogeneity and can readily be applied outside of coding regions. In contrast to analyzing common variants using NPL, where loci localize to large genomic regions (e.g., >50 Mb), mapped regions are well defined for RV-NPL. Using simulation studies, we demonstrate that RV-NPL is substantially more powerful than applying traditional NPL methods to analyze RVs. The RV-NPL was applied to analyze 107 late-onset Alzheimer disease (LOAD) pedigrees of Caribbean Hispanic and European ancestry with WGS data, and statistically significant linkage (LOD ≥ 3.8) was found with RVs in PSMF1 and PTPN21 which have been shown to be involved in LOAD etiology. Additionally, nominally significant linkage was observed with RVs in ABCA7, ACE, EPHA1, and SORL1, genes that were previously reported to be associated with LOAD. RV-NPL is an ideal method to elucidate the genetic etiology of complex familial diseases.
KW - Alzheimer disease
KW - nonparametric linkage analysis
KW - rare variant
UR - http://www.scopus.com/inward/record.url?scp=85072554077&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2019.09.006
DO - 10.1016/j.ajhg.2019.09.006
M3 - Article
C2 - 31585107
AN - SCOPUS:85072554077
SN - 0002-9297
VL - 105
SP - 822
EP - 835
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 4
ER -