TY - JOUR
T1 - KnockoffTrio
T2 - A knockoff framework for the identification of putative causal variants in genome-wide association studies with trio design
AU - Yang, Yi
AU - Wang, Chen
AU - Liu, Linxi
AU - Buxbaum, Joseph
AU - He, Zihuai
AU - Ionita-Laza, Iuliana
N1 - Funding Information:
This research was supported by NIH / National Institute of Mental Health Awards MH106910 and MH095797 (to I.I.-L.). We appreciate obtaining access to genetic and phenotypic data from dbGaP and SFARI Base and gratefully acknowledge the participants who provided data for the AGP, SPARK, and SSC projects.
Publisher Copyright:
© 2022 American Society of Human Genetics
PY - 2022/10/6
Y1 - 2022/10/6
N2 - Family-based designs can eliminate confounding due to population substructure and can distinguish direct from indirect genetic effects, but these designs are underpowered due to limited sample sizes. Here, we propose KnockoffTrio, a statistical method to identify putative causal genetic variants for father-mother-child trio design built upon a recently developed knockoff framework in statistics. KnockoffTrio controls the false discovery rate (FDR) in the presence of arbitrary correlations among tests and is less conservative and thus more powerful than the conventional methods that control the family-wise error rate via Bonferroni correction. Furthermore, KnockoffTrio is not restricted to family-based association tests and can be used in conjunction with more powerful, potentially nonlinear models to improve the power of standard family-based tests. We show, using empirical simulations, that KnockoffTrio can prioritize causal variants over associations due to linkage disequilibrium and can provide protection against confounding due to population stratification. In applications to 14,200 trios from three study cohorts for autism spectrum disorders (ASDs), including AGP, SPARK, and SSC, we show that KnockoffTrio can identify multiple significant associations that are missed by conventional tests applied to the same data. In particular, we replicate known ASD association signals with variants in several genes such as MACROD2, NRXN1, PRKAR1B, CADM2, PCDH9, and DOCK4 and identify additional associations with variants in other genes including ARHGEF10, SLC28A1, ZNF589, and HINT1 at FDR 10%.
AB - Family-based designs can eliminate confounding due to population substructure and can distinguish direct from indirect genetic effects, but these designs are underpowered due to limited sample sizes. Here, we propose KnockoffTrio, a statistical method to identify putative causal genetic variants for father-mother-child trio design built upon a recently developed knockoff framework in statistics. KnockoffTrio controls the false discovery rate (FDR) in the presence of arbitrary correlations among tests and is less conservative and thus more powerful than the conventional methods that control the family-wise error rate via Bonferroni correction. Furthermore, KnockoffTrio is not restricted to family-based association tests and can be used in conjunction with more powerful, potentially nonlinear models to improve the power of standard family-based tests. We show, using empirical simulations, that KnockoffTrio can prioritize causal variants over associations due to linkage disequilibrium and can provide protection against confounding due to population stratification. In applications to 14,200 trios from three study cohorts for autism spectrum disorders (ASDs), including AGP, SPARK, and SSC, we show that KnockoffTrio can identify multiple significant associations that are missed by conventional tests applied to the same data. In particular, we replicate known ASD association signals with variants in several genes such as MACROD2, NRXN1, PRKAR1B, CADM2, PCDH9, and DOCK4 and identify additional associations with variants in other genes including ARHGEF10, SLC28A1, ZNF589, and HINT1 at FDR 10%.
KW - GWAS
KW - causal variant identification
KW - family-based design
KW - knockoff framework
UR - http://www.scopus.com/inward/record.url?scp=85139338979&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2022.08.013
DO - 10.1016/j.ajhg.2022.08.013
M3 - Article
C2 - 36150388
AN - SCOPUS:85139338979
VL - 109
SP - 1761
EP - 1776
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
SN - 0002-9297
IS - 10
ER -