TY - JOUR
T1 - Optimized high-throughput screening of non-coding variants identified from genome-wide association studies
AU - Morova, Tunc
AU - Ding, Yi
AU - Huang, Chia Chi F.
AU - Sar, Funda
AU - Schwarz, Tommer
AU - Giambartolomei, Claudia
AU - Baca, Sylvan C.
AU - Grishin, Dennis
AU - Hach, Faraz
AU - Gusev, Alexander
AU - Freedman, Matthew L.
AU - Pasaniuc, Bogdan
AU - Lack, Nathan A.
N1 - Publisher Copyright:
© 2023 The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research.
PY - 2023/2/22
Y1 - 2023/2/22
N2 - The vast majority of disease-associated single nucleotide polymorphisms (SNP) identified from genome-wide association studies (GWAS) are localized in non-coding regions. A significant fraction of these variants impact transcription factors binding to enhancer elements and alter gene expression. To functionally interrogate the activity of such variants we developed snpSTARRseq, a high-throughput experimental method that can interrogate the functional impact of hundreds to thousands of non-coding variants on enhancer activity. snpSTARRseq dramatically improves signal-to-noise by utilizing a novel sequencing and bioinformatic approach that increases both insert size and the number of variants tested per loci. Using this strategy, we interrogated known prostate cancer (PCa) risk-associated loci and demonstrated that 35% of them harbor SNPs that significantly altered enhancer activity. Combining these results with chromosomal looping data we could identify interacting genes and provide a mechanism of action for 20 PCa GWAS risk regions. When benchmarked to orthogonal methods, snpSTARRseq showed a strong correlation with in vivo experimental allelic-imbalance studies whereas there was no correlation with predictive in silico approaches. Overall, snpSTARRseq provides an integrated experimental and computational framework to functionally test non-coding genetic variants.
AB - The vast majority of disease-associated single nucleotide polymorphisms (SNP) identified from genome-wide association studies (GWAS) are localized in non-coding regions. A significant fraction of these variants impact transcription factors binding to enhancer elements and alter gene expression. To functionally interrogate the activity of such variants we developed snpSTARRseq, a high-throughput experimental method that can interrogate the functional impact of hundreds to thousands of non-coding variants on enhancer activity. snpSTARRseq dramatically improves signal-to-noise by utilizing a novel sequencing and bioinformatic approach that increases both insert size and the number of variants tested per loci. Using this strategy, we interrogated known prostate cancer (PCa) risk-associated loci and demonstrated that 35% of them harbor SNPs that significantly altered enhancer activity. Combining these results with chromosomal looping data we could identify interacting genes and provide a mechanism of action for 20 PCa GWAS risk regions. When benchmarked to orthogonal methods, snpSTARRseq showed a strong correlation with in vivo experimental allelic-imbalance studies whereas there was no correlation with predictive in silico approaches. Overall, snpSTARRseq provides an integrated experimental and computational framework to functionally test non-coding genetic variants.
UR - https://www.scopus.com/pages/publications/85159711358
U2 - 10.1093/nar/gkac1198
DO - 10.1093/nar/gkac1198
M3 - Article
AN - SCOPUS:85159711358
SN - 0305-1048
VL - 51
SP - E18-E18
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 3
ER -