TY - JOUR
T1 - Identifying regulatory relationships among genomic loci, biological pathways, and disease
AU - Woo, Jung Hoon
AU - Cho, Sung Bum
AU - Lee, Eunjee
AU - Kim, Ju Han
N1 - Funding Information:
This study was supported by a grant of the Korea Health 21 R&D Project, Ministry of Health & Welfare ( A040002 ) and of KOSEF, Ministry of Sciences and Technology ( M10729070001-07N2907-00110 and MM10641000104-06N4100-1040 ), Republic of Korea. J.H.W. was supported in part by a GAW grant, R01 GM031575 .
PY - 2010/7
Y1 - 2010/7
N2 - Objective: Elucidating genetic factors of complex diseases is one of the most important challenges in biomedical research. Recently, a genetical genomics approach of mapping genotype to transcripts has been used in complex disease analysis. This approach treats messenger ribonucleic acid (mRNA) expression as a quantitative trait and identifies putative regulatory loci for the expression of an individual gene. However, the single-gene approach could not detect single nucleotide polymorphisms (SNP's) associated with the concerted activity of multiple genes. Since complex diseases result from the interactions of multiple genes, it is important to consider associations between genotype and multiple gene expression. Methods and materials: We developed the differential allelic co-expression (DACE) that identifies regulatory loci that affect the inter-correlation structure of multiple genes or a gene set. We applied DACE to two benchmark datasets: the normal human lymphoblastoid cell dataset and the glioblastoma dataset. These datasets consist of both SNPs and mRNA expression profiles for each sample. When analyzing the lymphoblastoid cell dataset, principal component analysis (PCA) was compared with the DACE test. Results: While PCA identified associations found by single-gene analysis, the DACE test detected associations not identified by the single-gene approach. Using the DACE test, seven putative regulatory loci of immune-related pathways were identified in lymphoblastoid cells after controlling for family-wise error rate. In the glioblastoma dataset, DACE identified 4582 SNPs associated with six pathways. In 231 of the 4582 SNPs, patient survival length was correlated significantly with the SNP genotype. This finding suggests that our integrative approach may provide a biological explanation for the putative relationship between sequence level variation and disease outcome, via expression of a functional pathway. Conclusion: The DACE test shows promise for finding regulatory relationships between a genomic locus and sets of genes which may be related to disease outcome.
AB - Objective: Elucidating genetic factors of complex diseases is one of the most important challenges in biomedical research. Recently, a genetical genomics approach of mapping genotype to transcripts has been used in complex disease analysis. This approach treats messenger ribonucleic acid (mRNA) expression as a quantitative trait and identifies putative regulatory loci for the expression of an individual gene. However, the single-gene approach could not detect single nucleotide polymorphisms (SNP's) associated with the concerted activity of multiple genes. Since complex diseases result from the interactions of multiple genes, it is important to consider associations between genotype and multiple gene expression. Methods and materials: We developed the differential allelic co-expression (DACE) that identifies regulatory loci that affect the inter-correlation structure of multiple genes or a gene set. We applied DACE to two benchmark datasets: the normal human lymphoblastoid cell dataset and the glioblastoma dataset. These datasets consist of both SNPs and mRNA expression profiles for each sample. When analyzing the lymphoblastoid cell dataset, principal component analysis (PCA) was compared with the DACE test. Results: While PCA identified associations found by single-gene analysis, the DACE test detected associations not identified by the single-gene approach. Using the DACE test, seven putative regulatory loci of immune-related pathways were identified in lymphoblastoid cells after controlling for family-wise error rate. In the glioblastoma dataset, DACE identified 4582 SNPs associated with six pathways. In 231 of the 4582 SNPs, patient survival length was correlated significantly with the SNP genotype. This finding suggests that our integrative approach may provide a biological explanation for the putative relationship between sequence level variation and disease outcome, via expression of a functional pathway. Conclusion: The DACE test shows promise for finding regulatory relationships between a genomic locus and sets of genes which may be related to disease outcome.
KW - Differential allelic co-expression test
KW - Functional pathway and disease outcome
KW - Genetical genomics
KW - Integrative approach
KW - Principal component analysis
KW - Relationship among sequence level variation
UR - http://www.scopus.com/inward/record.url?scp=77954310451&partnerID=8YFLogxK
U2 - 10.1016/j.artmed.2010.03.001
DO - 10.1016/j.artmed.2010.03.001
M3 - Article
C2 - 20554166
AN - SCOPUS:77954310451
SN - 0933-3657
VL - 49
SP - 161
EP - 165
JO - Artificial Intelligence in Medicine
JF - Artificial Intelligence in Medicine
IS - 3
ER -