TY - JOUR
T1 - Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks
AU - VA Million Veteran Program
AU - Kim, Minsu
AU - Huffman, Jennifer E.
AU - Justice, Amy
AU - Goethert, Ian
AU - Agasthya, Greeshma
AU - Sun, Yan
AU - McArdle, Rachel
AU - Dellitalia, Louis
AU - Stephens, Brady
AU - Cho, Kelly
AU - Pyarajan, Saiju
AU - Mattocks, Kristin
AU - Harley, John
AU - Whittle, Jeffrey
AU - Mathew, Roy
AU - Beckham, Jean
AU - Smith, River
AU - Wells, John
AU - Gutierrez, Salvador
AU - Hammer, Kimberly
AU - Iruvanti, Pran
AU - Ballas, Zuhair
AU - Mastorides, Stephen
AU - Moorman, Jonathan
AU - Gappy, Saib
AU - Klein, Jon
AU - Ratcliffe, Nora
AU - Palacio, Ana
AU - Okusaga, Olaoluwa
AU - Murdoch, Maureen
AU - Sriram, Peruvemba
AU - Argyres, Dean P.
AU - Connor, Todd
AU - Villareal, Gerardo
AU - Kinlay, Scott
AU - Yeh, Shing Shing
AU - Jhala, Darshana
AU - Tandon, Neeraj
AU - Chang, Kyong Mi
AU - Aguayo, Samuel
AU - Cohen, David
AU - Sharma, Satish
AU - Hamner, Mark
AU - Liangpunsakul, Suthat
AU - Godschalk, Michael
AU - Oursler, Kris Ann
AU - Whooley, Mary
AU - Greco, Jennifer
AU - Ahuja, Sunil
AU - Pyarajan, Saiju
N1 - Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - Background: Genome-wide Association Studies (GWAS) aims to uncover the link between genomic variation and phenotype. They have been actively applied in cancer biology to investigate associations between variations and cancer phenotypes, such as susceptibility to certain types of cancer and predisposed responsiveness to specific treatments. Since GWAS primarily focuses on finding associations between individual genomic variations and cancer phenotypes, there are limitations in understanding the mechanisms by which cancer phenotypes are cooperatively affected by more than one genomic variation. Results: This paper proposes a network representation learning approach to learn associations among genomic variations using a prostate cancer cohort. The learned associations are encoded into representations that can be used to identify functional modules of genomic variations within genes associated with early- and late-onset prostate cancer. The proposed method was applied to a prostate cancer cohort provided by the Veterans Administration’s Million Veteran Program to identify candidates for functional modules associated with early-onset prostate cancer. The cohort included 33,159 prostate cancer patients, 3181 early-onset patients, and 29,978 late-onset patients. The reproducibility of the proposed approach clearly showed that the proposed approach can improve the model performance in terms of robustness. Conclusions: To our knowledge, this is the first attempt to use a network representation learning approach to learn associations among genomic variations within genes. Associations learned in this way can lead to an understanding of the underlying mechanisms of how genomic variations cooperatively affect each cancer phenotype. This method can reveal unknown knowledge in the field of cancer biology and can be utilized to design more advanced cancer-targeted therapies.
AB - Background: Genome-wide Association Studies (GWAS) aims to uncover the link between genomic variation and phenotype. They have been actively applied in cancer biology to investigate associations between variations and cancer phenotypes, such as susceptibility to certain types of cancer and predisposed responsiveness to specific treatments. Since GWAS primarily focuses on finding associations between individual genomic variations and cancer phenotypes, there are limitations in understanding the mechanisms by which cancer phenotypes are cooperatively affected by more than one genomic variation. Results: This paper proposes a network representation learning approach to learn associations among genomic variations using a prostate cancer cohort. The learned associations are encoded into representations that can be used to identify functional modules of genomic variations within genes associated with early- and late-onset prostate cancer. The proposed method was applied to a prostate cancer cohort provided by the Veterans Administration’s Million Veteran Program to identify candidates for functional modules associated with early-onset prostate cancer. The cohort included 33,159 prostate cancer patients, 3181 early-onset patients, and 29,978 late-onset patients. The reproducibility of the proposed approach clearly showed that the proposed approach can improve the model performance in terms of robustness. Conclusions: To our knowledge, this is the first attempt to use a network representation learning approach to learn associations among genomic variations within genes. Associations learned in this way can lead to an understanding of the underlying mechanisms of how genomic variations cooperatively affect each cancer phenotype. This method can reveal unknown knowledge in the field of cancer biology and can be utilized to design more advanced cancer-targeted therapies.
KW - Genome-wide Association Study
KW - Machine Learning
KW - Network Representation Learning
UR - http://www.scopus.com/inward/record.url?scp=85133605498&partnerID=8YFLogxK
U2 - 10.1186/s12920-022-01298-6
DO - 10.1186/s12920-022-01298-6
M3 - Article
C2 - 35794577
AN - SCOPUS:85133605498
SN - 1471-2350
VL - 15
JO - BMC Medical Genomics
JF - BMC Medical Genomics
IS - 1
M1 - 151
ER -