TY - JOUR
T1 - Protein structure-based gene expression signatures
AU - Rahman, Rayees
AU - Zatorski, Nicole
AU - Hansen, Jens
AU - Xiong, Yuguang
AU - Van Hasselt, J. G.Coen
AU - Sobie, Eric A.
AU - Birtwistle, Marc R.
AU - Azeloglu, Evren U.
AU - Iyengar, Ravi
AU - Schlessinger, Avner
N1 - Publisher Copyright:
© 2021 National Academy of Sciences. All rights reserved.
PY - 2021/5/11
Y1 - 2021/5/11
N2 - Gene expression signatures (GES) connect phenotypes to differential messenger RNA (mRNA) expression of genes, providing a powerful approach to define cellular identity, function, and the effects of perturbations. The use of GES has suffered from vague assessment criteria and limited reproducibility. Because the structure of proteins defines the functional capability of genes, we hypothesized that enrichment of structural features could be a generalizable representation of gene sets. We derive structural gene expression signatures (sGES) using features from multiple levels of protein structure (e.g., domain and fold) encoded by the mRNAs in GES. Comprehensive analyses of data from the Genotype-Tissue Expression Project (GTEx), the all RNA-seq and ChIP-seq sample and signature search (ARCHS4) database, and mRNA expression of drug effects on cardiomyocytes show that sGES are useful for characterizing biological phenomena. sGES enable phenotypic characterization across experimental platforms, facilitates interoperability of expression datasets, and describe drug action on cells.
AB - Gene expression signatures (GES) connect phenotypes to differential messenger RNA (mRNA) expression of genes, providing a powerful approach to define cellular identity, function, and the effects of perturbations. The use of GES has suffered from vague assessment criteria and limited reproducibility. Because the structure of proteins defines the functional capability of genes, we hypothesized that enrichment of structural features could be a generalizable representation of gene sets. We derive structural gene expression signatures (sGES) using features from multiple levels of protein structure (e.g., domain and fold) encoded by the mRNAs in GES. Comprehensive analyses of data from the Genotype-Tissue Expression Project (GTEx), the all RNA-seq and ChIP-seq sample and signature search (ARCHS4) database, and mRNA expression of drug effects on cardiomyocytes show that sGES are useful for characterizing biological phenomena. sGES enable phenotypic characterization across experimental platforms, facilitates interoperability of expression datasets, and describe drug action on cells.
KW - Gene expression signatures
KW - Reproducibility
KW - Structural bioinformatics
UR - http://www.scopus.com/inward/record.url?scp=85105268876&partnerID=8YFLogxK
U2 - 10.1073/pnas.2014866118
DO - 10.1073/pnas.2014866118
M3 - Article
C2 - 33941686
AN - SCOPUS:85105268876
SN - 0027-8424
VL - 118
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 19
M1 - e2014866118
ER -