Combination use of protein-protein interaction network topological features improves the predictive scores of deleterious non-synonymous single-nucleotide polymorphisms

Yiming Wu, Runyu Jing, Lin Jiang, Yanping Jiang, Qifan Kuang, Ling Ye, Lijun Yang, Yizhou Li, Menglong Li

Research output: Contribution to journalArticlepeer-review

20 Scopus citations

Abstract

Single-nucleotide polymorphisms (SNPs) are the most frequent form of genetic variations. Non-synonymous SNPs (nsSNPs) occurring in coding region result in single amino acid substitutions that associate with human hereditary diseases. Plenty of approaches were designed for distinguishing deleterious from neutral nsSNPs based on sequence level information. Novel in this work, combinations of protein-protein interaction (PPI) network topological features were introduced in predicting disease-related nsSNPs. Based on a dataset that was compiled from Swiss-Prot, a random forest model was constructed with an average accuracy value of 80.43 % and an MCC value of 0.60 in a rigorous tenfold crossvalidation test. For an independent dataset, our model achieved an accuracy of 88.05 % and an MCC of 0.67. Compared with previous studies, our approach presented superior prediction ability. Results showed that the incorporated PPI network topological features outperform conventional features. Our further analysis indicated that disease-related proteins are topologically different from other proteins. This study suggested that nsSNPs may share some topological information of proteins and the change of topological attributes could provide clues in illustrating functional shift due to nsSNPs.

Original languageEnglish
Pages (from-to)2025-2035
Number of pages11
JournalAmino Acids
Volume46
Issue number8
DOIs
StatePublished - Aug 2014
Externally publishedYes

Keywords

  • Disease
  • Non-synonymous single-nucleotide polymorphisms
  • Prediction
  • Protein-protein interaction network
  • Topological features

Fingerprint

Dive into the research topics of 'Combination use of protein-protein interaction network topological features improves the predictive scores of deleterious non-synonymous single-nucleotide polymorphisms'. Together they form a unique fingerprint.

Cite this