Property-based sequence representations do not adequately encode local protein folding information

  • A. D. Solis
  • , S. Rackovsky

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

We examine the informatic characteristics of amino acid representations based on physical properties. We demonstrate that sequences rewritten using contracted alphabets based on physical properties do not encode local folding information well. The best four-character alphabet can only encode ∼57% of the maximum possible amount of structural information. This result suggests that property-based representations that operate on a local length scale are not likely to be useful in homology searches and fold-recognition exercises.

Original languageEnglish
Pages (from-to)785-788
Number of pages4
JournalProteins: Structure, Function and Bioinformatics
Volume67
Issue number4
DOIs
StatePublished - Jun 2007

Keywords

  • Amino acids
  • Bioinformatics
  • Fold recognition
  • Homology search
  • Reduced alphabets

Fingerprint

Dive into the research topics of 'Property-based sequence representations do not adequately encode local protein folding information'. Together they form a unique fingerprint.

Cite this