Abstract
We examine the informatic characteristics of amino acid representations based on physical properties. We demonstrate that sequences rewritten using contracted alphabets based on physical properties do not encode local folding information well. The best four-character alphabet can only encode ∼57% of the maximum possible amount of structural information. This result suggests that property-based representations that operate on a local length scale are not likely to be useful in homology searches and fold-recognition exercises.
| Original language | English |
|---|---|
| Pages (from-to) | 785-788 |
| Number of pages | 4 |
| Journal | Proteins: Structure, Function and Bioinformatics |
| Volume | 67 |
| Issue number | 4 |
| DOIs | |
| State | Published - Jun 2007 |
Keywords
- Amino acids
- Bioinformatics
- Fold recognition
- Homology search
- Reduced alphabets
Fingerprint
Dive into the research topics of 'Property-based sequence representations do not adequately encode local protein folding information'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver