Property-based sequence representations do not adequately encode local protein folding information |
| |
Authors: | Solis A D Rackovsky S |
| |
Institution: | Department of Pharmacology and Biological Chemistry, Mount Sinai School of Medicine, One Gustave L. Levy Place, New York, New York 10029, USA. |
| |
Abstract: | We examine the informatic characteristics of amino acid representations based on physical properties. We demonstrate that sequences rewritten using contracted alphabets based on physical properties do not encode local folding information well. The best four-character alphabet can only encode approximately 57% of the maximum possible amount of structural information. This result suggests that property-based representations that operate on a local length scale are not likely to be useful in homology searches and fold-recognition exercises. |
| |
Keywords: | bioinformatics reduced alphabets homology search fold recognition amino acids |
本文献已被 PubMed 等数据库收录! |
|