A new method for identification of protein (sub)families in a set of proteins based on hydropathy distribution in proteins |
| |
Authors: | Pánek Josef Eidhammer Ingvar Aasland Rein |
| |
Affiliation: | Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia. panek@biomed.cas.cz |
| |
Abstract: | Structural similarity among proteins is reflected in the distribution of hydropathicity along the amino acids in the protein sequence. Similarities in the hydropathy distributions are obvious for homologous proteins within a protein family. They also were observed for proteins with related structures, even when sequence similarities were undetectable. Here we present a novel method that employs the hydropathy distribution in proteins for identification of (sub)families in a set of (homologous) proteins. We represent proteins as points in a generalized hydropathy space, represented by vectors of specifically defined features. The features are derived from hydropathy of the individual amino acids. Projection of this space onto principal axes reveals groups of proteins with related hydropathy distributions. The groups identified correspond well to families of structurally and functionally related proteins. We found that this method accurately identifies protein families in a set of proteins, or subfamilies in a set of homologous proteins. Our results show that protein families can be identified by the analysis of hydropathy distribution, without the need for sequence alignment. |
| |
Keywords: | hydropathy hydropathy distribution protein feature protein family protein subfamily |
本文献已被 PubMed 等数据库收录! |
|