Prediction of function divergence in protein families using the substitution rate variation parameter alpha |
| |
Authors: | Abhiman Saraswathi Daub Carsten O Sonnhammer Erik L L |
| |
Affiliation: | Center for Genomics and Bioinformatics, Karolinska Institutet, Stockholm, Sweden. Abhiman.Saraswathi@cgb.ki.se |
| |
Abstract: | Protein families typically embody a range of related functions and may thus be decomposed into subfamilies with, for example, distinct substrate specificities. Detection of functionally divergent subfamilies is possible by methods for recognizing branches of adaptive evolution in a gene tree. As the number of genome sequences is growing rapidly, it is highly desirable to automatically detect subfamily function divergence. To this end, we here introduce a method for large-scale prediction of function divergence within protein families. It is called the alpha shift measure (ASM) as it is based on detecting a shift in the shape parameter (alpha [alpha]) of the substitution rate gamma distribution. Four different methods for estimating alpha were investigated. We benchmarked the accuracy of ASM using function annotation from Enzyme Commission numbers within Pfam protein families divided into subfamilies by the automatic tree-based method BETE. In a test using 563 subfamily pairs in 162 families, ASM outperformed functional site-based methods using rate or conservation shifting (rate shift measure [RSM] and conservation shift measure [CSM]). The best results were obtained using the "GZ-Gamma" method for estimating alpha. By combining ASM with RSM and CSM using linear discriminant analysis, the prediction accuracy was further improved. |
| |
Keywords: | protein evolution adaptive evolution enzyme protein function protein subfamily substitution rates gamma distribution alpha parameter function shift |
本文献已被 PubMed Oxford 等数据库收录! |