首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Clustering of protein families into functional subtypes using Relative Complexity Measure with reduced amino acid alphabets
Authors:Aydin Albayrak  Hasan H Otu  Ugur O Sezerman
Institution:(1) Biological Sciences and Bioengineering, Sabanci University, Orhanli, Tuzla, Istanbul, Turkey;(2) Department of Medicine, BIDMC Genomics Center, Harvard Medical School, Boston, MA 02115, USA;(3) Department of Bioengineering, Istanbul Bilgi University, 34060 Istanbul, Turkey
Abstract:

Background  

Phylogenetic analysis can be used to divide a protein family into subfamilies in the absence of experimental information. Most phylogenetic analysis methods utilize multiple alignment of sequences and are based on an evolutionary model. However, multiple alignment is not an automated procedure and requires human intervention to maintain alignment integrity and to produce phylogenies consistent with the functional splits in underlying sequences. To address this problem, we propose to use the alignment-free Relative Complexity Measure (RCM) combined with reduced amino acid alphabets to cluster protein families into functional subtypes purely on sequence criteria. Comparison with an alignment-based approach was also carried out to test the quality of the clustering.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号