首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Sensitive detection of sequence similarity using combinatorial pattern discovery: a challenging study of two distantly related protein families
Authors:Darzentas Nikos  Rigoutsos Isidore  Ouzounis Christos A
Institution:Computational Genomics Group, The European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge, UK.
Abstract:We investigate the performance of combinatorial pattern discovery to detect remote sequence similarities in terms of both biological accuracy and computational efficiency for a pair of distantly related families, as a case study. The two families represent the cupredoxins and multicopper oxidases, both containing blue copper-binding domains. These families present a challenging case due to low sequence similarity, different local structure, and variable sequence conservation at their copper-binding active sites. In this study, we investigate a new approach for automatically identifying weak sequence similarities that is based on combinatorial pattern discovery. We compare its performance with a traditional, HMM-based scheme and obtain estimates for sensitivity and specificity of the two approaches. Our analysis suggests that pattern discovery methods can be substantially more sensitive in detecting remote protein relationships while at the same time guaranteeing high specificity.
Keywords:sensitivity detection  sequence similarity  protein families
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号