首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Detecting non-orthology in the COGs database and other approaches grouping orthologs using genome-specific best hits
Authors:Dessimoz Christophe  Boeckmann Brigitte  Roth Alexander C J  Gonnet Gaston H
Institution:ETH Zurich, Institute of Computational Science, CH-8092 Zürich. cdessimoz@inf.ethz.ch
Abstract:Correct orthology assignment is a critical prerequisite of numerous comparative genomics procedures, such as function prediction, construction of phylogenetic species trees and genome rearrangement analysis. We present an algorithm for the detection of non-orthologs that arise by mistake in current orthology classification methods based on genome-specific best hits, such as the COGs database. The algorithm works with pairwise distance estimates, rather than computationally expensive and error-prone tree-building methods. The accuracy of the algorithm is evaluated through verification of the distribution of predicted cases, case-by-case phylogenetic analysis and comparisons with predictions from other projects using independent methods. Our results show that a very significant fraction of the COG groups include non-orthologs: using conservative parameters, the algorithm detects non-orthology in a third of all COG groups. Consequently, sequence analysis sensitive to correct orthology assignments will greatly benefit from these findings.
Keywords:
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号