Whole Proteome Prokaryote Phylogeny Without Sequence Alignment: A K-String Composition Approach |
| |
Authors: | Qi Ji Wang Bin Hao Bai-Iin |
| |
Institution: | (1) The Institute of Theoretical Physics, Academia Sinica, Beijing 100080, China;(2) The T-Life Research Center, Fudan University, Shanghai 200433, China |
| |
Abstract: | A systematic way of inferring evolutionary relatedness of microbial organisms from the oligopeptide content, i.e., frequency of amino acid K-strings in their complete proteomes, is proposed. The new method circumvents the ambiguity of choosing the genes for phylogenetic reconstruction and avoids the necessity of aligning sequences of essentially different length and gene content. The only parameter in the method is the length K of the oligopeptides, which serves to tune the resolution power of the method. The topology of the trees converges with K increasing. Applied to a total of 109 organisms, including 16 Archaea, 87 Bacteria, and 6 Eukarya, it yields an unrooted tree that agrees with the biologists tree of life based on SSU rRNA comparison in a majority of basic branchings, and especially, in all lower taxa. |
| |
Keywords: | Prokaryote Phylogeny Archaea K-strings Compositional distance Tree of life |
本文献已被 PubMed SpringerLink 等数据库收录! |
|