首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We perform an exhaustive, taxon by taxon, comparison of the branchings in the composition vector trees (CVTrees) inferred from 432 prokaryotic genomes available on 31 December 2006, with the bacteriologists’ taxonomy—primarily the latest online Outline of the Bergey’s Manual of Systematic Bacteriology. The CVTree phylogeny agrees very well with the Bergey’s taxonomy in majority of fine branchings and overall structures. At the same time most of the differences between the trees and the Manual have been known to biologists to some extent and may hint at taxonomic revisions. Instead of demonstrating the overwhelming agreement this paper puts emphasis on the biological implications of the differences.  相似文献   

2.
We perform an exhaustive, taxon by taxon, comparison of the branchings in the composition vector trees (CVTrees) inferred from 432 prokaryotic genomes available on 31 December 2006, with the bacte-riologists' taxonomy-primarily the latest online Outline of the Bergey's Manual of Systematic Bacteri-ology. The CVTree phylogeny agrees very well with the Bergey's taxonomy in majority of fine branchings and overall structures. At the same time most of the differences between the trees and the Manual have been known to biologists to some extent and may hint at taxonomic revisions. Instead of demonstrating the overwhelming agreement this paper puts emphasis on the biological implications of the differences.  相似文献   

3.
The Composition Vector Tree (CVTree) is a parameter-free and alignment-free method to infer prokaryotic phylogeny from their complete genomes. It is distinct from the traditional 16S rRNA analysis in both the input data and the methodology. The prokaryotic phylogenetic trees constructed by using the CVTree method agree well with the Bergey's taxonomy in all major groupings and fine branching patterns. Thus, combined use of the CVTree approach and the 16S rRNA analysis may provide an objective and reliable reconstruction of the prokaryotic branch of the Tree of Life.  相似文献   

4.
The process of inferring phylogenetic trees from molecular sequences almost always starts with a multiple alignment of these sequences but can also be based on methods that do not involve multiple sequence alignment. Very little is known about the accuracy with which such alignment-free methods recover the correct phylogeny or about the potential for increasing their accuracy. We conducted a large-scale comparison of ten alignment-free methods, among them one new approach that does not calculate distances and a faster variant of our pattern-based approach; all distance-based alignment-free methods are freely available from http://www.bioinformatics.org.au (as Python package decaf+py). We show that most methods exhibit a higher overall reconstruction accuracy in the presence of high among-site rate variation. Under all conditions that we considered, variants of the pattern-based approach were significantly better than the other alignment-free methods. The new pattern-based variant achieved a speed-up of an order of magnitude in the distance calculation step, accompanied by a small loss of tree reconstruction accuracy. A method of Bayesian inference from k-mers did not improve on classical alignment-free (and distance-based) methods but may still offer other advantages due to its Bayesian nature. We found the optimal word length k of word-based methods to be stable across various data sets, and we provide parameter ranges for two different alphabets. The influence of these alphabets was analyzed to reveal a trade-off in reconstruction accuracy between long and short branches. We have mapped the phylogenetic accuracy for many alignment-free methods, among them several recently introduced ones, and increased our understanding of their behavior in response to biologically important parameters. In all experiments, the pattern-based approach emerged as superior, at the expense of higher resource consumption. Nonetheless, no alignment-free method that we examined recovers the correct phylogeny as accurately as does an approach based on maximum-likelihood distance estimates of multiply aligned sequences.  相似文献   

5.
Most methods for phylogenetic tree reconstruction are based on sequence alignments; they infer phylogenies from substitutions that may have occurred at the aligned sequence positions. Gaps in alignments are usually not employed as phylogenetic signal. In this paper, we explore an alignment-free approach that uses insertions and deletions (indels) as an additional source of information for phylogeny inference. For a set of four or more input sequences, we generate so-called quartet blocks of four putative homologous segments each. For pairs of such quartet blocks involving the same four sequences, we compare the distances between the two blocks in these sequences, to obtain hints about indels that may have happened between the blocks since the respective four sequences have evolved from their last common ancestor. A prototype implementation that we call Gap-SpaM is presented to infer phylogenetic trees from these data, using a quartet-tree approach or, alternatively, under the maximum-parsimony paradigm. This approach should not be regarded as an alternative to established methods, but rather as a complementary source of phylogenetic information. Interestingly, however, our software is able to produce phylogenetic trees from putative indels alone that are comparable to trees obtained with existing alignment-free methods.  相似文献   

6.
The ITS2 gene class shows a high sequence divergence among its members that have complicated its annotation and its use for reconstructing phylogenies at a higher taxonomical level (beyond species and genus). Several alignment strategies have been implemented to improve the ITS2 annotation quality and its use for phylogenetic inferences. Although, alignment based methods have been exploited to the top of its complexity to tackle both issues, no alignment-free approaches have been able to successfully address both topics. By contrast, the use of simple alignment-free classifiers, like the topological indices (TIs) containing information about the sequence and structure of ITS2, may reveal to be a useful approach for the gene prediction and for assessing the phylogenetic relationships of the ITS2 class in eukaryotes. Thus, we used the TI2BioP (Topological Indices to BioPolymers) methodology [1], [2], freely available at http://ti2biop.sourceforge.net/ to calculate two different TIs. One class was derived from the ITS2 artificial 2D structures generated from DNA strings and the other from the secondary structure inferred from RNA folding algorithms. Two alignment-free models based on Artificial Neural Networks were developed for the ITS2 class prediction using the two classes of TIs referred above. Both models showed similar performances on the training and the test sets reaching values above 95% in the overall classification. Due to the importance of the ITS2 region for fungi identification, a novel ITS2 genomic sequence was isolated from Petrakia sp. This sequence and the test set were used to comparatively evaluate the conventional classification models based on multiple sequence alignments like Hidden Markov based approaches, revealing the success of our models to identify novel ITS2 members. The isolated sequence was assessed using traditional and alignment-free based techniques applied to phylogenetic inference to complement the taxonomy of the Petrakia sp. fungal isolate.  相似文献   

7.
Advances in sequencing have generated a large number of complete genomes. Traditionally, phylogenetic analysis relies on alignments of orthologs, but defining orthologs and separating them from paralogs is a complex task that may not always be suited to the large datasets of the future. An alternative to traditional, alignment-based approaches are whole-genome, alignment-free methods. These methods are scalable and require minimal manual intervention. We developed SlopeTree, a new alignment-free method that estimates evolutionary distances by measuring the decay of exact substring matches as a function of match length. SlopeTree corrects for horizontal gene transfer, for composition variation and low complexity sequences, and for branch-length nonlinearity caused by multiple mutations at the same site. We tested SlopeTree on 495 bacteria, 73 archaea, and 72 strains of Escherichia coli and Shigella. We compared our trees to the NCBI taxonomy, to trees based on concatenated alignments, and to trees produced by other alignment-free methods. The results were consistent with current knowledge about prokaryotic evolution. We assessed differences in tree topology over different methods and settings and found that the majority of bacteria and archaea have a core set of proteins that evolves by descent. In trees built from complete genomes rather than sets of core genes, we observed some grouping by phenotype rather than phylogeny, for instance with a cluster of sulfur-reducing thermophilic bacteria coming together irrespective of their phyla. The source-code for SlopeTree is available at: http://prodata.swmed.edu/download/pub/slopetree_v1/slopetree.tar.gz.  相似文献   

8.
Gillet EM  Gregorius HR 《Biometrics》2000,56(3):801-807
In forest trees, classical techniques of studying modes of inheritance are usually not feasible due to the difficulty of performing controlled crosses. The limited information on inheritance extractable from readily available data, such as the large progenies collectable from single seed trees, must be compensated by the design of appropriately parameterized models. For this purpose, a system analytic approach is used to develop a new inferential framework for testing a single-locus codominant mode of inheritance of genetic traits using the inferred genotypes within progenies of single trees of inferred heterozygous genotype. Model assumptions are random gametic fusion between the local gamete pools and absence of postzygotic selection; ovule segregation distortion is allowed. The method yields estimates of the allele frequencies in both local gamete pools. Since tests of modes of inheritance must be tests of models rather than of parameters, the utility of the classical statistical testing procedures is limited, particularly concerning the qualification of a sampling method to attain a preassigned level of precision. Consistent application of this principle makes it possible to design qualified sampling methods prior to the actual experiment as well as to specify qualification levels for tests of completed experiments.  相似文献   

9.
The phylogeny of most of the species in the avian passerine family Locustellidae is inferred using a Bayesian species tree approach (Bayesian Estimation of Species Trees, BEST), as well as a traditional Bayesian gene tree method (MrBayes), based on a dataset comprising one mitochondrial and four nuclear loci. The trees inferred by the different methods agree fairly well in topology, although in a few cases there are marked differences. Some of these discrepancies might be due to convergence problems for BEST (despite up to 1×10(9) iterations). The phylogeny strongly disagrees with the current taxonomy at the generic level, and we propose a revised classification that recognizes four instead of seven genera. These results emphasize the well known but still often neglected problem of basing classifications on non-cladistic evaluations of morphological characters. An analysis of an extended mitochondrial dataset with multiple individuals from most species, including many subspecies, suggest that several taxa presently treated as subspecies or as monotypic species as well as a few taxa recognized as separate species are in need of further taxonomic work.  相似文献   

10.
基于DNA序列K-tuple分布的一种非序列比对分析   总被引:1,自引:0,他引:1  
沈娟  吴文武  解小莉  郭满才  袁志发 《遗传》2010,32(6):606-612
文章在基因组K-tuple分布的基础上, 给出了一种推测生物序列差异大小的非序列比对方法。该方法可用于衡量真实DNA序列和随机重排序列在K-tuple分布上的差异。将此方法用于构建含有26种胎盘哺乳动物线粒体全基因组的系统树时, 随着K的增大, 系统树的分类效果与生物学一致公认的结果愈加匹配。结果表明, 用此方法构建的系统进化树比用其他非序列比对分析方法构建的更加合理。  相似文献   

11.
非序列联配的序列分析方法,将序列中特定寡聚核苷酸的kmer统计频率作为特征,在序列间按特征进行比较和分析。这种方法综合考虑了所有变异类型对序列整体特征的影响,因而在组学数据分析上有独特的优势。但是,这类方法在复杂多细胞生物基因组系统发育中的适用性仍然有待检验。在本文中,我们使用基于非序列联配方法的CVTree软件,以45种哺乳动物的蛋白质组数据建立了系统发育关系NJ树,并据此探讨了哺乳动物系统发育的若干问题。在广受关注的真兽下纲四个总目的关系问题上,CVTree支持形态学的普遍结论即上兽类(Epitheria)假说。这与基于序列联配方法支持的外非洲胎盘类(Exafro-placentalia )假说不同。在哺乳动物内部目的层次上,CVTree树的结论与分子和形态所普遍接受的系统发育关系基本一致。但是在目的内部,CVTree树会有较多的差异。研究结果初步显示非序列联配方法在使用复杂多细胞生物的组学数据进行系统发育关系分析中的可行性。对非序列联配方法自身的改进及其与传统基于取代的序列联配方法之间的比较仍有待深入研究。  相似文献   

12.
13.
This study uses traditional and contemporary phylogenetic and population genetic analyses to assess the causes of discordance (i.e., lineage sorting and introgression) among mitochondrial and nuclear gene trees for a clade of eastern North American scarab beetles (fraterna species group, genus Phyllophaga). I estimated gene trees using individual and combined analysis of one mitochondrial and two nuclear loci in MrBayes , and inferred a species tree using a hierarchical coalescent approach based on all loci in the program Best . Because hybridization violates the assumptions of Best , I tested for introgression by comparing species monophyly between the mitochondrial and nuclear gene trees based on the prediction that cytoplasmic genomes introgress more readily than nuclear genomes. Haplotype exclusivity was identified using Bayesian tests of monophyly and the genealogical sorting index. I used the results of the phylogenetic analyses and monophyly tests to develop an explicit hypothesis of introgression that could be tested in the program IMa. Results from these analyses provided evidence for introgression across clades within the fraterna group. The tiered analytical approach used in this study demonstrated how the use of multiple methods can identify when assumptions are violated and methods are prone to yield misleading results.  相似文献   

14.
A faithful phylogeny and an objective taxonomy for prokaryotes should agree with each other and ultimately follow the genome data. With the number of sequenced genomes reaching tens of thousands, both tree inference and detailed comparison with taxonomy are great challenges. We now provide one solution in the latest Release 3.0 of the alignment-free and whole-genome-based web server CVTree3. The server resides in a cluster of 64 cores and is equipped with an interactive,collapsible, and expandable tree display. It is capable of comparing the tree branching order with prokaryotic classification at all taxonomic ranks from domains down to species and strains.CVTree3 allows for inquiry by taxon names and trial on lineage modifications. In addition, it reports a summary of monophyletic and non-monophyletic taxa at all ranks as well as produces print-quality subtree figures. After giving an overview of retrospective verification of the CVTree approach, the power of the new server is described for the mega-classification of prokaryotes and determination of taxonomic placement of some newly-sequenced genomes. A few discrepancies between CVTree and 16 S r RNA analyses are also summarized with regard to possible taxonomic revisions. CVTree3 is freely accessible to all users at http://tlife.fudan.edu.cn/cvtree3/ without login requirements.  相似文献   

15.
本文将12S rRNA基因序列分析应用于研究若干重要蜘蛛类群的系统关系,以对传统的分类研究结论进行验证和补充,并且探讨12S rRNA基因序列分析在蜘蛛系统发生研究中的适用性。根据12S rRNA基因第3结构域构建的分子系统树得出结论:1.圆网类(即妖面蛛总科与园蛛总科)并非单系;2.隙蛛与暗蛛较漏斗蛛具有更近的亲缘关系;3.壁钱和拟壁钱并不近缘;4.有筛器类蜘蛛为复系类群;5.12S rRNA基因第3结构域片段对推断近缘科属间的系统发生关系是有效的遗传标记。  相似文献   

16.
Composition Vector Tree (CVTree) is an alignment-free algorithm to infer phylogenetic relationships from genome sequences. It has been successfully applied to study phylogeny and taxonomy of viruses, prokaryotes, and fungi based on the whole genomes, as well as chloroplast genomes, mitochondrial genomes, and metagenomes. Here we presented the standalone software for the CVTree algorithm. In the software, an extensible parallel workflow for the CVTree algorithm was designed. Based on the workflow, new alignment-free methods were also implemented. And by examining the phylogeny and taxonomy of 13,903 prokaryotes based on 16S rRNA sequences, we showed that CVTree software is an efficient and effective tool for studying phylogeny and taxonomy based on genome sequences. The code of CVTree software can be available at https://github.com/ghzuo/cvtree.  相似文献   

17.
There has been considerable disagreement regarding the relationships among Pestalotiopsis species and their delimitations. A molecular phylogenetic analysis was conducted on 32 species of Pestalotiopsis in order to evaluate the utility of morphological characters currently used in their taxonomy. Phylogenetic relationships were inferred from nucleotide sequences in the ITS regions and 5.8S gene of the rDNA under four optimality criteria: maximum parsimony, weighted parsimony, maximum likelihood, and neighbor joining. Phylogenies estimated from all analyses yielded trees of essentially similar topology and revealed 3 major groups that correspond with morphology-based classification systems. Molecular data indicated that the genus contains two distinct lineages based on pigmentation of median cells and four distinct groupings based on morphology of apical appendages. The analyses did not support reliability of other phenotypic characters of this genus, such as spore dimensions. Characters with particular phylogenetic significance are discussed in relation to the taxonomy of Pestalotiopsis.  相似文献   

18.

Background  

Accurate taxonomy is best maintained if species are arranged as hierarchical groups in phylogenetic trees. This is especially important as trees grow larger as a consequence of a rapidly expanding sequence database. Hierarchical group names are typically manually assigned in trees, an approach that becomes unfeasible for very large topologies.  相似文献   

19.
Phylogenetic relationships in Hyaloperonospora (Oomycetes) were investigated by molecular analyses using internal transcribed spacer (ITS) sequences and collections from different host plants. Trees were inferred with Bayesian Markov chain Monte Carlo, neighbor-joining and maximum parsimony methods and rooted with Perofascia. The results are discussed with respect to host taxonomy and species concepts of downy mildews from the literature. Molecular data mainly support the use of narrow species delimitations and host range as a taxonomic marker. Hyaloperonospora brassicae turns out to be a non-monophyletic assemblage of different species. New combinations are proposed in accordance with the phylogenetic trees.  相似文献   

20.
Taxonomic arrangements for the cormorants and shags (Phalacrocoracidae) had varied greatly until two quite similar arrangements, one based on behavior and the other on osteological characters, became the basis for current thought on the evolutionary relationships of these birds. The terms cormorant and shag, which had previously been haphazardly applied to members of the group, became the vernacular terms for the two major subdivisions within this family. The two taxonomies differ in places, however, with the behavioral taxonomy placing several species within the shags and the osteological taxonomy and phylogeny grouping those species (as the marine cormorants) and placing them within the cormorants. In an attempt to resolve the differences in the relationships hypothesized by behavior and morphology, we sequenced three mitochondrial genes (12S, ATPase 6, and ATPase 8). Initial equally weighted parsimony trees differed slightly from our two weighted parsimony trees, one of which was also our maximum-likelihood tree. Many of the branches within our trees were well supported, but some sections of the phylogeny proved difficult to resolve with confidence. Our sequence trees differ substantially from the morphological phylogeny and show that neither the shags nor the cormorants are monophyletic, but form an intermingled group. Some of the groups supported by both the behavioral and the morphological taxonomies (e.g., the cliff shags, Stictocarbo) appear to be polyphyletic. Conversely, the monophyly of the blue-eyed shags, a traditional group that the osteological analysis had found to be paraphyletic, was supported by the sequence data. Until more taxa are sampled and a fully robust phylogeny is obtained, a conservative approach accepting a single genus, Phalacrocorax, for the shags and cormorants is recommended.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号