首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 171 毫秒
1.
Muscodor is a non-sporulating, volatile organic compounds producing endophytic fungi that has been extensively explored as a bio-fumigant and bio-preservative. Novel species of this genus have been mainly identified using ITS sequences. However, the ITS hyper-variability hinders the creation of reproducible alignments and stable phylogenetic trees. Conserved structural data of the ITS region represents as a vital auxiliary information for accurate speciation of fungi. In the present study, secondary structural data of ITS1, 5.8S, and ITS2 region of all Muscodor species were generated using LocaRNA web server. The predicted secondary structural data displayed greater variability in ITS1 region in comparison to ITS2. The structural data of all sequences exhibited characteristic conserved features of eukaryotic rRNA. Evolutionary conserved motifs were found among all 5.8S and ITS2 sequences. Profile neighbor joining (PNJ) tree based on combined sequence-structural information of ITS region was generated in ProfDists. The PNJ tree resolved into four major groups whereby M. fengyangenesis and M. albus species formed monophyletic clades. However, three M. albus species along with other Muscodor species emerged as sister branches to the existing clades, thereby, improving the precision of phylogenetic analysis for identification of novel species of Muscodor genus. Hence, the results indicated that structural analysis along with primary sequence information can provide new insights for precise identification of Muscodor species.  相似文献   

2.
从线粒体细胞色素b基因探讨矮岩羊物种地位的有效性   总被引:21,自引:0,他引:21  
用采自四川省的岩羊和矮岩羊共18个头骨或皮张标本,分析了线粒体Cyt6基因1140bp的全序列(其中一个样品只测到802bp的基因片段)。用NJ、MP、ML等系统发育分析法分别重建的系统发育树的拓扑结构完全一致,均不支持矮岩羊是单系群。结果提示:所研究的全部样品都属于同一个物种,即岩羊P.nayaur,不支持矮岩羊的物种地位。根据基于17个样品Cyt6基因全序列的系统发育树和遗传变异以及地理分布,这些岩羊样品聚为5支,根据地理区划和本文分析结果,四川的岩羊可分为摩天岭、川西、川西北、川西南和川北5个种群。基于17个样品Cyt6基因1137bp的编码区,共定义了岩羊的16种单元型,在5个种群间未发现共享的单元型。  相似文献   

3.
Using simulated data, we compared five methods of phylogenetic tree estimation: parsimony, compatibility, maximum likelihood, Fitch- Margoliash, and neighbor joining. For each combination of substitution rates and sequence length, 100 data sets were generated for each of 50 trees, for a total of 5,000 replications per condition. Accuracy was measured by two measures of the distance between the true tree and the estimate of the tree, one measure sensitive to accuracy of branch lengths and the other not. The distance-matrix methods (Fitch- Margoliash and neighbor joining) performed best when they were constrained from estimating negative branch lengths; all comparisons with other methods used this constraint. Parsimony and compatibility had similar results, with compatibility generally inferior; Fitch- Margoliash and neighbor joining had similar results, with neighbor joining generally slightly inferior. Maximum likelihood was the most successful method overall, although for short sequences Fitch- Margoliash and neighbor joining were sometimes better. Bias of the estimates was inferred by measuring whether the independent estimates of a tree for different data sets were closer to the true tree than to each other. Parsimony and compatibility had particular difficulty with inaccuracy and bias when substitution rates varied among different branches. When rates of evolution varied among different sites, all methods showed signs of inaccuracy and bias.   相似文献   

4.
田鹏  刘占林 《生物信息学》2009,7(3):232-233
以系统发育树构建的原有距离方法为基础,吸取了NJ法和FM法中的部分理论,提出了以节点引入为手段的新的简易方法,通过该方法构建了分子系统发育树,结果表明这种方法更加快捷,而且所得结果与FM法完全一致。  相似文献   

5.
Clearcut: a fast implementation of relaxed neighbor joining   总被引:1,自引:0,他引:1  
SUMMARY: Clearcut is an open source implementation for the relaxed neighbor joining (RNJ) algorithm. While traditional neighbor joining (NJ) remains a popular method for distance-based phylogenetic tree reconstruction, it suffers from a O(N(3)) time complexity, where N represents the number of taxa in the input. Due to this steep asymptotic time complexity, NJ cannot reasonably handle very large datasets. In contrast, RNJ realizes a typical-case time complexity on the order of N(2)logN without any significant qualitative difference in output. RNJ is particularly useful when inferring a very large tree or a large number of trees. In addition, RNJ retains the desirable property that it will always reconstruct the true tree given a matrix of additive pairwise distances. Clearcut implements RNJ as a C program, which takes either a set of aligned sequences or a pre-computed distance matrix as input and produces a phylogenetic tree. Alternatively, Clearcut can reconstruct phylogenies using an extremely fast standard NJ implementation. AVAILABILITY: Clearcut source code is available for download at: http://bioinformatics.hungry.com/clearcut  相似文献   

6.
We examine whether phylogenetic methods provide biased estimates of tree shape with respect to the random branching model. We investigate the performance of five commonly used phylogenetic methods using computer simulation: (1) maximum parsimony; (2) neighbor joining; (3) UPGMA with an outgroup taxon; (4) UPGMA without an outgroup taxon; and (5) maximum likelihood. All methods provide estimates of tree shape that are, on average, more asymmetrical than the true tree, especially when rates of evolution are high. We suggest a simple explanation for the bias and propose a modified test of tree shape that corrects for it.  相似文献   

7.
为了探究进化模型对DNA条形码分类的影响, 本研究以雾灵山夜蛾科44个种的标本为材料, 获得COI基因序列。使用邻接法(neighbor-joining)、 最大简约法(maximum parsimony)、 最大似然法(maximum likelihood)以及贝叶斯法(Bayesian inference)构建系统发育树, 并且对邻接法的12种模型、 最大似然法的7种模型、 贝叶斯法的2种模型进行模型成功率的评估。结果表明, 邻接法的12种模型成功率相差不大, 较稳定; 最大似然法及贝叶斯法的不同模型成功率存在明显差异, 不稳定; 最大简约法不基于模型, 成功率比较稳定。邻接法及最大似然法共有6种相同的模型, 这6种模型在不同的方法中成功率存在差异。此外, 分子数据中存在单个物种仅有一条序列的情况, 显著降低了模型成功率, 表明在DNA条形码研究中, 每个物种需要有多个样本。  相似文献   

8.
Finding correct species relationships using phylogeny reconstruction based on molecular data is dependent on several empirical and technical factors. These include the choice of DNA sequence from which phylogeny is to be inferred, the establishment of character homology within a sequence alignment, and the phylogeny algorithm used. Nevertheless, sequencing and phylogeny tools provide a way of testing certain hypotheses regarding the relationship among the organisms for which phenotypic characters demonstrate conflicting evolutionary information. The protozoan family Sarcocystidae is one such group for which molecular data have been applied phylogenetically to resolve questionable relationships. However, analyses carried out to date, particularly based on small-subunit ribosomal DNA, have not resolved all of the relationships within this family. Analysis of more than one gene is necessary in order to obtain a robust species signal, and some DNA sequences may not be appropriate in terms of their phylogenetic information content. With this in mind, we tested the informativeness of our chosen molecule, the large-subunit ribosomal DNA (lsu rDNA), by using subdivisions of the sequence in phylogenetic analysis through PAUP, fastDNAml, and neighbor joining. The segments of sequence applied correspond to areas of higher nucleotide variation in a secondary-structure alignment involving 21 taxa. We found that subdivision of the entire lsu rDNA is inappropriate for phylogenetic analysis of the Sarcocystidae. There are limited informative nucleotide sites in the lsu rDNA for certain clades, such as the one encompassing the subfamily Toxoplasmatinae. Consequently, the removal of any segment of the alignment compromises the final tree topology. We also tested the effect of using two different alignment procedures (CLUSTAL W and the structure alignment using DCSE) and three different tree-building methods on the final tree topology. This work shows that congruence between different methods in the formation of clades may be a feature of robust topology; however, a sequence alignment based on primary structure may not be comparing homologous nucleotides even though the expected topology is obtained. Our results support previous findings showing the paraphyly of the current genera Sarcocystis and Hammondia and again bring to question the relationships of Sarcocystis muris, Isospora felis, and Neospora caninum. In addition, results based on phylogenetic analysis of the structure alignment suggest that Sarcocystis zamani and Sarcocystis singaporensis, which have reptilian definitive hosts, are monophyletic with Sarcocystis species using mammalian definitive hosts if the genus Frenkelia is synonymized with Sarcocystis.  相似文献   

9.
A phylogenetic method is a consistent estimator of phylogeny if and only if it is guaranteed to give the correct tree, given that sufficient (possibly infinite) independent data are examined. The following methods are examined for consistency: UPGMA (unweighted pair-group method, averages), NJ (neighbor joining), MF (modified Farris), and P (parsimony). A two-parameter model of nucleotide sequence substitution is used, and the expected distribution of character states is calculated. Without perfect correction for superimposed substitutions, all four methods may be inconsistent if there is but one branch evolving at a faster rate than the other branches. Partial correction of observed distances improves the robustness of the NJ method to rate variation, and perfect correction makes the NJ method a consistent estimator for all combinations of rates that were examined. The sensitivity of all the methods to unequal rates varies over a wide range, so relative-rate tests are unlikely to be a reliable guide for accepting or rejecting phylogenies based on parsimony analysis.  相似文献   

10.
We introduce a distance-based phylogeny reconstruction method called "weighted neighbor joining," or "Weighbor" for short. As in neighbor joining, two taxa are joined in each iteration; however, the Weighbor criterion for choosing a pair of taxa to join takes into account that errors in distance estimates are exponentially larger for longer distances. The criterion embodies a likelihood function on the distances, which are modeled as correlated Gaussian random variables with different means and variances, computed under a probabilistic model for sequence evolution. The Weighbor criterion consists of two terms, an additivity term and a positivity term, that quantify the implications of joining the pair. The first term evaluates deviations from additivity of the implied external branches, while the second term evaluates confidence that the implied internal branch has a positive branch length. Compared with maximum-likelihood phylogeny reconstruction, Weighbor is much faster, while building trees that are qualitatively and quantitatively similar. Weighbor appears to be relatively immune to the "long branches attract" and "long branch distracts" drawbacks observed with neighbor joining, BIONJ, and parsimony.  相似文献   

11.
It is generally accepted that the plastids arose from a cyanobacterial ancestor, but the exact phylogenetic relationships between cyanobacteria and plastids are still controversial. Most studies based on partial 16S rRNA sequences suggested a relatively late origin of plastids within the cyanobacterial divergence. In order to clarify the exact relationship and divergence order of cyanobacteria and plastids, we studied their phylogeny on the basis of nearly complete 16S rRNA gene sequences. The data set comprised 15 strains of cyanobacteria from different morphological groups, 1 prochlorophyte, and plastids belonging to 8 species of plants and 12 species of diverse algae. This set included three cyanobacterial sequences determined in this study. This is the most comprehensive set of complete cyanobacterial and plastidial 16S rRNA sequences used so far. Phylogenetic trees were constructed using neighbor joining and maximum parsimony, and the reliability of the tree topologies was tested by different methods. Our results suggest an early origin of plastids within the cyanobacterial divergence, preceded only by the divergence of two cyanobacterial genera, Gloeobacter and Pseudanabaena.   相似文献   

12.
Our ability to construct very large phylogenetic trees is becoming more important as vast amounts of sequence data are becoming readily available. Neighbor joining (NJ) is a widely used distance-based phylogenetic tree construction method that has historically been considered fast, but it is prohibitively slow for building trees from increasingly large datasets. We developed a fast variant of NJ called relaxed neighbor joining (RNJ) and performed experiments to measure the speed improvement over NJ. Since repeated runs of the RNJ algorithm generate a superset of the trees that repeated NJ runs generate, we also assessed tree quality. RNJ is dramatically faster than NJ, and the quality of resulting trees is very similar for the two algorithms. The results indicate that RNJ is a reasonable alternative to NJ and that it is especially well suited for uses that involve large numbers of taxa or highly repetitive procedures such as bootstrapping. [Reviewing Editor: Dr. James Bull]  相似文献   

13.
With the development of genome sequencing more whole genomes of microorganisms were completed, many methods wereintroduced to reconstruct the phylogenetic tree of those microorganismswith the information extracted from the whole genomes through variousways of transforming or mapping the whole genome sequences into otherforms which can describe the evolutionary distance in a new way. We thinkit might be possible that there exists information buried in the wholegenome transferred along lineage, which remains stable and is moreessential than sequence conservation of individual genes or the arrangementof some genes of a selected set. We need to find one measurement that caninvolve as many phylogenetic features as possible that are beyond thegenome sequence itself. We converted each genome sequence of themicroorganisms into another linear sequence to represent the functionalstructure of the sequence, and we used a new information function tocalculate the discrepancy of sequences and to get one distance matrix of thegenomes, and built one phylogenetic tree with a neighbor joining method.The resulting tree shows that the major lineages are consistent with theresult based on their 16srRNA sequences. Our method discovered onephylogenetic feature derived from the genome sequences and the encodedgenes that can rebuild the phylogenetic tree correctly. The mapping of onegenome sequence to its new form representing the relative positions of thefunctional genes provides a new way to measure the phylogeneticrelationships, and with the more specific classification of gene functions theresult could be more sensitive.  相似文献   

14.
The molecular phylogenetic relationship among two species of genus Leiurus, from Saudi Arabia with additional comparative sequence data available from Egypt, Oman and Turkey is presented. The molecular phylogeny was performed using maximum parsimony, neighbor joining and bayesian inference. Our results indicate a clear deep splitting between the Western clade, which represented by L. quinuestriatus sequences from Egypt and those from the Eastern clade which encompassing different Leiurus species from Saudi Arabia, Oman and Turkey was shown. Also, the phylogenetic relationship represents additional support for the taxonomic status of Arabian Leiurus species.  相似文献   

15.
We conducted a simulation study of the phylogenetic methods UPGMA, neighbor joining, maximum parsimony, and maximum likelihood for a five-taxon tree under a molecular clock. The parameter space included a small region where maximum parsimony is inconsistent, so we tested inconsistency correction for parsimony and distance correction for neighbor joining. As expected, corrected parsimony was consistent. For these data, maximum likelihood with the clock assumption outperformed each of the other methods tested. The distance-based methods performed marginally better than did maximum parsimony and maximum likelihood without the clock assumption. Data correction was generally detrimental to accuracy, especially for short sequence lengths. We identified another region of the parameter space where, although consistent for a given method, some incorrect trees were each selected with up to twice the frequency of the correct (generating) tree for sequences of bounded length. These incorrect trees are those where the outgroup has been incorrectly placed. In addition to this problem, the placement of the outgroup sequence can have a confounding effect on the ingroup tree, whereby the ingroup is correct when using the ingroup sequences alone, but with the inclusion of the outgroup the ingroup tree becomes incorrect.  相似文献   

16.
Evolution operates on whole genomes through direct rearrangements of genes, such as inversions, transpositions, and inverted transpositions, as well as through operations, such as duplications, losses, and transfers, that also affect the gene content of the genomes. Because these events are rare relative to nucleotide substitutions, gene order data offer the possibility of resolving ancient branches in the tree of life; the combination of gene order data with sequence data also has the potential to provide more robust phylogenetic reconstructions, since each can elucidate evolution at different time scales. Distance corrections greatly improve the accuracy of phylogeny reconstructions from DNA sequences, enabling distance-based methods to approach the accuracy of the more elaborate methods based on parsimony or likelihood at a fraction of the computational cost. This paper focuses on developing distance correction methods for phylogeny reconstruction from whole genomes. The main question we investigate is how to estimate evolutionary histories from whole genomes with equal gene content, and we present a technique, the empirically derived estimator (EDE), that we have developed for this purpose. We study the use of EDE on whole genomes with identical gene content, and we explore the accuracy of phylogenies inferred using EDE with the neighbor joining and minimum evolution methods under a wide range of model conditions. Our study shows that tree reconstruction under these two methods is much more accurate when based on EDE distances than when based on other distances previously suggested for whole genomes. Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users. [Reviewing Editor: Dr. Martin Kreitman]  相似文献   

17.
The Malayan gaur (Bos gaurus hubbacki) is one of the three subspecies of gaurs that can be found in Malaysia. We examined the phylogenetic relationships of this subspecies with other species of the genus Bos (B. javanicus, B. indicus, B. taurus, and B. grunniens). The sequence of a key gene, cytochrome b, was compared among 20 Bos species and the bongo antelope, used as an outgroup. Phylogenetic reconstruction was employed using neighbor joining and maximum parsimony in PAUP and Bayesian inference in MrBayes 3.1. All tree topologies indicated that the Malayan gaur is in its own monophyletic clade, distinct from other species of the genus Bos. We also found significant branching differences in the tree topologies between wild and domestic cattle.  相似文献   

18.
The evolutionary relationships of the thiamine pyrophosphate (TPP)-dependent family of enzymes was investigated by generation of a neighbor joining phylogenetic tree using sequences from the conserved pyrophosphate (PP) and pyrimidine (Pyr) binding domains of 17 TPP-dependent enzymes. This represents the most comprehensive analysis of TPP-dependent enzyme evolution to date. The phylogeny was shown to be robust by comparison with maximum likelihood trees generated for each individual enzyme and also broadly confirms the evolutionary history proposed recently from structural comparisons alone (Duggleby 2006). The phylogeny is most parsimonious with the TPP enzymes having arisen from a homotetramer which subsequently diverged into an α2β2 heterotetramer. The relationship between the PP- and Pyr-domains and the recruitment of additional protein domains was examined using the transketolase C-terminal (TKC)-domain as an example. This domain has been recruited by several members of the family and yet forms no part of the active site and has unknown function. Removal of the TKC-domain was found to increase activity toward β-hydroxypyruvate and glycolaldehyde. Further truncations of the Pyr-domain yielded several variants with retained activity. This suggests that the influence of TKC-domain recruitment on the evolution of the mechanism and specificity of transketolase (TK) has been minor, and that the smallest functioning unit of TK comprises the PP- and Pyr-domains, whose evolutionary histories extend to all TPP-dependent enzymes.  相似文献   

19.
The robustness (sensitivity to violation of assumptions) of the maximum- likelihood and neighbor-joining methods was examined using simulation. Maximum likelihood and neighbor joining were implemented with Jukes- Cantor, Kimura, and gamma models of DNA substitution. Simulations were performed in which the assumptions of the methods were violated to varying degrees on three model four-taxon trees. The performance of the methods was evaluated with respect to ability to correctly estimate the unrooted four-taxon tree. Maximum likelihood outperformed neighbor joining in 29 of the 36 cases in which the assumptions of both methods were satisfied. In 133 of 180 of the simulations in which the assumptions of the maximum-likelihood and neighbor-joining methods were violated, maximum likelihood outperformed neighbor joining. These results are consistent with a general superiority of maximum likelihood over neighbor joining under comparable conditions. They extend and clarify an earlier study that found an advantage for neighbor joining over maximum likelihood for gamma-distributed mutation rates.   相似文献   

20.
拟南芥GHMP基因家族成员的组织表达及生物信息学分析   总被引:1,自引:0,他引:1  
利用生物信息学方法获得拟南芥全基因组中12个GHMP基因家族成员。通过实时定量PCR技术研究这12个基因在不同组织中的表达,结果显示它们具有组织表达特异性。构建了拟南芥中GHMP基因家族成员的系统进化树。启动子区调控元件分析表明,大多数GHMP成员包含有光响应、生物钟及其它逆境胁迫响应的相关元件,预测这些GHMP基因家族成员可能参与了植物的光信号、生物钟及相关的逆境胁迫信号转导途径。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号