首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
We have used three independent phylogenomic approaches (concatenated alignments, single-, and multi-gene supertrees) to reconstruct the fungal tree of life (FTOL) using publicly available fungal genomes. This is the first time multi-gene families have been used in fungal supertree reconstruction and permits us to use up to 66% of the 1,001,217 genes in our fungal database. Our analyses show that different phylogenomic datasets derived from varying clustering criteria and alignment orientation do not have a major effect on phylogenomic supertree reconstruction. Overall the resultant phylogenomic trees are relatively congruent with one another and successfully recover the major fungal phyla, subphyla and classes. We find that where incongruences do occur, the inferences are usually poorly supported. Within the Ascomycota phylum, our phylogenies reconstruct monophyletic Saccharomycotina and Pezizomycotina subphyla clades and infer a sister group relationship between these to the exclusion of the Taphrinomycotina. Within the Pezizomycotina subphylum, all three phylogenies infer a sister group relationship between the Leotiomycetes and Sordariomycetes classes. However, there is conflict regarding the relationships with the Dothideomycetes and Eurotiomycetes classes. Within the Basidiomycota phylum, supertrees derived from single- and multi-gene families infer a sister group relationship between the Pucciniomycotina and Agaricomycotina subphyla while the concatenated phylogeny infers a poorly supported relationship between the Agaricomycotina and Ustilagomycotina. The reconstruction of a robust FTOL is important for future fungal comparative analyses. We illustrate this point by performing a preliminary investigation into the phyletic distribution of yeast prion-like proteins in the fungal kingdom.  相似文献   

2.
Remipedes are a small and enigmatic group of crustaceans, first described only 30 years ago. Analyses of both morphological and molecular data have recently suggested a close relationship between Remipedia and Hexapoda. If true, the remipedes occupy an important position in pancrustacean evolution and may be pivotal for understanding the evolutionary history of crustaceans and hexapods. However, it is important to test this hypothesis using new data and new types of analytical approaches. Here, we assembled a phylogenomic data set of 131 taxa, incorporating newly generated 454 expressed sequence tag (EST) data from six species of crustaceans, representing five lineages (Remipedia, Laevicaudata, Spinicaudata, Ostracoda, and Malacostraca). This data set includes all crustacean species for which EST data are available (46 species), and our largest alignment encompasses 866,479 amino acid positions and 1,886 genes. A series of phylogenomic analyses was performed to evaluate pancrustacean relationships. We significantly improved the quality of our data for predicting putative orthologous genes and for generating data subsets by matrix reduction procedures, thereby improving the signal to noise ratio in the data. Eight different data sets were constructed, representing various combinations of orthologous genes, data subsets, and taxa. Our results demonstrate that the different ways to compile an initial data set of core orthologs and the selection of data subsets by matrix reduction can have marked effects on the reconstructed phylogenetic trees. Nonetheless, all eight data sets strongly support Pancrustacea with Remipedia as the sister group to Hexapoda. This is the first time that a sister group relationship of Remipedia and Hexapoda has been inferred using a comprehensive phylogenomic data set that is based on EST data. We also show that selecting data subsets with increased overall signal can help to identify and prevent artifacts in phylogenetic analyses.  相似文献   

3.
The increasing availability of complete genome sequences and the development of new, faster methods for phylogenetic reconstruction allow the exploration of the set of evolutionary trees for each gene in the genome of any species. This has led to the development of new phylogenomic methods. Here, we have compared different phylogenetic and phylogenomic methods in the analysis of the monophyletic origin of insect endosymbionts from the gamma-Proteobacteria, a hotly debated issue with several recent, conflicting reports. We have obtained the phylogenetic tree for each of the 579 identified protein-coding genes in the genome of the primary endosymbiont of carpenter ants, Blochmannia floridanus, after determining their presumed orthologs in 20 additional Proteobacteria genomes. A reference phylogeny reflecting the monophyletic origin of insect endosymbionts was further confirmed with different approaches, which led us to consider it as the presumed species tree. Remarkably, only 43 individual genes produced exactly the same topology as this presumed species tree. Most discrepancies between this tree and those obtained from individual genes or by concatenation of different genes were due to the grouping of Xanthomonadales with beta-Proteobacteria and not to uncertainties over the monophyly of insect endosymbionts. As previously noted, operational genes were more prone to reject the presumed species tree than those included in information-processing categories, but caution should be exerted when selecting genes for phylogenetic inference on the basis of their functional category assignment. We have obtained strong evidence in support of the monophyletic origin of gamma-Proteobacteria insect endosymbionts by a combination of phylogenetic and phylogenomic methods. In our analysis, the use of concatenated genes has shown to be a valuable tool for analyzing primary phylogenetic signals coded in the genomes. Nevertheless, other phylogenomic methods such as supertree approaches were useful in revealing alternative phylogenetic signals and should be included in comprehensive phylogenomic studies.  相似文献   

4.
Ding G  Yu Z  Zhao J  Wang Z  Li Y  Xing X  Wang C  Liu L  Li Y 《PloS one》2008,3(10):e3357
Efforts in phylogenomics have greatly improved our understanding of the backbone tree of life. However, due to the systematic error in sequence data, a sequence-based phylogenomic approach leads to well-resolved but statistically significant incongruence. Thus, independent test of current phylogenetic knowledge is required. Here, we have devised a distance-based strategy to reconstruct a highly resolved backbone tree of life, on the basis of the genome context networks of 195 fully sequenced representative species. Along with strongly supporting the monophylies of three superkingdoms and most taxonomic sub-divisions, the derived tree also suggests some intriguing results, such as high G+C gram positive origin of Bacteria, classification of Symbiobacterium thermophilum and Alcanivorax borkumensis in Firmicutes. Furthermore, simulation analyses indicate that addition of more gene relationships with high accuracy can greatly improve the resolution of the phylogenetic tree. Our results demonstrate the feasibility of the reconstruction of highly resolved phylogenetic tree with extensible gene networks across all three domains of life. This strategy also implies that the relationships between the genes (gene network) can define what kind of species it is.  相似文献   

5.
Reconstructing the evolutionary relationships of species is a major goal in biology. Despite the increasing number of completely sequenced genomes, a large number of phylogenetic projects rely on targeted sequencing and analysis of a relatively small sample of marker genes. The selection of these phylogenetic markers should ideally be based on accurate predictions of their combined, rather than individual, potential to accurately resolve the phylogeny of interest. Here we present and validate a new phylogenomics strategy to efficiently select a minimal set of stable markers able to reconstruct the underlying species phylogeny. In contrast to previous approaches, our methodology does not only rely on the ability of individual genes to reconstruct a known phylogeny, but it also explores the combined power of sets of concatenated genes to accurately infer phylogenetic relationships of species not previously analyzed. We applied our approach to two broad sets of cyanobacterial and ascomycetous fungal species, and provide two minimal sets of six and four genes, respectively, necessary to fully resolve the target phylogenies. This approach paves the way for the informed selection of phylogenetic markers in the effort of reconstructing the tree of life.  相似文献   

6.
The advances accelerated by next-generation sequencing and long-read sequencing technologies continue to provide an impetus for plant phylogenetic study.In the past decade,a large number of phylogenetic studies adopting hundreds to thousands of genes across a wealth of clades have emerged and ushered plant phylogenetics and evolution into a new era.In the meantime,a roadmap for researchers when making decisions across different approaches for their phylogenomic research design is imminent.This r...  相似文献   

7.
Phylogenomics reveal a robust fungal tree of life   总被引:3,自引:0,他引:3  
Our understanding of the tree of life (TOL) is still fragmentary. Until recently, molecular phylogeneticists have built trees based on ribosomal RNA sequences and selected protein sequences, which, however, usually suffered from lack of support for the deeper branches and inconsistencies probably due to limited subsampling of the entire genome. Now, phylogenetic hypotheses can be based on the analysis of full genomes. We used available complete genome data as well as the eukaryote orthologous group (KOG) proteins to reconstruct with confidence basal branches of the fungal TOL. Phylogenetic analysis of a core of 531 KOGs shared among 21 fungal genomes, three animal genomes and one plant genome showed a single tree with high support resulting from four different methods of phylogenetic reconstruction. The single tree that we inferred from our dataset showed excellent nodal support for each branch, suggesting that it reflects the true phylogenetic relationships of the species involved.  相似文献   

8.
In this study, we investigated the relationships among insect orders with a main focus on Polyneoptera (lower Neoptera: roaches, mantids, earwigs, grasshoppers, etc.), and Paraneoptera (thrips, lice, bugs in the wide sense). The relationships between and within these groups of insects are difficult to resolve because only few informative molecular and morphological characters are available. Here, we provide the first phylogenomic expressed sequence tags data ('EST': short sub-sequences from a c(opy) DNA sequence encoding for proteins) for stick insects (Phasmatodea) and webspinners (Embioptera) to complete published EST data. As recent EST datasets are characterized by a heterogeneous distribution of available genes across taxa, we use different rationales to optimize the data matrix composition. Our results suggest a monophyletic origin of Polyneoptera and Eumetabola (Paraneoptera + Holometabola). However, we identified artefacts of tree reconstruction (human louse Pediculus humanus assigned to Odonata (damselflies and dragonflies) or Holometabola (insects with a complete metamorphosis); mayfly genus Baetis nested within Neoptera), which were most probably rooted in a data matrix composition bias due to the inclusion of sequence data of entire proteomes. Until entire proteomes are available for each species in phylogenomic analyses, this potential pitfall should be carefully considered.  相似文献   

9.

Background  

To date, most fungal phylogenies have been derived from single gene comparisons, or from concatenated alignments of a small number of genes. The increase in fungal genome sequencing presents an opportunity to reconstruct evolutionary events using entire genomes. As a tool for future comparative, phylogenomic and phylogenetic studies, we used both supertrees and concatenated alignments to infer relationships between 42 species of fungi for which complete genome sequences are available.  相似文献   

10.

Background

Genome level analyses have enhanced our view of phylogenetics in many areas of the tree of life. With the production of whole genome DNA sequences of hundreds of organisms and large-scale EST databases a large number of candidate genes for inclusion into phylogenetic analysis have become available. In this work, we exploit the burgeoning genomic data being generated for plant genomes to address one of the more important plant phylogenetic questions concerning the hierarchical relationships of the several major seed plant lineages (angiosperms, Cycadales, Gingkoales, Gnetales, and Coniferales), which continues to be a work in progress, despite numerous studies using single, few or several genes and morphology datasets. Although most recent studies support the notion that gymnosperms and angiosperms are monophyletic and sister groups, they differ on the topological arrangements within each major group.

Methodology

We exploited the EST database to construct a supermatrix of DNA sequences (over 1,200 concatenated orthologous gene partitions for 17 taxa) to examine non-flowering seed plant relationships. This analysis employed programs that offer rapid and robust orthology determination of novel, short sequences from plant ESTs based on reference seed plant genomes. Our phylogenetic analysis retrieved an unbiased (with respect to gene choice), well-resolved and highly supported phylogenetic hypothesis that was robust to various outgroup combinations.

Conclusions

We evaluated character support and the relative contribution of numerous variables (e.g. gene number, missing data, partitioning schemes, taxon sampling and outgroup choice) on tree topology, stability and support metrics. Our results indicate that while missing characters and order of addition of genes to an analysis do not influence branch support, inadequate taxon sampling and limited choice of outgroup(s) can lead to spurious inference of phylogeny when dealing with phylogenomic scale data sets. As expected, support and resolution increases significantly as more informative characters are added, until reaching a threshold, beyond which support metrics stabilize, and the effect of adding conflicting characters is minimized.  相似文献   

11.
生命之树的概念由达尔文在1859年提出, 用以反映分类群的亲缘关系和进化历史。近30年来, 随着建树性状种类的多样化、数据量的快速增长以及建树方法的不断发展和完善, 生命之树的规模越来越大, 可信度也越来越高。分子生物学、生态学、基因组学、生物信息学及计算机科学等的快速发展, 使得生命之树成为开展学科间交叉研究的桥梁, 其用途日益广泛。本文综述了生命之树研究的历史和现状, 介绍了生命之树在以下几个方面的应用: (1)通过构建不同尺度的生命之树, 理解生物类群间的系统发育关系; (2)通过时间估算和地理分布区重建, 推测现存生物的起源和地理分布格局及其成因; (3)基于时间树, 结合生态、环境因子及关键创新性状, 探讨生物的多样化进程和成因; (4)揭示生物多样性的来源和格局, 预测生物多样性动态变化, 并提出相应的保护策略。最后, 本文评估了生命之树在目前海量数据情况下遇到的序列比对困难、基因树冲突、“流浪类群”干扰等建树难题, 并指出了构建“超大树”的发展趋势。  相似文献   

12.
MOTIVATION: Phylogenomics integrates the vast amount of phylogenetic information contained in complete genome sequences, and is rapidly becoming the standard for reliably inferring species phylogenies. There are, however, fundamental differences between the ways in which phylogenomic approaches like gene content, superalignment, superdistance and supertree integrate the phylogenetic information from separate orthologous groups. Furthermore, they all depend on the method by which the orthologous groups are initially determined. Here, we systematically compare these four phylogenomic approaches, in parallel with three approaches for large-scale orthology determination: pairwise orthology, cluster orthology and tree-based orthology. RESULTS: Including various phylogenetic methods, we apply a total of 54 fully automated phylogenomic procedures to the fungi, the eukaryotic clade with the largest number of sequenced genomes, for which we retrieved a golden standard phylogeny from the literature. Phylogenomic trees based on gene content show, relative to the other methods, a bias in the tree topology that parallels convergence in lifestyle among the species compared, indicating convergence in gene content. CONCLUSIONS: Complete genomes are no guarantee for good or even consistent phylogenies. However, the large amounts of data in genomes enable us to carefully select the data most suitable for phylogenomic inference. In terms of performance, the superalignment approach, combined with restrictive orthology, is the most successful in recovering a fungal phylogeny that agrees with current taxonomic views, and allows us to obtain a high-resolution phylogeny. We provide solid support for what has grown to be a common practice in phylogenomics during its advance in recent years. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.  相似文献   

13.
Hess J  Goldman N 《PloS one》2011,6(8):e22783
Phylogenomic approaches to the resolution of inter-species relationships have become well established in recent years. Often these involve concatenation of many orthologous genes found in the respective genomes followed by analysis using standard phylogenetic models. Genome-scale data promise increased resolution by minimising sampling error, yet are associated with well-known but often inappropriately addressed caveats arising through data heterogeneity and model violation. These can lead to the reconstruction of highly-supported but incorrect topologies. With the aim of obtaining a species tree for 18 species within the ascomycetous yeasts, we have investigated the use of appropriate evolutionary models to address inter-gene heterogeneities and the scalability and validity of supermatrix analysis as the phylogenetic problem becomes more difficult and the number of genes analysed approaches truly phylogenomic dimensions. We have extended a widely-known early phylogenomic study of yeasts by adding additional species to increase diversity and augmenting the number of genes under analysis. We have investigated sophisticated maximum likelihood analyses, considering not only a concatenated version of the data but also partitioned models where each gene constitutes a partition and parameters are free to vary between the different partitions (thereby accounting for variation in the evolutionary processes at different loci). We find considerable increases in likelihood using these complex models, arguing for the need for appropriate models when analyzing phylogenomic data. Using these methods, we were able to reconstruct a well-supported tree for 18 ascomycetous yeasts spanning about 250 million years of evolution.  相似文献   

14.

Background

The availability of sequences from whole genomes to reconstruct the tree of life has the potential to enable the development of phylogenomic hypotheses in ways that have not been before possible. A significant bottleneck in the analysis of genomic-scale views of the tree of life is the time required for manual curation of genomic data into multi-gene phylogenetic matrices.

Results

To keep pace with the exponentially growing volume of molecular data in the genomic era, we have developed an automated technique, ASAP (Automated Simultaneous Analysis Phylogenetics), to assemble these multigene/multi species matrices and to evaluate the significance of individual genes within the context of a given phylogenetic hypothesis.

Conclusion

Applications of ASAP may enable scientists to re-evaluate species relationships and to develop new phylogenomic hypotheses based on genome-scale data.  相似文献   

15.
The early diversification of angiosperms is thought to have been a rapid process, which may complicate phylogenetic analyses of early angiosperm relationships. Plastid and nuclear phylogenomic studies have raised several conflicting hypotheses regarding overall angiosperm phylogeny, but mitochondrial genomes have been largely ignored as a relevant source of information. Here we sequenced mitochondrial genomes from 18 angiosperms to fill taxon-sampling gaps in Austrobaileyales, magnoliids, Chloranthales, Ceratophyllales, and major lineages of eudicots and monocots. We assembled a data matrix of 38 mitochondrial genes from 107 taxa to assess how well mitochondrial genomic data address current uncertainties in angiosperm relationships. Although we recovered conflicting phylogenies based on different data sets and analytical methods, we also observed congruence regarding deep relationships of several major angiosperm lineages: Chloranthales were always inferred to be the sister group of Ceratophyllales, Austrobaileyales to mesangiosperms, and the unplaced Dilleniales was consistently resolved as the sister to superasterids. Substitutional saturation, GC compositional heterogeneity, and codon-usage bias are possible reasons for the noise/conflict that may impact phylogenetic reconstruction; and angiosperm mitochondrial genes may not be substantially affected by these factors. The third codon positions of the mitochondrial genes appear to contain more parsimony-informative sites than the first and second codon positions, and therefore produced better resolved phylogenetic relationships with generally strong support. The relationships among these major lineages remain incompletely resolved, perhaps as a result of the rapidity of early radiations. Nevertheless, data from mitochondrial genomes provide additional evidence and alternative hypotheses for exploring the early evolution and diversification of the angiosperms.  相似文献   

16.
Research Coordination Networks: a phylogeny for kingdom Fungi (Deep Hypha)   总被引:3,自引:0,他引:3  
Research in fungal phylogenetics and systematics progressed rapidly in the past decade due to advances in DNA sequencing technologies and analytical methods. A newfound wealth of sequence data acquired through community-wide initiatives has advanced the process of acquiring a stable phylogenetic classification of many fungal taxa. Financial support from the National Science Foundation Research Coordination Networks: a phylogeny for kingdom Fungi (Deep Hypha) for 5 y enabled more than 100 fungal systematists to assess the taxon sampling, molecular markers and analytical methods necessary to facilitate such a project. Later a second NSF program provided financial support for the Assembling the Fungal Tree of Life (AFTOL) project to accomplish much of the research. Deep Hypha may be viewed as an involved parent of AFTOL with a continuing role as coordinator of likeminded workers. Many questions posed at the beginning of the Deep Hypha project have been addressed, at least in part, although some details remain to be clarified. Many of the main branches of the fungal tree are stable and well supported, often as a result of multigene analyses that involved collaboration of many laboratories. More work is necessary, however, to resolve certain branching events near the base of the tree, as well as to reconstruct relationships in some terminal groups. The phylogenetic classification in this issue of Mycologia is a product of the AFTOL project and many other independent research initiatives, and it is an initial synthesis of a working classification designed to be used for all major publications that require a phylogenetic classification of fungi.  相似文献   

17.
In the fungal kingdom, the ability to cause disease in plants appears to have arisen multiple times during evolution. In many cases, the ability to infect particular plant species depends on specific genes that distinguish virulent fungi from their sometimes closely related nonvirulent relatives. These genes encode host-determining "virulence factors," including small, secreted proteins and enzymes involved in the synthesis of toxins. These virulence factors typically are involved in evolutionary arms races between plants and pathogens. We briefly summarize current knowledge of these virulence factors from several fungal species in terms of function, phylogenetic distribution, sequence variation, and genomic location. Second, we address some issues that are relevant to the evolution of virulence in fungi toward plants; in particular, horizontal gene transfer and the genomic organization of virulence genes.  相似文献   

18.
油茶是我国热带和亚热带地区广泛栽培的一类重要的经济树种,其种子可生产优质食用油(茶油)。为了比较全面地了解油茶的内生真菌多样性,采用基于rDNA ITS的免培养法从油茶叶片中提取了总DNA,再从总DNA样本中直接扩增了内生真菌核糖体RNA基因内转录间隔区序列(ITS),进而构建了ITS克隆文库。共计从50个克隆子中获得了22种不同的克隆序列,其中3个为嵌合体,7个归属为植物,其余克隆序列根据序列相似性和系统发育分析归为12个不同的分类操作单元(OTUs),全部为子囊菌,分属于4纲4目,其中Aspergillus属2种,Mycosphaerellaceae科1种,Sordariales目4种,Helotiales目5种,Aspergillus属真菌是优势菌。结果表明油茶叶片内生真菌的种类分布较广。  相似文献   

19.
MOTIVATION: A major issue in computational biology is the reconstruction of pathways from several genomic datasets, such as expression data, protein interaction data and phylogenetic profiles. As a first step toward this goal, it is important to investigate the amount of correlation which exists between these data. RESULTS: These methods are successfully tested on their ability to recognize operons in the Escherichia coli genome, from the comparison of three datasets corresponding to functional relationships between genes in metabolic pathways, geometrical relationships along the chromosome, and co-expression relationships as observed by gene expression data.  相似文献   

20.

Background  

Large multigene sequence alignments have over recent years been increasingly employed for phylogenomic reconstruction of the eukaryote tree of life. Such supermatrices of sequence data are preferred over single gene alignments as they contain vastly more information about ancient sequence characteristics, and are thus more suitable for resolving deeply diverging relationships. However, as alignments are expanded, increasingly numbers of sites with misleading phylogenetic information are also added. Therefore, a major goal in phylogenomic analyses is to maximize the ratio of information to noise; this can be achieved by the reduction of fast evolving sites.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号