首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Next-generation sequencing and phylogenomics hold great promise for elucidating complex relationships among large plant families. Here, we performed targeted capture of low copy sequences followed by next-generation sequencing on the Illumina platform in the large and diverse angiosperm family Compositae (Asteraceae). The family is monophyletic, based on morphology and molecular data, yet many areas of the phylogeny have unresolved polytomies and interpreting phylogenetic patterns has been historically difficult. In order to outline a method and provide a framework and for future phylogenetic studies in the Compositae, we sequenced 23 taxa from across the family in which the relationships were well established as well as a member of the sister family Calyceraceae. We generated nuclear data from 795 loci and assembled chloroplast genomes from off-target capture reads enabling the comparison of nuclear and chloroplast genomes for phylogenetic analyses. We also analyzed multi-copy nuclear genes in our data set using a clustering method during orthology detection, and we applied a network approach to these clusters—analyzing all related locus copies. Using these data, we produced hypotheses of phylogenetic relationships employing both a conservative (restricted to only loci with one copy per targeted locus) and a multigene approach (including all copies per targeted locus). The methods and bioinformatics workflow presented here provide a solid foundation for future work aimed at understanding gene family evolution in the Compositae as well as providing a model for phylogenomic analyses in other plant mega-families.  相似文献   

2.
Background and AimsWith the advance of high-throughput sequencing, reduced-representation methods such as target capture sequencing (TCS) emerged as cost-efficient ways of gathering genomic information, particularly from coding regions. As the off-target reads from such sequencing are expected to be similar to genome skimming (GS), we assessed the quality of repeat characterization in plant genomes using these data.MethodsRepeat composition obtained from TCS datasets of five Rhynchospora (Cyperaceae) species were compared with GS data from the same taxa. In addition, a FISH probe was designed based on the most abundant satellite found in the TCS dataset of Rhynchospora cephalotes. Finally, repeat-based phylogenies of the five Rhynchospora species were constructed based on the GS and TCS datasets and the topologies were compared with a gene-alignment-based phylogenetic tree.Key ResultsAll the major repetitive DNA families were identified in TCS, including repeats that showed abundances as low as 0.01 % in the GS data. Rank correlations between GS and TCS repeat abundances were moderately high (r = 0.58–0.85), increasing after filtering out the targeted loci from the raw TCS reads (r = 0.66–0.92). Repeat data obtained by TCS were also reliable in developing a cytogenetic probe of a new variant of the holocentromeric satellite Tyba. Repeat-based phylogenies from TCS data were congruent with those obtained from GS data and the gene-alignment tree.ConclusionsOur results show that off-target TCS reads can be recycled to identify repeats for cyto- and phylogenomic investigations. Given the growing availability of TCS reads, driven by global phylogenomic projects, our strategy represents a way to recycle genomic data and contribute to a better characterization of plant biodiversity.  相似文献   

3.
Native grasslands are one of the most endangered ecosystems in North America. In this study, we examined the ecological and evolutionary roles of endangered and threatened (e/t) grasses by establishing robust evolutionary relationships with other nonthreatened native and introduced grass species of the community. We hypothesized that the phylogenomic distribution of e/t species of grasses in Illinois would be phylogenetically clustered because closely related species would be vulnerable to the same threats and have similar requirements for survival. This study presents the first time a phylogeny based on complete plastome DNA of Poaceae was analyzed by phylogenetic diversity analysis. To avoid the disturbance of e/t populations, DNA was extracted from herbarium specimens. Next‐generation sequencing (NGS) techniques were used to sequence DNA of plastid genomes (plastomes). The resulting phylogenomic tree was analyzed by phylogenetic diversity metrics. The extracted DNA successfully produced complete plastomes demonstrating that herbarium material is a practical source of DNA for genomic studies. The phylogenomic tree was strongly supported and defined Dichanthelium as a separate clade from Panicum. The phylogenetic metrics revealed phylogenetic clustering of e/t species, confirming our hypothesis.  相似文献   

4.
全基因组测序及其在遗传性疾病研究及诊断中的应用   总被引:1,自引:0,他引:1  
邵谦之  姜毅  吴金雨 《遗传》2014,36(11):1087-1098
最近,随着测序成本的不断降低,数据分析策略的不断提升,全基因组测序(whole-genome sequencing,WGS)已经在癌症、孟德尔遗传病、复杂疾病的致病基因检测中得到了一定运用,并逐步走向了临床诊断。全基因组测序不但可以检测编码区和非编码区的点突变(SNVs)和插入缺失(InDels),还可以在全基因组范围内检测拷贝数变异(copy number variation,CNV)以及结构变异(structure variation,SV)。本文详细地介绍了全基因组测序的标准生物信息分析流程与方法,及其在疾病研究、临床诊断中的应用,并对全基因组测序在医学遗传学中的应用与研究进展,以及数据分析方面面临的挑战进行了概述。  相似文献   

5.
The increasing availability of complete genome sequences and the development of new, faster methods for phylogenetic reconstruction allow the exploration of the set of evolutionary trees for each gene in the genome of any species. This has led to the development of new phylogenomic methods. Here, we have compared different phylogenetic and phylogenomic methods in the analysis of the monophyletic origin of insect endosymbionts from the gamma-Proteobacteria, a hotly debated issue with several recent, conflicting reports. We have obtained the phylogenetic tree for each of the 579 identified protein-coding genes in the genome of the primary endosymbiont of carpenter ants, Blochmannia floridanus, after determining their presumed orthologs in 20 additional Proteobacteria genomes. A reference phylogeny reflecting the monophyletic origin of insect endosymbionts was further confirmed with different approaches, which led us to consider it as the presumed species tree. Remarkably, only 43 individual genes produced exactly the same topology as this presumed species tree. Most discrepancies between this tree and those obtained from individual genes or by concatenation of different genes were due to the grouping of Xanthomonadales with beta-Proteobacteria and not to uncertainties over the monophyly of insect endosymbionts. As previously noted, operational genes were more prone to reject the presumed species tree than those included in information-processing categories, but caution should be exerted when selecting genes for phylogenetic inference on the basis of their functional category assignment. We have obtained strong evidence in support of the monophyletic origin of gamma-Proteobacteria insect endosymbionts by a combination of phylogenetic and phylogenomic methods. In our analysis, the use of concatenated genes has shown to be a valuable tool for analyzing primary phylogenetic signals coded in the genomes. Nevertheless, other phylogenomic methods such as supertree approaches were useful in revealing alternative phylogenetic signals and should be included in comprehensive phylogenomic studies.  相似文献   

6.
重建生物进化树一直以来都是进化生物学家的梦想。大量物种全基因组的测序使得我们可以从全基因组水平上构建进化树,来研究各个物种之间的进化关系。本文采用2种统计方法和3种距离计算方法,在全基因组水平上建立基于蛋白质结构的进化树。选取93个物种的全基因组作为分析对象,涵盖了3个超界:真核生物,细菌和古细菌。而结果也正确地将这些物种分为三个大类,每个大分支内部的物种聚类情况也基本和这些物种的形态学分类相吻合。并将这些方法的聚类结果与物种分类的结果相比较,得出丰度的统计方法和基于两向量夹角的距离计算方法这种组合在构建进化树上比其他组合更好。  相似文献   

7.
8.
Recent advances in high‐throughput sequencing library preparation and subgenomic enrichment methods have opened new avenues for population genetics and phylogenetics of nonmodel organisms. To multiplex large numbers of indexed samples while sequencing predominantly orthologous, targeted regions of the genome, we propose modifications to an existing, in‐solution capture that utilizes PCR products as target probes to enrich library pools for the genomic subset of interest. The sequence capture using PCR‐generated probes (SCPP) protocol requires no specialized equipment, is highly flexible and significantly reduces experimental costs for projects where a modest scale of genetic data is optimal (25–100 genomic loci). Our alterations enable application of this method across a wider phylogenetic range of taxa and result in higher capture efficiencies and coverage at each locus. Efficient and consistent capture over multiple SCPP experiments and at various phylogenetic distances is demonstrated, extending the utility of this method to both phylogeographic and phylogenomic studies.  相似文献   

9.
Genome-scale sequence data have become increasingly available in the phylogenetic studies for understanding the evolutionary histories of species. However, it is challenging to develop probabilistic models to account for heterogeneity of phylogenomic data. The multispecies coalescent model describes gene trees as independent random variables generated from a coalescence process occurring along the lineages of the species tree. Since the multispecies coalescent model allows gene trees to vary across genes, coalescent-based methods have been popularly used to account for heterogeneous gene trees in phylogenomic data analysis. In this paper, we summarize and evaluate the performance of coalescent-based methods for estimating species trees from genome-scale sequence data. We investigate the effects of deep coalescence and mutation on the performance of species tree estimation methods. We found that the coalescent-based methods perform well in estimating species trees for a large number of genes, regardless of the degree of deep coalescence and mutation. The performance of the coalescent methods is negatively correlated with the lengths of internal branches of the species tree.  相似文献   

10.
Sequencing them all. That is the ambitious goal of the recently launched Earth BioGenome project (Proceedings of the National Academy of Sciences of the United States of America, 115, 4325–4333), which aims to produce reference genomes for all eukaryotic species within the next decade. In this perspective, we discuss the opportunities of this project with a plant focus, but highlight also potential limitations. This includes the question of how to best capture all plant diversity, as the green taxon is one of the most complex clades in the tree of life, with over 300 000 species. For this, we highlight four key points: (i) the unique biological insights that could be gained from studying plants, (ii) their apparent underrepresentation in sequencing efforts given the number of threatened species, (iii) the necessity of phylogenomic methods that are aware of differences in genome complexity and quality, and (iv) the accounting for within‐species genetic diversity and the historical aspect of conservation genetics.  相似文献   

11.
The paper reviews the current state of low and single copy nuclear markers that have been applied successfully in plant phylogenetics to date, and discusses case studies highlighting the potential of massively parallel high throughput or next-generation sequencing (NGS) approaches for molecular phylogenetic and evolutionary investigations. The current state, prospects and challenges of specific single- or low-copy plant nuclear markers as well as phylogenomic case studies are presented and evaluated.  相似文献   

12.
The paper reviews the current state of low and single copy nuclear markers that have been applied successfully in plant phylogenetics to date, and discusses case studies highlighting the potential of massively parallel high throughput or next-generation sequencing (NGS) approaches for molecular phylogenetic and evolutionary investigations. The current state, prospects and challenges of specific single- or low-copy plant nuclear markers as well as phylogenomic case studies are presented and evaluated.  相似文献   

13.
14.
15.
The kingdom of fungi provides model organisms for biotechnology, cell biology, genetics, and life sciences in general. Only when their phylogenetic relationships are stably resolved, can individual results from fungal research be integrated into a holistic picture of biology. However, and despite recent progress, many deep relationships within the fungi remain unclear. Here, we present the first phylogenomic study of an entire eukaryotic kingdom that uses a consistency criterion to strengthen phylogenetic conclusions. We reason that branches (splits) recovered with independent data and different tree reconstruction methods are likely to reflect true evolutionary relationships. Two complementary phylogenomic data sets based on 99 fungal genomes and 109 fungal expressed sequence tag (EST) sets analyzed with four different tree reconstruction methods shed light from different angles on the fungal tree of life. Eleven additional data sets address specifically the phylogenetic position of Blastocladiomycota, Ustilaginomycotina, and Dothideomycetes, respectively. The combined evidence from the resulting trees supports the deep-level stability of the fungal groups toward a comprehensive natural system of the fungi. In addition, our analysis reveals methodologically interesting aspects. Enrichment for EST encoded data-a common practice in phylogenomic analyses-introduces a strong bias toward slowly evolving and functionally correlated genes. Consequently, the generalization of phylogenomic data sets as collections of randomly selected genes cannot be taken for granted. A thorough characterization of the data to assess possible influences on the tree reconstruction should therefore become a standard in phylogenomic analyses.  相似文献   

16.
Plastid sequencing is an essential tool in the study of plant evolution. This high‐copy organelle is one of the most technically accessible regions of the genome, and its sequence conservation makes it a valuable region for comparative genome evolution, phylogenetic analysis and population studies. Here, we discuss recent innovations and approaches for de novo plastid assembly that harness genomic tools. We focus on technical developments including low‐cost sequence library preparation approaches for genome skimming, enrichment via hybrid baits and methylation‐sensitive capture, sequence platforms with higher read outputs and longer read lengths, and automated tools for assembly. These developments allow for a much more streamlined assembly than via conventional short‐range PCR. Although newer methods make complete plastid sequencing possible for any land plant or green alga, there are still challenges for producing finished plastomes particularly from herbarium material or from structurally divergent plastids such as those of parasitic plants.  相似文献   

17.
Whole genome sequencing is helping generate robust phylogenetic hypotheses for a range of taxonomic groups that were previously recalcitrant to classical molecular phylogenetic approaches. As a case study, we performed a shallow shotgun sequencing of eight species in the tropical tree family Chrysobalanaceae to retrieve large fragments of high‐copy number DNA regions and test the potential of these regions for phylogeny reconstruction. We were able to assemble the nuclear ribosomal cluster (nrDNA), the complete plastid genome (ptDNA) and a large fraction of the mitochondrial genome (mtDNA) with approximately 1000×, 450× and 120× sequencing depth respectively. The phylogenetic tree obtained with ptDNA resolved five of the seven internal nodes. In contrast, the tree obtained with mtDNA and nrDNA data were largely unresolved. This study demonstrates that genome skimming is a cost‐effective approach and shows potential in plant molecular systematics within Chrysobalanaceae and other under‐studied groups.  相似文献   

18.
19.
Given the considerable promise whole-genome sequencing offers for phylogeny and classification, it is surprising that microbial systematics and genomics have not yet been reconciled. This might be due to the intrinsic difficulties in inferring reasonable phylogenies from genomic sequences, particularly in the light of the significant amount of lateral gene transfer in prokaryotic genomes. However, recent studies indicate that the species tree and the hierarchical classification based on it are still meaningful concepts, and that state-of-the-art phylogenetic inference methods are able to provide reliable estimates of the species tree to the benefit of taxonomy. Conversely, we suspect that the current lack of completely sequenced genomes for many of the major lineages of prokaryotes and for most type strains is a major obstacle in progress towards a genome-based classification of microorganisms. We conclude that phylogeny-driven microbial genome sequencing projects such as the Genomic Encyclopaedia of Archaea and Bacteria (GEBA) project are likely to rectify this situation.  相似文献   

20.
Double-barreled (DB) data have been widely used for the assembly of large genomes. Based on the experience of building the whole-genome working draft of Oryza sativa L.ssp. Indica, we present here the prevailing and improved uses of DB data in the assembly procedure and report on novel applications during the following data-mining processes such as acquiring precise insert fragment information of each clone across the genome, and a new kind of Iow-cost whole-genome microarray. With the increasing number of organisms being sequenced,we believe that DB data will play an important role both in other assembly procedures and infuture genomic studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号