首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Next‐generation sequencing (NGS) is emerging as an efficient and cost‐effective tool in population genomic analyses of nonmodel organisms, allowing simultaneous resequencing of many regions of multi‐genomic DNA from multiplexed samples. Here, we detail our synthesis of protocols for targeted resequencing of mitochondrial and nuclear loci by generating indexed genomic libraries for multiplexing up to 100 individuals in a single sequencing pool, and then enriching the pooled library using custom DNA capture arrays. Our use of DNA sequence from one species to capture and enrich the sequencing libraries of another species (i.e. cross‐species DNA capture) indicates that efficient enrichment occurs when sequences are up to about 12% divergent, allowing us to take advantage of genomic information in one species to sequence orthologous regions in related species. In addition to a complete mitochondrial genome on each array, we have included between 43 and 118 nuclear loci for low‐coverage sequencing of between 18 kb and 87 kb of DNA sequence per individual for single nucleotide polymorphisms discovery from 50 to 100 individuals in a single sequencing lane. Using this method, we have generated a total of over 500 whole mitochondrial genomes from seven cetacean species and green sea turtles. The greater variation detected in mitogenomes relative to short mtDNA sequences is helping to resolve genetic structure ranging from geographic to species‐level differences. These NGS and analysis techniques have allowed for simultaneous population genomic studies of mtDNA and nDNA with greater genomic coverage and phylogeographic resolution than has previously been possible in marine mammals and turtles.  相似文献   

2.
This is a time of unprecedented transition in DNA sequencing technologies. Next-generation sequencing (NGS) clearly holds promise for fast and cost-effective generation of multilocus sequence data for phylogeography and phylogenetics. However, the focus on non-model organisms, in addition to uncertainty about which sample preparation methods and analyses are appropriate for different research questions and evolutionary timescales, have contributed to a lag in the application of NGS to these fields. Here, we outline some of the major obstacles specific to the application of NGS to phylogeography and phylogenetics, including the focus on non-model organisms, the necessity of obtaining orthologous loci in a cost-effective manner, and the predominate use of gene trees in these fields. We describe the most promising methods of sample preparation that address these challenges. Methods that reduce the genome by restriction digest and manual size selection are most appropriate for studies at the intraspecific level, whereas methods that target specific genomic regions (i.e., target enrichment or sequence capture) have wider applicability from the population level to deep-level phylogenomics. Additionally, we give an overview of how to analyze NGS data to arrive at data sets applicable to the standard toolkit of phylogeography and phylogenetics, including initial data processing to alignment and genotype calling (both SNPs and loci involving many SNPs). Even though whole-genome sequencing is likely to become affordable rather soon, because phylogeography and phylogenetics rely on analysis of hundreds of individuals in many cases, methods that reduce the genome to a subset of loci should remain more cost-effective for some time to come.  相似文献   

3.
Contemporary crop improvement relies on the genetic analysis of progeny derived from a cross between different lines with contrasting phenotypes. Such analysis allowed positioning of genes for agronomically important traits, enabling development of DNA makers for marker-assisted selection (MAS). So far the identification of loci for desirable traits have been carried out by linkage analysis using DNA markers. This process required the development of DNA markers that are distributed over the genome as well as the genotyping of each progeny. Due to recent development in next generation sequencing (NGS) technology, whole genome sequencing (WGS) is becoming easier and cheaper. Using NGS, we developed a new method called MutMap that allows rapid isolation of useful alleles from rice mutant lines. An important feature of MutMap is that it does not require marker development. We foresee that the era of genetic markers will be eventually eclipsed by that of WGS applied to all the individuals in the breeding processes.  相似文献   

4.
Twyford AD  Ennos RA 《Heredity》2012,108(3):179-189
Hybridization has a major role in evolution-from the introgression of important phenotypic traits between species, to the creation of new species through hybrid speciation. Molecular studies of hybridization aim to understand the class of hybrids and the frequency of introgression, detect the signature of ancient hybridization, and understand the behaviour of introgressed loci in their new genomic background. This often involves a large investment in the design and application of molecular markers, leading to a compromise between the depth and breadth of genomic data. New techniques designed to assay a large sub-section of the genome, in association with next-generation sequencing (NGS) technologies, will allow genome-wide hybridization and introgression studies in organisms with no prior sequence data. These detailed genotypic data will unite the breadth of sampling of loci characteristic of population genetics with the depth of sequence information associated with molecular phylogenetics. In this review, we assess the theoretical and methodological constraints that limit our understanding of natural hybridization, and promote the use of NGS for detecting hybridization and introgression between non-model organisms. We also make recommendations for the ways in which emerging techniques, such as pooled barcoded amplicon sequencing and restriction site-associated DNA tags, should be used to overcome current limitations, and enhance our understanding of this evolutionary significant process.  相似文献   

5.
Molecular markers produced by next‐generation sequencing (NGS) technologies are revolutionizing genetic research. However, the costs of analysing large numbers of individual genomes remain prohibitive for most population genetics studies. Here, we present results based on mathematical derivations showing that, under many realistic experimental designs, NGS of DNA pools from diploid individuals allows to estimate the allele frequencies at single nucleotide polymorphisms (SNPs) with at least the same accuracy as individual‐based analyses, for considerably lower library construction and sequencing efforts. These findings remain true when taking into account the possibility of substantially unequal contributions of each individual to the final pool of sequence reads. We propose the intuitive notion of effective pool size to account for unequal pooling and derive a Bayesian hierarchical model to estimate this parameter directly from the data. We provide a user‐friendly application assessing the accuracy of allele frequency estimation from both pool‐ and individual‐based NGS population data under various sampling, sequencing depth and experimental error designs. We illustrate our findings with theoretical examples and real data sets corresponding to SNP loci obtained using restriction site–associated DNA (RAD) sequencing in pool‐ and individual‐based experiments carried out on the same population of the pine processionary moth (Thaumetopoea pityocampa). NGS of DNA pools might not be optimal for all types of studies but provides a cost‐effective approach for estimating allele frequencies for very large numbers of SNPs. It thus allows comparison of genome‐wide patterns of genetic variation for large numbers of individuals in multiple populations.  相似文献   

6.
Next generation sequencing (NGS) is revolutionizing genomics and is providing novel insights into genome organization, evolution and function. The number of plant genomes targeted for sequencing is rising. For the moment, however, the acquisition of full genome sequences in large genome species remains difficult, largely because the short reads produced by NGS platforms are inadequate to cope with repeat-rich DNA, which forms a large part of these genomes. The problem of sequence redundancy is compounded in polyploids, which dominate the plant kingdom. An approach to overcoming some of these difficulties is to reduce the full nuclear genome to its individual chromosomes using flow-sorting. The DNA acquired in this way has proven to be suitable for many applications, including PCR-based physical mapping, in situ hybridization, forming DNA arrays, the development of DNA markers, the construction of BAC libraries and positional cloning. Coupling chromosome sorting with NGS offers opportunities for the study of genome organization at the single chromosomal level, for comparative analyses between related species and for the validation of whole genome assemblies. Apart from the primary aim of reducing the complexity of the template, taking a chromosome-based approach enables independent teams to work in parallel, each tasked with the analysis of a different chromosome(s). Given that the number of plant species tractable for chromosome sorting is increasing, the likelihood is that chromosome genomics – the marriage of cytology and genomics – will make a significant contribution to the field of plant genetics.  相似文献   

7.
8.
9.
Siberian stone pine, Pinus sibirica Du Tour is one of the most economically and environmentally important forest-forming species of conifers in Russia. To study these forests a large number of highly polymorphic molecular genetic markers, such as microsatellite loci, are required. Prior to the new high-throughput next generation sequencing (NGS) methods, discovery of microsatellite loci and development of micro-satellite markers were very time consuming and laborious. The recently developed draft assembly of the Siberian stone pine genome, sequenced using the NGS methods, allowed us to identify a large number of microsatellite loci in the Siberian stone pine genome and to develop species-specific PCR primers for amplification and genotyping of 70 microsatellite loci. The primers were designed using contigs containing short simple sequence tandem repeats from the Siberian stone pine whole genome draft assembly. Based on the testing of primers for 70 microsatellite loci with tri-, tetra- or pentanucleotide repeats, 18 most promising, reliable and polymorphic loci were selected that can be used further as molecular genetic markers in population genetic studies of Siberian stone pine.  相似文献   

10.
Next-generation sequencing (NGS) technologies have enabled high-throughput and low-cost generation of sequence data; however, de novo genome assembly remains a great challenge, particularly for large genomes. NGS short reads are often insufficient to create large contigs that span repeat sequences and to facilitate unambiguous assembly. Plant genomes are notorious for containing high quantities of repetitive elements, which combined with huge genome sizes, makes accurate assembly of these large and complex genomes intractable thus far. Using two-color genome mapping of tiling bacterial artificial chromosomes (BAC) clones on nanochannel arrays, we completed high-confidence assembly of a 2.1-Mb, highly repetitive region in the large and complex genome of Aegilops tauschii, the D-genome donor of hexaploid wheat (Triticum aestivum). Genome mapping is based on direct visualization of sequence motifs on single DNA molecules hundreds of kilobases in length. With the genome map as a scaffold, we anchored unplaced sequence contigs, validated the initial draft assembly, and resolved instances of misassembly, some involving contigs <2 kb long, to dramatically improve the assembly from 75% to 95% complete.  相似文献   

11.
Unlocking the vast genomic diversity stored in natural history collections would create unprecedented opportunities for genome-scale evolutionary, phylogenetic, domestication and population genomic studies. Many researchers have been discouraged from using historical specimens in molecular studies because of both generally limited success of DNA extraction and the challenges associated with PCR-amplifying highly degraded DNA. In today''s next-generation sequencing (NGS) world, opportunities and prospects for historical DNA have changed dramatically, as most NGS methods are actually designed for taking short fragmented DNA molecules as templates. Here we show that using a standard multiplex and paired-end Illumina sequencing approach, genome-scale sequence data can be generated reliably from dry-preserved plant, fungal and insect specimens collected up to 115 years ago, and with minimal destructive sampling. Using a reference-based assembly approach, we were able to produce the entire nuclear genome of a 43-year-old Arabidopsis thaliana (Brassicaceae) herbarium specimen with high and uniform sequence coverage. Nuclear genome sequences of three fungal specimens of 22–82 years of age (Agaricus bisporus, Laccaria bicolor, Pleurotus ostreatus) were generated with 81.4–97.9% exome coverage. Complete organellar genome sequences were assembled for all specimens. Using de novo assembly we retrieved between 16.2–71.0% of coding sequence regions, and hence remain somewhat cautious about prospects for de novo genome assembly from historical specimens. Non-target sequence contaminations were observed in 2 of our insect museum specimens. We anticipate that future museum genomics projects will perhaps not generate entire genome sequences in all cases (our specimens contained relatively small and low-complexity genomes), but at least generating vital comparative genomic data for testing (phylo)genetic, demographic and genetic hypotheses, that become increasingly more horizontal. Furthermore, NGS of historical DNA enables recovering crucial genetic information from old type specimens that to date have remained mostly unutilized and, thus, opens up a new frontier for taxonomic research as well.  相似文献   

12.
Next generation sequencing (NGS) technology has had a transformatory effect upon population-level studies linking genetic variation to gene function. In this review, I briefly describe recent studies that have used top-down genome scanning and population genetic approaches to identify loci under recent selection, as well as some examples of how large NGS datasets can be deployed to detect the total level of deleterious, neutral and advantageous variation present in standing genetic variation. I then explore studies that have used some of these approaches to study gene function along with advances in sequencing populations under selection, QTL mapping techniques and emerging methodologies utilising targeted capture and NGS.  相似文献   

13.
14.

Background

The relatively short read lengths from next generation sequencing (NGS) technologies still pose a challenge for de novo assembly of complex mammal genomes. One important solution is to use paired-end (PE) sequence information experimentally obtained from long-range DNA fragments (>1 kb). Here, we characterize and extend a long-range PE library construction method based on direct intra-molecule ligation (or molecular linker-free circularization) for NGS.

Results

We found that the method performs stably for PE sequencing of 2- to 5- kb DNA fragments, and can be extended to 10–20 kb (and even in extremes, up to ∼35 kb). We also characterized the impact of low quality input DNA on the method, and develop a whole-genome amplification (WGA) based protocol using limited input DNA (<1 µg). Using this PE dataset, we accurately assembled the YanHuang (YH) genome, the first sequenced Asian genome, into a scaffold N50 size of >2 Mb, which is over100-times greater than the initial size produced with only small insert PE reads(17 kb). In addition, we mapped two 7- to 8- kb insertions in the YH genome using the larger insert sizes of the long-range PE data.

Conclusions

In conclusion, we demonstrate here the effectiveness of this long-range PE sequencing method and its use for the de novo assembly of a large, complex genome using NGS short reads.  相似文献   

15.
The roles of intra- and interspecific gene flow in speciation and species evolution are topics of great current interest in molecular ecology and evolutionary biology. Recent modelling studies call for new empirical data to test hypotheses arising from the recent shift from a 'whole-genome reproductive isolation' view to a 'genic' view of species and speciation. Particularly scarce (and thus of particular interest) are molecular genetic data on recently radiated, naturally hybridizing species in strongly structured and species-rich environments. Here, we studied four sympatric plant species (Pitcairnia spp.; Bromeliaceae) adapted to Neotropical inselbergs (isolated outcrops resembling habitat 'islands' in tropical rainforests) using nuclear and plastid DNA. Patterns of plastid DNA haplotype sharing and nuclear genomic admixture suggest the presence of both, incomplete lineage sorting and interspecific gene flow over extended periods of time. Integrity and cohesion of inselberg species of Pitcairnia are maintained despite introgression and in the face of extremely low within-species migration rates (N(e)m < 1 migrant per generation). Cross-evaluation of our genetic data against published pollination experiments indicate that species integrity is maintained by the simultaneous action of multiple prezygotic barriers, including flowering phenology, pollinator isolation and divergent mating systems. Postzygotic Bateson-Dobzhansky-Muller incompatibilities appear to contribute to isolation, as suggested by asymmetric introgression rates of single loci. Our results suggest that incomplete lineage sorting, hybridization and introgression form integral aspects of adaptive radiation in Neotropical inselberg 'archipelagos'. Inselbergs with multiple closely related co-occurring species should be of special interest to students of speciation in mountain systems, and to ongoing conservation programmes in the Atlantic Rainforest biodiversity hotspot.  相似文献   

16.
Pleistocene glacial cycles are thought to have played a major role in the diversification of temperate and boreal species of North American birds. Given that coalescence times between sister taxa typically range from 0.1 to 2.0 Myr, it has been assumed that diversification occurred as populations were isolated in refugia over long periods of time, probably spanning one to several full glacial cycles. In contrast, the rapid postglacial range expansions and recolonization of northern latitudes following glacial maxima have received less attention as potential promoters of speciation. Here we report a case of extremely rapid diversification in the songbird genus Junco as a result of a single continent-wide range expansion within the last 10 000 years. Molecular data from 264 juncos sampled throughout their range reveal that as the yellow-eyed junco (Junco phaeonotus) of Mesoamerica expanded northward following the last glacial maximum, it speciated into the dark-eyed junco (Junco hyemalis), which subsequently diversified itself into at least five markedly distinct and geographically structured morphotypes in the USA and Canada. Patterns of low genetic structure and diversity in mitochondrial DNA and amplified fragment length polymorphism loci found in dark-eyed juncos relative to Mesoamerican yellow-eyed juncos provide support for the hypothesis of an expansion from the south, followed by rapid diversification in the north. These results underscore the role of postglacial expansions in promoting diversification and speciation through a mechanism that represents an alternative to traditional modes of Pleistocene speciation.  相似文献   

17.
Rapid development of next generation sequencing (NGS) technologies in recent years has made whole genome sequencing of bacterial genomes widely accessible. However, it is often unnecessary or not feasible to sequence the whole genome for most applications of genetic analyses in bacteria. Selectively capturing defined genomic regions followed by NGS analysis could be a promising approach for high-resolution molecular typing of a large set of strains. In this study, we describe a novel and straightforward PCR-based target-capturing method, hairpin-primed multiplex amplification (HPMA), which allows for simultaneous amplification of numerous target genes. To test the feasibility of NGS-based strain typing using HPMA, 20 target gene sequences were simultaneously amplified with barcode tagging in each of 41 Salmonella strains. The amplicons were then pooled and analyzed by 454 pyrosequencing. Analysis of the sequence data, as an extension of multilocus sequence typing (MLST), demonstrated the utility and potential of this novel typing method, MLST-seq, as a high-resolution strain typing method. With the rapidly increasing sequencing capacity of NGS, MLST-seq or its variations using different target enrichment methods can be expected to become a high-resolution typing method in the near future for high-throughput analysis of a large collection of bacterial strains.  相似文献   

18.
This article reviews basic concepts,general applications,and the potential impact of next-generation sequencing(NGS)technologies on genomics,with particular reference to currently available and possible future platforms and bioinformatics.NGS technologies have demonstrated the capacity to sequence DNA at unprecedented speed,thereby enabling previously unimaginable scientific achievements and novel biological applications.But,the massive data produced by NGS also presents a significant challenge for data storage,analyses,and management solutions.Advanced bioinformatic tools are essential for the successful application of NGS technology.As evidenced throughout this review,NGS technologies will have a striking impact on genomic research and the entire biological field.With its ability to tackle the unsolved challenges unconquered by previous genomic technologies,NGS is likely to unravel the complexity of the human genome in terms of genetic variations,some of which may be confined to susceptible loci for some common human conditions.The impact of NGS technologies on genomics will be far reaching and likely change the field for years to come.  相似文献   

19.
Predicting protein domains is essential for understanding a protein’s function at the molecular level. However, up till now, there has been no direct and straightforward method for predicting protein domains in species without a reference genome sequence. In this study, we developed a functionality with a set of programs that can predict protein domains directly from genomic sequence data without a reference genome. Using whole genome sequence data, the programming functionality mainly comprised DNA assembly in combination with next-generation sequencing (NGS) assembly methods and traditional methods, peptide prediction and protein domain prediction. The proposed new functionality avoids problems associated with de novo assembly due to micro reads and small single repeats. Furthermore, we applied our functionality for the prediction of leucine rich repeat (LRR) domains in four species of Ficus with no reference genome, based on NGS genomic data. We found that the LRRNT_2 and LRR_8 domains are related to plant transpiration efficiency, as indicated by the stomata index, in the four species of Ficus. The programming functionality established in this study provides new insights for protein domain prediction, which is particularly timely in the current age of NGS data expansion.  相似文献   

20.
Next‐generation sequencing technologies now allow researchers of non‐model systems to perform genome‐based studies without the requirement of a (often unavailable) closely related genomic reference. We evaluated the role of restriction endonuclease (RE) selection in double‐digest restriction‐site‐associated DNA sequencing (ddRADseq) by generating reduced representation genome‐wide data using four different RE combinations. Our expectation was that RE selections targeting longer, more complex restriction sites would recover fewer loci than RE with shorter, less complex sites. We sequenced a diverse sample of non‐model arachnids, including five congeneric pairs of harvestmen (Opiliones) and four pairs of spiders (Araneae). Sample pairs consisted of either conspecifics or closely related congeneric taxa, and in total 26 sample pair analyses were tested. Sequence demultiplexing, read clustering and variant calling were performed in the pyRAD program. The 6‐base pair cutter EcoRI combined with methylated site‐specific 4‐base pair cutter MspI produced, on average, the greatest numbers of intra‐individual loci and shared loci per sample pair. As expected, the number of shared loci recovered for a sample pair covaried with the degree of genetic divergence, estimated with cytochrome oxidase I sequences, although this relationship was non‐linear. Our comparative results will prove useful in guiding protocol selection for ddRADseq experiments on many arachnid taxa where reference genomes, even from closely related species, are unavailable.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号