首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
Next‐generation sequencing (NGS) is emerging as an efficient and cost‐effective tool in population genomic analyses of nonmodel organisms, allowing simultaneous resequencing of many regions of multi‐genomic DNA from multiplexed samples. Here, we detail our synthesis of protocols for targeted resequencing of mitochondrial and nuclear loci by generating indexed genomic libraries for multiplexing up to 100 individuals in a single sequencing pool, and then enriching the pooled library using custom DNA capture arrays. Our use of DNA sequence from one species to capture and enrich the sequencing libraries of another species (i.e. cross‐species DNA capture) indicates that efficient enrichment occurs when sequences are up to about 12% divergent, allowing us to take advantage of genomic information in one species to sequence orthologous regions in related species. In addition to a complete mitochondrial genome on each array, we have included between 43 and 118 nuclear loci for low‐coverage sequencing of between 18 kb and 87 kb of DNA sequence per individual for single nucleotide polymorphisms discovery from 50 to 100 individuals in a single sequencing lane. Using this method, we have generated a total of over 500 whole mitochondrial genomes from seven cetacean species and green sea turtles. The greater variation detected in mitogenomes relative to short mtDNA sequences is helping to resolve genetic structure ranging from geographic to species‐level differences. These NGS and analysis techniques have allowed for simultaneous population genomic studies of mtDNA and nDNA with greater genomic coverage and phylogeographic resolution than has previously been possible in marine mammals and turtles.  相似文献   

3.
4.
The increased numbers of genetic markers produced by genomic techniques have the potential to both identify hybrid individuals and localize chromosomal regions responding to selection and contributing to introgression. We used restriction-site-associated DNA sequencing to identify a dense set of candidate SNP loci with fixed allelic differences between introduced rainbow trout (Oncorhynchus mykiss) and native westslope cutthroat trout (Oncorhynchus clarkii lewisi). We distinguished candidate SNPs from homeologs (paralogs resulting from whole-genome duplication) by detecting excessively high observed heterozygosity and deviations from Hardy-Weinberg proportions. We identified 2923 candidate species-specific SNPs from a single Illumina sequencing lane containing 24 barcode-labelled individuals. Published sequence data and ongoing genome sequencing of rainbow trout will allow physical mapping of SNP loci for genome-wide scans and will also provide flanking sequence for design of qPCR-based TaqMan(?) assays for high-throughput, low-cost hybrid identification using a subset of 50-100 loci. This study demonstrates that it is now feasible to identify thousands of informative SNPs in nonmodel species quickly and at reasonable cost, even if no prior genomic information is available.  相似文献   

5.
6.
With the availability of next-generation sequencing (NGS) technology, it is expected that sequence variants may be called on a genomic scale. Here, we demonstrate that a deeper understanding of the distribution of the variant call frequencies at heterozygous loci in NGS data sets is a prerequisite for sensitive variant detection. We model the crucial steps in an NGS protocol as a stochastic branching process and derive a mathematical framework for the expected distribution of alleles at heterozygous loci before measurement that is sequencing. We confirm our theoretical results by analyzing technical replicates of human exome data and demonstrate that the variance of allele frequencies at heterozygous loci is higher than expected by a simple binomial distribution. Due to this high variance, mutation callers relying on binomial distributed priors are less sensitive for heterozygous variants that deviate strongly from the expected mean frequency. Our results also indicate that error rates can be reduced to a greater degree by technical replicates than by increasing sequencing depth.  相似文献   

7.
8.
9.
The maintenance or breakdown of reproductive isolation is an observable outcome of secondary contact between species. In cases where hybrids beyond the F1 are formed, the representation of each species' ancestry can vary dramatically among genomic regions. This genomic heterogeneity in ancestry and introgression can offer insight into evolutionary processes, particularly if introgression is compared in multiple hybrid zones. Similarly, considerable heterogeneity exists across the genome in the extent to which populations and species have diverged, reflecting the combined effects of different evolutionary processes on genetic variation. We studied hybridization across two hybrid zones of two phenotypically well‐differentiated bird species in Mexico (Pipilo maculatus and P. ocai), to investigate genomic heterogeneity in differentiation and introgression. Using genotyping‐by‐sequencing (GBS) and hierarchical Bayesian models, we genotyped 460 birds at over 41 000 single nucleotide polymorphism (SNP) loci. We identified loci exhibiting extreme introgression relative to the genome‐wide expectation using a Bayesian genomic cline model. We also estimated locus‐specific FST and identified loci with exceptionally high genetic divergence between the parental species. We found some concordance of locus‐specific introgression in the two independent hybrid zones (6–20% of extreme loci shared across zones), reflecting areas of the genome that experience similar gene flow when the species interact. Additionally, heterogeneity in introgression and divergence across the genome revealed another subset of loci under the influence of locally specific factors. These results are consistent with a history in which reproductive isolation has been influenced by a common set of loci in both hybrid zones, but where local environmental and stochastic factors also lead to genomic differentiation.  相似文献   

10.
This is a time of unprecedented transition in DNA sequencing technologies. Next-generation sequencing (NGS) clearly holds promise for fast and cost-effective generation of multilocus sequence data for phylogeography and phylogenetics. However, the focus on non-model organisms, in addition to uncertainty about which sample preparation methods and analyses are appropriate for different research questions and evolutionary timescales, have contributed to a lag in the application of NGS to these fields. Here, we outline some of the major obstacles specific to the application of NGS to phylogeography and phylogenetics, including the focus on non-model organisms, the necessity of obtaining orthologous loci in a cost-effective manner, and the predominate use of gene trees in these fields. We describe the most promising methods of sample preparation that address these challenges. Methods that reduce the genome by restriction digest and manual size selection are most appropriate for studies at the intraspecific level, whereas methods that target specific genomic regions (i.e., target enrichment or sequence capture) have wider applicability from the population level to deep-level phylogenomics. Additionally, we give an overview of how to analyze NGS data to arrive at data sets applicable to the standard toolkit of phylogeography and phylogenetics, including initial data processing to alignment and genotype calling (both SNPs and loci involving many SNPs). Even though whole-genome sequencing is likely to become affordable rather soon, because phylogeography and phylogenetics rely on analysis of hundreds of individuals in many cases, methods that reduce the genome to a subset of loci should remain more cost-effective for some time to come.  相似文献   

11.
Simple sequence repeats (SSRs) are widely used genetic markers in ecology, evolution, and conservation even in the genomics era, while a general limitation to their application is the difficulty of developing polymorphic SSR markers. Next‐generation sequencing (NGS) offers the opportunity for the rapid development of SSRs; however, previous studies developing SSRs using genomic data from only one individual need redundant experiments to test the polymorphisms of SSRs. In this study, we designed a pipeline for the rapid development of polymorphic SSR markers from multi‐sample genomic data. We used bioinformatic software to genotype multiple individuals using resequencing data, detected highly polymorphic SSRs prior to experimental validation, significantly improved the efficiency and reduced the experimental effort. The pipeline was successfully applied to a globally threatened species, the brown eared‐pheasant (Crossoptilon mantchuricum), which showed very low genomic diversity. The 20 newly developed SSR markers were highly polymorphic, the average number of alleles was much higher than the genomic average. We also evaluated the effect of the number of individuals and sequencing depth on the SSR mining results, and we found that 10 individuals and ~10X sequencing data were enough to obtain a sufficient number of polymorphic SSRs, even for species with low genetic diversity. Furthermore, the genome assembly of NGS data from the optimal number of individuals and sequencing depth can be used as an alternative reference genome if a high‐quality genome is not available. Our pipeline provided a paradigm for the application of NGS technology to mining and developing molecular markers for ecological and evolutionary studies.  相似文献   

12.
This article reviews basic concepts,general applications,and the potential impact of next-generation sequencing(NGS)technologies on genomics,with particular reference to currently available and possible future platforms and bioinformatics.NGS technologies have demonstrated the capacity to sequence DNA at unprecedented speed,thereby enabling previously unimaginable scientific achievements and novel biological applications.But,the massive data produced by NGS also presents a significant challenge for data storage,analyses,and management solutions.Advanced bioinformatic tools are essential for the successful application of NGS technology.As evidenced throughout this review,NGS technologies will have a striking impact on genomic research and the entire biological field.With its ability to tackle the unsolved challenges unconquered by previous genomic technologies,NGS is likely to unravel the complexity of the human genome in terms of genetic variations,some of which may be confined to susceptible loci for some common human conditions.The impact of NGS technologies on genomics will be far reaching and likely change the field for years to come.  相似文献   

13.
Predicting protein domains is essential for understanding a protein’s function at the molecular level. However, up till now, there has been no direct and straightforward method for predicting protein domains in species without a reference genome sequence. In this study, we developed a functionality with a set of programs that can predict protein domains directly from genomic sequence data without a reference genome. Using whole genome sequence data, the programming functionality mainly comprised DNA assembly in combination with next-generation sequencing (NGS) assembly methods and traditional methods, peptide prediction and protein domain prediction. The proposed new functionality avoids problems associated with de novo assembly due to micro reads and small single repeats. Furthermore, we applied our functionality for the prediction of leucine rich repeat (LRR) domains in four species of Ficus with no reference genome, based on NGS genomic data. We found that the LRRNT_2 and LRR_8 domains are related to plant transpiration efficiency, as indicated by the stomata index, in the four species of Ficus. The programming functionality established in this study provides new insights for protein domain prediction, which is particularly timely in the current age of NGS data expansion.  相似文献   

14.
Closely related marine species with large overlapping ranges provide opportunities to study mechanisms of speciation, particularly when there is evidence of gene flow between such lineages. Here, we focus on a case of hybridization between the sympatric sister‐species Haemulon maculicauda and H. flaviguttatum, using Sanger sequencing of mitochondrial and nuclear loci, as well as 2422 single nucleotide polymorphisms (SNPs) obtained via restriction site‐associated DNA sequencing (RADSeq). Mitochondrial markers revealed a shared haplotype for COI and low divergence for CytB and CR between the sister‐species. On the other hand, complete lineage sorting was observed at the nuclear loci and most of the SNPs. Under neutral expectations, the smaller effective population size of mtDNA should lead to fixation of mutations faster than nDNA. Thus, these results suggest that hybridization in the recent past (0.174–0.263 Ma) led to introgression of the mtDNA, with little effect on the nuclear genome. Analyses of the SNP data revealed 28 loci potentially under divergent selection between the two species. The combination of mtDNA introgression and limited nuclear DNA introgression provides a mechanism for the evolution of independent lineages despite recurrent hybridization events. This study adds to the growing body of research that exemplifies how genetic divergence can be maintained in the presence of gene flow between closely related species.  相似文献   

15.
16.
ABSTRACT: BACKGROUND: Introgression likely plays a significant role in evolution, but understanding the extent and consequences of this process requires a clear identification of species boundaries in each focal group. The delimitation of species, however, is a contentious endeavor. This is true not only because of the inadequacy of current tools to identify species lineages, but also because of the inherent ambiguity between natural populations and species paradigms. The result has been a debate about the supremacy of various species concepts and criteria. Here, we utilized multiple separate sources of molecular data, mtDNA, nuclear sequences, and microsatellites, to delimit species under a polytypic species concept (PTSC) and estimate the frequency and genomic extent of introgression in a Neotropical genus of cichlid fishes (Cichla). We compared our inferences of species boundaries and introgression under this paradigm to those when species are identified under a diagnostic species concept (DSC). RESULTS: We find that, based on extensive molecular data and an inclusive species concept, 8 separate biological entities should be recognized rather than the 15 described species of Cichla. Under the PTSC, fewer individuals are expected to exhibit hybrid ancestry than under the DSC (~2% vs. ~12%), but more of the species exhibit introgression from at least one other species (75% vs. 60%). Under either species concept, the phylogenetic breadth of introgression in this group is notable, with both sister species and species from different major mtDNA clades exhibiting introgression. CONCLUSIONS: Introgression was observed to be a widespread phenomenon for delimited species in this group. While several instances of introgressive hybridization were observed in anthropogenically altered habitats, most were found in undisturbed natural habitats, suggesting that introgression is a natural but ephemeral part of the evolution of many tropical species. Nevertheless, even transient introgression may facilitate an increase in genetic diversity or transfer of adaptive mutations that have important consequences in the evolution of tropical biodiversity.  相似文献   

17.
In recently diverged species, ancestral polymorphism and introgression can cause incongruence between gene and species trees. In the face of hybridization, few genomic regions may exhibit reciprocal monophyly, and these regions, usually evolving rapidly under selection, may be important for the maintenance of species boundaries. In animals with internal fertilization, genes encoding seminal protein are candidate barrier genes. Recently diverged hybridizing species such as the field crickets Gryllus firmus and G. pennsylvanicus , offer excellent opportunities to investigate the origins of barriers to gene exchange. These recently diverged species form a well-characterized hybrid zone, and share ancestral polymorphisms across the genome. We analyzed DNA sequence divergence for seminal protein loci, housekeeping loci, and mtDNA, using a combination of analytical approaches and extensive sampling across both species and the hybrid zone. We report discordant genealogical patterns and differential introgression rates across the genome. The most dramatic outliers, showing near-zero introgression and more structured species trees, are also the only two seminal protein loci under selection. These are candidate barrier genes with possible reproductive functions. We also use genealogical data to examine the demographic history of the field crickets and the current structure of the hybrid zone.  相似文献   

18.
19.
Numerous fungal morphospecies include cryptic species that routinely are detected by sequencing a few unlinked DNA loci. However, whether the patterns observed by multi-locus sequencing are compatible with genome wide data, such as amplified fragment length polymorphisms (AFLPs), is not well known for fungi. In this study we compared the ability of three DNA loci and AFLP data to discern between cryptic fungal lineages in the three morphospecies Coniophora olivacea, Coniophora arida, and Coniophora puteana. The sequences and the AFLP data were highly congruent in delimiting the morphotaxa into multiple cryptic species. However, while the DNA sequences indicated introgression or hybridization between some of the cryptic lineages the AFLP data did not. We conclude that as few as three polymorphic DNA loci was sufficient to recognize cryptic lineages within the studied Coniophora taxa. However, based on analyses of a few (three) sequenced loci the hybridization could not easily be distinguished from incomplete lineage sorting. Hence, great caution should be taken when concluding about hybridization based on data from just a few loci.  相似文献   

20.
Salmonid fishes rank among species being most severely affected by introgressive hybridization as a result of a long tradition of stocking with hatchery-reared conspecifics. The objectives of this study were (i) to evaluate the genetic consequences of stocking and resulting introgression rates on the genetic integrity of natural populations of brook charr, (ii) to identify genomic regions potentially associated with adaptation to natural and artificial rearing environments, and (iii) to test the null hypothesis that introgression from domesticated brook charr into wild populations is homogeneous among loci. A total of 336 individuals were sampled from nine lakes, which were stocked at different intensities with domestic fish. Individuals were genotyped at 280 SNPs located in transcribed regions and developed by means of next-generation sequencing. As previously reported with microsatellites, we observed a positive relationship between stocking intensity and genetic diversity among stocking groups, and a decrease in population differentiation. Individual admixture proportions also increased with stocking intensity. Moreover, genomic cline analysis revealed 27 SNPs, seven of which were also identified as outliers in a genome scan, which showed an introgression rate either more restricted or enhanced relative to neutral expectations. This indicated that selection, mainly for growth-related biological processes, has favored or hampered the introgression of genomic blocks into the introgressed wild populations. Overall, this study highlights the usefulness of investigating the impact of stocking on the dynamics of introgression of potentially adaptive genetic variation to better understand the consequences of such practice on the genomic integrity of wild populations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号