首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 359 毫秒
1.
For many taxa, including isomorphic haplodiplontic macroalgae, determining sex and ploidy is challenging, thereby limiting the scope of some population demographic and genetic studies. Here, we used double‐digest restriction site‐associated DNA sequencing (ddRAD‐seq) to identify sex‐linked molecular markers in the widespread red alga Agarophyton vermiculophyllum. In the ddRAD‐seq library, we included 10 female gametophytes, 10 male gametophytes, and 16 tetrasporophytes from one native and one non‐native site (N = 40 gametophytes and N = 32 tetrasporophytes total). We identified seven putatively female‐linked and 19 putatively male‐linked sequences. Four female‐ and eight male‐linked markers amplified in all three life cycle stages. Using one female‐ and one male‐linked marker that were sex‐specific, we developed a duplex PCR and tested the efficacy of this assay on a subset of thalli sampled at two sites in the non‐native range. We confirmed ploidy based on the visual observation of reproductive structures and previous microsatellite genotyping at 10 polymorphic loci. For 32 vegetative thalli, we were able to assign sex and confirm ploidy in these previously genotyped thalli. These markers will be integral to ongoing studies of A. vermiculophyllum invasion. We discuss the utility of RAD‐seq over other approaches previously used, such as RAPDs (random amplified polymorphic DNA), for future work designing sex‐linked markers in other haplodiplontic macroalgae for which genomes are lacking.  相似文献   

2.
Information on genetic relationships among individuals is essential to many studies of the behaviour and ecology of wild organisms. Parentage and relatedness assays based on large numbers of single nucleotide polymorphism (SNP) loci hold substantial advantages over the microsatellite markers traditionally used for these purposes. We present a double‐digest restriction site‐associated DNA sequencing (ddRAD‐seq) analysis pipeline that, as such, simultaneously achieves the SNP discovery and genotyping steps and which is optimized to return a statistically powerful set of SNP markers (typically 150–600 after stringent filtering) from large numbers of individuals (up to 240 per run). We explore the trade‐offs inherent in this approach through a set of experiments in a species with a complex social system, the variegated fairy‐wren (Malurus lamberti) and further validate it in a phylogenetically broad set of other bird species. Through direct comparisons with a parallel data set from a robust panel of highly variable microsatellite markers, we show that this ddRAD‐seq approach results in substantially improved power to discriminate among potential relatives and considerably more precise estimates of relatedness coefficients. The pipeline is designed to be universally applicable to all bird species (and with minor modifications to many other taxa), to be cost‐ and time‐efficient, and to be replicable across independent runs such that genotype data from different study periods can be combined and analysed as field samples are accumulated.  相似文献   

3.
Data from a large‐scale restriction site‐associated DNA sequencing (RAD‐Seq) study of nine butterflyfish species in the Red Sea and Arabian Sea provided a means to test the utility of a recently published draft genome (Chaetodon austriacus) and assess apparent bias in this method of isolating nuclear loci. We here processed double‐digest restriction site‐associated DNA (ddRAD) sequencing data to identify single nucleotide polymorphism (SNP) markers and their associated function with and without our reference genome to see whether it improves the quality of RAD‐Seq. Our analyses indicate (i) a modest gap between the number of nonannotated versus annotated SNPs across all species, (ii) an advantage of using genomic resources for closely related but not distantly related butterflyfish species based on the ability to assign putative gene function to SNPs and (iii) an enrichment of genes among sister butterflyfish taxa related to calcium transmembrane transport and binding. The latter result highlights the potential for this approach to reveal insights into adaptive mechanisms in populations inhabiting challenging coral reef environments such as the Red Sea, Arabian Sea and Arabian Gulf with further study.  相似文献   

4.
A major goal of evolutionary biology is to identify the genome‐level targets of natural and sexual selection. With the advent of next‐generation sequencing, whole‐genome selection components analysis provides a promising avenue in the search for loci affected by selection in nature. Here, we implement a genome‐wide selection components analysis in the sex role reversed Gulf pipefish, Syngnathus scovelli. Our approach involves a double‐digest restriction‐site associated DNA sequencing (ddRAD‐seq) technique, applied to adult females, nonpregnant males, pregnant males, and their offspring. An FST comparison of allele frequencies among these groups reveals 47 genomic regions putatively experiencing sexual selection, as well as 468 regions showing a signature of differential viability selection between males and females. A complementary likelihood ratio test identifies similar patterns in the data as the FST analysis. Sexual selection and viability selection both tend to favor the rare alleles in the population. Ultimately, we conclude that genome‐wide selection components analysis can be a useful tool to complement other approaches in the effort to pinpoint genome‐level targets of selection in the wild.  相似文献   

5.
Parentage assignment is defined as the identification of the true parents of one focal offspring among a list of candidates and has been commonly used in zoological, ecological, and agricultural studies. Although likelihood‐based parentage assignment is the preferred method in most cases, it requires genotyping a predefined set of DNA markers and providing their population allele frequencies. In the present study, we proposed an alternative method of parentage assignment that does not depend on genotype data and prior information of allele frequencies. Our method employs the restriction site‐associated DNA sequencing (RAD‐seq) reads for clustering into the overlapped RAD loci among the compared individuals, following which the likelihood ratio of parentage assignment could be directly calculated using two parameters—the genome heterozygosity and error rate of sequencing reads. This method was validated on one simulated and two real data sets with the accurate assignment of true parents to focal offspring. However, our method could not provide a statistical confidence to conclude that the first ranked candidate is a true parent.  相似文献   

6.
Research in evolutionary biology involving nonmodel organisms is rapidly shifting from using traditional molecular markers such as mtDNA and microsatellites to higher throughput SNP genotyping methodologies to address questions in population genetics, phylogenetics and genetic mapping. Restriction site associated DNA sequencing (RAD sequencing or RADseq) has become an established method for SNP genotyping on Illumina sequencing platforms. Here, we developed a protocol and adapters for double‐digest RAD sequencing for Ion Torrent (Life Technologies; Ion Proton, Ion PGM) semiconductor sequencing. We sequenced thirteen genomic libraries of three different nonmodel vertebrate species on Ion Proton with PI chips: Arctic charr Salvelinus alpinus, European whitefish Coregonus lavaretus and common lizard Zootoca vivipara. This resulted in ~962 million single‐end reads overall and a mean of ~74 million reads per library. We filtered the genomic data using Stacks, a bioinformatic tool to process RAD sequencing data. On average, we obtained ~11 000 polymorphic loci per library of 6–30 individuals. We validate our new method by technical and biological replication, by reconstructing phylogenetic relationships, and using a hybrid genetic cross to track genomic variants. Finally, we discuss the differences between using the different sequencing platforms in the context of RAD sequencing, assessing possible advantages and disadvantages. We show that our protocol can be used for Ion semiconductor sequencing platforms for the rapid and cost‐effective generation of variable and reproducible genetic markers.  相似文献   

7.
8.
Interpretation of high‐throughput sequence data requires an understanding of how decisions made during bioinformatic data processing can influence results. One source of bias that is often cited is PCR clones (or PCR duplicates). PCR clones are common in restriction site‐associated sequencing (RAD‐seq) data sets, which are increasingly being used for molecular ecology. To determine the influence PCR clones and the bioinformatic handling of clones have on genotyping, we evaluate four RAD‐seq data sets. Data sets were compared before and after clones were removed to estimate the number of clones present in RAD‐seq data, quantify how often the presence of clones in a data set causes genotype calls to change compared to when clones were removed, investigate the mechanisms that lead to genotype call changes and test whether clones bias heterozygosity estimates. Our RAD‐seq data sets contained 30%–60% PCR clones, but 95% of RAD‐tags had five or fewer clones. Relatively few genotypes changed once clones were removed (5%–10%), and the vast majority of these changes (98%) were associated with genotypes switching from a called to no‐call state or vice versa. PCR clones had a larger influence on genotype calls in individuals with low read depth but appeared to influence genotype calls at all loci similarly. Removal of PCR clones reduced the number of called genotypes by 2% but had almost no influence on estimates of heterozygosity. As such, while steps should be taken to limit PCR clones during library preparation, PCR clones are likely not a substantial source of bias for most RAD‐seq studies.  相似文献   

9.
A major barrier to evolutionary studies of sex determination and sex chromosomes has been a lack of information on the types of sex‐determining mechanisms that occur among different species. This is particularly problematic in groups where most species lack visually heteromorphic sex chromosomes, such as fish, amphibians and reptiles, because cytogenetic analyses will fail to identify the sex chromosomes in these species. We describe the use of restriction site‐associated DNA (RAD) sequencing, or RAD‐seq, to identify sex‐specific molecular markers and subsequently determine whether a species has male or female heterogamety. To test the accuracy of this technique, we examined the lizard Anolis carolinensis. We performed RAD‐seq on seven male and ten female A. carolinensis and found one male‐specific molecular marker. Anolis carolinensis has previously been shown to possess male heterogamety and the recently published A. carolinensis genome facilitated the characterization of the sex‐specific RAD‐seq marker. We validated the male specificity of the new marker using PCR on additional individuals and also found that it is conserved in some other Anolis species. We discuss the utility of using RAD‐seq to identify sex‐determining mechanisms in other species with cryptic or homomorphic sex chromosomes and the implications for the evolution of male heterogamety in Anolis.  相似文献   

10.
Genotyping‐by‐sequencing (GBS) and related methods are increasingly used for studies of non‐model organisms from population genetic to phylogenetic scales. We present GIbPSs, a new genotyping toolkit for the analysis of data from various protocols such as RAD, double‐digest RAD, GBS, and two‐enzyme GBS without a reference genome. GIbPSs can handle paired‐end GBS data and is able to assign reads from both strands of a restriction fragment to the same locus. GIbPSs is most suitable for population genetic and phylogeographic analyses. It avoids genotyping errors due to indel variation by identifying and discarding affected loci. GIbPSs creates a genotype database that offers rich functionality for data filtering and export in numerous formats. We performed comparative analyses of simulated and real GBS data with GIbPSs and another program, pyRAD. This program accounts for indel variation by aligning homologous sequences. GIbPSs performed better than pyRAD in several aspects. It required much less computation time and displayed higher genotyping accuracy. GIbPSs retained smaller numbers of loci overall in analyses of real GBS data. It nevertheless delivered more complete genotype matrices with greater locus overlap between individuals and greater numbers of loci sampled in all individuals.  相似文献   

11.
High‐throughput sequencing has revolutionized population and conservation genetics. RAD sequencing methods, such as 2b‐RAD, can be used on species lacking a reference genome. However, transferring protocols across taxa can potentially lead to poor results. We tested two different IIB enzymes (AlfI and CspCI) on two species with different genome sizes (the loggerhead turtle Caretta caretta and the sharpsnout seabream Diplodus puntazzo) to build a set of guidelines to improve 2b‐RAD protocols on non‐model organisms while optimising costs. Good results were obtained even with degraded samples, showing the value of 2b‐RAD in studies with poor DNA quality. However, library quality was found to be a critical parameter on the number of reads and loci obtained for genotyping. Resampling analyses with different number of reads per individual showed a trade‐off between number of loci and number of reads per sample. The resulting accumulation curves can be used as a tool to calculate the number of sequences per individual needed to reach a mean depth ≥20 reads to acquire good genotyping results. Finally, we demonstrated that selective‐base ligation does not affect genomic differentiation between individuals, indicating that this technique can be used in species with large genome sizes to adjust the number of loci to the study scope, to reduce sequencing costs and to maintain suitable sequencing depth for a reliable genotyping without compromising the results. Here, we provide a set of guidelines to improve 2b‐RAD protocols on non‐model organisms with different genome sizes, helping decision‐making for a reliable and cost‐effective genotyping.  相似文献   

12.
Restriction‐site‐associated DNA sequencing (RAD‐seq) and related methods are revolutionizing the field of population genomics in nonmodel organisms as they allow generating an unprecedented number of single nucleotide polymorphisms (SNPs) even when no genomic information is available. Yet, RAD‐seq data analyses rely on assumptions on nature and number of nucleotide variants present in a single locus, the choice of which may lead to an under‐ or overestimated number of SNPs and/or to incorrectly called genotypes. Using the Atlantic mackerel (Scomber scombrus L.) and a close relative, the Atlantic chub mackerel (Scomber colias), as case study, here we explore the sensitivity of population structure inferences to two crucial aspects in RAD‐seq data analysis: the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides insights into the effects of alternative RAD‐seq data analysis strategies on population structure inferences that are directly applicable to other species.  相似文献   

13.
Predicting the consequences of environmental changes, including human‐mediated climate change on species, requires that we quantify range‐wide patterns of genetic diversity and identify the ecological, environmental, and historical factors that have contributed to it. Here, we generate baseline data on polar bear population structure across most Canadian subpopulations (n = 358) using 13,488 genome‐wide single nucleotide polymorphisms (SNPs) identified with double‐digest restriction site‐associated DNA sequencing (ddRAD). Our ddRAD dataset showed three genetic clusters in the sampled Canadian range, congruent with previous studies based on microsatellites across the same regions; however, due to a lack of sampling in Norwegian Bay, we were unable to confirm the existence of a unique cluster in that subpopulation. These data on the genetic structure of polar bears using SNPs provide a detailed baseline against which future shifts in population structure can be assessed, and opportunities to develop new noninvasive tools for monitoring polar bears across their range.  相似文献   

14.
15.
Harmful algal blooms (HABs), which can be lethal in marine species and cause illness in humans, are increasing worldwide. In the Gulf of Mexico, HABs of Karenia brevis produce neurotoxic brevetoxins that cause large‐scale marine mortality events. The long history of such blooms, combined with the potentially severe effects of exposure, may have produced a strong selective pressure for evolved resistance. Advances in next‐generation sequencing, in particular genotyping‐by‐sequencing, greatly enable the genomic study of such adaptation in natural populations. We used restriction site‐associated DNA (RAD) sequencing to investigate brevetoxicosis resistance in common bottlenose dolphins (Tursiops truncatus). To improve our understanding of the epidemiology and aetiology of brevetoxicosis and the potential for evolved resistance in an upper trophic level predator, we sequenced pools of genomic DNA from dolphins sampled from both coastal and estuarine populations in Florida and during multiple HAB‐associated mortality events. We sequenced 129 594 RAD loci and analysed 7431 single nucleotide polymorphisms (SNPs). The allele frequencies of many of these polymorphic loci differed significantly between live and dead dolphins. Some loci associated with survival showed patterns suggesting a common genetic‐based mechanism of resistance to brevetoxins in bottlenose dolphins along the Gulf coast of Florida, but others suggested regionally specific mechanisms of resistance or reflected differences among HABs. We identified candidate genes that may be the evolutionary target for brevetoxin resistance by searching the dolphin genome for genes adjacent to survival‐associated SNPs.  相似文献   

16.
Recently evolved species typically share genetic variation across their genomes due to incomplete lineage sorting and/or ongoing gene flow. Given only subtle allele frequency differences at most loci and the expectation that divergent selection may affect only a tiny fraction of the genome, distinguishing closely related species based on multi‐locus data requires substantial genomic coverage. In this study, we used ddRAD‐seq to sample the genomes of five recently diverged, New World “mallards” (Anas spp.), a group of dabbling duck species characterized by diagnosable phenotypic differences but minimal genetic differentiation. With increased genomic sampling, we aimed to characterize population structure within this group and identify genomic regions that may have experienced divergent selection during speciation. We analyzed 3,017 autosomal ddRAD‐seq loci and 177 loci from the Z‐chromosome. In contrast to previous studies, the ddRAD‐seq data were sufficient to assign individuals to their respective species or subspecies and to generate estimates of gene flow in a phylogenetic framework. We find limited evidence of contemporary gene flow between the dichromatic mallard and several monochromatic taxa, but find evidence for historical gene flow between some monochromatic species pairs. We conclude that the overall genetic similarity of these taxa likely reflects retained ancestral polymorphism rather than recent and extensive gene flow. Thus, despite recurring cases of hybridization in this group, our results challenge the current dogma predicting the genetic extinction of the New World monochromatic dabbling ducks via introgressive hybridization with mallards. Moreover, ddRAD‐seq data were sufficient to identify previously unknown outlier regions across the Z‐chromosome and several autosomal chromosomes that may have been involved in the diversification of species in this recent radiation.  相似文献   

17.
The identification of thousands of variants across the genomes and their accurate genotyping are crucial for estimating the genetic parameters needed to address a host of molecular ecological and evolutionary questions. With rapid advances of massively parallel high‐throughput sequencing technologies, several methods have recently been developed to access genomewide data on population variation. One of the most successful and widely used techniques relies on the combination of restriction enzymes and sequencing‐by‐synthesis: restriction‐site‐associated DNA sequencing (RADSeq). We developed a new, more time‐ and cost‐efficient double‐digest RAD paired‐end protocol (quaddRAD) that simplifies and speeds up the identification of PCR duplicates and permits large‐scale multiplexing. Assessing its performance on a technical data set, we also applied the quaddRAD method on population samples of a Neotropical cichlid fish lineage (Archocentrus centrarchus) to assess its genetic structure and demographic history. While we identified allopatric interlake genetic divergence, most likely driven by drift, no signature of sympatric divergence was detected. This differs from what has been observed in the clade of Midas cichlids (Amphilophus citrinellus spp.), another cichlid lineage that inhabits the same lakes and shares a similar demographic history, but has evolved into small‐scale adaptive radiations via sympatric speciation. We demonstrate that quaddRAD is a robust and efficient method for genotyping a massive number and widely overlapping set of loci with high accuracy. Furthermore, the results on A. centrarchus open new research avenues providing an ideal system to investigate genome‐level mechanisms that could alter the speciation potential of different but closely related cichlid lineages.  相似文献   

18.
Single nucleotide polymorphisms (SNPs) are essential to the understanding of population genetic variation and diversity. Here, we performed restriction‐site‐associated DNA sequencing (RAD‐seq) on 72 individuals from 13 Chinese indigenous and three introduced chicken breeds. A total of 620 million reads were obtained using an Illumina Hiseq2000 sequencer. An average of 75 587 SNPs were identified from each individual. Further filtering strictly validated 28 895 SNPs candidates for all populations. When compared with the NCBI dbSNP (chicken_9031), 15 404 SNPs were new discoveries. In this study, RAD‐seq was performed for the first time on chickens, implicating the remarkable effectiveness and potential applications on genetic analysis and breeding technique for whole‐genome selection in chicken and other agricultural animals.  相似文献   

19.
We are writing in response to the population and phylogenomics meeting review by Andrews & Luikart ( 2014 ) entitled ‘Recent novel approaches for population genomics data analysis’. Restriction‐site‐associated DNA (RAD) sequencing has become a powerful and useful approach in molecular ecology, with several different published methods now available to molecular ecologists, none of which can be considered the best option in all situations. A&L report that the original RAD protocol of Miller et al. ( 2007 ) and Baird et al. ( 2008 ) is superior to all other RAD variants because putative PCR duplicates can be identified (see Baxter et al. 2011 ), thereby reducing the impact of PCR artefacts on allele frequency estimates (Andrews & Luikart 2014 ). In response, we (i) challenge the assertion that the original RAD protocol minimizes the impact of PCR artefacts relative to that of other RAD protocols, (ii) present additional biases in RADseq that are at least as important as PCR artefacts in selecting a RAD protocol and (iii) highlight the strengths and weaknesses of four different approaches to RADseq which are a representative sample of all RAD variants: the original RAD protocol (mbRAD, Miller et al. 2007 ; Baird et al. 2008 ), double digest RAD (ddRAD, Peterson et al. 2012 ), ezRAD (Toonen et al. 2013 ) and 2bRAD (Wang et al. 2012 ). With an understanding of the strengths and weaknesses of different RAD protocols, researchers can make a more informed decision when selecting a RAD protocol.  相似文献   

20.
For half a century population genetics studies have put type II restriction endonucleases to work. Now, coupled with massively‐parallel, short‐read sequencing, the family of RAD protocols that wields these enzymes has generated vast genetic knowledge from the natural world. Here, we describe the first software natively capable of using paired‐end sequencing to derive short contigs from de novo RAD data. Stacks version 2 employs a de Bruijn graph assembler to build and connect contigs from forward and reverse reads for each de novo RAD locus, which it then uses as a reference for read alignments. The new architecture allows all the individuals in a metapopulation to be considered at the same time as each RAD locus is processed. This enables a Bayesian genotype caller to provide precise SNPs, and a robust algorithm to phase those SNPs into long haplotypes, generating RAD loci that are 400–800 bp in length. To prove its recall and precision, we tested the software with simulated data and compared reference‐aligned and de novo analyses of three empirical data sets. Our study shows that the latest version of Stacks is highly accurate and outperforms other software in assembling and genotyping paired‐end de novo data sets.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号