首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The trade‐offs of using single‐digest vs. double‐digest restriction site‐associated DNA sequencing (RAD‐seq) protocols have been widely discussed. However, no direct empirical comparisons of the two methods have been conducted. Here, we sampled a single population of Gulf pipefish (Syngnathus scovelli) and genotyped 444 individuals using RAD‐seq. Sixty individuals were subjected to single‐digest RAD‐seq (sdRAD‐seq), and the remaining 384 individuals were genotyped using a double‐digest RAD‐seq (ddRAD‐seq) protocol. We analysed the resulting Illumina sequencing data and compared the two genotyping methods when reads were analysed either together or separately. Coverage statistics, observed heterozygosity, and allele frequencies differed significantly between the two protocols, as did the results of selection components analysis. We also performed an in silico digestion of the Gulf pipefish genome and modelled five major sources of bias: PCR duplicates, polymorphic restriction sites, shearing bias, asymmetric sampling (i.e., genotyping fewer individuals with sdRAD‐seq than with ddRAD‐seq) and higher major allele frequencies. This combination of approaches allowed us to determine that polymorphic restriction sites, an asymmetric sampling scheme, mean allele frequencies and to some extent PCR duplicates all contribute to different estimates of allele frequencies between samples genotyped using sdRAD‐seq versus ddRAD‐seq. Our finding that sdRAD‐seq and ddRAD‐seq can result in different allele frequencies has implications for comparisons across studies and techniques that endeavour to identify genomewide signatures of evolutionary processes in natural populations.  相似文献   

2.
In the context of parentage assignment using genomic markers, key issues are genotyping errors and an absence of parent genotypes because of sampling, traceability or genotyping problems. Most likelihood‐based parentage assignment software programs require a priori estimates of genotyping errors and the proportion of missing parents to set up meaningful assignment decision rules. We present here the R package APIS, which can assign offspring to their parents without any prior information other than the offspring and parental genotypes, and a user‐defined, acceptable error rate among assigned offspring. Assignment decision rules use the distributions of average Mendelian transmission probabilities, which enable estimates of the proportion of offspring with missing parental genotypes. APIS has been compared to other software (CERVUS, VITASSIGN), on a real European seabass (Dicentrarchus labrax) single nucleotide polymorphism data set. The type I error rate (false positives) was lower with APIS than with other software, especially when parental genotypes were missing, but the true positive rate was also lower, except when the theoretical exclusion power reached 0.99999. In general, APIS provided assignments that satisfied the user‐set acceptable error rate of 1% or 5%, even when tested on simulated data with high genotyping error rates (1% or 3%) and up to 50% missing sires. Because it uses the observed distribution of Mendelian transmission probabilities, APIS is best suited to assigning parentage when numerous offspring (>200) are genotyped. We have demonstrated that APIS is an easy‐to‐use and reliable software for parentage assignment, even when up to 50% of sires are missing.  相似文献   

3.
Molecular markers produced by next‐generation sequencing (NGS) technologies are revolutionizing genetic research. However, the costs of analysing large numbers of individual genomes remain prohibitive for most population genetics studies. Here, we present results based on mathematical derivations showing that, under many realistic experimental designs, NGS of DNA pools from diploid individuals allows to estimate the allele frequencies at single nucleotide polymorphisms (SNPs) with at least the same accuracy as individual‐based analyses, for considerably lower library construction and sequencing efforts. These findings remain true when taking into account the possibility of substantially unequal contributions of each individual to the final pool of sequence reads. We propose the intuitive notion of effective pool size to account for unequal pooling and derive a Bayesian hierarchical model to estimate this parameter directly from the data. We provide a user‐friendly application assessing the accuracy of allele frequency estimation from both pool‐ and individual‐based NGS population data under various sampling, sequencing depth and experimental error designs. We illustrate our findings with theoretical examples and real data sets corresponding to SNP loci obtained using restriction site–associated DNA (RAD) sequencing in pool‐ and individual‐based experiments carried out on the same population of the pine processionary moth (Thaumetopoea pityocampa). NGS of DNA pools might not be optimal for all types of studies but provides a cost‐effective approach for estimating allele frequencies for very large numbers of SNPs. It thus allows comparison of genome‐wide patterns of genetic variation for large numbers of individuals in multiple populations.  相似文献   

4.
High‐throughput sequencing has revolutionized population and conservation genetics. RAD sequencing methods, such as 2b‐RAD, can be used on species lacking a reference genome. However, transferring protocols across taxa can potentially lead to poor results. We tested two different IIB enzymes (AlfI and CspCI) on two species with different genome sizes (the loggerhead turtle Caretta caretta and the sharpsnout seabream Diplodus puntazzo) to build a set of guidelines to improve 2b‐RAD protocols on non‐model organisms while optimising costs. Good results were obtained even with degraded samples, showing the value of 2b‐RAD in studies with poor DNA quality. However, library quality was found to be a critical parameter on the number of reads and loci obtained for genotyping. Resampling analyses with different number of reads per individual showed a trade‐off between number of loci and number of reads per sample. The resulting accumulation curves can be used as a tool to calculate the number of sequences per individual needed to reach a mean depth ≥20 reads to acquire good genotyping results. Finally, we demonstrated that selective‐base ligation does not affect genomic differentiation between individuals, indicating that this technique can be used in species with large genome sizes to adjust the number of loci to the study scope, to reduce sequencing costs and to maintain suitable sequencing depth for a reliable genotyping without compromising the results. Here, we provide a set of guidelines to improve 2b‐RAD protocols on non‐model organisms with different genome sizes, helping decision‐making for a reliable and cost‐effective genotyping.  相似文献   

5.
Genotypes are frequently used to assess alternative reproductive strategies such as extra‐pair paternity and conspecific brood parasitism in wild populations. However, such analyses are vulnerable to genotyping error or molecular artefacts that can bias results. For example, when using multilocus microsatellite data, a mismatch at a single locus, suggesting the offspring was not directly related to its putative parents, can occur quite commonly even when the offspring is truly related. Some recent studies have advocated an ad‐hoc rule that offspring must differ at more than one locus in order to conclude that they are not directly related. While this reduces the frequency with which true offspring are identified as not directly related young, it also introduces bias in the opposite direction, wherein not directly related young are categorized as true offspring. More importantly, it ignores the additional information on allele frequencies which would reduce overall bias. In this study, we present a novel technique for assessing extra‐pair paternity and conspecific brood parasitism using a likelihood‐based approach in a new version of program cervus . We test the suitability of the technique by applying it to a simulated data set and then present an example to demonstrate its influence on the estimation of alternative reproductive strategies.  相似文献   

6.
Modern genetic parentage methods reveal that alternative reproductive strategies are common in both males and females. Under ideal conditions, genetic methods accurately connect the parents to offspring produced by extra-pair matings or conspecific brood parasitism. However, some breeding systems and sampling scenarios present significant complications for accurate parentage assignment. We used simulated genetic pedigrees to assess the reliability of parentage assignment for a series of challenging sampling regimes that reflect realistic conditions for many brood-parasitic birds: absence of genetic samples from sires, absence of samples from brood parasites and female kin-structured populations. Using 18 microsatellite markers and empirical allele frequencies from two populations of a conspecific brood parasite, the wood duck (Aix sponsa), we simulated brood parasitism and determined maternity using two widely used programs, cervus and colony . Errors in assignment were generally modest for most sampling scenarios but differed by program: cervus suffered from false assignment of parasitic offspring, whereas colony sometimes failed to assign offspring to their known mothers. Notably, colony was able to accurately infer unsampled parents. Reducing the number of markers (nine loci rather than 18) caused the assignment error to slightly worsen with colony but balloon with cervus . One potential error with important biological implications was rare in all cases—few nesting females were incorrectly excluded as the mother of their own offspring, an error that could falsely indicate brood parasitism. We consider the implications of our findings for both a retrospective assessment of previous studies and suggestions for best practices for future studies.  相似文献   

7.
Validation of microsatellite markers for routine horse parentage testing   总被引:4,自引:1,他引:3  
A parallel testing of 4803 routine Quarter Horse parentage cases, using 15 loci of blood group and protein polymorphisms (blood typing) and 11 loci of dinucleotide repeat microsatellites (DNA typing), validated DNA markers for horse pedigree verification. For the 26 loci, taken together, the theoretical effectiveness of detecting incorrect parentage was 99·999%, making it extremely unlikely that false parentage would fail to be recognized. The tests identified incorrect parentage assignment for 95 offspring (2% of cases). Despite fewer loci, DNA typing was as effective as blood typing and, in parentage exclusion cases, provided more systems to substantiate the genetic incompatibility. Five offspring presented potential genetic incompatibilities with their parents in only a single microsatellite system, but the parentage exclusions could not be confirmed with discordant results at additional loci. Two of these five incompatibilities could be explained as consequences of a null allele and three as fragment size increases or decreases (putative mutations). Provided that an exclusion assignment was based on at least two systems of genetic incompatibility, such rare genetic events did not lead to false exclusions. Notwithstanding the near 100% effectiveness estimations for either typing panel alone to identify incorrect parentage, this validation test showed an actual effectiveness of 97·3% for blood typing and 98·2% for DNA typing. The DNA-based test, however, may feasibly achieve higher efficacy than reported here by adding selected systems to the parentage test panel.  相似文献   

8.
Research in evolutionary biology involving nonmodel organisms is rapidly shifting from using traditional molecular markers such as mtDNA and microsatellites to higher throughput SNP genotyping methodologies to address questions in population genetics, phylogenetics and genetic mapping. Restriction site associated DNA sequencing (RAD sequencing or RADseq) has become an established method for SNP genotyping on Illumina sequencing platforms. Here, we developed a protocol and adapters for double‐digest RAD sequencing for Ion Torrent (Life Technologies; Ion Proton, Ion PGM) semiconductor sequencing. We sequenced thirteen genomic libraries of three different nonmodel vertebrate species on Ion Proton with PI chips: Arctic charr Salvelinus alpinus, European whitefish Coregonus lavaretus and common lizard Zootoca vivipara. This resulted in ~962 million single‐end reads overall and a mean of ~74 million reads per library. We filtered the genomic data using Stacks, a bioinformatic tool to process RAD sequencing data. On average, we obtained ~11 000 polymorphic loci per library of 6–30 individuals. We validate our new method by technical and biological replication, by reconstructing phylogenetic relationships, and using a hybrid genetic cross to track genomic variants. Finally, we discuss the differences between using the different sequencing platforms in the context of RAD sequencing, assessing possible advantages and disadvantages. We show that our protocol can be used for Ion semiconductor sequencing platforms for the rapid and cost‐effective generation of variable and reproducible genetic markers.  相似文献   

9.
We demonstrate a genotyping‐by‐sequencing approach to identify homomorphic sex chromosomes and their homolog in a distantly related reference genome, based on noninvasive sampling of wild‐caught individuals, in the moor frog Rana arvalis. Double‐digest RADseq libraries were generated using buccal swabs from 30 males and 21 females from the same population. Search for sex‐limited markers from the unfiltered data set (411 446 RAD tags) was more successful than searches from a filtered data set (33 073 RAD tags) for markers showing sex differences in heterozygosity or in allele frequencies. Altogether, we obtained 292 putatively sex‐linked RAD loci, 98% of which point to male heterogamety. We could map 15 of them to the Xenopus tropicalis genome, all but one on chromosome pair 1, which seems regularly co‐opted for sex determination among amphibians. The most efficient mapping strategy was a three‐step hierarchical approach, where R. arvalis reads were first mapped to a low‐coverage genome of Rana temporaria (17 My divergence), then the R. temporaria scaffolds to the Nanorana parkeri genome (90 My divergence), and finally the N. parkeri scaffolds to the X. tropicalis genome (210 My). We validated our conclusions with PCR primers amplifying part of Dmrt1, a candidate sex determination gene mapping to chromosome 1: a sex‐diagnostic allele was present in all 30 males but in none of the 21 females. Our approach is likely to be productive in many situations where biological samples and/or genomic resources are limited.  相似文献   

10.
Genetic markers are important resources for individual identification and parentage assessment. Although short tandem repeats (STRs) have been the traditional DNA marker, technological advances have led to single nucleotide polymorphisms (SNPs) becoming an attractive alternative. SNPs can be highly multiplexed and automatically scored, which allows for easier standardization and sharing among laboratories. Equine parentage is currently assessed using STRs. We obtained a publicly available SNP dataset of 729 horses representing 32 diverse breeds. A proposed set of 101 SNPs was analyzed for DNA typing suitability. The overall minor allele frequency of the panel was 0.376 (range 0.304–0.419), with per breed probability of identities ranging from 5.6 × 10?35 to 1.86 × 10?42. When one parent was available, exclusion probabilities ranged from 0.9998 to 0.999996, although when both parents were available, all breeds had exclusion probabilities greater than 0.9999999. A set of 388 horses from 35 breeds was genotyped to evaluate marker performance on known families. The set included 107 parent–offspring pairs and 101 full trios. No horses shared identical genotypes across all markers, indicating that the selected set was sufficient for individual identification. All pairwise comparisons were classified using ISAG rules, with one or two excluding markers considered an accepted parent–offspring pair, two or three excluding markers considered doubtful and four or more excluding markers rejecting parentage. The panel had an overall accuracy of 99.9% for identifying true parent–offspring pairs. Our developed marker set is both present on current generation SNP chips and can be highly multiplexed in standalone panels and thus is a promising resource for SNP‐based DNA typing.  相似文献   

11.
Restriction site‐associated DNA sequencing (RAD‐seq) was used to illuminate the genetic relationships among Eriobotrya species. The raw data were filtered, and 221 million clean reads were used for further analysis. A total of 1,983,332 SNPs were obtained from 23 Eriobotrya species and two relative genera. We obtained similar results by neighbor‐joining and maximum likelihood phylogenetic trees. All Eriobotrya plants grouped together into a big clade, and two out‐groups clustered together into a single or separate clade. Chinese and Vietnam accessions were distributed throughout the dendrogram. There was nonsignificant correlation between genotype and geographical distance. However, clustering results were correlated with leaf size to some extent. The Eriobotrya species could be divided into following three groups based on leaf size and phylogenetic analysis: group A and group B comprised of small leaves with <10 cm length except E. stipularis (16.76 cm), and group C can be further divided into two subgroups, which contained medium‐size leaves with a leaf length ranged from 10 to 20 cm and a leaf length bigger than 20 cm.  相似文献   

12.
A major goal of evolutionary biology is to identify the genome‐level targets of natural and sexual selection. With the advent of next‐generation sequencing, whole‐genome selection components analysis provides a promising avenue in the search for loci affected by selection in nature. Here, we implement a genome‐wide selection components analysis in the sex role reversed Gulf pipefish, Syngnathus scovelli. Our approach involves a double‐digest restriction‐site associated DNA sequencing (ddRAD‐seq) technique, applied to adult females, nonpregnant males, pregnant males, and their offspring. An FST comparison of allele frequencies among these groups reveals 47 genomic regions putatively experiencing sexual selection, as well as 468 regions showing a signature of differential viability selection between males and females. A complementary likelihood ratio test identifies similar patterns in the data as the FST analysis. Sexual selection and viability selection both tend to favor the rare alleles in the population. Ultimately, we conclude that genome‐wide selection components analysis can be a useful tool to complement other approaches in the effort to pinpoint genome‐level targets of selection in the wild.  相似文献   

13.
Determining how many and which codominant marker loci are required for accurate parentage assignment is not straightforward because levels of marker polymorphism, linkage, allelic distributions among potential parents and other factors produce differences in the discriminatory power of individual markers and sets of markers. p-loci software identifies the most efficient set of codominant markers for assigning parentage at a user-defined level of success, using either simulated or actual offspring genotypes of known parentage. Simulations can incorporate linkage among markers, mating design and frequencies of null alleles and/or genotyping errors. p-loci is available for windows systems at http://marineresearch.oregonstate.edu/genetics/ploci.htm.  相似文献   

14.
The evolution of locally adapted phenotypes among populations that experience divergent selective pressures is a central mechanism for generating and maintaining biodiversity. Recently, the advent of high‐throughput DNA sequencing technology has provided tools for investigating the genetic basis of this process in natural populations of nonmodel organisms. Kokanee, the freshwater form of sockeye salmon (Oncorhynchus nerka), occurs as two reproductive ecotypes, which differ in spawning habitat (tributaries vs. shorelines); however, outside of the spawning season the two ecotypes co‐occur in many lakes and lack diagnostic morphological characteristics. We used restriction site‐associated DNA (RAD) sequencing to identify 6145 SNPs and genotype kokanee from multiple spawning sites in Okanagan Lake (British Columbia, Canada). Outlier tests revealed 18 loci putatively under divergent selection between ecotypes, all of which exhibited temporally stable allele frequencies within ecotypes. Six outliers were annotated to sequences in the NCBI database, two of which matched genes associated with early development. There was no evidence for neutral genetic differentiation; however, outlier loci demonstrated significant structure with respect to ecotype and had high assignment accuracy in mixed composition simulations. The absence of neutral structure combined with a small number of highly divergent outlier loci is consistent with theoretical predictions for the early stages of ecological divergence. These outlier loci were then applied to a realistic fisheries scenario in which additional RAD sequencing was used to genotype kokanee collected by trawl in Okanagan Lake, providing preliminary evidence that this approach may be an effective tool for conservation and management.  相似文献   

15.
Single nucleotide polymorphisms (SNPs) are essential to the understanding of population genetic variation and diversity. Here, we performed restriction‐site‐associated DNA sequencing (RAD‐seq) on 72 individuals from 13 Chinese indigenous and three introduced chicken breeds. A total of 620 million reads were obtained using an Illumina Hiseq2000 sequencer. An average of 75 587 SNPs were identified from each individual. Further filtering strictly validated 28 895 SNPs candidates for all populations. When compared with the NCBI dbSNP (chicken_9031), 15 404 SNPs were new discoveries. In this study, RAD‐seq was performed for the first time on chickens, implicating the remarkable effectiveness and potential applications on genetic analysis and breeding technique for whole‐genome selection in chicken and other agricultural animals.  相似文献   

16.
Restriction‐site‐associated DNA sequencing (RAD‐seq) and related methods are revolutionizing the field of population genomics in nonmodel organisms as they allow generating an unprecedented number of single nucleotide polymorphisms (SNPs) even when no genomic information is available. Yet, RAD‐seq data analyses rely on assumptions on nature and number of nucleotide variants present in a single locus, the choice of which may lead to an under‐ or overestimated number of SNPs and/or to incorrectly called genotypes. Using the Atlantic mackerel (Scomber scombrus L.) and a close relative, the Atlantic chub mackerel (Scomber colias), as case study, here we explore the sensitivity of population structure inferences to two crucial aspects in RAD‐seq data analysis: the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides insights into the effects of alternative RAD‐seq data analysis strategies on population structure inferences that are directly applicable to other species.  相似文献   

17.
Parentage analysis is a cornerstone of molecular ecology that has delivered fundamental insights into behaviour, ecology and evolution. Microsatellite markers have long been the king of parentage, their hypervariable nature conferring sufficient power to correctly assign offspring to parents. However, microsatellite markers have seen a sharp decline in use with the rise of next‐generation sequencing technologies, especially in the study of population genetics and local adaptation. The time is ripe to review the current state of parentage analysis and see how it stands to be affected by the emergence of next‐generation sequencing approaches. We find that single nucleotide polymorphisms (SNPs), the typical next‐generation sequencing marker, remain underutilized in parentage analysis but are gaining momentum, with 58 SNP‐based parentage analyses published thus far. Many of these papers, particularly the earlier ones, compare the power of SNPs and microsatellites in a parentage context. In virtually every case, SNPs are at least as powerful as microsatellite markers. As few as 100–500 SNPs are sufficient to resolve parentage completely in most situations. We also provide an overview of the analytical programs that are commonly used and compatible with SNP data. As the next‐generation parentage enterprise grows, a reliance on likelihood and Bayesian approaches, as opposed to strict exclusion, will become increasingly important. We discuss some of the caveats surrounding the use of next‐generation sequencing data for parentage analysis and conclude that the future is bright for this important realm of molecular ecology.  相似文献   

18.
Identifying patterns of larval dispersal within marine metapopulations is vital for effective fisheries management, appropriate marine reserve design, and conservation efforts. We employed genetic markers (microsatellites) to determine dispersal patterns in bicolour damselfish (Pomacentridae: Stegastes partitus). Tissue samples of 751 fish were collected in 2004 and 2005 from 11 sites encompassing the Exuma Sound, Bahamas. Bayesian parentage analysis identified two parent–offspring pairs, which is remarkable given the large population sizes and 28 day pelagic larval duration of bicolour damselfish. The two parent–offspring pairs directly documented self‐recruitment at the two northern‐most sites, one of which is a long‐established marine reserve. Principal coordinates analyses of pair‐wise relatedness values further indicated that self‐recruitment was common in all sampled populations. Nevertheless, measures of genetic differentiation (FST) and results from assignment methods suggested high levels of gene flow among populations. Comparisons of heterozygosity and relatedness among samples of adults and recruits indicated spatially and temporally independent sweepstakes events, whereby only a subset of adults successfully contribute to subsequent generations. These results indicate that self‐recruitment and sweepstakes reproduction are the predominant, ecologically‐relevant processes that shape patterns of larval dispersal in this system.  相似文献   

19.
Zoysiagrass (Zoysia spp.), belonging to the genus Zoysia in the subfamily Chloridoideae, is widely used in domestic lawns, sports fields and as forage. We constructed high‐density genetic maps of Zoysia japonica using a restriction site‐associated DNA sequencing (RAD‐Seq) approach and an F1 mapping population derived from a cross between ‘Carrizo’ and ‘El Toro’. Two linkage maps were constructed, one for each of the parents. A map consisting of 2408 RAD markers distributed on 21 linkage groups was constructed for ‘Carrizo’. Another map with 1230 RAD markers mapped on 20 linkage groups was constructed for ‘El Toro’. The average distance between adjacent markers of the two maps was at 0.56 and 1.4 cM, respectively. Comparative genomics analysis was carried out among zoysiagrass, rice and sorghum genomes and a highly conserved collinearity in the gene order was observed among the three genomes. Chromosome collinearity was disrupted at centromeric regions for each chromosome pair between zoysiagrass and sorghum genomes. However, no obvious synteny gaps were observed across the centromeric regions between zoysiagrass and rice genomes. Two homologous chromosomes for each of the 10 sorghum chromosomes were found in the zoysiagrass genome, indicating an allotetraploid origin for zoysiagrass. The reduction of the basic chromosome number from 12 to 10 in chloridoids and panicoids took place via independent single‐step nested chromosome fusion events after the two subfamilies diverged from a common ancestor. The genetic maps will assist in genome sequence assembly, targeted gene isolation and comparative genomic analyses among grasses.  相似文献   

20.
Sequencing pools of individuals rather than individuals separately reduces the costs of estimating allele frequencies at many loci in many populations. Theoretical and empirical studies show that sequencing pools comprising a limited number of individuals (typically fewer than 50) provides reliable allele frequency estimates, provided that the DNA pooling and DNA sequencing steps are carefully controlled. Unequal contributions of different individuals to the DNA pool and the mean and variance in sequencing depth both can affect the standard error of allele frequency estimates. To our knowledge, no study separately investigated the effect of these two factors on allele frequency estimates; so that there is currently no method to a priori estimate the relative importance of unequal individual DNA contributions independently of sequencing depth. We develop a new analytical model for allele frequency estimation that explicitly distinguishes these two effects. Our model shows that the DNA pooling variance in a pooled sequencing experiment depends solely on two factors: the number of individuals within the pool and the coefficient of variation of individual DNA contributions to the pool. We present a new method to experimentally estimate this coefficient of variation when planning a pooled sequencing design where samples are either pooled before or after DNA extraction. Using this analytical and experimental framework, we provide guidelines to optimize the design of pooled sequencing experiments. Finally, we sequence replicated pools of inbred lines of the plant Medicago truncatula and show that the predictions from our model generally hold true when estimating the frequency of known multilocus haplotypes using pooled sequencing.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号