首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Flexibility and low cost make genotyping‐by‐sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI‐MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference‐free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000–11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking.  相似文献   

2.
In a de novo genotyping‐by‐sequencing (GBS) analysis of short, 64‐base tag‐level haplotypes in 4657 accessions of cultivated oat, we discovered 164741 tag‐level (TL) genetic variants containing 241224 SNPs. From this, the marker density of an oat consensus map was increased by the addition of more than 70000 loci. The mapped TL genotypes of a 635‐line diversity panel were used to infer chromosome‐level (CL) haplotype maps. These maps revealed differences in the number and size of haplotype blocks, as well as differences in haplotype diversity between chromosomes and subsets of the diversity panel. We then explored potential benefits of SNP vs. TL vs. CL GBS variants for mapping, high‐resolution genome analysis and genomic selection in oats. A combined genome‐wide association study (GWAS) of heading date from multiple locations using both TL haplotypes and individual SNP markers identified 184 significant associations. A comparative GWAS using TL haplotypes, CL haplotype blocks and their combinations demonstrated the superiority of using TL haplotype markers. Using a principal component‐based genome‐wide scan, genomic regions containing signatures of selection were identified. These regions may contain genes that are responsible for the local adaptation of oats to Northern American conditions. Genomic selection for heading date using TL haplotypes or SNP markers gave comparable and promising prediction accuracies of up to r = 0.74. Genomic selection carried out in an independent calibration and test population for heading date gave promising prediction accuracies that ranged between r = 0.42 and 0.67. In conclusion, TL haplotype GBS‐derived markers facilitate genome analysis and genomic selection in oat.  相似文献   

3.
Establishing the sex of individuals in wild systems can be challenging and often requires genetic testing. Genotyping‐by‐sequencing (GBS) and other reduced‐representation DNA sequencing (RRS) protocols (e.g., RADseq, ddRAD) have enabled the analysis of genetic data on an unprecedented scale. Here, we present a novel approach for the discovery and statistical validation of sex‐specific loci in GBS data sets. We used GBS to genotype 166 New Zealand fur seals (NZFS, Arctocephalus forsteri) of known sex. We retained monomorphic loci as potential sex‐specific markers in the locus discovery phase. We then used (i) a sex‐specific locus threshold (SSLT) to identify significantly male‐specific loci within our data set; and (ii) a significant sex‐assignment threshold (SSAT) to confidently assign sex in silico the presence or absence of significantly male‐specific loci to individuals in our data set treated as unknowns (98.9% accuracy for females; 95.8% for males, estimated via cross‐validation). Furthermore, we assigned sex to 86 individuals of true unknown sex using our SSAT and assessed the effect of SSLT adjustments on these assignments. From 90 verified sex‐specific loci, we developed a panel of three sex‐specific PCR primers that we used to ascertain sex independently of our GBS data, which we show amplify reliably in at least two other pinniped species. Using monomorphic loci normally discarded from large SNP data sets is an effective way to identify robust sex‐linked markers for nonmodel species. Our novel pipeline can be used to identify and statistically validate monomorphic and polymorphic sex‐specific markers across a range of species and RRS data sets.  相似文献   

4.
Species delimitation has seen a paradigm shift as increasing accessibility of genomic‐scale data enables separation of lineages with convergent morphological traits and the merging of recently diverged ecotypes that have distinguishing characteristics. We inferred the process of lineage formation among Australian species in the widespread and highly variable genus Pelargonium by combining phylogenomic and population genomic analyses along with breeding system studies and character analysis. Phylogenomic analysis and population genetic clustering supported seven of the eight currently described species but provided little evidence for differences in genetic structure within the most widely distributed group that containing P. australe. In contrast, morphometric analysis detected three deep lineages within Australian Pelargonium; with P. australe consisting of five previously unrecognized entities occupying separate geographic ranges. The genomic approach enabled elucidation of parallel evolution in some traits formerly used to delineate species, as well as identification of ecotypic morphological differentiation within recognized species. Highly variable morphology and trait convergence each contribute to the discordance between phylogenomic relationships and morphological taxonomy. Data suggest that genetic divergence among species within the Australian Pelargonium may result from allopatric speciation while morphological differentiation within and among species may be more strongly driven by environmental differences.  相似文献   

5.
Ploidy levels sometimes vary among individuals or populations, particularly in plants. When such variation exists, accurate determination of cytotype can inform studies of ecology or trait variation and is required for population genetic analyses. Here, we propose and evaluate a statistical approach for distinguishing low‐level ploidy variants (e.g. diploids, triploids and tetraploids) based on genotyping‐by‐sequencing (GBS) data. The method infers cytotypes based on observed heterozygosity and the ratio of DNA sequences containing different alleles at thousands of heterozygous SNPs (i.e. allelic ratios). Whereas the method does not require prior information on ploidy, a reference set of samples with known ploidy can be included in the analysis if it is available. We explore the power and limitations of this method using simulated data sets and GBS data from natural populations of aspen (Populus tremuloides) known to include both diploid and triploid individuals. The proposed method was able to reliably discriminate among diploids, triploids and tetraploids in simulated data sets, and this was true for different levels of genetic diversity, inbreeding and population structure. Power and accuracy were minimally affected by low coverage (i.e. 2×), but did sometimes suffer when simulated mixtures of diploids, autotetraploids and allotetraploids were analysed. Cytotype assignments based on the proposed method closely matched those from previous microsatellite and flow cytometry data when applied to GBS data from aspen. An R package (gbs2ploidy) implementing the proposed method is available from CRAN.  相似文献   

6.
Analysis of genetic diversity and population structure among Quercus fabri populations is essential for the conservation and utilization of Q. fabri resources. Here, the genetic diversity and structure of 158 individuals from 13 natural populations of Quercus fabri in China were analyzed using genotyping‐by‐sequencing (GBS). A total of 459,564 high‐quality single nucleotide polymorphisms (SNPs) were obtained after filtration for subsequent analysis. Genetic structure analysis revealed that these individuals can be clustered into two groups and the structure can be explained mainly by the geographic barrier, showed gene introgression from coastal to inland areas and high mountains could significantly hinder the mutual introgression of genes. Genetic diversity analysis indicated that the individual differences within groups are greater than the differences between the two groups. These results will help us better understand the genetic backgrounds of Q. fabri.  相似文献   

7.
Genetic relatedness of 24 animals belonging to seven Indian cattle breeds was studied using high throughput genotyping‐by‐sequencing (GBS) markers. GBS produced 93.6 million reads with an average of about 3.9 million reads per animal. A total of 107 488 SNPs were identified in these individuals. When only one SNP per read was considered, a total of 60 261 SNPs representing independent reads were identified with an average SNP‐to‐SNP distance of 45 kb across the bovine reference genome. About 24% of the GBS‐SNP markers were more than 100 kb apart. Of these, 58 322 SNPs mapped to autosomes, 1645 to the X chromosome and 28 to the Y chromosome. The average SNP‐to‐SNP distance on the X chromosome was 91.3 kb, whereas on the Y chromosome it was 1546.4 kb. The minor allele frequency within the Indian cattle varied from 0.103 (Ongole) to 0.177 (Siri), whereas Holstein cattle had the lowest value of 0.089. This is the first application of GBS in cattle of South Asia. The baseline information generated in this study might prompt implementation of GBS in breeding of cattle belonging to this region.  相似文献   

8.
Genotyping‐by‐sequencing (GBS) and related methods are increasingly used for studies of non‐model organisms from population genetic to phylogenetic scales. We present GIbPSs, a new genotyping toolkit for the analysis of data from various protocols such as RAD, double‐digest RAD, GBS, and two‐enzyme GBS without a reference genome. GIbPSs can handle paired‐end GBS data and is able to assign reads from both strands of a restriction fragment to the same locus. GIbPSs is most suitable for population genetic and phylogeographic analyses. It avoids genotyping errors due to indel variation by identifying and discarding affected loci. GIbPSs creates a genotype database that offers rich functionality for data filtering and export in numerous formats. We performed comparative analyses of simulated and real GBS data with GIbPSs and another program, pyRAD. This program accounts for indel variation by aligning homologous sequences. GIbPSs performed better than pyRAD in several aspects. It required much less computation time and displayed higher genotyping accuracy. GIbPSs retained smaller numbers of loci overall in analyses of real GBS data. It nevertheless delivered more complete genotype matrices with greater locus overlap between individuals and greater numbers of loci sampled in all individuals.  相似文献   

9.
Whole‐genome duplications have occurred in the recent ancestors of many plants, fish and amphibians. Signals of these whole‐genome duplications still exist in the form of paralogous loci. Recent advances have allowed reliable identification of paralogs in genotyping‐by‐sequencing (GBS) data such as that generated from restriction‐site‐associated DNA sequencing (RADSeq); however, excluding paralogs from analyses is still routine due to difficulties in genotyping. This exclusion of paralogs may filter a large fraction of loci, including loci that may be adaptively important or informative for population genetic analyses. We present a maximum‐likelihood method for inferring allele dosage in paralogs and assess its accuracy using simulated GBS, empirical RADSeq and amplicon sequencing data from Chinook salmon. We accurately infer allele dosage for some paralogs from a RADSeq data set and show how accuracy is dependent upon both read depth and allele frequency. The amplicon sequencing data set, using RADSeq‐derived markers, achieved sufficient depth to infer allele dosage for all paralogs. This study demonstrates that RADSeq locus discovery combined with amplicon sequencing of targeted loci is an effective method for incorporating paralogs into population genetic analyses.  相似文献   

10.
Whole‐genome duplications have occurred in the recent ancestors of many plants, fish, and amphibians, resulting in a pervasiveness of paralogous loci and the potential for both disomic and tetrasomic inheritance in the same genome. Paralogs can be difficult to reliably genotype and are often excluded from genotyping‐by‐sequencing (GBS) analyses; however, removal requires paralogs to be identified which is difficult without a reference genome. We present a method for identifying paralogs in natural populations by combining two properties of duplicated loci: (i) the expected frequency of heterozygotes exceeds that for singleton loci, and (ii) within heterozygotes, observed read ratios for each allele in GBS data will deviate from the 1:1 expected for singleton (diploid) loci. These deviations are often not apparent within individuals, particularly when sequence coverage is low; but, we postulated that summing allele reads for each locus over all heterozygous individuals in a population would provide sufficient power to detect deviations at those loci. We identified paralogous loci in three species: Chinook salmon (Oncorhynchus tshawytscha) which retains regions with ongoing residual tetrasomy on eight chromosome arms following a recent whole‐genome duplication, mountain barberry (Berberis alpina) which has a large proportion of paralogs that arose through an unknown mechanism, and dusky parrotfish (Scarus niger) which has largely rediploidized following an ancient whole‐genome duplication. Importantly, this approach only requires the genotype and allele‐specific read counts for each individual, information which is readily obtained from most GBS analysis pipelines.  相似文献   

11.
Ongoing hybridization and retained ancestral polymorphism in rapidly radiating lineages could mask recent cladogenetic events. This presents a challenge for the application of molecular phylogenetic methods to resolve differences between closely related taxa. We reanalyzed published genotyping‐by‐sequencing (GBS) data to infer the phylogeny of four species within the Ophrys sphegodes complex, a recently radiated clade of orchids. We used different data filtering approaches to detect different signals contained in the dataset generated by GBS and estimated their effects on maximum likelihood trees, global FST and bootstrap support values. We obtained a maximum likelihood tree with high bootstrap support, separating the species by using a large dataset based on loci shared by at least 30% of accessions. Bootstrap and FST values progressively decreased when filtering for loci shared by a higher number of accessions. However, when filtering more stringently to retain homozygous and organellar loci, we identified two main clades. These clades group individuals independently from their a priori species assignment, but were associated with two organellar haplotype clusters. We infer that a less stringent filtering preferentially selects for rapidly evolving lineage‐specific loci, which might better delimit lineages. In contrast, when using homozygous/organellar DNA loci the signature of a putative hybridization event in the lineage prevails over the most recent phylogenetic signal. These results show that using differing filtering strategies on GBS data could dissect the organellar and nuclear DNA phylogenetic signal and yield novel insights into relationships between closely related species.  相似文献   

12.
13.
Population genetic structure in the marine environment can be influenced by life‐history traits such as developmental mode (biphasic, with distinct adult and larval morphology, and direct development, in which larvae resemble adults) or habitat specificity, as well as geography and selection. Developmental mode is thought to significantly influence dispersal, with direct developers expected to have much lower dispersal potential. However, this prediction can be complicated by the presence of geophysical barriers to dispersal. In this study, we use a panel of 8,020 SNPs to investigate population structure and biogeography over multiple spatial scales for a direct‐developing species, the New Zealand endemic marine isopod Isocladus armatus. Because our sampling range is intersected by two well‐known biogeographic barriers (the East Cape and the Cook Strait), our study provides an opportunity to understand how such barriers influence dispersal in direct developers. On a small spatial scale (20 km), gene flow between locations is extremely high, suggestive of an island model of migration. However, over larger spatial scales (600 km), populations exhibit a clear pattern of isolation‐by‐distance. Our results indicate that I. armatus exhibits significant migration across the hypothesized barriers and suggest that large‐scale ocean currents associated with these locations do not present a barrier to dispersal. Interestingly, we find evidence of a north‐south population genetic break occurring between Māhia and Wellington. While no known geophysical barrier is apparent in this area, it coincides with the location of a proposed border between bioregions. Analysis of loci under selection revealed that both isolation‐by‐distance and adaption may be contributing to the degree of population structure we have observed here. We conclude that developmental life history largely predicts dispersal in the intertidal isopod I. armatus. However, localized biogeographic processes can disrupt this expectation, and this may explain the potential meta‐population detected in the Auckland region.  相似文献   

14.
15.
Blue catfish, Ictalurus furcatus, are valued in the United States as a trophy fishery for their capacity to reach large sizes, sometimes exceeding 45 kg. Additionally, blue catfish × channel catfish (I. punctatus) hybrid food fish production has recently increased the demand for blue catfish broodstock. However, there has been little study of the genetic impacts and interaction of farmed, introduced and stocked populations of blue catfish. We utilized genotyping‐by‐sequencing (GBS) to capture and genotype SNP markers on 190 individuals from five wild and domesticated populations (Mississippi River, Missouri, D&B, Rio Grande and Texas). Stringent filtering of SNP‐calling parameters resulted in 4275 SNP loci represented across all five populations. Population genetics and structure analyses revealed potential shared ancestry and admixture between populations. We utilized the Sequenom MassARRAY to validate two multiplex panels of SNPs selected from the GBS data. Selection criteria included SNPs shared between populations, SNPs specific to populations, number of reads per individual and number of individuals genotyped by GBS. Putative SNPs were validated in the discovery population and in two additional populations not used in the GBS analysis. A total of 64 SNPs were genotyped successfully in 191 individuals from nine populations. Our results should guide the development of highly informative, flexible genotyping multiplexes for blue catfish from the larger GBS SNP set as well as provide an example of a rapid, low‐cost approach to generate and genotype informative marker loci in aquatic species with minimal previous genetic information.  相似文献   

16.
The gene responsible for testis induction in normal male mammals is the Y‐linked Sry. However, there is increasing evidence that other genes may have testis‐determining properties. In XX sex reversal (XXSR), testis tissue develops in the absence of the Y chromosome. Previous polymerase chain reaction (PCR) assays indicated that autosomal recessive XXSR in the American cocker spaniel is Sry‐negative. In this study, genomic DNA from the breeding colony of American cocker spaniels and from privately owned purebred dogs were tested by PCR using canine primers for the Sry HMG box and by Southern blots probed with the complete canine Sry coding sequence. Sry was not detected by either method in genomic DNA of affected American cocker spaniels or in the majority (20/21) of affected privately owned purebred dogs. These results confirm that the autosomal recessive form of XXSR in the American cocker spaniel is Sry‐negative. In combination with previous studies, this indicates that Sry‐negative XXSR occurs in at least 15 dog breeds. The canine disorder may be genetically heterogeneous, potentially with a different mutation in each breed, and may provide several models for human Sry‐negative XXSR. A comparative approach to sex determination should be informative in defining the genetic and cellular mechanisms that are common to all mammals. Mol. Reprod. Dev. 53:266–273, 1999. © 1999 Wiley‐Liss, Inc.  相似文献   

17.
Tony Gamble 《Molecular ecology》2016,25(10):2114-2116
Next‐generation sequencing methods have initiated a revolution in molecular ecology and evolution (Tautz et al. 2010 ). Among the most impressive of these sequencing innovations is restriction site‐associated DNA sequencing or RAD‐seq (Baird et al. 2008 ; Andrews et al. 2016 ). RAD‐seq uses the Illumina sequencing platform to sequence fragments of DNA cut by a specific restriction enzyme and can generate tens of thousands of molecular genetic markers for analysis. One of the many uses of RAD‐seq data has been to identify sex‐specific genetic markers, markers found in one sex but not the other (Baxter et al. 2011 ; Gamble & Zarkower 2014 ). Sex‐specific markers are a powerful tool for biologists. At their most basic, they can be used to identify the sex of an individual via PCR. This is useful in cases where a species lacks obvious sexual dimorphism at some or all life history stages. For example, such tests have been important for studying sex differences in life history (Sheldon 1998 ; Mossman & Waser 1999 ), the management and breeding of endangered species (Taberlet et al. 1993 ; Griffiths & Tiwari 1995 ; Robertson et al. 2006 ) and sexing embryonic material (Hacker et al. 1995 ; Smith et al. 1999 ). Furthermore, sex‐specific markers allow recognition of the sex chromosome system in cases where standard cytogenetic methods fail (Charlesworth & Mank 2010 ; Gamble & Zarkower 2014 ). Thus, species with male‐specific markers have male heterogamety (XY) while species with female‐specific markers have female heterogamety (ZW). In this issue, Fowler & Buonaccorsi ( 2016 ) illustrate the ease by which RAD‐seq data can generate sex‐specific genetic markers in rockfish (Sebastes). Moreover, by examining RAD‐seq data from two closely related rockfish species, Sebastes chrysomelas and Sebastes carnatus (Fig.  1 ), Fowler & Buonaccorsi ( 2016 ) uncover shared sex‐specific markers and a conserved sex chromosome system.  相似文献   

18.
There has been remarkably little attention to using the high resolution provided by genotyping‐by‐sequencing (i.e., RADseq and similar methods) for assessing relatedness in wildlife populations. A major hurdle is the genotyping error, especially allelic dropout, often found in this type of data that could lead to downward‐biased, yet precise, estimates of relatedness. Here, we assess the applicability of genotyping‐by‐sequencing for relatedness inferences given its relatively high genotyping error rate. Individuals of known relatedness were simulated under genotyping error, allelic dropout and missing data scenarios based on an empirical ddRAD data set, and their true relatedness was compared to that estimated by seven relatedness estimators. We found that an estimator chosen through such analyses can circumvent the influence of genotyping error, with the estimator of Ritland (Genetics Research, 67, 175) shown to be unaffected by allelic dropout and to be the most accurate when there is genotyping error. We also found that the choice of estimator should not rely solely on the strength of correlation between estimated and true relatedness as a strong correlation does not necessarily mean estimates are close to true relatedness. We also demonstrated how even a large SNP data set with genotyping error (allelic dropout or otherwise) or missing data still performs better than a perfectly genotyped microsatellite data set of tens of markers. The simulation‐based approach used here can be easily implemented by others on their own genotyping‐by‐sequencing data sets to confirm the most appropriate and powerful estimator for their data.  相似文献   

19.
Species delimitation requires an assessment of varied traits that can contribute to reproductive isolation, as well as of the permanence of evolutionary differentiation among closely related lineages. Integrative taxonomy, including the combination of genome‐wide molecular data with ecological data, offers an effective approach to this issue. We use genotyping‐by‐sequencing together with a review of ecological divergence to assess the traditionally recognized species status of three closely related members of the spruce budworm species complex, Choristoneura fumiferana (Clemens), C. occidentalis Freeman (=C. freemani Razowski) and C. biennis Freeman, each of which is a major defoliator of conifer forests. We sampled a broad region of overlap between these three taxa in Alberta and British Columbia (Canada) where potential for gene flow provides a strong test of the durability of divergence among lineages. A total of 2218 single nucleotide polymorphisms (SNPs) were assayed, and patterns of differentiation were evaluated under the biological, ecological, genotypic cluster and phylogenetic species concepts. Choristoneura fumiferana was genetically distinct with substantial barriers to genetic exchange with C. occidentalis and C. biennis. Conversely, divergence between C. occidentalis and C. biennis was limited to a small subset of outlier loci and was within the range observed within any one of the taxa. Considering both population genetic and ecological patterns of divergence, C. fumiferana should continue to be recognized as a distinct species, and C. biennis ( syn.n. ) should be treated as a subspecies (C. occidentalis biennis Freeman, 1967) of C. occidentalis, thereby automatically establishing the nominate name C. occidentalis occidentalis Freeman, 1967 for univoltine populations of this species.  相似文献   

20.
Batesian mimicry is a striking example of Darwinian evolution, in which a mimetic species resembles toxic or unpalatable model species, thereby receiving protection from predators. In some species exhibiting Batesian mimicry, nonmimetic individuals coexist as polymorphism in the same population despite the benefits of mimicry. In a previous study, we proposed that the abundance of mimics is limited by that of the models, leading to polymorphic Batesian mimicry in the swallowtail butterfly, Papilio polytes, on the Ryukyu Islands in Japan. We found that their mimic ratios (MRs), which varied among the Islands, were explained by the model abundance of each habitat, rather than isolation by distance or phylogenetic constraint based on the mitochondrial DNA (mtDNA) analysis. In the present study, this possibility was reexamined based on hundreds of nuclear single nucleotide polymorphisms (SNPs) of 93 P. polytes individuals from five Islands of the Ryukyus. We found that the population genetic and phylogenetic structures of P. polytes largely corresponded to the geographic arrangement of the habitat Islands, and the genetic distances among island populations show significant correlation with the geographic distances, which was not evident by the mtDNA‐based analysis. A partial Mantel test controlling for the present SNP‐based genetic distances revealed that the MRs of P. polytes were strongly correlated with the model abundance of each island, implying that negative frequency‐dependent selection interacting with model species shaped and maintained the mimetic polymorphism. Taken together, our results support the possibility that predation pressure, not isolation by distance or other neutral factors, is a major driving force of evolution of the Batesian mimicry in P. polytes from the Ryukyus.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号