首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The implications of transitioning to single nucleotide polymorphism (SNPs) from microsatellite markers (MSs) have been investigated in a number of population genetics studies, but the effect of genomic location on the amount of information each type of marker reveals has not been explored in detail. We developed novel SNP markers flanking 1 kb regions of 13 genic (within gene or <1 kb away from gene) and 13 nongenic (>10 kb from annotated gene) MSs in the threespine stickleback genome to obtain comparable data for both types of markers. We analysed patterns of genetic diversity and divergence on various geographic scales after converting the SNP loci within each genomic region into haplotypes. Marker type (SNP haplotype or MS) and location (genic or nongenic) significantly affected most estimates of population diversity and divergence. Between‐lineage divergence was significantly higher in SNP haplotypes (genic and nongenic), however, within‐lineage divergence was similar between marker types. Most divergence and diversity measures were uncorrelated between markers, except for population differentiation which was correlated between MSs and SNP haplotypes (both genic and nongenic). Broad‐scale population structure and assignment were similarly resolved by both marker types, however, only the MSs were able to delimit fine‐scale population structuring, particularly when genic and nongenic markers were combined. These results demonstrate that estimates of genetic variability and differentiation among populations can be strongly influenced by marker type, their genomic location in relation to genes and by the interaction of these two factors. This highlights the importance of having an awareness of the inherent strengths and limitations associated with different molecular tools to select the most appropriate methods for accurately addressing various ecological and evolutionary questions.  相似文献   

2.
3.
Explicitly fitting effects for major genes or QTL that account for a large percentage of variation in a whole genomic prediction model may increase prediction accuracy. This study compared approaches to account for a major effect of an F94L variant in the MSTN gene within the genomic prediction using bovine whole‐genomic SNP markers. Among the beef cattle breeds, Limousin have been known to have an F94L variant that is not present in Angus. The reference population in this study consisted of 3060 beef cattle including pure‐bred Limousin (PL), cross‐bred Limousin with Angus (LF) and pure‐bred Angus, genotyped using a BovineSNP50 BeadChip and directly for the MSTN‐F94L variant. We compared prediction accuracies in PL animals using the three datasets from only the PL population, admixed PL and LF (AL) or multibreed analysis using all of the PL, LF and Angus (MB) population according to four‐fold cross‐validation after K‐means clustering. The MSTN‐F94L variant was the most strongly associated with five traits (birth weight, calving ease direct, milk, weaning weight and yield grade) among the 13 measured traits in PL and AL populations. Fitting the MSTN‐F94L variant as a random effect, the genomic prediction accuracies for birth weight increased by 2.7% in PL, by 2.2% in AL and by 3.2% in MB. Prediction accuracies for five traits increased in the MB analysis. Fitting MSTN‐F94L as a fixed effect in PL, AL and MB analyses resulted in increased prediction accuracy in PL for eight traits. Prediction accuracies can be improved by including a causal variant in genomic evaluation compared with simply using whole‐genome SNP markers. Fitting the causal variant as a fixed effect along with markers fitted as random effects resulted in greater prediction accuracies for most traits. Causal variants should be genotyped along with SNP markers.  相似文献   

4.
Genomic selection uses total breeding values for juvenile animals, predicted from a large number of estimated marker haplotype effects across the whole genome. In this study the accuracy of predicting breeding values is compared for four different models including a large number of markers, at different marker densities for traits with heritabilities of 50 and 10%. The models estimated the effect of (1) each single-marker allele [single-nucleotide polymorphism (SNP)1], (2) haplotypes constructed from two adjacent marker alleles (SNP2), and (3) haplotypes constructed from 2 or 10 markers, including the covariance between haplotypes by combining linkage disequilibrium and linkage analysis (HAP_IBD2 and HAP_IBD10). Between 119 and 2343 polymorphic SNPs were simulated on a 3-M genome. For the trait with a heritability of 10%, the differences between models were small and none of them yielded the highest accuracies across all marker densities. For the trait with a heritability of 50%, the HAP_IBD10 model yielded the highest accuracies of estimated total breeding values for juvenile and phenotyped animals at all marker densities. It was concluded that genomic selection is considerably more accurate than traditional selection, especially for a low-heritability trait.  相似文献   

5.
Extensive genomic resources are available in the model legume Medicago truncatula. Here, we present the discovery and design of the first array of single‐nucleotide polymorphism (SNP) markers in M. truncatula through large‐scale Sanger resequencing of genomic fragments spanning the genome, in a diverse panel of 16 M. truncatula accessions. Both anonymous fragments and fragments targeting candidate genes for flowering phenology and symbiosis were surveyed for nucleotide variation in almost 230 kb of unique genomic regions. A set of 384 SNP markers was designed for an Illumina's GoldenGate assay, genotyped on a collection of 192 inbred lines (CC192) representing the geographical range of the species and used to survey the diversity of two natural populations. Finally, 86% of the tested SNPs were of high quality and exhibited polymorphism in the CC192 collection. Even at the population level, we detected polymorphism for more than 50% of the selected SNPs. Analysis of the allele frequency spectrum in the CC192 showed a reduced ascertainment bias, mostly limited to very rare alleles (frequency <0.01). The substantial polymorphism detected at the species and population levels, the high marker quality and the potential to survey large samples of individuals make this set of SNP markers a valuable tool to improve our understanding of the effect of demographic and selective factors that shape the natural genetic diversity within the selfing species Medicago truncatula.  相似文献   

6.
Flexibility and low cost make genotyping‐by‐sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI‐MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference‐free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000–11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking.  相似文献   

7.
To improve the efficiency of breeding of Miscanthus for biomass yield, there is a need to develop genomics‐assisted selection for this long‐lived perennial crop by relating genotype to phenotype and breeding value across a broad range of environments. We present the first genome‐wide association (GWA) and genomic prediction study of Miscanthus that utilizes multilocation phenotypic data. A panel of 568 Miscanthus sinensis accessions was genotyped with 46,177 single nucleotide polymorphisms (SNPs) and evaluated at one subtropical and five temperate locations over 3 years for biomass yield and 14 yield‐component traits. GWA and genomic prediction were performed separately for different years of data in order to assess reproducibility. The analyses were also performed for individual field trial locations, as well as combined phenotypic data across groups of locations. GWA analyses identified 27 significant SNPs for yield, and a total of 504 associations across 298 unique SNPs across all traits, sites, and years. For yield, the greatest number of significant SNPs was identified by combining phenotypic data across all six locations. For some of the other yield‐component traits, greater numbers of significant SNPs were obtained from single site data, although the number of significant SNPs varied greatly from site to site. Candidate genes were identified. Accounting for population structure, genomic prediction accuracies for biomass yield ranged from 0.31 to 0.35 across five northern sites and from 0.13 to 0.18 for the subtropical location, depending on the estimation method. Genomic prediction accuracies of all traits were similar for single‐location and multilocation data, suggesting that genomic selection will be useful for breeding broadly adapted M. sinensis as well as M. sinensis optimized for specific climates. All of our data, including DNA sequences flanking each SNP, are publicly available. By facilitating genomic selection in M. sinensis and Miscanthus × giganteus, our results will accelerate the breeding of these species for biomass in diverse environments.  相似文献   

8.
We have developed a software analysis package, HapScope, which includes a comprehensive analysis pipeline and a sophisticated visualization tool for analyzing functionally annotated haplotypes. The HapScope analysis pipeline supports: (i) computational haplotype construction with an expectation-maximization or Bayesian statistical algorithm; (ii) SNP classification by protein coding change, homology to model organisms or putative regulatory regions; and (iii) minimum SNP subset selection by either a Brute Force Algorithm or a Greedy Partition Algorithm. The HapScope viewer displays genomic structure with haplotype information in an integrated environment, providing eight alternative views for assessing genetic and functional correlation. It has a user-friendly interface for: (i) haplotype block visualization; (ii) SNP subset selection; (iii) haplotype consolidation with subset SNP markers; (iv) incorporation of both experimentally determined haplotypes and computational results; and (v) data export for additional analysis. Comparison of haplotypes constructed by the statistical algorithms with those determined experimentally shows variation in haplotype prediction accuracies in genomic regions with different levels of nucleotide diversity. We have applied HapScope in analyzing haplotypes for candidate genes and genomic regions with extensive SNP and genotype data. We envision that the systematic approach of integrating functional genomic analysis with population haplotypes, supported by HapScope, will greatly facilitate current genetic disease research.  相似文献   

9.
Whole genome resequencing of 51 Populus nigra (L.) individuals from across Western Europe was performed using Illumina platforms. A total number of 1 878 727 SNPs distributed along the P. nigra reference sequence were identified. The SNP calling accuracy was validated with Sanger sequencing. SNPs were selected within 14 previously identified QTL regions, 2916 expressional candidate genes related to rust resistance, wood properties, water‐use efficiency and bud phenology and 1732 genes randomly spread across the genome. Over 10 000 SNPs were selected for the construction of a 12k Infinium Bead‐Chip array dedicated to association mapping. The SNP genotyping assay was performed with 888 P. nigra individuals. The genotyping success rate was 91%. Our high success rate was due to the discovery panel design and the stringent parameters applied for SNP calling and selection. In the same set of P. nigra genotypes, linkage disequilibrium throughout the genome decayed on average within 5–7 kb to half of its maximum value. As an application test, ADMIXTURE analysis was performed with a selection of 600 SNPs spread throughout the genome and 706 individuals collected along 12 river basins. The admixture pattern was consistent with genetic diversity revealed by neutral markers and the geographical distribution of the populations. These newly developed SNP resources and genotyping array provide a valuable tool for population genetic studies and identification of QTLs through natural‐population based genetic association studies in P. nigra.  相似文献   

10.
Advances in next-generation sequencing offer high-throughput and cost-effective genotyping alternatives, including genotyping-by-sequencing (GBS). Results have shown that this methodology is efficient for genotyping a variety of species, including those with complex genomes. To assess the utility of GBS in cultivated hexaploid oat (Avena sativa L.), seven bi-parental mapping populations and diverse inbred lines from breeding programs around the world were studied. We examined technical factors that influence GBS SNP calls, established a workflow that combines two bioinformatics pipelines for GBS SNP calling, and provided a nomenclature for oat GBS loci. The high-throughput GBS system enabled us to place 45,117 loci on an oat consensus map, thus establishing a positional reference for further genomic studies. Using the diversity lines, we estimated that a minimum density of one marker per 2 to 2.8 cM would be required for genome-wide association studies (GWAS), and GBS markers met this density requirement in most chromosome regions. We also demonstrated the utility of GBS in additional diagnostic applications related to oat breeding. We conclude that GBS is a powerful and useful approach, which will have many additional applications in oat breeding and genomic studies.  相似文献   

11.
Introgression of genomic variation between and within related crop species is a significant evolutionary approach for population differentiation, genome reorganization and trait improvement. Using the Illumina Infinium Brassica 60K SNP array, we investigated genomic changes in a panel of advanced generation new‐type Brassica napus breeding lines developed from hundreds of interspecific crosses between 122 Brassica rapa and 74 Brassica carinata accessions, and compared them with representative accessions of their three parental species. The new‐type B. napus population presented rich genetic diversity and abundant novel genomic alterations, consisting of introgressions from B. rapa and B. carinata, novel allelic combinations, reconstructed linkage disequilibrium patterns and haplotype blocks, and frequent deletions and duplications (nonrandomly distributed), particularly in the C subgenome. After a much shorter, but very intensive, selection history compared to traditional B. napus, a total of 15 genomic regions with strong selective sweeps and 112 genomic regions with putative signals of selective sweeps were identified. Some of these regions were associated with important agronomic traits that were selected for during the breeding process, while others were potentially associated with restoration of genome stability and fertility after interspecific hybridization. Our results demonstrate how a novel method for population‐based crop genetic improvement can lead to rapid adaptation, restoration of genome stability and positive responses to artificial selection.  相似文献   

12.
Blue catfish, Ictalurus furcatus, are valued in the United States as a trophy fishery for their capacity to reach large sizes, sometimes exceeding 45 kg. Additionally, blue catfish × channel catfish (I. punctatus) hybrid food fish production has recently increased the demand for blue catfish broodstock. However, there has been little study of the genetic impacts and interaction of farmed, introduced and stocked populations of blue catfish. We utilized genotyping‐by‐sequencing (GBS) to capture and genotype SNP markers on 190 individuals from five wild and domesticated populations (Mississippi River, Missouri, D&B, Rio Grande and Texas). Stringent filtering of SNP‐calling parameters resulted in 4275 SNP loci represented across all five populations. Population genetics and structure analyses revealed potential shared ancestry and admixture between populations. We utilized the Sequenom MassARRAY to validate two multiplex panels of SNPs selected from the GBS data. Selection criteria included SNPs shared between populations, SNPs specific to populations, number of reads per individual and number of individuals genotyped by GBS. Putative SNPs were validated in the discovery population and in two additional populations not used in the GBS analysis. A total of 64 SNPs were genotyped successfully in 191 individuals from nine populations. Our results should guide the development of highly informative, flexible genotyping multiplexes for blue catfish from the larger GBS SNP set as well as provide an example of a rapid, low‐cost approach to generate and genotype informative marker loci in aquatic species with minimal previous genetic information.  相似文献   

13.
The identification of genetic markers linked to genes of agronomic importance is a major aim of crop research and breeding programmes. Here, we identify markers for Yr15, a major disease resistance gene for wheat yellow rust, using a segregating F2 population. After phenotyping, we implemented RNA sequencing (RNA‐Seq) of bulked pools to identify single‐nucleotide polymorphisms (SNP) associated with Yr15. Over 27 000 genes with SNPs were identified between the parents, and then classified based on the results from the sequenced bulks. We calculated the bulk frequency ratio (BFR) of SNPs between resistant and susceptible bulks, selecting those showing sixfold enrichment/depletion in the corresponding bulks (BFR > 6). Using additional filtering criteria, we reduced the number of genes with a putative SNP to 175. The 35 SNPs with the highest BFR values were converted into genome‐specific KASP assays using an automated bioinformatics pipeline (PolyMarker) which circumvents the limitations associated with the polyploid wheat genome. Twenty‐eight assays were polymorphic of which 22 (63%) mapped in the same linkage group as Yr15. Using these markers, we mapped Yr15 to a 0.77‐cM interval. The three most closely linked SNPs were tested across varieties and breeding lines representing UK elite germplasm. Two flanking markers were diagnostic in over 99% of lines tested, thus providing a reliable haplotype for marker‐assisted selection in these breeding programmes. Our results demonstrate that the proposed methodology can be applied in polyploid F2 populations to generate high‐resolution genetic maps across target intervals.  相似文献   

14.
With the advent of next generation sequencing, new avenues have opened to study genomics in wild populations of non‐model species. Here, we describe a successful approach to a genome‐wide medium density Single Nucleotide Polymorphism (SNP) panel in a non‐model species, the house sparrow (Passer domesticus), through the development of a 10 K Illumina iSelect HD BeadChip. Genomic DNA and cDNA derived from six individuals were sequenced on a 454 GS FLX system and generated a total of 1.2 million sequences, in which SNPs were detected. As no reference genome exists for the house sparrow, we used the zebra finch (Taeniopygia guttata) reference genome to determine the most likely position of each SNP. The 10 000 SNPs on the SNP‐chip were selected to be distributed evenly across 31 chromosomes, giving on average one SNP per 100 000 bp. The SNP‐chip was screened across 1968 individual house sparrows from four island populations. Of the original 10 000 SNPs, 7413 were found to be variable, and 99% of these SNPs were successfully called in at least 93% of all individuals. We used the SNP‐chip to demonstrate the ability of such genome‐wide marker data to detect population sub‐division, and compared these results to similar analyses using microsatellites. The SNP‐chip will be used to map Quantitative Trait Loci (QTL) for fitness‐related phenotypic traits in natural populations.  相似文献   

15.
With the access to draft genome sequence assemblies and whole‐genome resequencing data from population samples, molecular ecology studies will be able to take truly genome‐wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million single‐nucleotide polymorphism (SNP)s in population resequencing of this species and its sister species, pied flycatcher. Here, we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. Using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. Linkage disequilibrium declined quickly to the background level at an average distance of 17 kb, but the extent of LD varied markedly within the genome and was more than 10‐fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later‐generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 Ma, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system.  相似文献   

16.
Soya bean is a major source of edible oil and protein for human consumption as well as animal feed. Understanding the genetic basis of different traits in soya bean will provide important insights for improving breeding strategies for this crop. A genome‐wide association study (GWAS) was conducted to accelerate molecular breeding for the improvement of agronomic traits in soya bean. A genotyping‐by‐sequencing (GBS) approach was used to provide dense genome‐wide marker coverage (>47 000 SNPs) for a panel of 304 short‐season soya bean lines. A subset of 139 lines, representative of the diversity among these, was characterized phenotypically for eight traits under six environments (3 sites × 2 years). Marker coverage proved sufficient to ensure highly significant associations between the genes known to control simple traits (flower, hilum and pubescence colour) and flanking SNPs. Between one and eight genomic loci associated with more complex traits (maturity, plant height, seed weight, seed oil and protein) were also identified. Importantly, most of these GWAS loci were located within genomic regions identified by previously reported quantitative trait locus (QTL) for these traits. In some cases, the reported QTLs were also successfully validated by additional QTL mapping in a biparental population. This study demonstrates that integrating GBS and GWAS can be used as a powerful complementary approach to classical biparental mapping for dissecting complex traits in soya bean.  相似文献   

17.
Crop wild relatives (CWR) provide an important source of allelic diversity for any given crop plant species for counteracting the erosion of genetic diversity caused by domestication and elite breeding bottlenecks. Hordeum bulbosum L. is representing the secondary gene pool of the genus Hordeum. It has been used as a source of genetic introgressions for improving elite barley germplasm (Hordeum vulgare L.). However, genetic introgressions from Hbulbosum have yet not been broadly applied, due to a lack of suitable molecular tools for locating, characterizing, and decreasing by recombination and marker‐assisted backcrossing the size of introgressed segments. We applied next‐generation sequencing (NGS) based strategies for unlocking genetic diversity of three diploid introgression lines of cultivated barley containing chromosomal segments of its close relative H. bulbosum. Firstly, exome capture‐based (re)‐sequencing revealed large numbers of single nucleotide polymorphisms (SNPs) enabling the precise allocation of H. bulbosum introgressions. This SNP resource was further exploited by designing a custom multiplex SNP genotyping assay. Secondly, two‐enzyme‐based genotyping‐by‐sequencing (GBS) was employed to allocate the introgressed H. bulbosum segments and to genotype a mapping population. Both methods provided fast and reliable detection and mapping of the introgressed segments and enabled the identification of recombinant plants. Thus, the utilization of H. bulbosum as a resource of natural genetic diversity in barley crop improvement will be greatly facilitated by these tools in the future.  相似文献   

18.
19.
Estimating the evolutionary potential of quantitative traits and reliably predicting responses to selection in wild populations are important challenges in evolutionary biology. The genomic revolution has opened up opportunities for measuring relatedness among individuals with precision, enabling pedigree‐free estimation of trait heritabilities in wild populations. However, until now, most quantitative genetic studies based on a genomic relatedness matrix (GRM) have focused on long‐term monitored populations for which traditional pedigrees were also available, and have often had access to knowledge of genome sequence and variability. Here, we investigated the potential of RAD‐sequencing for estimating heritability in a free‐ranging roe deer (Capreolous capreolus) population for which no prior genomic resources were available. We propose a step‐by‐step analytical framework to optimize the quality and quantity of the genomic data and explore the impact of the single nucleotide polymorphism (SNP) calling and filtering processes on the GRM structure and GRM‐based heritability estimates. As expected, our results show that sequence coverage strongly affects the number of recovered loci, the genotyping error rate and the amount of missing data. Ultimately, this had little effect on heritability estimates and their standard errors, provided that the GRM was built from a minimum number of loci (above 7,000). Genomic relatedness matrix‐based heritability estimates thus appear robust to a moderate level of genotyping errors in the SNP data set. We also showed that quality filters, such as the removal of low‐frequency variants, affect the relatedness structure of the GRM, generating lower h2 estimates. Our work illustrates the huge potential of RAD‐sequencing for estimating GRM‐based heritability in virtually any natural population.  相似文献   

20.
In the last decade, the revolution in sequencing technologies has deeply impacted crop genotyping practice. New methods allowing rapid, high‐throughput genotyping of entire crop populations have proliferated and opened the door to wider use of molecular tools in plant breeding. These new genotyping‐by‐sequencing (GBS) methods include over a dozen reduced‐representation sequencing (RRS) approaches and at least four whole‐genome resequencing (WGR) approaches. The diversity of methods available, each often producing different types of data at different cost, can make selection of the best‐suited method seem a daunting task. We review the most common genotyping methods used today and compare their suitability for linkage mapping, genomewide association studies (GWAS), marker‐assisted and genomic selection and genome assembly and improvement in crops with various genome sizes and complexity. Furthermore, we give an outline of bioinformatics tools for analysis of genotyping data. WGR is well suited to genotyping biparental cross populations with complex, small‐ to moderate‐sized genomes and provides the lowest cost per marker data point. RRS approaches differ in their suitability for various tasks, but demonstrate similar costs per marker data point. These approaches are generally better suited for de novo applications and more cost‐effective when genotyping populations with large genomes or high heterozygosity. We expect that although RRS approaches will remain the most cost‐effective for some time, WGR will become more widespread for crop genotyping as sequencing costs continue to decrease.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号