首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
 Strawberry genotypes selected for superior fruit yield or chosen at random from first-generation self, full-sib, and half-sib populations were crossed to provide second-generation inbred progenies and composite cross-fertilized control populations. Mean yields for inbred offspring from crosses among selected parents exceeded those from the offspring of unselected parents by 87%, 23%, and 37% for self, full-sib, and half-sib populations, respectively; yields for offspring from unrelated crosses among selected parents were 54% larger than those for crosses among unselected parents. Selection for yield also resulted in significant correlated response for fruit number and plant diameter. Mean yields for second-generation half-sib and full-sib offspring from selected parents were greater than those for offspring from the unselected but non-inbred control population. This suggests that selection can be a powerful force in counteracting most of the inbreeding depression expected in cross-fertilized strawberry breeding programs. Selection treatment× inbreeding rate interactions were non-significant for all traits; thus, selection among partially inbred offspring did not have a large effect on the rate of genetic progress. Differential realized selection intensity among individuals with differing levels of homozygosity accumulated due to inbreeding is suggested as the most likely explanation for the absence of association between pedigree inbreeding coefficients and cross performance detected previously in strawberry. Received: 21 July 1996 / Accepted: 7 March 1997  相似文献   

2.
The prediction accuracies of genomic selection depend on several factors, including the genetic architecture of target traits, the number of traits considered at a given time, and the statistical models. Here, we assessed the potential of single-trait (ST) and multi-trait (MT) genomic prediction models for durum wheat on yield and quality traits using a breeding panel (BP) of 170 varieties and advanced breeding lines, and a doubled-haploid (DH) population of 154 lines. The two populations were genotyped with the Infinium iSelect 90K SNP assay and phenotyped for various traits. Six ST-GS models (RR-BLUP, G-BLUP, BayesA, BayesB, Bayesian LASSO, and RKHS) and three MT prediction approaches (MT-BayesA, MT-Matrix, and MT-SI approaches which use economic selection index as a trait value) were applied for predicting yield, protein content, gluten index, and alveograph measures. The ST prediction accuracies ranged from 0.5 to 0.8 for the various traits and models and revealed comparable prediction accuracies for most of the traits in both populations, except BayesA and BayesB, which better predicted gluten index, tenacity, and strength in the DH population. The MT-GS models were more accurate than the ST-GS models only for grain yield in the BP. Using BP as a training set to predict the DH population resulted in poor predictions. Overall, all the six ST-GS models appear to be applicable for GS of yield and gluten strength traits in durum wheat, but we recommend the simple computational models RR-BLUP or G-BLUP for predicating single trait and MT-SI for predicting yield and protein simultaneously.  相似文献   

3.
The efficiency of marker-assisted prediction of phenotypes has been studied intensively for different types of plant breeding populations. However, one remaining question is how to incorporate and counterbalance information from biparental and multiparental populations into model training for genome-wide prediction. To address this question, we evaluated testcross performance of 1652 doubled-haploid maize (Zea mays L.) lines that were genotyped with 56,110 single nucleotide polymorphism markers and phenotyped for five agronomic traits in four to six European environments. The lines are arranged in two diverse half-sib panels representing two major European heterotic germplasm pools. The data set contains 10 related biparental dent families and 11 related biparental flint families generated from crosses of maize lines important for European maize breeding. With this new data set we analyzed genome-based best linear unbiased prediction in different validation schemes and compositions of estimation and test sets. Further, we theoretically and empirically investigated marker linkage phases across multiparental populations. In general, predictive abilities similar to or higher than those within biparental families could be achieved by combining several half-sib families in the estimation set. For the majority of families, 375 half-sib lines in the estimation set were sufficient to reach the same predictive performance of biomass yield as an estimation set of 50 full-sib lines. In contrast, prediction across heterotic pools was not possible for most cases. Our findings are important for experimental design in genome-based prediction as they provide guidelines for the genetic structure and required sample size of data sets used for model training.  相似文献   

4.
Small reference populations limit the accuracy of genomic prediction in numerically small breeds, such like Danish Jersey. The objective of this study was to investigate two approaches to improve genomic prediction by increasing size of reference population in Danish Jersey. The first approach was to include North American Jersey bulls in Danish Jersey reference population. The second was to genotype cows and use them as reference animals. The validation of genomic prediction was carried out on bulls and cows, respectively. In validation on bulls, about 300 Danish bulls (depending on traits) born in 2005 and later were used as validation data, and the reference populations were: (1) about 1050 Danish bulls, (2) about 1050 Danish bulls and about 1150 US bulls. In validation on cows, about 3000 Danish cows from 87 young half-sib families were used as validation data, and the reference populations were: (1) about 1250 Danish bulls, (2) about 1250 Danish bulls and about 1150 US bulls, (3) about 1250 Danish bulls and about 4800 cows, (4) about 1250 Danish bulls, 1150 US bulls and 4800 Danish cows. Genomic best linear unbiased prediction model was used to predict breeding values. De-regressed proofs were used as response variables. In the validation on bulls for eight traits, the joint DK-US bull reference population led to higher reliability of genomic prediction than the DK bull reference population for six traits, but not for fertility and longevity. Averaged over the eight traits, the gain was 3 percentage points. In the validation on cows for six traits (fertility and longevity were not available), the gain from inclusion of US bull in reference population was 6.6 percentage points in average over the six traits, and the gain from inclusion of cows was 8.2 percentage points. However, the gains from cows and US bulls were not accumulative. The total gain of including both US bulls and Danish cows was 10.5 percentage points. The results indicate that sharing reference data and including cows in reference population are efficient approaches to increase reliability of genomic prediction. Therefore, genomic selection is promising for numerically small population.  相似文献   

5.

Background

Genomic selection (GS) in forestry can substantially reduce the length of breeding cycle and increase gain per unit time through early selection and greater selection intensity, particularly for traits of low heritability and late expression. Affordable next-generation sequencing technologies made it possible to genotype large numbers of trees at a reasonable cost.

Results

Genotyping-by-sequencing was used to genotype 1,126 Interior spruce trees representing 25 open-pollinated families planted over three sites in British Columbia, Canada. Four imputation algorithms were compared (mean value (MI), singular value decomposition (SVD), expectation maximization (EM), and a newly derived, family-based k-nearest neighbor (kNN-Fam)). Trees were phenotyped for several yield and wood attributes. Single- and multi-site GS prediction models were developed using the Ridge Regression Best Linear Unbiased Predictor (RR-BLUP) and the Generalized Ridge Regression (GRR) to test different assumption about trait architecture. Finally, using PCA, multi-trait GS prediction models were developed. The EM and kNN-Fam imputation methods were superior for 30 and 60% missing data, respectively. The RR-BLUP GS prediction model produced better accuracies than the GRR indicating that the genetic architecture for these traits is complex. GS prediction accuracies for multi-site were high and better than those of single-sites while multi-site predictability produced the lowest accuracies reflecting type-b genetic correlations and deemed unreliable. The incorporation of genomic information in quantitative genetics analyses produced more realistic heritability estimates as half-sib pedigree tended to inflate the additive genetic variance and subsequently both heritability and gain estimates. Principle component scores as representatives of multi-trait GS prediction models produced surprising results where negatively correlated traits could be concurrently selected for using PCA2 and PCA3.

Conclusions

The application of GS to open-pollinated family testing, the simplest form of tree improvement evaluation methods, was proven to be effective. Prediction accuracies obtained for all traits greatly support the integration of GS in tree breeding. While the within-site GS prediction accuracies were high, the results clearly indicate that single-site GS models ability to predict other sites are unreliable supporting the utilization of multi-site approach. Principle component scores provided an opportunity for the concurrent selection of traits with different phenotypic optima.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1597-y) contains supplementary material, which is available to authorized users.  相似文献   

6.
Use of microsatellite loci to classify individuals by relatedness   总被引:19,自引:1,他引:18  
This study investigates the use of microsatellite loci for estimating relatedness between individuals in wild, outbred, vertebrate populations. We measured allele frequencies at 20 unlinked, dinucleotide-repeat microsatellite loci in a population of wild mice ( Mus musculus ), and used these observed frequencies to generate the expected distributions of pairwise relatedness among full sib, half sib, and unrelated pairs of individuals, as would be estimated from the microsatellite data. In this population one should be able to discriminate between unrelated and full-sib dyads with at least 97% accuracy, and to discriminate half-sib pairs from unrelated pairs or from full-sib pairs with better than 80% accuracy. If one uses the criterion that parent-offspring pairs must share at least one allele per locus, then only 15% of full-sib pairs, 2% of half-sib pairs, and 0% of unrelated pairs in this population would qualify as potential parent-offspring pairs. We verified that the simulation results (which assume a random mating population in Hardy-Weinberg and linkage equilibrium) accurately predict results one would obtain from this population in real life by scoring laboratory-bred full- and half-sib families whose parents were wild-caught mice from the study population. We also investigated the effects of using different numbers of loci, or loci of different average heterozygosities ( He ), on misclassification frequencies. Both variables have strong effects on misclassification rate. For example, it requires almost twice as many loci of He = 0.62 to achieve the same accuracy as a given number of loci of He = 0.75. Finally, we tested the ability of UPGMA clustering to identify family groups in our population. Clustering of allele matching scores among the offspring of four sets of independent maternal half sibships (four females, each mated to two different males) perfectly recovered the true family relationships.  相似文献   

7.
Hybrid breeding of rice via genomic selection   总被引:1,自引:0,他引:1  
Hybrid breeding is the main strategy for improving productivity in many crops, especially in rice and maize. Genomic hybrid breeding is a technology that uses whole‐genome markers to predict future hybrids. Predicted superior hybrids are then field evaluated and released as new hybrid cultivars after their superior performances are confirmed. This will increase the opportunity of selecting true superior hybrids with minimum costs. Here, we used genomic best linear unbiased prediction to perform hybrid performance prediction using an existing rice population of 1495 hybrids. Replicated 10‐fold cross‐validations showed that the prediction abilities on ten agronomic traits ranged from 0.35 to 0.92. Using the 1495 rice hybrids as a training sample, we predicted six agronomic traits of 100 hybrids derived from half diallel crosses involving 21 parents that are different from the parents of the hybrids in the training sample. The prediction abilities were relatively high, varying from 0.54 (yield) to 0.92 (grain length). We concluded that the current population of 1495 hybrids can be used to predict hybrids from seemingly unrelated parents. Eventually, we used this training population to predict all potential hybrids of cytoplasm male sterile lines from 3000 rice varieties from the 3K Rice Genome Project. Using a breeding index combining 10 traits, we identified the top and bottom 200 predicted hybrids. SNP genotypes of the training population and parameters estimated from this training population are available for general uses and further validation in genomic hybrid prediction of all potential hybrids generated from all varieties of rice.  相似文献   

8.
Genome-based prediction of testcross values in maize   总被引:1,自引:0,他引:1  
This is the first large-scale experimental study on genome-based prediction of testcross values in an advanced cycle breeding population of maize. The study comprised testcross progenies of 1,380 doubled haploid lines of maize derived from 36 crosses and phenotyped for grain yield and grain dry matter content in seven locations. The lines were genotyped with 1,152 single nucleotide polymorphism markers. Pedigree data were available for three generations. We used best linear unbiased prediction and stratified cross-validation to evaluate the performance of prediction models differing in the modeling of relatedness between inbred lines and in the calculation of genome-based coefficients of similarity. The choice of similarity coefficient did not affect prediction accuracies. Models including genomic information yielded significantly higher prediction accuracies than the model based on pedigree information alone. Average prediction accuracies based on genomic data were high even for a complex trait like grain yield (0.72–0.74) when the cross-validation scheme allowed for a high degree of relatedness between the estimation and the test set. When predictions were performed across distantly related families, prediction accuracies decreased significantly (0.47–0.48). Prediction accuracies decreased with decreasing sample size but were still high when the population size was halved (0.67–0.69). The results from this study are encouraging with respect to genome-based prediction of the genetic value of untested lines in advanced cycle breeding populations and the implementation of genomic selection in the breeding process.  相似文献   

9.
Genomic selection (GS) has been implemented in animal and plant species, and is regarded as a useful tool for accelerating genetic gains. Varying levels of genomic prediction accuracy have been obtained in plants, depending on the prediction problem assessed and on several other factors, such as trait heritability, the relationship between the individuals to be predicted and those used to train the models for prediction, number of markers, sample size and genotype × environment interaction (GE). The main objective of this article is to describe the results of genomic prediction in International Maize and Wheat Improvement Center''s (CIMMYT''s) maize and wheat breeding programs, from the initial assessment of the predictive ability of different models using pedigree and marker information to the present, when methods for implementing GS in practical global maize and wheat breeding programs are being studied and investigated. Results show that pedigree (population structure) accounts for a sizeable proportion of the prediction accuracy when a global population is the prediction problem to be assessed. However, when the prediction uses unrelated populations to train the prediction equations, prediction accuracy becomes negligible. When genomic prediction includes modeling GE, an increase in prediction accuracy can be achieved by borrowing information from correlated environments. Several questions on how to incorporate GS into CIMMYT''s maize and wheat programs remain unanswered and subject to further investigation, for example, prediction within and between related bi-parental crosses. Further research on the quantification of breeding value components for GS in plant breeding populations is required.  相似文献   

10.

Key message

Compared with independent validation, cross-validation simultaneously sampling genotypes and environments provided similar estimates of accuracy for genomic selection, but inflated estimates for marker-assisted selection.

Abstract

Estimates of prediction accuracy of marker-assisted (MAS) and genomic selection (GS) require validations. The main goal of our study was to compare the prediction accuracies of MAS and GS validated in an independent sample with results obtained from fivefold cross-validation using genomic and phenotypic data for Fusarium head blight resistance in wheat. In addition, the applicability of the reliability criterion, a concept originally developed in the context of classic animal breeding and GS, was explored for MAS. We observed that prediction accuracies of MAS were overestimated by 127% using cross-validation sampling genotype and environments in contrast to independent validation. In contrast, prediction accuracies of GS determined in independent samples are similar to those estimated with cross-validation sampling genotype and environments. This can be explained by small population differentiation between the training and validation sets in our study. For European wheat breeding, which is so far characterized by a slow temporal dynamic in allele frequencies, this assumption seems to be realistic. Thus, GS models used to improve European wheat populations are expected to possess a long-lasting validity. Since quantitative trait loci information can be exploited more precisely if the predicted genotype is more related to the training population, the reliability criterion is also a valuable tool to judge the level of prediction accuracy of individual genotypes in MAS.
  相似文献   

11.
Genomic Selection (GS) is a new breeding method in which genome-wide markers are used to predict the breeding value of individuals in a breeding population. GS has been shown to improve breeding efficiency in dairy cattle and several crop plant species, and here we evaluate for the first time its efficacy for breeding inbred lines of rice. We performed a genome-wide association study (GWAS) in conjunction with five-fold GS cross-validation on a population of 363 elite breeding lines from the International Rice Research Institute''s (IRRI) irrigated rice breeding program and herein report the GS results. The population was genotyped with 73,147 markers using genotyping-by-sequencing. The training population, statistical method used to build the GS model, number of markers, and trait were varied to determine their effect on prediction accuracy. For all three traits, genomic prediction models outperformed prediction based on pedigree records alone. Prediction accuracies ranged from 0.31 and 0.34 for grain yield and plant height to 0.63 for flowering time. Analyses using subsets of the full marker set suggest that using one marker every 0.2 cM is sufficient for genomic selection in this collection of rice breeding materials. RR-BLUP was the best performing statistical method for grain yield where no large effect QTL were detected by GWAS, while for flowering time, where a single very large effect QTL was detected, the non-GS multiple linear regression method outperformed GS models. For plant height, in which four mid-sized QTL were identified by GWAS, random forest produced the most consistently accurate GS models. Our results suggest that GS, informed by GWAS interpretations of genetic architecture and population structure, could become an effective tool for increasing the efficiency of rice breeding as the costs of genotyping continue to decline.  相似文献   

12.
Bulked co-segregant analysis is a method of rapidly allocating unmapped genetic markers to a specific chromosomal region. Although originally developed for utilization in populations derived from crosses between fully inbred lines, it has been proposed that co-segregant pools could also serve the same purpose in outbreeding populations, if individuals from only a single large family are pooled. Large, fully mapped, single-sire backcross and half-sib families are presently available as part of the international chicken and bovine reference family panels respectively. In this study, power and tests of significance for single-parent co-segregant analysis are derived for full-sib, single-parent back-cross and single-parent half-sib families, as a function of proportion of recombination between index marker and linked marker, proportion of single-parent alleles among the mates, number of individuals in each segregant pool and technical error variance. Power was found to be greater than 0–80 for many reasonable parameter combinations. The method is illustrated using microsatellite markers and a large single-sire bovine family, part of the international bovine reference family panel.  相似文献   

13.
We analysed family relationships among brown trout from two small tributary populations that have been suggested as a source of individuals for supportive breeding, using variation at eight microsatellite loci. As a control, we analysed a sample of supposedly unrelated individuals representing a large anadromous population, and we simulated unrelated individuals based on the allelic distributions in all three samples. Two different approaches were used: (1) pairwise estimates of relatedness between individuals and (2) a method for partitioning individuals into half-sib and full-sib families. The anadromous population did not show evidence of a significant number of closely related individuals. In both tributary populations, however, the distributions of pairwise relatedness estimates suggested the presence of several related individuals, and sibship reconstruction suggested fewer families consisting of more individuals than were observed for the simulated individuals. The expected increase of inbreeding coefficient in the two samples due to family structure was 0.026 and 0.030 respectively. Moreover, tests for recent bottlenecks yielded significant outcomes in both populations suggesting a history of low effective population sizes. Depending on the effective population size of captive spawners and past effective population sizes in the populations it could be beneficial to conduct sib-avoidance matings, though this cannot eliminate inbreeding but only delay it. Alternatively, individuals from different populations could be crossed. Sibship reconstruction provided the clearest evidence for family structure, but pairwise relatedness is the best measure for designing mating schemes, as it allows for mating as unrelated individuals as possible rather than just avoiding mating between sibs.  相似文献   

14.

Key message

We compare genomic selection methods that use correlated traits to help predict biomass yield in sorghum, and find that trait-assisted genomic selection performs best.

Abstract

Genomic selection (GS) is usually performed on a single trait, but correlated traits can also help predict a focal trait through indirect or multi-trait GS. In this study, we use a pre-breeding population of biomass sorghum to compare strategies that use correlated traits to improve prediction of biomass yield, the focal trait. Correlated traits include moisture, plant height measured at monthly intervals between planting and harvest, and the area under the growth progress curve. In addition to single- and multi-trait direct and indirect GS, we test a new strategy called trait-assisted GS, in which correlated traits are used along with marker data in the validation population to predict a focal trait. Single-trait GS for biomass yield had a prediction accuracy of 0.40. Indirect GS performed best using area under the growth progress curve to predict biomass yield, with a prediction accuracy of 0.37, and did not differ from indirect multi-trait GS that also used moisture information. Multi-trait GS and single-trait GS yielded similar results, indicating that correlated traits did not improve prediction of biomass yield in a standard GS scenario. However, trait-assisted GS increased prediction accuracy by up to \(50\%\) when using plant height in both the training and validation populations to help predict yield in the validation population. Coincidence between selected genotypes in phenotypic and genomic selection was also highest in trait-assisted GS. Overall, these results suggest that trait-assisted GS can be an efficient strategy when correlated traits are obtained earlier or more inexpensively than a focal trait.
  相似文献   

15.

Background

Genomic selection (GS) may improve selection response over conventional pedigree-based selection if markers capture more detailed information than pedigrees in recently domesticated tree species and/or make it more cost effective. Genomic prediction accuracies using 1748 trees and 6932 SNPs representative of as many distinct gene loci were determined for growth and wood traits in white spruce, within and between environments and breeding groups (BG), each with an effective size of Ne ≈ 20. Marker subsets were also tested.

Results

Model fits and/or cross-validation (CV) prediction accuracies for ridge regression (RR) and the least absolute shrinkage and selection operator models approached those of pedigree-based models. With strong relatedness between CV sets, prediction accuracies for RR within environment and BG were high for wood (r = 0.71–0.79) and moderately high for growth (r = 0.52–0.69) traits, in line with trends in heritabilities. For both classes of traits, these accuracies achieved between 83% and 92% of those obtained with phenotypes and pedigree information. Prediction into untested environments remained moderately high for wood (r ≥ 0.61) but dropped significantly for growth (r ≥ 0.24) traits, emphasizing the need to phenotype in all test environments and model genotype-by-environment interactions for growth traits. Removing relatedness between CV sets sharply decreased prediction accuracies for all traits and subpopulations, falling near zero between BGs with no known shared ancestry. For marker subsets, similar patterns were observed but with lower prediction accuracies.

Conclusions

Given the need for high relatedness between CV sets to obtain good prediction accuracies, we recommend to build GS models for prediction within the same breeding population only. Breeding groups could be merged to build genomic prediction models as long as the total effective population size does not exceed 50 individuals in order to obtain high prediction accuracy such as that obtained in the present study. A number of markers limited to a few hundred would not negatively impact prediction accuracies, but these could decrease more rapidly over generations. The most promising short-term approach for genomic selection would likely be the selection of superior individuals within large full-sib families vegetatively propagated to implement multiclonal forestry.  相似文献   

16.
QTL mapping experiments involve many animals to be genotyped and performance tested. Consequently, experimental designs need to be optimized to minimize the costs of data collection and genotyping. The present study has analyzed the power and efficiency of experiments with two or three-generation family structures containing full-sib families, half-sib families, or both. The focus was on data from one outbred population because the main interest is to locate genes that can be used for within-line selection. For a two generation experiment more animals had to be typed for marker loci to obtain a certain power than for a three generation experiment. Fewer trait values, however, had to be obtained for a two-generation experiment than for a three-generation experiment. A two or three-generation family structure with full-sib offspring was more efficient than a two or three-generation family structure with half-sib offspring. A family structure with full-sib grand-offspring, however, was less efficient than a family structure with half-sib grand-offspring. For the most efficient family structure each pair of parents had full-sib offspring that were genotyped for the marker. For the most-efficient family structure each full-sib offspring had half-sib grand-offspring for which trait values were obtained. For equal power with a heritability of 0.1 and 100 grand-offspring per full-sib offspring, 30-times less marker typings were required for this most efficient family structure than for a two-generation half-sib structure in which marker genotypes and trait values were obtained for half-sib offspring. The effect of heritability and the type of analysis (single marker or interval analysis) on the efficiency of a family structure is described. The results of this study should help to design QTL mapping experiments in an outbred population.  相似文献   

17.
Explicitly fitting effects for major genes or QTL that account for a large percentage of variation in a whole genomic prediction model may increase prediction accuracy. This study compared approaches to account for a major effect of an F94L variant in the MSTN gene within the genomic prediction using bovine whole‐genomic SNP markers. Among the beef cattle breeds, Limousin have been known to have an F94L variant that is not present in Angus. The reference population in this study consisted of 3060 beef cattle including pure‐bred Limousin (PL), cross‐bred Limousin with Angus (LF) and pure‐bred Angus, genotyped using a BovineSNP50 BeadChip and directly for the MSTN‐F94L variant. We compared prediction accuracies in PL animals using the three datasets from only the PL population, admixed PL and LF (AL) or multibreed analysis using all of the PL, LF and Angus (MB) population according to four‐fold cross‐validation after K‐means clustering. The MSTN‐F94L variant was the most strongly associated with five traits (birth weight, calving ease direct, milk, weaning weight and yield grade) among the 13 measured traits in PL and AL populations. Fitting the MSTN‐F94L variant as a random effect, the genomic prediction accuracies for birth weight increased by 2.7% in PL, by 2.2% in AL and by 3.2% in MB. Prediction accuracies for five traits increased in the MB analysis. Fitting MSTN‐F94L as a fixed effect in PL, AL and MB analyses resulted in increased prediction accuracy in PL for eight traits. Prediction accuracies can be improved by including a causal variant in genomic evaluation compared with simply using whole‐genome SNP markers. Fitting the causal variant as a fixed effect along with markers fitted as random effects resulted in greater prediction accuracies for most traits. Causal variants should be genotyped along with SNP markers.  相似文献   

18.
Genomic selection (GS) is of interest in breeding because of its potential for predicting the genetic value of individuals and increasing genetic gains per unit of time. To date, very few studies have reported empirical results of GS potential in the context of large population sizes and long breeding cycles such as for boreal trees. In this study, we assessed the effectiveness of marker-aided selection in an undomesticated white spruce (Picea glauca (Moench) Voss) population of large effective size using a GS approach. A discovery population of 1694 trees representative of 214 open-pollinated families from 43 natural populations was phenotyped for 12 wood and growth traits and genotyped for 6385 single-nucleotide polymorphisms (SNPs) mined in 2660 gene sequences. GS models were built to predict estimated breeding values using all the available SNPs or SNP subsets of the largest absolute effects, and they were validated using various cross-validation schemes. The accuracy of genomic estimated breeding values (GEBVs) varied from 0.327 to 0.435 when the training and the validation data sets shared half-sibs that were on average 90% of the accuracies achieved through traditionally estimated breeding values. The trend was also the same for validation across sites. As expected, the accuracy of GEBVs obtained after cross-validation with individuals of unknown relatedness was lower with about half of the accuracy achieved when half-sibs were present. We showed that with the marker densities used in the current study, predictions with low to moderate accuracy could be obtained within a large undomesticated population of related individuals, potentially resulting in larger gains per unit of time with GS than with the traditional approach.  相似文献   

19.
Stella A  Boettcher PJ 《Genetics》2004,166(1):341-350
Simulation was used to evaluate the performance of different selective genotyping strategies when using linkage disequilibrium across large half-sib families to position a QTL within a previously defined genomic region. Strategies examined included standard selective genotyping and different approaches of discordant and concordant sib selection applied to arbitrary or selected families. Strategies were compared as a function of effect and frequency of QTL alleles, heritability, and phenotypic expression of the trait. Large half-sib families were simulated for 100 generations and 2% of the population was genotyped in the final generation. Simple ANOVA was applied and the marker with the greatest F-value was considered the most likely QTL position. For traits with continuous phenotypes, genotyping the most divergent pairs of half-sibs from all families was the best strategy in general, but standard selective genotyping was somewhat more precise when heritability was low. When the phenotype was distributed in ordered categories, discordant sib selection was the optimal approach for positioning QTL for traits with high heritability and concordant sib selection was the best approach when genetic effects were small. Genotyping of a few selected sibs from many families was generally more efficient than genotyping many individuals from a few highly selected sires.  相似文献   

20.
Genomic selection (GS) is a modern breeding approach where genome-wide single-nucleotide polymorphism (SNP) marker profiles are simultaneously used to estimate performance of untested genotypes. In this study, the potential of genomic selection methods to predict testcross performance for hybrid canola breeding was applied for various agronomic traits based on genome-wide marker profiles. A total of 475 genetically diverse spring-type canola pollinator lines were genotyped at 24,403 single-copy, genome-wide SNP loci. In parallel, the 950 F1 testcross combinations between the pollinators and two representative testers were evaluated for a number of important agronomic traits including seedling emergence, days to flowering, lodging, oil yield and seed yield along with essential seed quality characters including seed oil content and seed glucosinolate content. A ridge-regression best linear unbiased prediction (RR-BLUP) model was applied in combination with 500 cross-validations for each trait to predict testcross performance, both across the whole population as well as within individual subpopulations or clusters, based solely on SNP profiles. Subpopulations were determined using multidimensional scaling and K-means clustering. Genomic prediction accuracy across the whole population was highest for seed oil content (0.81) followed by oil yield (0.75) and lowest for seedling emergence (0.29). For seed yieId, seed glucosinolate, lodging resistance and days to onset of flowering (DTF), prediction accuracies were 0.45, 0.61, 0.39 and 0.56, respectively. Prediction accuracies could be increased for some traits by treating subpopulations separately; a strategy which only led to moderate improvements for some traits with low heritability, like seedling emergence. No useful or consistent increase in accuracy was obtained by inclusion of a population substructure covariate in the model. Testcross performance prediction using genome-wide SNP markers shows considerable potential for pre-selection of promising hybrid combinations prior to resource-intensive field testing over multiple locations and years.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号