期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

The accuracy of prediction of genomic selection in elite hybrid rye populations surpasses the accuracy of marker-assisted selection and is equally augmented by multiple field evaluation locations and test years

Yu Wang Michael Florian Mette Thomas Miedaner Marlen Gottwald Peer Wilde Jochen C Reif Yusheng Zhao 《BMC genomics》2014,15(1)

Background

Marker-assisted selection (MAS) and genomic selection (GS) based on genome-wide marker data provide powerful tools to predict the genotypic value of selection material in plant breeding. However, case-to-case optimization of these approaches is required to achieve maximum accuracy of prediction with reasonable input.

Results

Based on extended field evaluation data for grain yield, plant height, starch content and total pentosan content of elite hybrid rye derived from testcrosses involving two bi-parental populations that were genotyped with 1048 molecular markers, we compared the accuracy of prediction of MAS and GS in a cross-validation approach. MAS delivered generally lower and in addition potentially over-estimated accuracies of prediction than GS by ridge regression best linear unbiased prediction (RR-BLUP). The grade of relatedness of the plant material included in the estimation and test sets clearly affected the accuracy of prediction of GS. Within each of the two bi-parental populations, accuracies differed depending on the relatedness of the respective parental lines. Across populations, accuracy increased when both populations contributed to estimation and test set. In contrast, accuracy of prediction based on an estimation set from one population to a test set from the other population was low despite that the two bi-parental segregating populations under scrutiny shared one parental line. Limiting the number of locations or years in field testing reduced the accuracy of prediction of GS equally, supporting the view that to establish robust GS calibration models a sufficient number of test locations is of similar importance as extended testing for more than one year.

Conclusions

In hybrid rye, genomic selection is superior to marker-assisted selection. However, it achieves high accuracies of prediction only for selection candidates closely related to the plant material evaluated in field trials, resulting in a rather pessimistic prognosis for distantly related material. Both, the numbers of evaluation locations and testing years in trials contribute equally to prediction accuracy.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-556) contains supplementary material, which is available to authorized users. 相似文献

2.

Practical application of genomic selection in a doubled-haploid winter wheat breeding program

Jiayin Song Brett F. Carver Carol Powers Liuling Yan Jaroslav Klápště Yousry A. El-Kassaby Charles Chen 《Molecular breeding : new strategies in plant improvement》2017,37(10):117

Crop improvement is a long-term, expensive institutional endeavor. Genomic selection (GS), which uses single nucleotide polymorphism (SNP) information to estimate genomic breeding values, has proven efficient to increasing genetic gain by accelerating the breeding process in animal breeding programs. As for crop improvement, with few exceptions, GS applicability remains in the evaluation of algorithm performance. In this study, we examined factors related to GS applicability in line development stage for grain yield using a hard red winter wheat (Triticum aestivum L.) doubled-haploid population. The performance of GS was evaluated in two consecutive years to predict grain yield. In general, the semi-parametric reproducing kernel Hilbert space prediction algorithm outperformed parametric genomic best linear unbiased prediction. For both parametric and semi-parametric algorithms, an upward bias in predictability was apparent in within-year cross-validation, suggesting the prerequisite of cross-year validation for a more reliable prediction. Adjusting the training population’s phenotype for genotype by environment effect had a positive impact on GS model’s predictive ability. Possibly due to marker redundancy, a selected subset of SNPs at an absolute pairwise correlation coefficient threshold value of 0.4 produced comparable results and reduced the computational burden of considering the full SNP set. Finally, in the context of an ongoing breeding and selection effort, the present study has provided a measure of confidence based on the deviation of line selection from GS results, supporting the implementation of GS in wheat variety development. 相似文献

3.

Prediction accuracies for growth and wood attributes of interior spruce in space using genotyping-by-sequencing

Omnia Gamal El-Dien Blaise Ratcliffe Jaroslav Kláp?tě Charles Chen Ilga Porth Yousry A El-Kassaby 《BMC genomics》2015,16(1)

Background

Genomic selection (GS) in forestry can substantially reduce the length of breeding cycle and increase gain per unit time through early selection and greater selection intensity, particularly for traits of low heritability and late expression. Affordable next-generation sequencing technologies made it possible to genotype large numbers of trees at a reasonable cost.

Results

Genotyping-by-sequencing was used to genotype 1,126 Interior spruce trees representing 25 open-pollinated families planted over three sites in British Columbia, Canada. Four imputation algorithms were compared (mean value (MI), singular value decomposition (SVD), expectation maximization (EM), and a newly derived, family-based k-nearest neighbor (kNN-Fam)). Trees were phenotyped for several yield and wood attributes. Single- and multi-site GS prediction models were developed using the Ridge Regression Best Linear Unbiased Predictor (RR-BLUP) and the Generalized Ridge Regression (GRR) to test different assumption about trait architecture. Finally, using PCA, multi-trait GS prediction models were developed. The EM and kNN-Fam imputation methods were superior for 30 and 60% missing data, respectively. The RR-BLUP GS prediction model produced better accuracies than the GRR indicating that the genetic architecture for these traits is complex. GS prediction accuracies for multi-site were high and better than those of single-sites while multi-site predictability produced the lowest accuracies reflecting type-b genetic correlations and deemed unreliable. The incorporation of genomic information in quantitative genetics analyses produced more realistic heritability estimates as half-sib pedigree tended to inflate the additive genetic variance and subsequently both heritability and gain estimates. Principle component scores as representatives of multi-trait GS prediction models produced surprising results where negatively correlated traits could be concurrently selected for using PCA2 and PCA3.

Conclusions

The application of GS to open-pollinated family testing, the simplest form of tree improvement evaluation methods, was proven to be effective. Prediction accuracies obtained for all traits greatly support the integration of GS in tree breeding. While the within-site GS prediction accuracies were high, the results clearly indicate that single-site GS models ability to predict other sites are unreliable supporting the utilization of multi-site approach. Principle component scores provided an opportunity for the concurrent selection of traits with different phenotypic optima.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1597-y) contains supplementary material, which is available to authorized users. 相似文献

4.

Optimization of multi-environment trials for genomic selection based on crop models

R. Rincent E. Kuhn H. Monod F.-X. Oury M. Rousset V. Allard J. Le Gouis 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2017,130(8):1735-1752

Key message

We propose a statistical criterion to optimize multi-environment trials to predict genotype × environment interactions more efficiently, by combining crop growth models and genomic selection models.

Abstract

Genotype × environment interactions (GEI) are common in plant multi-environment trials (METs). In this context, models developed for genomic selection (GS) that refers to the use of genome-wide information for predicting breeding values of selection candidates need to be adapted. One promising way to increase prediction accuracy in various environments is to combine ecophysiological and genetic modelling thanks to crop growth models (CGM) incorporating genetic parameters. The efficiency of this approach relies on the quality of the parameter estimates, which depends on the environments composing this MET used for calibration. The objective of this study was to determine a method to optimize the set of environments composing the MET for estimating genetic parameters in this context. A criterion called OptiMET was defined to this aim, and was evaluated on simulated and real data, with the example of wheat phenology. The MET defined with OptiMET allowed estimating the genetic parameters with lower error, leading to higher QTL detection power and higher prediction accuracies. MET defined with OptiMET was on average more efficient than random MET composed of twice as many environments, in terms of quality of the parameter estimates. OptiMET is thus a valuable tool to determine optimal experimental conditions to best exploit MET and the phenotyping tools that are currently developed.

相似文献

5.

Genomic selection accuracies within and between environments and small breeding groups in white spruce

Jean Beaulieu Trevor K Doerksen John MacKay André Rainville Jean Bousquet 《BMC genomics》2014,15(1)

Background

Genomic selection (GS) may improve selection response over conventional pedigree-based selection if markers capture more detailed information than pedigrees in recently domesticated tree species and/or make it more cost effective. Genomic prediction accuracies using 1748 trees and 6932 SNPs representative of as many distinct gene loci were determined for growth and wood traits in white spruce, within and between environments and breeding groups (BG), each with an effective size of N_e ≈ 20. Marker subsets were also tested.

Results

Model fits and/or cross-validation (CV) prediction accuracies for ridge regression (RR) and the least absolute shrinkage and selection operator models approached those of pedigree-based models. With strong relatedness between CV sets, prediction accuracies for RR within environment and BG were high for wood (r = 0.71–0.79) and moderately high for growth (r = 0.52–0.69) traits, in line with trends in heritabilities. For both classes of traits, these accuracies achieved between 83% and 92% of those obtained with phenotypes and pedigree information. Prediction into untested environments remained moderately high for wood (r ≥ 0.61) but dropped significantly for growth (r ≥ 0.24) traits, emphasizing the need to phenotype in all test environments and model genotype-by-environment interactions for growth traits. Removing relatedness between CV sets sharply decreased prediction accuracies for all traits and subpopulations, falling near zero between BGs with no known shared ancestry. For marker subsets, similar patterns were observed but with lower prediction accuracies.

Conclusions

Given the need for high relatedness between CV sets to obtain good prediction accuracies, we recommend to build GS models for prediction within the same breeding population only. Breeding groups could be merged to build genomic prediction models as long as the total effective population size does not exceed 50 individuals in order to obtain high prediction accuracy such as that obtained in the present study. A number of markers limited to a few hundred would not negatively impact prediction accuracies, but these could decrease more rapidly over generations. The most promising short-term approach for genomic selection would likely be the selection of superior individuals within large full-sib families vegetatively propagated to implement multiclonal forestry. 相似文献

6.

Efficiency of multi-trait,indirect, and trait-assisted genomic selection for improvement of biomass sorghum

Samuel B. Fernandes Kaio O. G. Dias Daniel F. Ferreira Patrick J. Brown 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2018,131(3):747-755

Key message

We compare genomic selection methods that use correlated traits to help predict biomass yield in sorghum, and find that trait-assisted genomic selection performs best.

Abstract

Genomic selection (GS) is usually performed on a single trait, but correlated traits can also help predict a focal trait through indirect or multi-trait GS. In this study, we use a pre-breeding population of biomass sorghum to compare strategies that use correlated traits to improve prediction of biomass yield, the focal trait. Correlated traits include moisture, plant height measured at monthly intervals between planting and harvest, and the area under the growth progress curve. In addition to single- and multi-trait direct and indirect GS, we test a new strategy called trait-assisted GS, in which correlated traits are used along with marker data in the validation population to predict a focal trait. Single-trait GS for biomass yield had a prediction accuracy of 0.40. Indirect GS performed best using area under the growth progress curve to predict biomass yield, with a prediction accuracy of 0.37, and did not differ from indirect multi-trait GS that also used moisture information. Multi-trait GS and single-trait GS yielded similar results, indicating that correlated traits did not improve prediction of biomass yield in a standard GS scenario. However, trait-assisted GS increased prediction accuracy by up to \(50\%\) when using plant height in both the training and validation populations to help predict yield in the validation population. Coincidence between selected genotypes in phenotypic and genomic selection was also highest in trait-assisted GS. Overall, these results suggest that trait-assisted GS can be an efficient strategy when correlated traits are obtained earlier or more inexpensively than a focal trait.

相似文献

7.

Genomic models with genotype × environment interaction for predicting hybrid performance: an application in maize hybrids

Rocío Acosta-Pech José Crossa Gustavo de los Campos Simon Teyssèdre Bruno Claustres Sergio Pérez-Elizalde Paulino Pérez-Rodríguez 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2017,130(7):1431-1440

Key message

A new genomic model that incorporates genotype?×?environment interaction gave increased prediction accuracy of untested hybrid response for traits such as percent starch content, percent dry matter content and silage yield of maize hybrids.

Abstract

The prediction of hybrid performance (HP) is very important in agricultural breeding programs. In plant breeding, multi-environment trials play an important role in the selection of important traits, such as stability across environments, grain yield and pest resistance. Environmental conditions modulate gene expression causing genotype?×?environment interaction (G?×?E), such that the estimated genetic correlations of the performance of individual lines across environments summarize the joint action of genes and environmental conditions. This article proposes a genomic statistical model that incorporates G?×?E for general and specific combining ability for predicting the performance of hybrids in environments. The proposed model can also be applied to any other hybrid species with distinct parental pools. In this study, we evaluated the predictive ability of two HP prediction models using a cross-validation approach applied in extensive maize hybrid data, comprising 2724 hybrids derived from 507 dent lines and 24 flint lines, which were evaluated for three traits in 58 environments over 12 years; analyses were performed for each year. On average, genomic models that include the interaction of general and specific combining ability with environments have greater predictive ability than genomic models without interaction with environments (ranging from 12 to 22%, depending on the trait). We concluded that including G?×?E in the prediction of untested maize hybrids increases the accuracy of genomic models.

相似文献

8.

Genomic prediction for rust resistance in diverse wheat landraces 总被引：1，自引：0，他引：1

Hans D. Daetwyler Urmil K. Bansal Harbans S. Bariana Matthew J. Hayden Ben J. Hayes 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2014,127(8):1795-1803

Key message

We have demonstrated that genomic selection in diverse wheat landraces for resistance to leaf, stem and strip rust is possible, as genomic breeding values were moderately accurate. Markers with large effects in the Bayesian analysis confirmed many known genes, while also discovering many previously uncharacterised genome regions associated with rust scores.

Abstract

Genomic selection, where selection decisions are based on genomic estimated breeding values (GEBVs) derived from genome-wide DNA markers, could accelerate genetic progress in plant breeding. In this study, we assessed the accuracy of GEBVs for rust resistance in 206 hexaploid wheat (Triticum aestivum) landraces from the Watkins collection of phenotypically diverse wheat genotypes from 32 countries. The landraces were genotyped for 5,568 SNPs using an Illumina iSelect 9 K bead chip assay and phenotyped for field-based leaf rust (Lr), stem rust (Sr) and stripe rust (Yr) responses across multiple years. Genomic Best Linear Unbiased Prediction (GBLUP) and a Bayesian Regression method (BayesR) were used to predict GEBVs. Based on fivefold cross-validation, the accuracy of genomic prediction averaged across years was 0.35, 0.27 and 0.44 for Lr, Sr and Yr using GBLUP and 0.33, 0.38 and 0.30 for Lr, Sr and Yr using BayesR, respectively. Inclusion of PCR-predicted genotypes for known rust resistance genes increased accuracy more substantially when the marker was diagnostic (Lr34/Sr57/Yr18) for the presence-absence of the gene rather than just linked (Sr2). Investigation of the impact of genetic relatedness between validation and reference lines on accuracy of genomic prediction showed that accuracy will be higher when each validation line had at least one close relationship to the reference lines. Overall, the prediction accuracies achieved in this study are encouraging, and confirm the feasibility of genomic selection in wheat. In several instances, estimated marker effects were confirmed by published literature and results of mapping experiments using Watkins accessions. 相似文献

9.

Genomic and pedigree-based prediction for leaf,stem, and stripe rust resistance in wheat

Philomin Juliana Ravi P. Singh Pawan K. Singh Jose Crossa Julio Huerta-Espino Caixia Lan Sridhar Bhavani Jessica E. Rutkoski Jesse A. Poland Gary C. Bergstrom Mark E. Sorrells 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2017,130(7):1415-1430

Key message

Genomic prediction for seedling and adult plant resistance to wheat rusts was compared to prediction using few markers as fixed effects in a least-squares approach and pedigree-based prediction.

Abstract

The unceasing plant-pathogen arms race and ephemeral nature of some rust resistance genes have been challenging for wheat (Triticum aestivum L.) breeding programs and farmers. Hence, it is important to devise strategies for effective evaluation and exploitation of quantitative rust resistance. One promising approach that could accelerate gain from selection for rust resistance is ‘genomic selection’ which utilizes dense genome-wide markers to estimate the breeding values (BVs) for quantitative traits. Our objective was to compare three genomic prediction models including genomic best linear unbiased prediction (GBLUP), GBLUP A that was GBLUP with selected loci as fixed effects and reproducing kernel Hilbert spaces-markers (RKHS-M) with least-squares (LS) approach, RKHS-pedigree (RKHS-P), and RKHS markers and pedigree (RKHS-MP) to determine the BVs for seedling and/or adult plant resistance (APR) to leaf rust (LR), stem rust (SR), and stripe rust (YR). The 333 lines in the 45th IBWSN and the 313 lines in the 46th IBWSN were genotyped using genotyping-by-sequencing and phenotyped in replicated trials. The mean prediction accuracies ranged from 0.31–0.74 for LR seedling, 0.12–0.56 for LR APR, 0.31–0.65 for SR APR, 0.70–0.78 for YR seedling, and 0.34–0.71 for YR APR. For most datasets, the RKHS-MP model gave the highest accuracies, while LS gave the lowest. GBLUP, GBLUP A, RKHS-M, and RKHS-P models gave similar accuracies. Using genome-wide marker-based models resulted in an average of 42% increase in accuracy over LS. We conclude that GS is a promising approach for improvement of quantitative rust resistance and can be implemented in the breeding pipeline.

相似文献

10.

Automated prediction of HIV drug resistance from genotype data

ChenHsiang Shen Xiaxia Yu Robert W. Harrison Irene T. Weber 《BMC bioinformatics》2016,17(8):278

相似文献

11.

Genome-based prediction of maize hybrid performance across genetic groups,testers, locations,and years

Theresa Albrecht Hans-Jürgen Auinger Valentin Wimmer Joseph O. Ogutu Carsten Knaak Milena Ouzunova Hans-Peter Piepho Chris-Carolin Schön 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2014,127(6):1375-1386

Key message

The calibration data for genomic prediction should represent the full genetic spectrum of a breeding program. Data heterogeneity is minimized by connecting data sources through highly related test units.

Abstract

One of the major challenges of genome-enabled prediction in plant breeding lies in the optimum design of the population employed in model training. With highly interconnected breeding cycles staggered in time the choice of data for model training is not straightforward. We used cross-validation and independent validation to assess the performance of genome-based prediction within and across genetic groups, testers, locations, and years. The study comprised data for 1,073 and 857 doubled haploid lines evaluated as testcrosses in 2 years. Testcrosses were phenotyped for grain dry matter yield and content and genotyped with 56,110 single nucleotide polymorphism markers. Predictive abilities strongly depended on the relatedness of the doubled haploid lines from the estimation set with those on which prediction accuracy was assessed. For scenarios with strong population heterogeneity it was advantageous to perform predictions within a priori defined genetic groups until higher connectivity through related test units was achieved. Differences between group means had a strong effect on predictive abilities obtained with both cross-validation and independent validation. Predictive abilities across subsequent cycles of selection and years were only slightly reduced compared to predictive abilities obtained with cross-validation within the same year. We conclude that the optimum data set for model training in genome-enabled prediction should represent the full genetic and environmental spectrum of the respective breeding program. Data heterogeneity can be reduced by experimental designs that maximize the connectivity between data sources by common or highly related test units. 相似文献

12.

Will genomic selection be a practical method for plant breeding?

Akihiro Nakaya Sachiko N. Isobe 《Annals of botany》2012,110(6):1303-1316

Background

Genomic selection or genome-wide selection (GS) has been highlighted as a new approach for marker-assisted selection (MAS) in recent years. GS is a form of MAS that selects favourable individuals based on genomic estimated breeding values. Previous studies have suggested the utility of GS, especially for capturing small-effect quantitative trait loci, but GS has not become a popular methodology in the field of plant breeding, possibly because there is insufficient information available on GS for practical use.

Scope

In this review, GS is discussed from a practical breeding viewpoint. Statistical approaches employed in GS are briefly described, before the recent progress in GS studies is surveyed. GS practices in plant breeding are then reviewed before future prospects are discussed.

Conclusions

Statistical concepts used in GS are discussed with genetic models and variance decomposition, heritability, breeding value and linear model. Recent progress in GS studies is reviewed with a focus on empirical studies. For the practice of GS in plant breeding, several specific points are discussed including linkage disequilibrium, feature of populations and genotyped markers and breeding scheme. Currently, GS is not perfect, but it is a potent, attractive and valuable approach for plant breeding. This method will be integrated into many practical breeding programmes in the near future with further advances and the maturing of its theory.Key words: Genomic selection, plant breeding, marker assisted selection, genetic model, linkage disequilibrium 相似文献

13.

Accuracies of genomic breeding values in American Angus beef cattle using K-means clustering for cross-validation

Mahdi Saatchi Mathew C McClure Stephanie D McKay Megan M Rolf JaeWoo Kim Jared E Decker Tasia M Taxis Richard H Chapple Holly R Ramey Sally L Northcutt Stewart Bauck Brent Woodward Jack CM Dekkers Rohan L Fernando Robert D Schnabel Dorian J Garrick Jeremy F Taylor 《遗传、选种与进化》2011,43(1):40

Background

Genomic selection is a recently developed technology that is beginning to revolutionize animal breeding. The objective of this study was to estimate marker effects to derive prediction equations for direct genomic values for 16 routinely recorded traits of American Angus beef cattle and quantify corresponding accuracies of prediction.

Methods

Deregressed estimated breeding values were used as observations in a weighted analysis to derive direct genomic values for 3570 sires genotyped using the Illumina BovineSNP50 BeadChip. These bulls were clustered into five groups using K-means clustering on pedigree estimates of additive genetic relationships between animals, with the aim of increasing within-group and decreasing between-group relationships. All five combinations of four groups were used for model training, with cross-validation performed in the group not used in training. Bivariate animal models were used for each trait to estimate the genetic correlation between deregressed estimated breeding values and direct genomic values.

Results

Accuracies of direct genomic values ranged from 0.22 to 0.69 for the studied traits, with an average of 0.44. Predictions were more accurate when animals within the validation group were more closely related to animals in the training set. When training and validation sets were formed by random allocation, the accuracies of direct genomic values ranged from 0.38 to 0.85, with an average of 0.65, reflecting the greater relationship between animals in training and validation. The accuracies of direct genomic values obtained from training on older animals and validating in younger animals were intermediate to the accuracies obtained from K-means clustering and random clustering for most traits. The genetic correlation between deregressed estimated breeding values and direct genomic values ranged from 0.15 to 0.80 for the traits studied.

Conclusions

These results suggest that genomic estimates of genetic merit can be produced in beef cattle at a young age but the recurrent inclusion of genotyped sires in retraining analyses will be necessary to routinely produce for the industry the direct genomic values with the highest accuracy. 相似文献

14.

Genomic predictions of breeding values in a cloned <Emphasis Type="Italic">Eucalyptus globulus</Emphasis> population in Chile

Ricardo Durán Fikret Isik Jaime Zapata-Valenzuela Claudio Balocchi Sofía Valenzuela 《Tree Genetics & Genomes》2017,13(4):74

In Chile, an intensive Eucalyptus globulus clonal selection program is being carried out to increase forest productivity for pulp production. A breeding population was used to investigate the predicted ability of single nucleotide polymorphism (SNP) markers for genomic selection (GS). A total of 310 clones from 53 families were used. Stem volume and wood density were measured on all clones. Trees were genotyped at 12 K polymorphic markers using the EUChip60K genotype array. Genomic best linear unbiased prediction, Bayesian lasso regression, Bayes B, and Bayes C models were used to predict genomic estimated breeding values (GEBV). For cross-validation, 260 individuals were sampled for model training and 50 individuals for model validation, using 2 folds and 10 replications each. The average predictive ability estimates for wood density and stem volume across the models were 0.58 and 0.75, respectively. The average rank correlations were 0.59 and 0.71, respectively. Models produced very similar bias for both traits. When clones were ranked based on their GEBV, models had similar phenotypic mean for the top 10% of the clones. The predicted ability of markers will likely decrease if the models are used to predict GEBV of new material coming from the breeding program, because of a different marker–trait phase introduced by recombination. The results should be validated with larger populations and across two generations before routine applications of GS in E. globulus. We suggest that GS is a viable strategy to accelerate clonal selection program of E. globulus in Chile. 相似文献

15.

Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions 总被引：1，自引：0，他引：1

Nicolas Heslot Deniz Akdemir Mark E. Sorrells Jean-Luc Jannink 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2014,127(2):463-480

Key message

Development of models to predict genotype by environment interactions, in unobserved environments, using environmental covariates, a crop model and genomic selection. Application to a large winter wheat dataset.

Abstract

Genotype by environment interaction (G*E) is one of the key issues when analyzing phenotypes. The use of environment data to model G*E has long been a subject of interest but is limited by the same problems as those addressed by genomic selection methods: a large number of correlated predictors each explaining a small amount of the total variance. In addition, non-linear responses of genotypes to stresses are expected to further complicate the analysis. Using a crop model to derive stress covariates from daily weather data for predicted crop development stages, we propose an extension of the factorial regression model to genomic selection. This model is further extended to the marker level, enabling the modeling of quantitative trait loci (QTL) by environment interaction (Q*E), on a genome-wide scale. A newly developed ensemble method, soft rule fit, was used to improve this model and capture non-linear responses of QTL to stresses. The method is tested using a large winter wheat dataset, representative of the type of data available in a large-scale commercial breeding program. Accuracy in predicting genotype performance in unobserved environments for which weather data were available increased by 11.1 % on average and the variability in prediction accuracy decreased by 10.8 %. By leveraging agronomic knowledge and the large historical datasets generated by breeding programs, this new model provides insight into the genetic architecture of genotype by environment interactions and could predict genotype performance based on past and future weather scenarios. 相似文献

16.

Genome-wide mapping and prediction suggests presence of local epistasis in a vast elite winter wheat populations adapted to Central Europe

Sang He Jochen C. Reif Viktor Korzun Reiner Bothe Erhard Ebmeyer Yong Jiang 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2017,130(4):635-647

Key message

Genome-wide association mapping as well as marker- and haplotype-based genome-wide selection unraveled a complex genetic architecture of grain yield with absence of large effect QTL and presence of local epistatic effects.

Abstract

The genetic architecture of grain yield determines to a large extent the optimum design of genomic-assisted wheat breeding programs. The main goal of our study was to examine the potential and limitations to dissect the genetic architecture of grain yield in wheat using a large experimental data set. Our study was based on phenotypic information and genomic data of 13,901 SNPs of a diverse set of 3816 elite wheat lines adapted to Central Europe. We applied genome-wide association mapping based on experimental and simulated data sets and performed marker- and haplotype-based genomic prediction. Computer simulations revealed for our mapping population a high power to detect QTL, even if they individually explained only 2.5% of the genetic variation. Despite this, we found no stable marker–trait associations when validating in independent subsets. A two-dimensional scan for marker–marker interactions indicated presence of local epistasis which was further supported by improved prediction abilities when shifting from marker- to haplotype-based genome-wide prediction approaches. We observed that marker effects estimated using genome-wide prediction approaches strongly varied across years albeit resulting in high prediction abilities. Thus, our results suggested that the prediction accuracy of genomic selection in wheat is mainly driven by relatedness rather than by exploiting knowledge of the genetic architecture.

相似文献

17.

Increased genomic prediction accuracy in wheat breeding using a large Australian panel

Adam Norman Julian Taylor Emi Tanaka Paul Telfer James Edwards Jean-Pierre Martinant Haydn Kuchel 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2017,130(12):2543-2555

Key message

Genomic prediction accuracy within a large panel was found to be substantially higher than that previously observed in smaller populations, and also higher than QTL-based prediction.

Abstract

In recent years, genomic selection for wheat breeding has been widely studied, but this has typically been restricted to population sizes under 1000 individuals. To assess its efficacy in germplasm representative of commercial breeding programmes, we used a panel of 10,375 Australian wheat breeding lines to investigate the accuracy of genomic prediction for grain yield, physical grain quality and other physiological traits. To achieve this, the complete panel was phenotyped in a dedicated field trial and genotyped using a custom Axiom^TM Affymetrix SNP array. A high-quality consensus map was also constructed, allowing the linkage disequilibrium present in the germplasm to be investigated. Using the complete SNP array, genomic prediction accuracies were found to be substantially higher than those previously observed in smaller populations and also more accurate compared to prediction approaches using a finite number of selected quantitative trait loci. Multi-trait genetic correlations were also assessed at an additive and residual genetic level, identifying a negative genetic correlation between grain yield and protein as well as a positive genetic correlation between grain size and test weight.

相似文献

18.

Performance of multi-trait genomic selection for <Emphasis Type="Italic">Eucalyptus robusta</Emphasis> breeding program

Tahina Rambolarimanana Lolona Ramamonjisoa Daniel Verhaegen Jean-Michel Leong Pock Tsy Laval Jacquin Tuong-Vi Cao-Hamadou Garel Makouanzi Jean-Marc Bouvet 《Tree Genetics & Genomes》2018,14(5):71

In forest tree genetic improvement, multi-trait genomic selection (GS) may have advantages in improving the accuracy of the genotype estimation and shortening selection cycles. For the breeding of Eucalyptus robusta, one of the most exotic planted species in Madagascar, volume at 49 months (V49), total lignin (TL), and holo-cellulose (Holo) were considered. For GS, 2919 single nucleotide polymorphisms (SNP) were used with the genomic best linear unbiased predictor (GBLUP) method, which was as efficient as the reproducing kernel Hilbert space (RKHS) and elastic net methods (EN), but more adapted to multi-trait modeling. The efficiency of individual I model, including the genomic data, was much higher than the provenance effect P model. For example, with V49, mean goodness-of-fit was: r_{I_Full} =?0.79, r_{P_Full} =?0.37 for I and P, respectively. The prediction accuracies using the cross-validation procedure were lower for V49: r_I =?0.29 r_P =?0.28. The genetic gains resulting from the indexes associating (V49, TL) and (V49, Holo) were higher using I than for the P model; for V49, the relative genetic gain was 37 and 20%, respectively, with 5% of selection intensity. The single-trait approach was as efficient as the multi-trait approach given the weak correlations between V49 and TL or Holo. The I model also brings greater diversity: for V49 the number of provenances represented in a selected population was two and three with the P model, and 6 and 16 with the I model. 相似文献

19.

Genomic prediction of starch content and chipping quality in tetraploid potato using genotyping-by-sequencing

Elsa Sverrisdóttir Stephen Byrne Ea Høegh Riis Sundmark Heidi Øllegaard Johnsen Hanne Grethe Kirk Torben Asp Luc Janss Kåre L. Nielsen 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2017,130(10):2091-2108

Key message

Genomic prediction models for starch content and chipping quality show promising results, suggesting that genomic selection is a feasible breeding strategy in tetraploid potato.

Abstract

Genomic selection uses genome-wide molecular markers to predict performance of individuals and allows selections in the absence of direct phenotyping. It is regarded as a useful tool to accelerate genetic gain in breeding programs, and is becoming increasingly viable for crops as genotyping costs continue to fall. In this study, we have generated genomic prediction models for starch content and chipping quality in tetraploid potato to facilitate varietal development. Chipping quality was evaluated as the colour of a potato chip after frying following cold induced sweetening. We used genotyping-by-sequencing to genotype 762 offspring, derived from a population generated from biparental crosses of 18 tetraploid parents. Additionally, 74 breeding clones were genotyped, representing a test panel for model validation. We generated genomic prediction models from 171,859 single-nucleotide polymorphisms to calculate genomic estimated breeding values. Cross-validated prediction correlations of 0.56 and 0.73 were obtained within the training population for starch content and chipping quality, respectively, while correlations were lower when predicting performance in the test panel, at 0.30–0.31 and 0.42–0.43, respectively. Predictions in the test panel were slightly improved when including representatives from the test panel in the training population but worsened when preceded by marker selection. Our results suggest that genomic prediction is feasible, however, the extremely high allelic diversity of tetraploid potato necessitates large training populations to efficiently capture the genetic diversity of elite potato germplasm and enable accurate prediction across the entire spectrum of elite potatoes. Nonetheless, our results demonstrate that GS is a promising breeding strategy for tetraploid potato.

相似文献

20.

Assessing the variation and genetic architecture of asparagine content in wheat: What can plant breeding contribute to a reduction in the acrylamide precursor?

Matthias Rapp Klaus Schwadorf Willmar L. Leiser Tobias Würschum C. Friedrich H. Longin 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2018,131(11):2427-2437

Key message

A large genetic variation, moderately high heritability, and promising prediction ability for genomic selection show that wheat breeding can substantially reduce the acrylamide forming potential in bread wheat by a reduction in its precursor asparagine.

Abstract

Acrylamide is a potentially carcinogenic substance that is formed in baked products of wheat via the Maillard reaction from carbonyl sources and asparagine. In bread, the acrylamide content increases almost linearly with the asparagine content of the wheat grains. Our objective was, therefore, to investigate the potential of wheat breeding to contribute to a reduction in acrylamide by decreasing the asparagine content in wheat grains. To this end, we evaluated 149 wheat varieties from Central Europe at three locations for asparagine content, as well as for sulfur content, and five important quality traits regularly assessed in bread wheat breeding. The mean asparagine content ranged from 143.25 to 392.75 mg/kg for the different wheat varieties, thus underlining the possibility to reduce the acrylamide content of baked wheat products considerably by selecting appropriate varieties. Furthermore, a moderately high heritability of 0.65 and no negative correlations with quality traits like protein content, sedimentation volume and falling number show that breeding of quality wheat with low asparagine content is feasible. Genome-wide association mapping identified few QTL for asparagine content, the largest explaining 18% of the genotypic variance. Combining these QTL with a genome-wide prediction approach yielded a mean cross-validated prediction ability of 0.62. As we observed a high genotype-by-environment interaction for asparagine content, we recommend the costly and slow laboratory analysis only for late breeding generations, while selection in early generations could be based on marker-assisted or genomic selection.

相似文献