期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Comparison of whole-genome prediction models for traits with contrasting genetic architecture in a diversity panel of maize inbred lines

C Riedelsheimer F Technow AE Melchinger 《BMC genomics》2012,13(1):452

ABSTRACT: BACKGROUND: There is increasing empirical evidence that whole-genome prediction (WGP) is a powerful tool for predicting line and hybrid performance in maize. However, there is a lack of knowledge about the sensitivity of WGP models towards the genetic architecture of the trait. Whereas previous studies exclusively focused on highly polygenic traits, important agronomic traits such as disease resistances, nutrifunctional or climate adaptational traits have a genetic architecture which is either much less complex or unknown. For such cases, information about model robustness and guidelines for model selection are lacking. Here, we compared five WGP models with different assumptions about the distribution of the underlying genetic effects. As contrasting model traits, we chose three highly polygenic agronomic traits and three metabolites each with a major QTL explaining 22 to 30 % of the genetic variance in a panel of 289 diverse maize inbred lines genotyped with 56,110 SNPs. RESULTS: We found the five WGP models to be remarkable robust towards trait architecture with the largest differences in prediction accuracies ranging between 0.05 and 0.14 for the same trait, most likely as the result of the high level of linkage disequilibrium prevailing in elite maize germplasm. Whereas RR-BLUP performed best for the agronomic traits, it was inferior to LASSO or elastic net for the three metabolites. We found the approach of genome partitioning of genetic variance, first applied in human genetics, as useful in guiding the breeder which model to choose, if prior knowledge of the trait architecture is lacking. CONCLUSIONS: Our results suggest that in diverse germplasm of elite maize inbred lines with a high level of LD, WGP models differ only slightly in their accuracies, irrespective of the number and effects of QTL found in previous linkage or association mapping studies. However, small gains in prediction accuracies can be achieved if the WGP model is selected according to the genetic architecture of the trait. If the trait architecture is unknown e.g. for novel traits which only recently received attention in breeding, we suggest to inspect the distribution of the genetic variance explained by each chromosome for guiding model selection in WGP. 相似文献

2.

A Comprehensive Genetic Approach for Improving Prediction of Skin Cancer Risk in Humans

Ana I. Vazquez Gustavo de los Campos Yann C. Klimentidis Guilherme J. M. Rosa Daniel Gianola Nengjun Yi David B. Allison 《Genetics》2012,192(4):1493-1502

Prediction of genetic risk for disease is needed for preventive and personalized medicine. Genome-wide association studies have found unprecedented numbers of variants associated with complex human traits and diseases. However, these variants explain only a small proportion of genetic risk. Mounting evidence suggests that many traits, relevant to public health, are affected by large numbers of small-effect genes and that prediction of genetic risk to those traits and diseases could be improved by incorporating large numbers of markers into whole-genome prediction (WGP) models. We developed a WGP model incorporating thousands of markers for prediction of skin cancer risk in humans. We also considered other ways of incorporating genetic information into prediction models, such as family history or ancestry (using principal components, PCs, of informative markers). Prediction accuracy was evaluated using the area under the receiver operating characteristic curve (AUC) estimated in a cross-validation. Incorporation of genetic information (i.e., familial relationships, PCs, or WGP) yielded a significant increase in prediction accuracy: from an AUC of 0.53 for a baseline model that accounted for nongenetic covariates to AUCs of 0.58 (pedigree), 0.62 (PCs), and 0.64 (WGP). In summary, prediction of skin cancer risk could be improved by considering genetic information and using a large number of single-nucleotide polymorphisms (SNPs) in a WGP model, which allows for the detection of patterns of genetic risk that are above and beyond those that can be captured using family history. We discuss avenues for improving prediction accuracy and speculate on the possible use of WGP to prospectively identify individuals at high risk. 相似文献

3.

Joint prediction of multiple quantitative traits using a Bayesian multivariate antedependence model

J Jiang Q Zhang L Ma J Li Z Wang J-F Liu 《Heredity》2015,115(1):29-36

Predicting organismal phenotypes from genotype data is important for preventive and personalized medicine as well as plant and animal breeding. Although genome-wide association studies (GWAS) for complex traits have discovered a large number of trait- and disease-associated variants, phenotype prediction based on associated variants is usually in low accuracy even for a high-heritability trait because these variants can typically account for a limited fraction of total genetic variance. In comparison with GWAS, the whole-genome prediction (WGP) methods can increase prediction accuracy by making use of a huge number of variants simultaneously. Among various statistical methods for WGP, multiple-trait model and antedependence model show their respective advantages. To take advantage of both strategies within a unified framework, we proposed a novel multivariate antedependence-based method for joint prediction of multiple quantitative traits using a Bayesian algorithm via modeling a linear relationship of effect vector between each pair of adjacent markers. Through both simulation and real-data analyses, our studies demonstrated that the proposed antedependence-based multiple-trait WGP method is more accurate and robust than corresponding traditional counterparts (Bayes A and multi-trait Bayes A) under various scenarios. Our method can be readily extended to deal with missing phenotypes and resequence data with rare variants, offering a feasible way to jointly predict phenotypes for multiple complex traits in human genetic epidemiology as well as plant and livestock breeding. 相似文献

4.

Genomic prediction in CIMMYT maize and wheat breeding programs

J Crossa P Pérez J Hickey J Burgue?o L Ornella J Cerón-Rojas X Zhang S Dreisigacker R Babu Y Li D Bonnett K Mathews 《Heredity》2014,112(1):48-60

Genomic selection (GS) has been implemented in animal and plant species, and is regarded as a useful tool for accelerating genetic gains. Varying levels of genomic prediction accuracy have been obtained in plants, depending on the prediction problem assessed and on several other factors, such as trait heritability, the relationship between the individuals to be predicted and those used to train the models for prediction, number of markers, sample size and genotype × environment interaction (GE). The main objective of this article is to describe the results of genomic prediction in International Maize and Wheat Improvement Center''s (CIMMYT''s) maize and wheat breeding programs, from the initial assessment of the predictive ability of different models using pedigree and marker information to the present, when methods for implementing GS in practical global maize and wheat breeding programs are being studied and investigated. Results show that pedigree (population structure) accounts for a sizeable proportion of the prediction accuracy when a global population is the prediction problem to be assessed. However, when the prediction uses unrelated populations to train the prediction equations, prediction accuracy becomes negligible. When genomic prediction includes modeling GE, an increase in prediction accuracy can be achieved by borrowing information from correlated environments. Several questions on how to incorporate GS into CIMMYT''s maize and wheat programs remain unanswered and subject to further investigation, for example, prediction within and between related bi-parental crosses. Further research on the quantification of breeding value components for GS in plant breeding populations is required. 相似文献

5.

Evaluation and retrospective optimization of genomic selection for yield and disease resistance in spring barley

Tyler Tiede Kevin P. Smith 《Molecular breeding : new strategies in plant improvement》2018,38(5):55

The general applicability of genomic selection (GS) to plant breeding and principles guiding its use have been established by simulation and empirical cross-validation studies. More recently, studies have demonstrated genetic gains over multiple cycles of selection in a variety of crop species. In this study, we provide additional evidence for the effectiveness of GS in an actual breeding program by demonstrating significant gains of 186.1 kg ha^?1 and ??1.85 ppm for grain yield and deoxynivalenol, respectively, two unfavorably correlated quantitative traits, across 3 cycles of selection in a spring six-row barley breeding population. With its general effectiveness established, the next step is to increase the accuracy of predictions used in GS and thereby increase genetic gains. For this, we first showed that updating the training population (TP) with phenotyped lines from recent breeding cycles, specifically selected lines, had an overall positive effect on prediction accuracy. Additionally, we investigated four recently proposed algorithms that seek to optimize the composition of a TP. Overall, the optimization algorithms improved prediction accuracy when compared to a randomly selected TP subset of the same size, but which algorithm performed best was dependent on the trait being predicted and other factors discussed within. This retrospective investigation highlights the importance of maintaining and optimizing the TP when using GS in applied breeding to maximize prediction accuracy, thereby maximizing gain from selection and resource utilization efficiency. 相似文献

6.

Evaluation of genome-wide selection efficiency in maize nested association mapping populations

Guo Z Tucker DM Lu J Kishore V Gay G 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2012,124(2):261-275

In comparison to conventional marker-assisted selection (MAS), which utilizes only a subset of genetic markers associated with a trait to predict breeding values (BVs), genome-wide selection (GWS) improves prediction accuracies by incorporating all markers into a model simultaneously. This strategy avoids risks of missing quantitative trait loci (QTL) with small effects. Here, we evaluated the accuracy of prediction for three corn flowering traits days to silking, days to anthesis, and anthesis-silking interval with GWS based on cross-validation experiments using a large data set of 25 nested association mapping populations in maize (Zea mays). We found that GWS via ridge regression-best linear unbiased prediction (RR-BLUP) gave significantly higher predictions compared to MAS utilizing composite interval mapping (CIM). The CIM method may be selected over multiple linear regression to decrease over-estimations of the efficiency of GWS over a MAS strategy. The RR-BLUP method was the preferred method for estimating marker effects in GWS with prediction accuracies comparable to or greater than BayesA and BayesB. The accuracy with RR-BLUP increased with training sample proportion, marker density, and heritability until it reached a plateau. In general, gains in accuracy with RR-BLUP over CIM increased with decreases of these factors. Compared to training sample proportion, the accuracy of prediction with RR-BLUP was relatively insensitive to marker density. 相似文献

7.

Empirical and deterministic accuracies of across-population genomic prediction

Yvonne CJ Wientjes Roel F Veerkamp Piter Bijma Henk Bovenhuis Chris Schrooten Mario PL Calus 《遗传、选种与进化》2015,47(1)

Background

Differences in linkage disequilibrium and in allele substitution effects of QTL (quantitative trait loci) may hinder genomic prediction across populations. Our objective was to develop a deterministic formula to estimate the accuracy of across-population genomic prediction, for which reference individuals and selection candidates are from different populations, and to investigate the impact of differences in allele substitution effects across populations and of the number of QTL underlying a trait on the accuracy.

Methods

A deterministic formula to estimate the accuracy of across-population genomic prediction was derived based on selection index theory. Moreover, accuracies were deterministically predicted using a formula based on population parameters and empirically calculated using simulated phenotypes and a GBLUP (genomic best linear unbiased prediction) model. Phenotypes of 1033 Holstein-Friesian, 105 Groninger White Headed and 147 Meuse-Rhine-Yssel cows were simulated by sampling 3000, 300, 30 or 3 QTL from the available high-density SNP (single nucleotide polymorphism) information of three chromosomes, assuming a correlation of 1.0, 0.8, 0.6, 0.4, or 0.2 between allele substitution effects across breeds. The simulated heritability was set to 0.95 to resemble the heritability of deregressed proofs of bulls.

Results

Accuracies estimated with the deterministic formula based on selection index theory were similar to empirical accuracies for all scenarios, while accuracies predicted with the formula based on population parameters overestimated empirical accuracies by ~25 to 30%. When the between-breed genetic correlation differed from 1, i.e. allele substitution effects differed across breeds, empirical and deterministic accuracies decreased in proportion to the genetic correlation. Using a multi-trait model, it was possible to accurately estimate the genetic correlation between the breeds based on phenotypes and high-density genotypes. The number of QTL underlying the simulated trait did not affect the accuracy.

Conclusions

The deterministic formula based on selection index theory estimated the accuracy of across-population genomic predictions well. The deterministic formula using population parameters overestimated the across-population genomic accuracy, but may still be useful because of its simplicity. Both formulas could accommodate for genetic correlations between populations lower than 1. The number of QTL underlying a trait did not affect the accuracy of across-population genomic prediction using a GBLUP method. 相似文献

8.

Accuracy of genomic prediction within and across populations for nematode resistance and body weight traits in sheep

《Animal : an international journal of animal bioscience》2014,8(4):520-528

Genomic prediction utilizes single nucleotide polymorphism (SNP) chip data to predict animal genetic merit. It has the advantage of potentially capturing the effects of the majority of loci that contribute to genetic variation in a trait, even when the effects of the individual loci are very small. To implement genomic prediction, marker effects are estimated with a training set, including individuals with marker genotypes and trait phenotypes; subsequently, genomic estimated breeding values (GEBV) for any genotyped individual in the population can be calculated using the estimated marker effects. In this study, we aimed to: (i) evaluate the potential of genomic prediction to predict GEBV for nematode resistance traits and BW in sheep, within and across populations; (ii) evaluate the accuracy of these predictions through within-population cross-validation; and (iii) explore the impact of population structure on the accuracy of prediction. Four data sets comprising 752 lambs from a Scottish Blackface population, 2371 from a Sarda×Lacaune backcross population, 1000 from a Martinik Black-Belly×Romane backcross population and 64 from a British Texel population were used in this study. Traits available for the analysis were faecal egg count for Nematodirus and Strongyles and BW at different ages or as average effect, depending on the population. Moreover, immunoglobulin A was also available for the Scottish Blackface population. Results show that GEBV had moderate to good within-population predictive accuracy, whereas across-population predictions had accuracies close to zero. This can be explained by our finding that in most cases the accuracy estimates were mostly because of additive genetic relatedness between animals, rather than linkage disequilibrium between SNP and quantitative trait loci. Therefore, our results suggest that genomic prediction for nematode resistance and BW may be of value in closely related animals, but that with the current SNP chip genomic predictions are unlikely to work across breeds. 相似文献

9.

Integrating Crop Growth Models with Whole Genome Prediction through Approximate Bayesian Computation

Frank Technow Carlos D. Messina L. Radu Totir Mark Cooper 《PloS one》2015,10(6)

Genomic selection, enabled by whole genome prediction (WGP) methods, is revolutionizing plant breeding. Existing WGP methods have been shown to deliver accurate predictions in the most common settings, such as prediction of across environment performance for traits with additive gene effects. However, prediction of traits with non-additive gene effects and prediction of genotype by environment interaction (G×E), continues to be challenging. Previous attempts to increase prediction accuracy for these particularly difficult tasks employed prediction methods that are purely statistical in nature. Augmenting the statistical methods with biological knowledge has been largely overlooked thus far. Crop growth models (CGMs) attempt to represent the impact of functional relationships between plant physiology and the environment in the formation of yield and similar output traits of interest. Thus, they can explain the impact of G×E and certain types of non-additive gene effects on the expressed phenotype. Approximate Bayesian computation (ABC), a novel and powerful computational procedure, allows the incorporation of CGMs directly into the estimation of whole genome marker effects in WGP. Here we provide a proof of concept study for this novel approach and demonstrate its use with synthetic data sets. We show that this novel approach can be considerably more accurate than the benchmark WGP method GBLUP in predicting performance in environments represented in the estimation set as well as in previously unobserved environments for traits determined by non-additive gene effects. We conclude that this proof of concept demonstrates that using ABC for incorporating biological knowledge in the form of CGMs into WGP is a very promising and novel approach to improving prediction accuracy for some of the most challenging scenarios in plant breeding and applied genetics. 相似文献

10.

The impact of population structure on genomic prediction in stratified populations

Zhigang Guo Dominic M. Tucker Christopher J. Basten Harish Gandhi Elhan Ersoz Baohong Guo Zhanyou Xu Daolong Wang Gilles Gay 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2014,127(3):749-762

Key message

Impacts of population structure on the evaluation of genomic heritability and prediction were investigated and quantified using high-density markers in diverse panels in rice and maize.

Abstract

Population structure is an important factor affecting estimation of genomic heritability and assessment of genomic prediction in stratified populations. In this study, our first objective was to assess effects of population structure on estimations of genomic heritability using the diversity panels in rice and maize. Results indicate population structure explained 33 and 7.5 % of genomic heritability for rice and maize, respectively, depending on traits, with the remaining heritability explained by within-subpopulation variation. Estimates of within-subpopulation heritability were higher than that derived from quantitative trait loci identified in genome-wide association studies, suggesting 65 % improvement in genetic gains. The second objective was to evaluate effects of population structure on genomic prediction using cross-validation experiments. When population structure exists in both training and validation sets, correcting for population structure led to a significant decrease in accuracy with genomic prediction. In contrast, when prediction was limited to a specific subpopulation, population structure showed little effect on accuracy and within-subpopulation genetic variance dominated predictions. Finally, effects of genomic heritability on genomic prediction were investigated. Accuracies with genomic prediction increased with genomic heritability in both training and validation sets, with the former showing a slightly greater impact. In summary, our results suggest that the population structure contribution to genomic prediction varies based on prediction strategies, and is also affected by the genetic architectures of traits and populations. In practical breeding, these conclusions may be helpful to better understand and utilize the different genetic resources in genomic prediction. 相似文献

11.

Genome-Wide Variation of Cytosine Modifications Between European and African Populations and the Implications for Complex Traits

Erika L. Moen Xu Zhang Wenbo Mu Shannon M. Delaney Claudia Wing Jennifer McQuade Jamie Myers Lucy A. Godley M. Eileen Dolan Wei Zhang 《Genetics》2013,194(4):987-996

Elucidating cytosine modification differences between human populations can enhance our understanding of ethnic specificity in complex traits. In this study, cytosine modification levels in 133 HapMap lymphoblastoid cell lines derived from individuals of European or African ancestry were profiled using the Illumina HumanMethylation450 BeadChip. Approximately 13% of the analyzed CpG sites showed differential modification between the two populations at a false discovery rate of 1%. The CpG sites with greater modification levels in European descent were enriched in the proximal regulatory regions, while those greater in African descent were biased toward gene bodies. More than half of the detected population-specific cytosine modifications could be explained primarily by local genetic variation. In addition, a substantial proportion of local modification quantitative trait loci exhibited population-specific effects, suggesting that genetic epistasis and/or genotype × environment interactions could be common. Distinct correlations were observed between gene expression levels and cytosine modifications in proximal regions and gene bodies, suggesting epigenetic regulation of interindividual expression variation. Furthermore, quantitative trait loci associated with population-specific modifications can be colocalized with expression quantitative trait loci and single nucleotide polymorphisms previously identified for complex traits with known racial disparities. Our findings revealed abundant population-specific cytosine modifications and the underlying genetic basis, as well as the relatively independent contribution of genetic and epigenetic variations to population differences in gene expression. 相似文献

12.

Niche–trait relationships at individual and population level in three co‐occurring passerine species

Pei&#x;Jen L. Shaner Yin&#x;Kai Chen Yu&#x;Cheng Hsu 《Ecology and evolution》2021,11(12):7378

The niche variation hypothesis (NVH) predicts that populations with wider niches exhibit greater morphological variation through increased interindividual differences in both niche and morphology. In this study, we examined niche–trait relationships in three passerine species (Cyanoderma ruficeps, Sinosuthora webbiana, and Zosterops simplex). A total of 289 C. ruficeps from 7 sites, 259 S. webbiana from 8 sites, and 144 Z. simplex from 6 sites were sampled along an elevation gradient (0–2,700 m) in Taiwan from 2009 to 2017. We measured bill traits (length, width, and depth of bill) and body size traits (length of head, tarsus, and wing) of the birds, which were reduced to four principal components (bill PC1, bill PC2, body size PC1, and body size PC2). We collected feather tissues for stable carbon and nitrogen isotope analyses to quantify their isotope niche. We quantified interindividual differences in isotope space and trait space with four diversity metrics (divergence, dispersion, evenness, and uniqueness) and tested whether interindividual differences in isotope space and trait space are positively associated. We quantified population isotope niche width by Bayesian ellipse area and population morphological variation by variances of the PCs. The results showed that individual uniqueness in isotope niche and bill morphology (average closeness of individuals within the population isotope/trait space) were positively associated across three species. Furthermore, isotope niche width and bill PC1 (reflecting the size of bill) variation at population level were also positively associated across the three species, supporting the NVH. Of the three species, C. ruficeps and S. webbiana showed stronger support for the NVH than Z. simplex, possibly due to the latter having narrower elevational distribution and a more specialized, plant‐based diet. The diversity metrics represented different aspects of interindividual differences in niche/trait space, and for the passerines, individual uniqueness appeared to play an important role in their niche–trait dynamics. 相似文献

13.

Genetic architecture of complex traits and accuracy of genomic prediction: coat colour, milk-fat percentage, and type in Holstein cattle as contrasting model traits 总被引：2，自引：0，他引：2

Hayes BJ Pryce J Chamberlain AJ Bowman PJ Goddard ME 《PLoS genetics》2010,6(9):e1001139

Prediction of genetic merit using dense SNP genotypes can be used for estimation of breeding values for selection of livestock, crops, and forage species; for prediction of disease risk; and for forensics. The accuracy of these genomic predictions depends in part on the genetic architecture of the trait, in particular number of loci affecting the trait and distribution of their effects. Here we investigate the difference among three traits in distribution of effects and the consequences for the accuracy of genomic predictions. Proportion of black coat colour in Holstein cattle was used as one model complex trait. Three loci, KIT, MITF, and a locus on chromosome 8, together explain 24% of the variation of proportion of black. However, a surprisingly large number of loci of small effect are necessary to capture the remaining variation. A second trait, fat concentration in milk, had one locus of large effect and a host of loci with very small effects. Both these distributions of effects were in contrast to that for a third trait, an index of scores for a number of aspects of cow confirmation ("overall type"), which had only loci of small effect. The differences in distribution of effects among the three traits were quantified by estimating the distribution of variance explained by chromosome segments containing 50 SNPs. This approach was taken to account for the imperfect linkage disequilibrium between the SNPs and the QTL affecting the traits. We also show that the accuracy of predicting genetic values is higher for traits with a proportion of large effects (proportion black and fat percentage) than for a trait with no loci of large effect (overall type), provided the method of analysis takes advantage of the distribution of loci effects. 相似文献

14.

Accuracy and bias of genomic prediction with different de-regression methods

《Animal : an international journal of animal bioscience》2018,12(6):1111-1117

Genomic selection has become increasingly important in the breeding of animals and plants. The response variable is an important factor, influencing the accuracy of genomic selection. The de-regressed proof (DRP) based on traditional estimated breeding value (EBV) is commonly used as response variable. In the current study, simulated data from 16th QTL-MAS Workshop and real data from Chinese Holstein cattle were used to compare accuracy and bias of genomic prediction with two methods of calculating DRP. Our results with simulated data showed that the correlation between genomic EBV and true breeding value achieved using the Jairath method (DRP_J) was superior to that achieved using the Garrick method (DRP_G) for simulated trait 1 but the reverse was true for simulated trait 3, and these two methods performed comparably for simulated trait 2. For all three simulated traits, DRP_J yielded larger bias of genomic prediction. However, DRP_J outperformed DRP_G in both accuracy and unbiasedness for four milk production traits in Chinese Holstein. In the estimation of genomic breeding value using genomic BLUP model, two methods for weighting diagonal elements of incidence matrix associated with residual error were also compared. With increasing the proportion of genetic variance unexplained by markers, the accuracy of genomic prediction was decreased and the bias was increased. Weighting by the reliability of DRP produced accuracy comparable to the evaluation where the proportion of genetic variance unexplained by markers was considered, but with smaller bias in general. 相似文献

15.

Joint-multiple family linkage analysis predicts within-family variation better than single-family analysis of the maize nested association mapping population

F Ogut Y Bian P J Bradbury J B Holland 《Heredity》2015,114(6):552-563

Quantitative trait locus (QTL) mapping has been used to dissect the genetic architecture of complex traits and predict phenotypes for marker-assisted selection. Many QTL mapping studies in plants have been limited to one biparental family population. Joint analysis of multiple biparental families offers an alternative approach to QTL mapping with a wider scope of inference. Joint-multiple population analysis should have higher power to detect QTL shared among multiple families, but may have lower power to detect rare QTL. We compared prediction ability of single-family and joint-family QTL analysis methods with fivefold cross-validation for 6 diverse traits using the maize nested association mapping population, which comprises 25 biparental recombinant inbred families. Joint-family QTL analysis had higher mean prediction abilities than single-family QTL analysis for all traits at most significance thresholds, and was always better at more stringent significance thresholds. Most robust QTL (detected in >50% of data samples) were restricted to one family and were often not detected at high frequency by joint-family analysis, implying substantial genetic heterogeneity among families for complex traits in maize. The superior predictive ability of joint-family QTL models despite important genetic differences among families suggests that joint-family models capture sufficient smaller effect QTL that are shared across families to compensate for missing some rare large-effect QTL. 相似文献

16.

The effect of using genealogy-based haplotypes for genomic prediction

Vahid Edriss Rohan L Fernando Guosheng Su Mogens S Lund Bernt Guldbrandtsen 《遗传、选种与进化》2013,45(1):5

Background

Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information.

Methods

A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method.

Results

About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers.

Conclusions

Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy. 相似文献

17.

Beyond missing heritability: prediction of complex traits

Makowsky R Pajewski NM Klimentidis YC Vazquez AI Duarte CW Allison DB de los Campos G 《PLoS genetics》2011,7(4):e1002051

Despite rapid advances in genomic technology, our ability to account for phenotypic variation using genetic information remains limited for many traits. This has unfortunately resulted in limited application of genetic data towards preventive and personalized medicine, one of the primary impetuses of genome-wide association studies. Recently, a large proportion of the "missing heritability" for human height was statistically explained by modeling thousands of single nucleotide polymorphisms concurrently. However, it is currently unclear how gains in explained genetic variance will translate to the prediction of yet-to-be observed phenotypes. Using data from the Framingham Heart Study, we explore the genomic prediction of human height in training and validation samples while varying the statistical approach used, the number of SNPs included in the model, the validation scheme, and the number of subjects used to train the model. In our training datasets, we are able to explain a large proportion of the variation in height (h(2) up to 0.83, R(2) up to 0.96). However, the proportion of variance accounted for in validation samples is much smaller (ranging from 0.15 to 0.36 depending on the degree of familial information used in the training dataset). While such R(2) values vastly exceed what has been previously reported using a reduced number of pre-selected markers (<0.10), given the heritability of the trait (～ 0.80), substantial room for improvement remains. 相似文献

18.

Nonlinear kernels,dominance, and envirotyping data increase the accuracy of genome-based prediction in multi-environment trials

Germano Costa-Neto Roberto Fritsche-Neto Jos Crossa 《Heredity》2021,126(1):92

Modern whole-genome prediction (WGP) frameworks that focus on multi-environment trials (MET) integrate large-scale genomics, phenomics, and envirotyping data. However, the more complex the statistical model, the longer the computational processing times, which do not always result in accuracy gains. We investigated the use of new kernel methods and modeling structures involving genomics and nongenomic sources of variation in two MET maize data sets. Five WGP models were considered, advancing in complexity from a main-effect additive model (A) to more complex structures, including dominance deviations (D), genotype × environment interaction (AE and DE), and the reaction-norm model using environmental covariables (W) and their interaction with A and D (AW + DW). A combination of those models built with three different kernel methods, Gaussian kernel (GK), Deep kernel (DK), and the benchmark genomic best linear-unbiased predictor (GBLUP/GB), was tested under three prediction scenarios: newly developed hybrids (CV1), sparse MET conditions (CV2), and new environments (CV0). GK and DK outperformed GB in prediction accuracy and reduction of computation time (~up to 20%) under all model–kernel scenarios. GK was more efficient in capturing the variation due to A + AE and D + DE effects and translated it into accuracy gains (~up to 85% compared with GB). DK provided more consistent predictions, even for more complex structures such as W + AW + DW. Our results suggest that DK and GK are more efficient in translating model complexity into accuracy, and more suitable for including dominance and reaction-norm effects in a biologically accurate and faster way.Subject terms: Evolution, Genomics 相似文献

19.

Patterns of genetic,phenotypic, and acoustic variation across a chiffchaff (Phylloscopus collybita abietinus/tristis) hybrid zone

下载免费PDF全文

Daria Shipilina Maksym Serbyn Vladimir Ivanitskii Irina Marova Niclas Backström 《Ecology and evolution》2017,7(7):2169-2180

Characterizing patterns of evolution of genetic and phenotypic divergence between incipient species is essential to understand how evolution of reproductive isolation proceeds. Hybrid zones are excellent for studying such processes, as they provide opportunities to assess trait variation in individuals with mixed genetic background and to quantify gene flow across different genomic regions. Here, we combine plumage, song, mtDNA and whole‐genome sequence data and analyze variation across a sympatric zone between the European and the Siberian chiffchaff (Phylloscopus collybita abietinus/tristis) to study how gene exchange between the lineages affects trait variation. Our results show that chiffchaff within the sympatric region show more extensive trait variation than allopatric birds, with a large proportion of individuals exhibiting intermediate phenotypic characters. The genomic differentiation between the subspecies is lower in sympatry than in allopatry and sympatric birds have a mix of genetic ancestry indicating extensive ongoing and past gene flow. Patterns of phenotypic and genetic variation also vary between regions within the hybrid zone, potentially reflecting differences in population densities, age of secondary contact, or differences in mate recognition or mate preference. The genomic data support the presence of two distinct genetic clades corresponding to allopatric abietinus and tristis and that genetic admixture is the force underlying trait variation in the sympatric region—the previously described subspecies (“fulvescens”) from the region is therefore not likely a distinct taxon. In addition, we conclude that subspecies identification based on appearance is uncertain as an individual with an apparently distinct phenotype can have a considerable proportion of the genome composed of mixed alleles, or even a major part of the genome introgressed from the other subspecies. Our results provide insights into the dynamics of admixture across subspecies boundaries and have implications for understanding speciation processes and for the identification of specific chiffchaff individuals based on phenotypic characters. 相似文献

20.

Potential and limits to unravel the genetic architecture and predict the variation of Fusarium head blight resistance in European winter wheat (Triticum aestivum L.)

Y Jiang Y Zhao B Rodemann J Plieske S Kollers V Korzun E Ebmeyer O Argillier M Hinze J Ling M S R?der M W Ganal M F Mette J C Reif 《Heredity》2015,114(3):318-326

Genome-wide mapping approaches in diverse populations are powerful tools to unravel the genetic architecture of complex traits. The main goals of our study were to investigate the potential and limits to unravel the genetic architecture and to identify the factors determining the accuracy of prediction of the genotypic variation of Fusarium head blight (FHB) resistance in wheat (Triticum aestivum L.) based on data collected with a diverse panel of 372 European varieties. The wheat lines were phenotyped in multi-location field trials for FHB resistance and genotyped with 782 simple sequence repeat (SSR) markers, and 9k and 90k single-nucleotide polymorphism (SNP) arrays. We applied genome-wide association mapping in combination with fivefold cross-validations and observed surprisingly high accuracies of prediction for marker-assisted selection based on the detected quantitative trait loci (QTLs). Using a random sample of markers not selected for marker–trait associations revealed only a slight decrease in prediction accuracy compared with marker-based selection exploiting the QTL information. The same picture was confirmed in a simulation study, suggesting that relatedness is a main driver of the accuracy of prediction in marker-assisted selection of FHB resistance. When the accuracy of prediction of three genomic selection models was contrasted for the three marker data sets, no significant differences in accuracies among marker platforms and genomic selection models were observed. Marker density impacted the accuracy of prediction only marginally. Consequently, genomic selection of FHB resistance can be implemented most cost-efficiently based on low- to medium-density SNP arrays. 相似文献