首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The purpose of this study is review and evaluation of computing methods used in genomic selection for animal breeding. Commonly used models include SNP BLUP with extensions (BayesA, etc), genomic BLUP (GBLUP) and single-step GBLUP (ssGBLUP). These models are applied for genomewide association studies (GWAS), genomic prediction and parameter estimation. Solving methods include finite Cholesky decomposition possibly with a sparse implementation, and iterative Gauss–Seidel (GS) or preconditioned conjugate gradient (PCG), the last two methods possibly with iteration on data. Details are provided that can drastically decrease some computations. For SNP BLUP especially with sampling and large number of SNP, the only choice is GS with iteration on data and adjustment of residuals. If only solutions are required, PCG by iteration on data is a clear choice. A genomic relationship matrix (GRM) has limited dimensionality due to small effective population size, resulting in infinite number of generalized inverses of GRM for large genotyped populations. A specific inverse called APY requires only a small fraction of GRM, is sparse and can be computed and stored at a low cost for millions of animals. With APY inverse and PCG iteration, GBLUP and ssGBLUP can be applied to any population. Both tools can be applied to GWAS. When the system of equations is sparse but contains dense blocks, a recently developed package for sparse Cholesky decomposition and sparse inversion called YAMS has greatly improved performance over packages where such blocks were treated as sparse. With YAMS, GREML and possibly single-step GREML can be applied to populations with >50 000 genotyped animals. From a computational perspective, genomic selection is becoming a mature methodology.  相似文献   

2.
Genomic selection (GS) is a DNA-based method of selecting for quantitative traits in animal and plant breeding, and offers a potentially superior alternative to traditional breeding methods that rely on pedigree and phenotype information. Using a 60 K SNP chip with markers spaced throughout the entire chicken genome, we compared the impact of GS and traditional BLUP (best linear unbiased prediction) selection methods applied side-by-side in three different lines of egg-laying chickens. Differences were demonstrated between methods, both at the level and genomic distribution of allele frequency changes. In all three lines, the average allele frequency changes were larger with GS, 0.056 0.064 and 0.066, compared with BLUP, 0.044, 0.045 and 0.036 for lines B1, B2 and W1, respectively. With BLUP, 35 selected regions (empirical P<0.05) were identified across the three lines. With GS, 70 selected regions were identified. Empirical thresholds for local allele frequency changes were determined from gene dropping, and differed considerably between GS (0.167–0.198) and BLUP (0.105–0.126). Between lines, the genomic regions with large changes in allele frequencies showed limited overlap. Our results show that GS applies selection pressure much more locally than BLUP, resulting in larger allele frequency changes. With these results, novel insights into the nature of selection on quantitative traits have been gained and important questions regarding the long-term impact of GS are raised. The rapid changes to a part of the genetic architecture, while another part may not be selected, at least in the short term, require careful consideration, especially when selection occurs before phenotypes are observed.  相似文献   

3.
Combining different swine populations in genomic prediction can be an important tool, leading to an increased accuracy of genomic prediction using single nucleotide polymorphism (SNP) chip data compared with within-population genomic. However, the expected higher accuracy of multi-population genomic prediction has not been realized. This may be due to an inconsistent linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTL) across populations, and the weak genetic relationships across populations. In this study, we determined the impact of different genomic relationship matrices, SNP density and pre-selected variants on prediction accuracy using a combined Yorkshire pig population. Our objective was to provide useful strategies for improving the accuracy of genomic prediction within a combined population. Results showed that the accuracy of genomic best linear unbiased prediction (GBLUP) using imputed whole-genome sequencing (WGS) data in the combined population was always higher than that within populations. Furthermore, the use of imputed WGS data always resulted in a higher accuracy of GBLUP than the use of 80K chip data for the combined population. Additionally, the accuracy of GBLUP with a non-linear genomic relationship matrix was markedly increased (0.87% to 15.17% for 80K chip data, and 0.43% to 4.01% for imputed WGS data) compared with that obtained with a linear genomic relationship matrix, except for the prediction of XD population in the combined population using imputed WGS data. More importantly, the application of pre-selected variants based on fixation index (Fst) scores improved the accuracy of multi-population genomic prediction, especially for 80K chip data. For BLUP|GA (BLUP approach given the genetic architecture), the use of a linear method with an appropriate weight to build a weight-relatedness matrix led to a higher prediction accuracy compared with the use of only pre-selected SNPs for genomic evaluations, especially for the total number of piglets born. However, for the non-linear method, BLUP|GA showed only a small increase or even a decrease in prediction accuracy compared with the use of only pre-selected SNPs. Overall, the best genomic evaluation strategy for reproduction-related traits for a combined population was found to be GBLUP performed with a non-linear genomic relationship matrix using variants pre-selected from the 80K chip data based on Fst scores.  相似文献   

4.
In order to find SNPs and genes affecting shank traits, we performed a GWAS in a chicken F2 population of eight half-sib families from five hatches derived from reciprocal crosses between an Arian fast-growing line and an Urmia indigenous slow-growing chicken. A total of 308 birds were genotyped using a 60K chicken SNP chip. Shank traits including shank length and diameter were measured weekly from birth to 12 weeks of age. A generalized linear model and a compressed mixed linear model (CMLM) were applied to achieve the significant regions. The value of the average genomic inflation factor (λ statistic) of the CMLM model (0.99) indicated that the CMLM was more effective than the generalized linear model in controlling the population structure. The genes surrounding significant SNPs and their biological functions were identified from NCBI, Ensembl and UniProt databases. The results indicated that 12 SNPs at 12 different ages passed the LD-adjusted 5% Bonferroni significant threshold. Two SNPs were significant for shank length and nine SNPs were significant for shank diameter. The significant SNPs were located near to or inside 11 candidate genes. The results showed that a number of significant SNPs in the middle ages were higher than the rest. The MXRA8 gene was related to the significant SNP at week 1 that promotes proliferation of growth plate chondrocytes. A unique SNP of Gga_rs16689511 located on chicken Z chromosome within the LOC101747628 gene was related to shank length at three different ages of birds (weeks 8, 9 and 11). The significant SNPs for shank diameter were found at weeks 4 and 7 (four and five SNPs respectively). The identifications of SNPs and genes here could contribute to a better understanding of the genetic control of shank traits in chicken.  相似文献   

5.
Improving drought tolerance of crop plants is a major goal of plant breeders. In this study, we characterized biomass and drought‐related traits of 220 Medicago truncatula HapMap accessions. Characterized traits included shoot biomass, maximum leaf size, specific leaf weight, stomatal density, trichome density and shoot carbon‐13 isotope discrimination (δ13C) of well‐watered M. truncatula plants, and leaf performance in vitro under dehydration stress. Genome‐wide association analyses were carried out using the general linear model (GLM), the standard mixed linear model (MLM) and compressed MLM (CMLM) in TASSEL, which revealed significant overestimation of P‐values by CMLM. For each trait, candidate genes and chromosome regions containing SNP markers were found that are in significant association with the trait. For plant biomass, a 0.5 Mbp region on chromosome 2 harbouring a plasma membrane intrinsic protein, PIP2, was discovered that could potentially be targeted to increase dry matter yield. A protein disulfide isomerase‐like protein was found to be tightly associated with both shoot biomass and leaf size. A glutamate‐cysteine ligase and an aldehyde dehydrogenase family protein with Arabidopsis homologs strongly expressed in the guard cells were two of the top genes identified by stomata density genome‐wide association studies analysis.  相似文献   

6.

Background

The prediction accuracy of several linear genomic prediction models, which have previously been used for within-line genomic prediction, was evaluated for multi-line genomic prediction.

Methods

Compared to a conventional BLUP (best linear unbiased prediction) model using pedigree data, we evaluated the following genomic prediction models: genome-enabled BLUP (GBLUP), ridge regression BLUP (RRBLUP), principal component analysis followed by ridge regression (RRPCA), BayesC and Bayesian stochastic search variable selection. Prediction accuracy was measured as the correlation between predicted breeding values and observed phenotypes divided by the square root of the heritability. The data used concerned laying hens with phenotypes for number of eggs in the first production period and known genotypes. The hens were from two closely-related brown layer lines (B1 and B2), and a third distantly-related white layer line (W1). Lines had 1004 to 1023 training animals and 238 to 240 validation animals. Training datasets consisted of animals of either single lines, or a combination of two or all three lines, and had 30 508 to 45 974 segregating single nucleotide polymorphisms.

Results

Genomic prediction models yielded 0.13 to 0.16 higher accuracies than pedigree-based BLUP. When excluding the line itself from the training dataset, genomic predictions were generally inaccurate. Use of multiple lines marginally improved prediction accuracy for B2 but did not affect or slightly decreased prediction accuracy for B1 and W1. Differences between models were generally small except for RRPCA which gave considerably higher accuracies for B2. Correlations between genomic predictions from different methods were higher than 0.96 for W1 and higher than 0.88 for B1 and B2. The greater differences between methods for B1 and B2 were probably due to the lower accuracy of predictions for B1 (~0.45) and B2 (~0.40) compared to W1 (~0.76).

Conclusions

Multi-line genomic prediction did not affect or slightly improved prediction accuracy for closely-related lines. For distantly-related lines, multi-line genomic prediction yielded similar or slightly lower accuracies than single-line genomic prediction. Bayesian variable selection and GBLUP generally gave similar accuracies. Overall, RRPCA yielded the greatest accuracies for two lines, suggesting that using PCA helps to alleviate the “n ≪ p” problem in genomic prediction.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-014-0057-5) contains supplementary material, which is available to authorized users.  相似文献   

7.
玉米出籽率全基因组关联分析   总被引:1,自引:0,他引:1  
出籽率与玉米单穗产量密切相关,其遗传机制的解析对玉米高产育种具有重要意义.本研究利用309份玉米自交系为关联群体,利用固定和随机模型交替概率统一(FarmCPU)、压缩混合线性模型(CMLM)和多位点混合线性模型(MLMM)对2017年和2019年河南新乡原阳、周口郸城、海南三亚以及最佳线性无偏估计值(BLUE)的出籽...  相似文献   

8.
Genomic Selection (GS) is a new breeding method in which genome-wide markers are used to predict the breeding value of individuals in a breeding population. GS has been shown to improve breeding efficiency in dairy cattle and several crop plant species, and here we evaluate for the first time its efficacy for breeding inbred lines of rice. We performed a genome-wide association study (GWAS) in conjunction with five-fold GS cross-validation on a population of 363 elite breeding lines from the International Rice Research Institute''s (IRRI) irrigated rice breeding program and herein report the GS results. The population was genotyped with 73,147 markers using genotyping-by-sequencing. The training population, statistical method used to build the GS model, number of markers, and trait were varied to determine their effect on prediction accuracy. For all three traits, genomic prediction models outperformed prediction based on pedigree records alone. Prediction accuracies ranged from 0.31 and 0.34 for grain yield and plant height to 0.63 for flowering time. Analyses using subsets of the full marker set suggest that using one marker every 0.2 cM is sufficient for genomic selection in this collection of rice breeding materials. RR-BLUP was the best performing statistical method for grain yield where no large effect QTL were detected by GWAS, while for flowering time, where a single very large effect QTL was detected, the non-GS multiple linear regression method outperformed GS models. For plant height, in which four mid-sized QTL were identified by GWAS, random forest produced the most consistently accurate GS models. Our results suggest that GS, informed by GWAS interpretations of genetic architecture and population structure, could become an effective tool for increasing the efficiency of rice breeding as the costs of genotyping continue to decline.  相似文献   

9.
Along with the development of high-throughput sequencing technologies, both sample size and SNP number are increasing rapidly in genome-wide association studies (GWAS), and the associated computation is more challenging than ever. Here, we present a memory-efficient, visualization-enhanced, and parallel-accelerated R package called “rMVP” to address the need for improved GWAS computation. rMVP can 1) effectively process large GWAS data, 2) rapidly evaluate population structure, 3) efficiently estimate variance components by Efficient Mixed-Model Association eXpedited (EMMAX), Factored Spectrally Transformed Linear Mixed Models (FaST-LMM), and Haseman-Elston (HE) regression algorithms, 4) implement parallel-accelerated association tests of markers using general linear model (GLM), mixed linear model (MLM), and fixed and random model circulating probability unification (FarmCPU) methods, 5) compute fast with a globally efficient design in the GWAS processes, and 6) generate various visualizations of GWAS-related information. Accelerated by block matrix multiplication strategy and multiple threads, the association test methods embedded in rMVP are significantly faster than PLINK, GEMMA, and FarmCPU_pkg. rMVP is freely available at https://github.com/xiaolei-lab/rMVP.  相似文献   

10.
Genomic selection for marker-assisted improvement in line crosses   总被引:1,自引:0,他引:1  
Efficiency of genomic selection with low-cost genotyping in a composite line from a cross between inbred lines was evaluated for a trait with heritability 0.10 or 0.25 using a low-density marker map. With genomic selection, selection was on the sum of estimates of effects of all marker intervals across the genome, fitted either as fixed (fixed GS) or random (random GS) effects. Reponses to selection over 10 generations, starting from the F2, were compared with standard BLUP selection. Estimates of variance for each interval were assumed independent and equal. Both GS strategies outperformed BLUP selection, especially in initial generations. Random GS outperformed fixed GS in early generations and performed slightly better than fixed GS in later generations. Random GS gave higher genetic gain when the number of marker intervals was greater (180 or 10 cM intervals), whereas fixed GS gave higher genetic gain when the number of marker intervals was low (90 or 20 cM). Including interactions between generation and marker scores in the model resulted in lower genetic gains than models without interactions. When phenotypes were available only in the F2 for GS, treating marker scores as fixed effects led to considerably lower genetic gain than random GS. Benefits of GS over standard BLUP were lower with high heritability. Genomic selection resulted in greater response than MAS based on only significant marker intervals (standard MAS) by increasing the frequency of QTL with both large and small effects. The efficiency of genomic selection over standard MAS depends on stringency of the threshold used for QTL detection. In conclusion, genomic selection can be effective in composite lines using a sparse marker map.  相似文献   

11.
Litter traits are critical economic variables in the pig industry as they represent a production indicator that can serve to determine sow fertility. In this study, a genome-wide association study on litter traits, including total number born (TNB), number born alive (NBA), litter birth weight (LBW), average birth weight (ABW), and piglet uniformity (PU), was carried out on two pig breeds (Yorkshire and Landrace). A total of 3 637 pigs of both breeds were genotyped using the GeneSeek GGP Porcine 50K SNP BeadChip. A mixed linear model (MLM) and fixed and random model circulating probability unification (FarmCPU) were employed in the genome-wide association studies for litter traits using combined data from the two pig breeds and data from each breed separately. Additionally, the heritability of traits was estimated using three methods—pedigree-based best linear unbiased prediction (PBLUP), genomic best linear unbiased prediction (GBLUP), and single-step best linear unbiased prediction (ssGBLUP)—and was found to lie between 0.065 and 0.1289, 0.0478 and 0.0938, 0.0793 and 0.0935, 0.1862 and 0.2163, and 0.0327 and 0.0419 for TNB, NBA, LBW, ABW, and PU, respectively. We also compared the genomic prediction accuracies and unbiasedness for litter traits of the three BLUP models. Our results indicated that the ssGBLUP method provided higher predictive accuracies and more rational unbiasedness compared with the PBLUP and GBLUP methodologies. Furthermore, based on their possible roles, eight candidate genes (INHBA, LEPR, HDHD2, CTNND2, RNF216, HMX1, PAPPA2, and NTN1) were identified as being linked with litter traits. In the middle of the test, these genes were found to be connected with pig metabolism and ovulation rate. Our results provide the insights into the genetic architecture of litter traits in pigs, and the potential single nucleotide polymorphisms (SNPs) and candidate genes identified may benefit economic profits in pig-breeding industry and contribute to improve litter traits.  相似文献   

12.
Selective breeding is a common and effective approach for genetic improvement of aquaculture stocks with parental selection as the key factor. Genomic selection (GS) has been proposed as a promising tool to facilitate selective breeding. Here, we evaluated the predictability of four GS methods in Zhikong scallop (Chlamys farreri) through real dataset analyses of four economical traits (e.g., shell length, shell height, shell width, and whole weight). Our analysis revealed that different GS models exhibited variable performance in prediction accuracy depending on genetic and statistical factors, but non-parametric method, including reproducing kernel Hilbert spaces regression (RKHS) and sparse neural networks (SNN), generally outperformed parametric linear method, such as genomic best linear unbiased prediction (GBLUP) and BayesB. Furthermore, we demonstrated that the predictability relied mainly on the heritability regardless of GS methods. The size of training population and marker density also had considerable effects on the predictive performance. In practice, increasing the training population size could better improve the genomic prediction than raising the marker density. This study is the first to apply non-linear model and neural networks for GS in scallop and should be valuable to help develop strategies for aquaculture breeding programs.  相似文献   

13.
In the era of big data, univariate models have widely been used as a workhorse tool for quickly producing marginal estimators; and this is true even when in a high-dimensional dense setting, in which many features are “true,” but weak signals. Genome-wide association studies (GWAS) epitomize this type of setting. Although the GWAS marginal estimator is popular, it has long been criticized for ignoring the correlation structure of genetic variants (i.e., the linkage disequilibrium [LD] pattern). In this paper, we study the effects of LD pattern on the GWAS marginal estimator and investigate whether or not additionally accounting for the LD can improve the prediction accuracy of complex traits. We consider a general high-dimensional dense setting for GWAS and study a class of ridge-type estimators, including the popular marginal estimator and the best linear unbiased prediction (BLUP) estimator as two special cases. We show that the performance of GWAS marginal estimator depends on the LD pattern through the first three moments of its eigenvalue distribution. Furthermore, we uncover that the relative performance of GWAS marginal and BLUP estimators highly depends on the ratio of GWAS sample size over the number of genetic variants. Particularly, our finding reveals that the marginal estimator can easily become near-optimal within this class when the sample size is relatively small, even though it ignores the LD pattern. On the other hand, BLUP estimator has substantially better performance than the marginal estimator as the sample size increases toward the number of genetic variants, which is typically in millions. Therefore, adjusting for the LD (such as in the BLUP) is most needed when GWAS sample size is large. We illustrate the importance of our results by using the simulated data and real GWAS.  相似文献   

14.
Cuprizone affects the liver of treated mice in a random manner, causing no appreciable change in some cases and inducing the formation of megamito-chondria with altered properties in others. Lack of a full appreciation of this variability may be at the origin of some discrepancies in published work dealing with the properties of cuprizone mouse liver mitochondria (CMLM). CMLM from fully affected livers were remarkably labile and difficult to isolate in a coupled state by homogenization and centrifugation techniques. The integral respiratory chain proteins of CMLM were functionally normal, with the exception of succinic dehydrogenase which showed considerable inhibition. Coupled morphological and functional analysis provided evidence that these properties were independent of CMLM size, a matter which had remained doubtful thus far and bears on the validity of literature reports.Abbreviations MLM mouse liver mitochondria - CMLM cuprizone mouse liver mitochondria - PMS phenazine methosulfate - Mops 4-morphoinopropanesulfonic acid - TMPD tetramethyl-p-phenylenediamine - AA Antimycin A - RCR respiratory control ratio - BSA bovine serum albumin  相似文献   

15.
We compared the accuracies of four genomic-selection prediction methods as affected by marker density, level of linkage disequilibrium (LD), quantitative trait locus (QTL) number, sample size, and level of replication in populations generated from multiple inbred lines. Marker data on 42 two-row spring barley inbred lines were used to simulate high and low LD populations from multiple inbred line crosses: the first included many small full-sib families and the second was derived from five generations of random mating. True breeding values (TBV) were simulated on the basis of 20 or 80 additive QTL. Methods used to derive genomic estimated breeding values (GEBV) were random regression best linear unbiased prediction (RR–BLUP), Bayes-B, a Bayesian shrinkage regression method, and BLUP from a mixed model analysis using a relationship matrix calculated from marker data. Using the best methods, accuracies of GEBV were comparable to accuracies from phenotype for predicting TBV without requiring the time and expense of field evaluation. We identified a trade-off between a method's ability to capture marker-QTL LD vs. marker-based relatedness of individuals. The Bayesian shrinkage regression method primarily captured LD, the BLUP methods captured relationships, while Bayes-B captured both. Under most of the study scenarios, mixed-model analysis using a marker-derived relationship matrix (BLUP) was more accurate than methods that directly estimated marker effects, suggesting that relationship information was more valuable than LD information. When markers were in strong LD with large-effect QTL, or when predictions were made on individuals several generations removed from the training data set, however, the ranking of method performance was reversed and BLUP had the lowest accuracy.  相似文献   

16.
Several methods have been used for genome-enabled prediction (or genomic selection) of complex traits, for example, multiple regression models describing a target trait with a linear function of a set of genetic markers. Genomic selection studies have been focused mostly on single-trait analyses. However, most profitability traits are genetically correlated, and an increase in prediction accuracy of genomic breeding values for genetically correlated traits is expected when using multiple-trait models. Thus, this study was carried out to assess the accuracy of genomic prediction for carcass and meat quality traits in Nelore cattle, using single- and multiple-trait approaches. The study considered 15 780, 15 784, 15 742 and 526 records of rib eye area (REA, cm2), back fat thickness (BF, mm), rump fat (RF, mm) and Warner–Bratzler shear force (WBSF, kg), respectively, in Nelore cattle, from the Nelore Brazil Breeding Program. Animals were genotyped with a low-density single nucleotide polymorphism (SNP) panel and subsequently imputed to arrays with 54 and 777 k SNPs. Four Bayesian specifications of genomic regression models, namely, Bayes A, Bayes B, Bayes Cπ and Bayesian Ridge Regression; blending methods, BLUP; and single-step genomic best linear unbiased prediction (ssGBLUP) methods were compared in terms of prediction accuracy using a fivefold cross-validation. Estimates of heritability ranged from 0.20 to 0.35 and from 0.21 to 0.46 for RF and WBSF on single- and multiple-trait analyses, respectively. Prediction accuracies for REA, BF, RF and WBSF were all similar using the different specifications of regression models. In addition, this study has shown the impact of genomic information upon genetic evaluations in beef cattle using the multiple-trait model, which was also advantageous compared to the single-trait model because it accounted for the selection process using multiple traits at the same time. The advantage of multi-trait analyses is attributed to the consideration of correlations and genetic influences between the traits, in addition to the non-random association of alleles.  相似文献   

17.

Key Message

Genomic prediction using the Brassica 60 k genotyping array is efficient in oilseed rape hybrids. Prediction accuracy is more dependent on trait complexity than on the prediction model.

Abstract

In oilseed rape breeding programs, performance prediction of parental combinations is of fundamental importance. Due to the phenomenon of heterosis, per se performance is not a reliable indicator for F1-hybrid performance, and selection of well-paired parents requires the testing of large quantities of hybrid combinations in extensive field trials. However, the number of potential hybrids, in general, dramatically exceeds breeding capacity and budget. Integration of genomic selection (GS) could substantially increase the number of potential combinations that can be evaluated. GS models can be used to predict the performance of untested individuals based only on their genotypic profiles, using marker effects previously predicted in a training population. This allows for a preselection of promising genotypes, enabling a more efficient allocation of resources. In this study, we evaluated the usefulness of the Illumina Brassica 60 k SNP array for genomic prediction and compared three alternative approaches based on a homoscedastic ridge regression BLUP and three Bayesian prediction models that considered general and specific combining ability (GCA and SCA, respectively). A total of 448 hybrids were produced in a commercial breeding program from unbalanced crosses between 220 paternal doubled haploid lines and five male-sterile testers. Predictive ability was evaluated for seven agronomic traits. We demonstrate that the Brassica 60 k genotyping array is an adequate and highly valuable platform to implement genomic prediction of hybrid performance in oilseed rape. Furthermore, we present first insights into the application of established statistical models for prediction of important agronomical traits with contrasting patterns of polygenic control.
  相似文献   

18.
Nowadays, genome-wide association studies (GWAS) and genomic selection (GS) methods which use genome-wide marker data for phenotype prediction are of much potential interest in plant breeding. However, to our knowledge, no studies have been performed yet on the predictive ability of these methods for structured traits when using training populations with high levels of genetic diversity. Such an example of a highly heterozygous, perennial species is grapevine. The present study compares the accuracy of models based on GWAS or GS alone, or in combination, for predicting simple or complex traits, linked or not with population structure. In order to explore the relevance of these methods in this context, we performed simulations using approx 90,000 SNPs on a population of 3,000 individuals structured into three groups and corresponding to published diversity grapevine data. To estimate the parameters of the prediction models, we defined four training populations of 1,000 individuals, corresponding to these three groups and a core collection. Finally, to estimate the accuracy of the models, we also simulated four breeding populations of 200 individuals. Although prediction accuracy was low when breeding populations were too distant from the training populations, high accuracy levels were obtained using the sole core-collection as training population. The highest prediction accuracy was obtained (up to 0.9) using the combined GWAS-GS model. We thus recommend using the combined prediction model and a core-collection as training population for grapevine breeding or for other important economic crops with the same characteristics.  相似文献   

19.
Genome-wide association studies(GWAS) have identified thousands of genomic loci associated with complex diseases and traits, including cancer. The vast majority of common traitassociated variants identified via GWAS fall in non-coding regions of the genome, posing a challenge in elucidating the causal variants, genes, and mechanisms involved. Expression quantitative trait locus(e QTL) and other molecular QTL studies have been valuable resources in identifying candidate causal genes from GWAS loc...  相似文献   

20.
《Genomics》2021,113(3):1396-1406
Rice is one of the most important cereal crops, providing the daily dietary intake for approximately 50% of the global human population. Here, we re-sequenced 259 rice accessions, generating 1371.65 Gb of raw data. Furthermore, we performed genome-wide association studies (GWAS) on 13 agronomic traits using 2.8 million single nucleotide polymorphisms (SNPs) characterized in 259 rice accessions. Phenotypic data and best linear unbiased prediction (BLUP) values of each of the 13 traits over two years of each trait were used for the GWAS. The results showed that 816 SNP signals were significantly associated with the 13 agronomic traits. Then we detected candidate genes related to target traits within 200 kb upstream and downstream of the associated SNP loci, based on linkage disequilibrium (LD) blocks in the whole rice genome. These candidate genes were further identified through haplotype block constructions. This comprehensive study provides a timely and important genomic resource for breeding high yielding rice cultivars.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号