首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Single-step genomic BLUP (ssGBLUP) has been widely used in genomic evaluation due to relatively higher prediction accuracy and simplicity of use. The prediction accuracy from ssGBLUP depends on the amount of information available concerning both genotype and phenotype. This study investigated how information on genotype and phenotype that had been acquired from previous generations influences the prediction accuracy of ssGBLUP, and thus we sought an optimal balance about genotypic and phenotypic information to achieve a cost-effective and computationally efficient genomic evaluation. We generated two genetically correlated traits (h2 = 0.35 for trait A, h2 = 0.10 for trait B and genetic correlation 0.20) as well as two distinct populations mimicking purebred swine. Phenotypic and genotypic information in different numbers of previous generations and different genotyping rates for each litter were set to generate different datasets. Prediction accuracy was evaluated by correlating genomic estimated breeding values with true breeding values for genotyped animals in the last generation. The results revealed a negligible impact of previous generations that lacked genotyped animals on the prediction accuracy. Phenotypic and genotypic data, including the most recent three to four generations with a genotyping rate of 40% or 50% for each litter, could lead to asymptotic maximum prediction accuracy for genotyped animals in the last generation. Single-step genomic best linear unbiased prediction yielded an optimal balance about genotypic and phenotypic information to ensure a cost-effective and computationally efficient genomic evaluation of populations of polytocous animals such as purebred pigs.  相似文献   

2.
Monitoring the rate of change in inbreeding and genetic diversity within a population is important to guide breeding programmes. Such interest stems from the impact of loss in genetic diversity on sustainable genetic gain but also the impact on performance (i.e. inbreeding depression). The objective of the present study was to evaluate trends in inbreeding and genetic diversity in 43 066 Belclare, 120 753 Charollais, 22 652 Galway, 78 925 Suffolk, 187 395 Texel, and 19 821 Vendeen purebred sheep. The effective population size for each of the six breeds was between 116.0 (Belclare population) and 314.8 (Charollais population). The Charollais population was the most genetically diverse with the greatest number of effective founders, effective ancestors, and effective founder genomes; conversely, the Belclare was the least genetically diverse population with the fewest number of effective founders, effective ancestors, and effective founder genomes for each of the six breeds investigated. Overall, the effective population sizes and the total genetic diversity within each of the six breeds were above the minimum thresholds generally considered to be required for the long-term viability of a population.  相似文献   

3.
Genomic prediction utilizes single nucleotide polymorphism (SNP) chip data to predict animal genetic merit. It has the advantage of potentially capturing the effects of the majority of loci that contribute to genetic variation in a trait, even when the effects of the individual loci are very small. To implement genomic prediction, marker effects are estimated with a training set, including individuals with marker genotypes and trait phenotypes; subsequently, genomic estimated breeding values (GEBV) for any genotyped individual in the population can be calculated using the estimated marker effects. In this study, we aimed to: (i) evaluate the potential of genomic prediction to predict GEBV for nematode resistance traits and BW in sheep, within and across populations; (ii) evaluate the accuracy of these predictions through within-population cross-validation; and (iii) explore the impact of population structure on the accuracy of prediction. Four data sets comprising 752 lambs from a Scottish Blackface population, 2371 from a Sarda×Lacaune backcross population, 1000 from a Martinik Black-Belly×Romane backcross population and 64 from a British Texel population were used in this study. Traits available for the analysis were faecal egg count for Nematodirus and Strongyles and BW at different ages or as average effect, depending on the population. Moreover, immunoglobulin A was also available for the Scottish Blackface population. Results show that GEBV had moderate to good within-population predictive accuracy, whereas across-population predictions had accuracies close to zero. This can be explained by our finding that in most cases the accuracy estimates were mostly because of additive genetic relatedness between animals, rather than linkage disequilibrium between SNP and quantitative trait loci. Therefore, our results suggest that genomic prediction for nematode resistance and BW may be of value in closely related animals, but that with the current SNP chip genomic predictions are unlikely to work across breeds.  相似文献   

4.

Key message

We propose a criterion to predict genomic selection efficiency for structured populations. This criterion is useful to define optimal calibration set and to estimate prediction reliability for multiparental populations.

Abstract

Genomic selection refers to the use of genotypic information for predicting the performance of selection candidates. It has been shown that prediction accuracy depends on various parameters including the composition of the calibration set (CS). Assessing the level of accuracy of a given prediction scenario is of highest importance because it can be used to optimize CS sampling before collecting phenotypes, and once the breeding values are predicted it informs the breeders about the reliability of these predictions. Different criteria were proposed to optimize CS sampling in highly diverse panels, which can be useful to screen collections of genotypes. But plant breeders often work on structured material such as biparental or multiparental populations, for which these criteria are less adapted. We derived from the generalized coefficient of determination (CD) theory different criteria to optimize CS sampling and to assess the reliability associated to predictions in structured populations. These criteria were evaluated on two nested association mapping (NAM) populations and two highly diverse panels of maize. They were efficient to sample optimized CS in most situations. They could also estimate at least partly the reliability associated to predictions between NAM families, but they could not estimate differences in the reliability associated to the predictions of NAM families using the highly diverse panels as calibration sets. We illustrated that the CD criteria could be adapted to various prediction scenarios including inter and intra-family predictions, resulting in higher prediction accuracies.
  相似文献   

5.
The Ascanian multi-foetus and pure-bred Karakul sheep reared in the steppe region of the Ukraine are characterized by five-allelic status of transferrin and by diallelic serum arylesterase and alkaline phosphatase. Besides the basic HbA and HbB alleles the rare HbC type has been revealed in the multi-foetus Karakul. Reliable differences in concentrations of the transferrin and haemoglobin alleles between the multi-foetus Karakul population and that of the pure-bred Karakul have been found.  相似文献   

6.
7.

Background

In crossbreeding programs, genomic selection offers the opportunity to make efficient use of information on crossbred (CB) individuals in the selection of purebred (PB) candidates. In such programs, reference populations often contain genotyped PB animals, although the breeding objective is usually more focused on CB performance. The question is what would be the benefit of including a larger proportion of CB individuals in the reference population.

Methods

In a deterministic simulation study, we evaluated the benefit of including various proportions of CB animals in a reference population for genomic selection of PB animals in a crossbreeding program. We used a pig breeding scheme with selection for a moderately heritable trait and a size of 6000 for the reference population.

Results

Applying genomic selection to improve the performance of CB individuals, with a genetic correlation between PB and CB performance (rPC) of 0.7, selection accuracy of PB candidates increased from 0.49 to 0.52 if the reference population consisted of PB individuals, it increased to 0.55 if the reference population consisted of the same number of CB individuals, and to 0.60 if the size of the CB reference population was twice that of the reference population for each PB line. The advantage of using CB rather than PB individuals increased linearly with the proportion of CB individuals in the reference population. This advantage disappeared quickly if rPC was higher or if the breeding objective put some emphasis on PB performance. The benefit of adding CB individuals to an existing PB reference population was limited for high rPC.

Conclusions

Using CB rather than PB individuals in a reference population for genomic selection can provide substantial advantages, but only when correlations between PB and CB performances are not high and PB performance is not part of the breeding objective.  相似文献   

8.

Background

The purpose of this work was to study the impact of both the size of genomic reference populations and the inclusion of a residual polygenic effect on dairy cattle genetic evaluations enhanced with genomic information.

Methods

Direct genomic values were estimated for German Holstein cattle with a genomic BLUP model including a residual polygenic effect. A total of 17,429 genotyped Holstein bulls were evaluated using the phenotypes of 44 traits. The Interbull genomic validation test was implemented to investigate how the inclusion of a residual polygenic effect impacted genomic estimated breeding values.

Results

As the number of reference bulls increased, both the variance of the estimates of single nucleotide polymorphism effects and the reliability of the direct genomic values of selection candidates increased. Fitting a residual polygenic effect in the model resulted in less biased genome-enhanced breeding values and decreased the correlation between direct genomic values and estimated breeding values of sires in the reference population.

Conclusions

Genetic evaluation of dairy cattle enhanced with genomic information is highly effective in increasing reliability, as well as using large genomic reference populations. We found that fitting a residual polygenic effect reduced the bias in genome-enhanced breeding values, decreased the correlation between direct genomic values and sire''s estimated breeding values and made genome-enhanced breeding values more consistent in mean and variance as is the case for pedigree-based estimated breeding values.  相似文献   

9.
A study was conducted on 851 cows (Gir cows and their crossbreeds) between 1971 and 1984 at the Livestock Farm of the All India Coordinated Research Project on Cattle in Jabalpur, India. The cows were impregnated with frozen or fresh semen from 90 bulls and a total of 2699 pregnancies resulted. An overall incidence of prenatal mortality was recorded at 4.89%. A significantly higher (P / 0.01) incidence of prenatal mortality was encountered in three breed crosses in comparison with Gir cows and their first generation crossbreeds. Parity of dam had no effect on prenatal mortality. However, effects of season and individual sires on prenatal mortality were documented. It was observed that reproductive performance was improved significantly after prenatal mortality.  相似文献   

10.
11.

Background

Genotype imputation is commonly used as an initial step in genomic selection since the accuracy of genomic selection does not decline if accurately imputed genotypes are used instead of actual genotypes but for a lower cost. Performance of imputation has rarely been investigated in crossbred animals and, in particular, in pigs. The extent and pattern of linkage disequilibrium differ in crossbred versus purebred animals, which may impact the performance of imputation. In this study, first we compared different scenarios of imputation from 5 K to 8 K single nucleotide polymorphisms (SNPs) in genotyped Danish Landrace and Yorkshire and crossbred Landrace-Yorkshire datasets and, second, we compared imputation from 8 K to 60 K SNPs in genotyped purebred and simulated crossbred datasets. All imputations were done using software Beagle version 3.3.2. Then, we investigated the reasons that could explain the differences observed.

Results

Genotype imputation performs as well in crossbred animals as in purebred animals when both parental breeds are included in the reference population. When the size of the reference population is very large, it is not necessary to use a reference population that combines the two breeds to impute the genotypes of purebred animals because a within-breed reference population can provide a very high level of imputation accuracy (correct rate ≥ 0.99, correlation ≥ 0.95). However, to ensure that similar imputation accuracies are obtained for crossbred animals, a reference population that combines both parental purebred animals is required. Imputation accuracies are higher when a larger proportion of haplotypes are shared between the reference population and the validation (imputed) populations.

Conclusions

The results from both real data and pedigree-based simulated data demonstrate that genotype imputation from low-density panels to medium-density panels is highly accurate in both purebred and crossbred pigs. In crossbred pigs, combining the parental purebred animals in the reference population is necessary to obtain high imputation accuracy.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-015-0134-4) contains supplementary material, which is available to authorized users.  相似文献   

12.

Background

Genomic prediction faces two main statistical problems: multicollinearity and n ≪ p (many fewer observations than predictor variables). Principal component (PC) analysis is a multivariate statistical method that is often used to address these problems. The objective of this study was to compare the performance of PC regression (PCR) for genomic prediction with that of a commonly used REML model with a genomic relationship matrix (GREML) and to investigate the full potential of PCR for genomic prediction.

Methods

The PCR model used either a common or a semi-supervised approach, where PC were selected based either on their eigenvalues (i.e. proportion of variance explained by SNP (single nucleotide polymorphism) genotypes) or on their association with phenotypic variance in the reference population (i.e. the regression sum of squares contribution). Cross-validation within the reference population was used to select the optimum PCR model that minimizes mean squared error. Pre-corrected average daily milk, fat and protein yields of 1609 first lactation Holstein heifers, from Ireland, UK, the Netherlands and Sweden, which were genotyped with 50 k SNPs, were analysed. Each testing subset included animals from only one country, or from only one selection line for the UK.

Results

In general, accuracies of GREML and PCR were similar but GREML slightly outperformed PCR. Inclusion of genotyping information of validation animals into model training (semi-supervised PCR), did not result in more accurate genomic predictions. The highest achievable PCR accuracies were obtained across a wide range of numbers of PC fitted in the regression (from one to more than 1000), across test populations and traits. Using cross-validation within the reference population to derive the number of PC, yielded substantially lower accuracies than the highest achievable accuracies obtained across all possible numbers of PC.

Conclusions

On average, PCR performed only slightly less well than GREML. When the optimal number of PC was determined based on realized accuracy in the testing population, PCR showed a higher potential in terms of achievable accuracy that was not capitalized when PC selection was based on cross-validation. A standard approach for selecting the optimal set of PC in PCR remains a challenge.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-014-0060-x) contains supplementary material, which is available to authorized users.  相似文献   

13.
The aim of this study was to analyze milk protein composition in purebred and crossbred dairy cattle and estimate the effects of individual sources of variation on the investigated traits. Milk samples were collected from 505 cows from three commercial farms located in Northern Italy, some of which had originated from crossbreeding programs, although most were purebred Holsteins (HO). The basic crossbreeding scheme was a three-breed rotational system using Swedish Red (SR) semen on HO cows (SR×HO), Montbeliarde (MO) semen on SR×HO cows (MO×(SR×HO)) and HO semen again on MO×(SR×HO) cows. A smaller number of purebred HO from each of the herds were mated inverting the breed order (MO×HO and SR×(MO×HO)) or using Brown Swiss (BS) bulls (BS×HO) then MO bulls (MO×(BS×HO)). Milk samples were analyzed by reverse-phase HPLC to obtain protein fraction amounts (g/l) and proportions (% of total true protein). Traits were analyzed using a linear model, which included the fixed effects of herd-test-day (HTD), parity, days in milk and breed combination. Results showed that milk protein fractions were influenced by HTD, stage of lactation, parity and breed combination. The increase in protein concentration during lactation was due in particular to β-casein (β-CN), αS1-CN and β-lactoglobulin (β-LG). The higher protein content of primiparous milk was mainly due to higher concentrations of all casein fractions. The milk from crossbred cows had higher contents and proportions of κ-CN and α-lactalbumin (α-LA), lower proportions of β-LG and greater proportion of caseins/smaller in whey proteins on milk true protein than purebred HO. The three-way crossbreds differed from two-way crossbreds only in having greater proportions of α-LA in their milk. Of the three-way crossbreds, the SR sired cows yielded milk with a smaller content and proportion of β-LG than the MO sired cows, and, consequently, a higher proportion of caseins than whey proteins. Results from this study support the feasibility of using crossbreeding programs to alter milk protein profiles with the aim of improving milk quality and cheese-making properties.  相似文献   

14.

Background

Size of the reference population and reliability of phenotypes are crucial factors influencing the reliability of genomic predictions. It is therefore useful to combine closely related populations. Increased accuracies of genomic predictions depend on the number of individuals added to the reference population, the reliability of their phenotypes, and the relatedness of the populations that are combined.

Methods

This paper assesses the increase in reliability achieved when combining four Holstein reference populations of 4000 bulls each, from European breeding organizations, i.e. UNCEIA (France), VikingGenetics (Denmark, Sweden, Finland), DHV-VIT (Germany) and CRV (The Netherlands, Flanders). Each partner validated its own bulls using their national reference data and the combined data, respectively.

Results

Combining the data significantly increased the reliability of genomic predictions for bulls in all four populations. Reliabilities increased by 10%, compared to reliabilities obtained with national reference populations alone, when they were averaged over countries and the traits evaluated. For different traits and countries, the increase in reliability ranged from 2% to 19%.

Conclusions

Genomic selection programs benefit greatly from combining data from several closely related populations into a single large reference population.  相似文献   

15.
《Genomics》2021,113(3):1407-1415
Genome-wide pattern of runs of homozygosity (ROH) across ovine genome can provide a useful resource for studying diversity and demography history in sheep. We analyzed 50 k SNPs chip data of 2536 animals to identify pattern, distribution and level of ROHs in 68 global sheep populations. A total of 60,301 ROHs were detected in all breeds. The majority of the detected ROHs were <16 Mb and the average total number of ROHs per individual was 23.8 ± 13.8. The ROHs greater than 1 Mb covered on average 8.2% of the sheep autosomes, 1% of which was related to the ROHs with 1–4 Mb of length. The mean sum of ROH length in two-thirds of the populations was less than 250 Mb ranging from 21.7 to near 570 Mb. The level of genomic inbreeding was relatively low. The average of the inbreeding coefficients based on ROH (FROH) was 0.09 ± 0.05. It was rising in a stepwise manner with distance from Southwest Asia and maximum values were detected in North European breeds. A total of 465 ROH hotspots were detected in 25 different autosomes which partially surrounding 257 Refseq genes across the genome. Most of the detected genes were related to growth, body weight, meat production and quality, wool production and pigmentation. In conclusion, our analysis showed that the sheep genome, compared with other livestock species such as cattle and pig, displays low levels of homozygosity and appropriate genetic diversity for selection response and genetic merit gain.  相似文献   

16.
17.

Key message

Impacts of population structure on the evaluation of genomic heritability and prediction were investigated and quantified using high-density markers in diverse panels in rice and maize.

Abstract

Population structure is an important factor affecting estimation of genomic heritability and assessment of genomic prediction in stratified populations. In this study, our first objective was to assess effects of population structure on estimations of genomic heritability using the diversity panels in rice and maize. Results indicate population structure explained 33 and 7.5 % of genomic heritability for rice and maize, respectively, depending on traits, with the remaining heritability explained by within-subpopulation variation. Estimates of within-subpopulation heritability were higher than that derived from quantitative trait loci identified in genome-wide association studies, suggesting 65 % improvement in genetic gains. The second objective was to evaluate effects of population structure on genomic prediction using cross-validation experiments. When population structure exists in both training and validation sets, correcting for population structure led to a significant decrease in accuracy with genomic prediction. In contrast, when prediction was limited to a specific subpopulation, population structure showed little effect on accuracy and within-subpopulation genetic variance dominated predictions. Finally, effects of genomic heritability on genomic prediction were investigated. Accuracies with genomic prediction increased with genomic heritability in both training and validation sets, with the former showing a slightly greater impact. In summary, our results suggest that the population structure contribution to genomic prediction varies based on prediction strategies, and is also affected by the genetic architectures of traits and populations. In practical breeding, these conclusions may be helpful to better understand and utilize the different genetic resources in genomic prediction.  相似文献   

18.

Key message

Genomic prediction accuracy within a large panel was found to be substantially higher than that previously observed in smaller populations, and also higher than QTL-based prediction.

Abstract

In recent years, genomic selection for wheat breeding has been widely studied, but this has typically been restricted to population sizes under 1000 individuals. To assess its efficacy in germplasm representative of commercial breeding programmes, we used a panel of 10,375 Australian wheat breeding lines to investigate the accuracy of genomic prediction for grain yield, physical grain quality and other physiological traits. To achieve this, the complete panel was phenotyped in a dedicated field trial and genotyped using a custom AxiomTM Affymetrix SNP array. A high-quality consensus map was also constructed, allowing the linkage disequilibrium present in the germplasm to be investigated. Using the complete SNP array, genomic prediction accuracies were found to be substantially higher than those previously observed in smaller populations and also more accurate compared to prediction approaches using a finite number of selected quantitative trait loci. Multi-trait genetic correlations were also assessed at an additive and residual genetic level, identifying a negative genetic correlation between grain yield and protein as well as a positive genetic correlation between grain size and test weight.
  相似文献   

19.
Garcia M  Edqvist LE 《Theriogenology》1990,33(5):1091-1103
Five experiments were undertaken to investigate variation in progesterone concentrations as related to the collection and handling of samples of milk and blood, to determine reference values for progesterone and to evaluate clinical findings in relation to progesterone data from pure- and crossbred Zebu cattle reared in the Peruvian tropics. Whole-milk progesterone concentrations obtained from 12 Holstein x Nellore pregnant cows at hourly intervals over a 24-h period were highest immediately after milking; this peak was followed by a sharp drop over the next 3 h. Milk-fat content from 28 Brown Swiss x Nellore pregnant cows increased from 2.4% before milking to 6.7% afterwards (P < 0.05), whereas progesterone concentrations in whole milk increased from 18.4 to 59.5 nmol/1 (P < 0.05). Progesterone concentrations in fat-free milk were stable, with the exception of the fore-milk sample, which was lower than subsequent samples collected during milking (P < 0.05). Blood samples collected from 23 purebred, pregnant Nellore cows, were divided into four aliquots, and plasma and serum were harvested periodically over the next 120 h of storage at +4 degrees C, or in the shade at ambient air temperatures. The results indicate that blood samples can be stored unseparated at both temperatures studied for up to 3 h without severe loss of progesterone. Milk samples collected at different intervals during the luteal phase of the estrous cycle and during early and mid-pregnancy from crossbred cows showed no significant differences in progesterone concentrations between Days 9 to 13 of the cycle and Days 9 to 13 of gestation. Progesterone levels during advanced gestation were higher (P < 0.05). Out of 2,607 clinical examinations, the level of agreement between palpatory findings and progesterone determinations was 95.6 and 81.9% in diagnosing non-luteal and luteal structures, respectively. Significant differences in the level of agreement between palpators were found (P < 0.01). It is concluded that milk samples, preferably composite milk or strippings, must be consistently collected at the same stage of milking, and that centrifugation of blood samples should be done as soon as possible and not later than 3 h after collection.  相似文献   

20.

Background

Marker-assisted selection (MAS) and genomic selection (GS) based on genome-wide marker data provide powerful tools to predict the genotypic value of selection material in plant breeding. However, case-to-case optimization of these approaches is required to achieve maximum accuracy of prediction with reasonable input.

Results

Based on extended field evaluation data for grain yield, plant height, starch content and total pentosan content of elite hybrid rye derived from testcrosses involving two bi-parental populations that were genotyped with 1048 molecular markers, we compared the accuracy of prediction of MAS and GS in a cross-validation approach. MAS delivered generally lower and in addition potentially over-estimated accuracies of prediction than GS by ridge regression best linear unbiased prediction (RR-BLUP). The grade of relatedness of the plant material included in the estimation and test sets clearly affected the accuracy of prediction of GS. Within each of the two bi-parental populations, accuracies differed depending on the relatedness of the respective parental lines. Across populations, accuracy increased when both populations contributed to estimation and test set. In contrast, accuracy of prediction based on an estimation set from one population to a test set from the other population was low despite that the two bi-parental segregating populations under scrutiny shared one parental line. Limiting the number of locations or years in field testing reduced the accuracy of prediction of GS equally, supporting the view that to establish robust GS calibration models a sufficient number of test locations is of similar importance as extended testing for more than one year.

Conclusions

In hybrid rye, genomic selection is superior to marker-assisted selection. However, it achieves high accuracies of prediction only for selection candidates closely related to the plant material evaluated in field trials, resulting in a rather pessimistic prognosis for distantly related material. Both, the numbers of evaluation locations and testing years in trials contribute equally to prediction accuracy.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-556) contains supplementary material, which is available to authorized users.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号