首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.

Background

Genomic selection is increasingly widely practised, particularly in dairy cattle. However, the accuracy of current predictions using GBLUP (genomic best linear unbiased prediction) decays rapidly across generations, and also as selection candidates become less related to the reference population. This is likely caused by the effects of causative mutations being dispersed across many SNPs (single nucleotide polymorphisms) that span large genomic intervals. In this paper, we hypothesise that the use of a nonlinear method (BayesR), combined with a multi-breed (Holstein/Jersey) reference population will map causative mutations with more precision than GBLUP and this, in turn, will increase the accuracy of genomic predictions for selection candidates that are less related to the reference animals.

Results

BayesR improved the across-breed prediction accuracy for Australian Red dairy cattle for five milk yield and composition traits by an average of 7% over the GBLUP approach (Australian Red animals were not included in the reference population). Using the multi-breed reference population with BayesR improved accuracy of prediction in Australian Red cattle by 2 – 5% compared to using BayesR with a single breed reference population. Inclusion of 8478 Holstein and 3917 Jersey cows in the reference population improved accuracy of predictions for these breeds by 4 and 5%. However, predictions for Holstein and Jersey cattle were similar using within-breed and multi-breed reference populations. We propose that the improvement in across-breed prediction achieved by BayesR with the multi-breed reference population is due to more precise mapping of quantitative trait loci (QTL), which was demonstrated for several regions. New candidate genes with functional links to milk synthesis were identified using differential gene expression in the mammary gland.

Conclusions

QTL detection and genomic prediction are usually considered independently but persistence of genomic prediction accuracies across breeds requires accurate estimation of QTL effects. We show that accuracy of across-breed genomic predictions was higher with BayesR than with GBLUP and that BayesR mapped QTL more precisely. Further improvements of across-breed accuracy of genomic predictions and QTL mapping could be achieved by increasing the size of the reference population, including more breeds, and possibly by exploiting pleiotropic effects to improve mapping efficiency for QTL with small effects.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-014-0074-4) contains supplementary material, which is available to authorized users.  相似文献   

2.
Combining different swine populations in genomic prediction can be an important tool, leading to an increased accuracy of genomic prediction using single nucleotide polymorphism (SNP) chip data compared with within-population genomic. However, the expected higher accuracy of multi-population genomic prediction has not been realized. This may be due to an inconsistent linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTL) across populations, and the weak genetic relationships across populations. In this study, we determined the impact of different genomic relationship matrices, SNP density and pre-selected variants on prediction accuracy using a combined Yorkshire pig population. Our objective was to provide useful strategies for improving the accuracy of genomic prediction within a combined population. Results showed that the accuracy of genomic best linear unbiased prediction (GBLUP) using imputed whole-genome sequencing (WGS) data in the combined population was always higher than that within populations. Furthermore, the use of imputed WGS data always resulted in a higher accuracy of GBLUP than the use of 80K chip data for the combined population. Additionally, the accuracy of GBLUP with a non-linear genomic relationship matrix was markedly increased (0.87% to 15.17% for 80K chip data, and 0.43% to 4.01% for imputed WGS data) compared with that obtained with a linear genomic relationship matrix, except for the prediction of XD population in the combined population using imputed WGS data. More importantly, the application of pre-selected variants based on fixation index (Fst) scores improved the accuracy of multi-population genomic prediction, especially for 80K chip data. For BLUP|GA (BLUP approach given the genetic architecture), the use of a linear method with an appropriate weight to build a weight-relatedness matrix led to a higher prediction accuracy compared with the use of only pre-selected SNPs for genomic evaluations, especially for the total number of piglets born. However, for the non-linear method, BLUP|GA showed only a small increase or even a decrease in prediction accuracy compared with the use of only pre-selected SNPs. Overall, the best genomic evaluation strategy for reproduction-related traits for a combined population was found to be GBLUP performed with a non-linear genomic relationship matrix using variants pre-selected from the 80K chip data based on Fst scores.  相似文献   

3.

Background

In China, the reference population of genotyped Holstein cattle is relatively small with to date, 80 bulls and 2091 cows genotyped with the Illumina 54 K chip. Including genotyped Holstein cattle from other countries in the reference population could improve the accuracy of genomic prediction of the Chinese Holstein population. This study investigated the consistency of linkage disequilibrium between adjacent markers between the Chinese and Nordic Holstein populations, and compared the reliability of genomic predictions based on the Chinese reference population only or the combined Chinese and Nordic reference populations.

Methods

Genomic estimated breeding values of Chinese Holstein cattle were predicted using a single-trait GBLUP model based on the Chinese reference dataset, and using a two-trait GBLUP model based on a joint reference dataset that included both the Chinese and Nordic Holstein data.

Results

The extent of linkage disequilibrium was similar in the Chinese and Nordic Holstein populations and the consistency of linkage disequilibrium between the two populations was very high, with a correlation of 0.97. Genomic prediction using the joint versus the Chinese reference dataset increased reliabilities of genomic predictions of Chinese Holstein bulls in the test data from 0.22, 0.15 and 0.11 to 0.51, 0.47 and 0.36 for milk yield, fat yield and protein yield, respectively. Using five-fold cross-validation, reliabilities of genomic predictions of Chinese cows increased from 0.15, 0.12 and 0.15 to 0.26, 0.17 and 0.20 for milk yield, fat yield and protein yield, respectively.

Conclusions

The linkage disequilibrium between the two populations was very consistent and using the combined Nordic and Chinese reference dataset substantially increased reliabilities of genomic predictions for Chinese Holstein cattle.  相似文献   

4.
This study evaluated different female-selective genotyping strategies to increase the predictive accuracy of genomic breeding values (GBVs) in populations that have a limited number of sires with a large number of progeny. A simulated dairy population was utilized to address the aims of the study. The following selection strategies were used: random selection, two-tailed selection by yield deviations, two-tailed selection by breeding value, top yield deviation selection and top breeding value selection. For comparison, two other strategies, genotyping of sires and pedigree indexes from traditional genetic evaluation, were included in the analysis. Two scenarios were simulated, low heritability (h2 = 0.10) and medium heritability (h2 = 0.30). GBVs were estimated using the Bayesian Lasso. The accuracy of predicted GBVs using the two-tailed strategies was better than the accuracy obtained using other strategies (0.50 and 0.63 for the two-tailed selection by yield deviations strategy and 0.48 and 0.63 for the two-tailed selection by breeding values strategy in low- and medium-heritability scenarios, respectively, using 1000 genotyped cows). When 996 genotyped bulls were used as the training population, the sire’ strategy led to accuracies of 0.48 and 0.55 for low- and medium-heritability traits, respectively. The Random strategies required larger training populations to outperform the accuracies of the pedigree index; however, selecting females from the top of the yield deviations or breeding values of the population did not improve accuracy relative to that of the pedigree index. Bias was found for all genotyping strategies considered, although the Top strategies produced the most biased predictions. Strategies that involve genotyping cows can be implemented in breeding programs that have a limited number of sires with a reliable progeny test. The results of this study showed that females that exhibited upper and lower extreme values within the distribution of yield deviations may be included as training population to increase reliability in small reference populations. The strategies that selected only the females that had high estimated breeding values or yield deviations produced suboptimal results.  相似文献   

5.
Ignacy Misztal 《Genetics》2016,202(2):401-409
Many computations with SNP data including genomic evaluation, parameter estimation, and genome-wide association studies use an inverse of the genomic relationship matrix. The cost of a regular inversion is cubic and is prohibitively expensive for large matrices. Recent studies in cattle demonstrated that the inverse can be computed in almost linear time by recursion on any subset of ∼10,000 individuals. The purpose of this study is to present a theory of why such a recursion works and its implication for other populations. Assume that, because of a small effective population size, the additive information in a genotyped population has a small dimensionality, even with a very large number of SNP markers. That dimensionality is visible as a limited number of effective SNP effects, independent chromosome segments, or the rank of the genomic relationship matrix. Decompose a population arbitrarily into core and noncore individuals, with the number of core individuals equal to that dimensionality. Then, breeding values of noncore individuals can be derived by recursions on breeding values of core individuals, with coefficients of the recursion computed from the genomic relationship matrix. A resulting algorithm for the inversion called “algorithm for proven and young” (APY) has a linear computing and memory cost for noncore animals. Noninfinitesimal genetic architecture can be accommodated through a trait-specific genomic relationship matrix, possibly derived from Bayesian regressions. For populations with small effective population size, the inverse of the genomic relationship matrix can be computed inexpensively for a very large number of genotyped individuals.  相似文献   

6.
T Druet  I M Macleod  B J Hayes 《Heredity》2014,112(1):39-47
Genomic prediction from whole-genome sequence data is attractive, as the accuracy of genomic prediction is no longer bounded by extent of linkage disequilibrium between DNA markers and causal mutations affecting the trait, given the causal mutations are in the data set. A cost-effective strategy could be to sequence a small proportion of the population, and impute sequence data to the rest of the reference population. Here, we describe strategies for selecting individuals for sequencing, based on either pedigree relationships or haplotype diversity. Performance of these strategies (number of variants detected and accuracy of imputation) were evaluated in sequence data simulated through a real Belgian Blue cattle pedigree. A strategy (AHAP), which selected a subset of individuals for sequencing that maximized the number of unique haplotypes (from single-nucleotide polymorphism panel data) sequenced gave good performance across a range of variant minor allele frequencies. We then investigated the optimum number of individuals to sequence by fold coverage given a maximum total sequencing effort. At 600 total fold coverage (x 600), the optimum strategy was to sequence 75 individuals at eightfold coverage. Finally, we investigated the accuracy of genomic predictions that could be achieved. The advantage of using imputed sequence data compared with dense SNP array genotypes was highly dependent on the allele frequency spectrum of the causative mutations affecting the trait. When this followed a neutral distribution, the advantage of the imputed sequence data was small; however, when the causal mutations all had low minor allele frequencies, using the sequence data improved the accuracy of genomic prediction by up to 30%.  相似文献   

7.
Most dairy cattle populations found in different countries around the world are small to medium sized and use many artificial insemination bulls imported from different foreign countries. The Walloon population in the southern part of Belgium is a good example for such a small-scale population. Wallonia has also a very active community of Holstein breeders requesting high level genetic evaluation services. Single-step Genomic BLUP (ssGBLUP) methods allow the simultaneous use of genomic, pedigree and phenotypic information and could reduce potential biases in the estimation of genomically enhanced breeding values (GEBV). Therefore, in the context of implementing a Walloon genomic evaluation system for Holsteins, it was considered as the best option. However, in contrast to multi-step genomic predictions, natively ssGBLUP will only use local phenotypic information and is unable to use directly important other sources of information coming from abroad, for example Multiple Across Country Evaluation (MACE) results as provided by the Interbull Center (Uppsala, Sweden). Therefore, we developed and implemented single-step Genomic Bayesian Prediction (ssGBayes), as an alternative method for the Walloon genomic evaluations. The ssGBayes method approximated the correct system of equations directly using estimated breeding values (EBV) and associated reliabilities (REL) without any explicit deregression step. In the Walloon genomic evaluation, local information refers to Walloon EBV and REL and foreign information refers to MACE EBV and associated REL. Combining simultaneously all available genotypes, pedigree, local and foreign information in an evaluation can be achieved but adding contributions to left-hand and right-hand sides subtracting double-counted contributions. Correct propagation of external information avoiding double counting of contributions due to relationships and due to records can be achieved. This ssGBayes method computed more accurate predictions for all types of animals. For example, for genotyped animals with low Walloon REL (<0.25) without MACE results but sired by genotyped bulls with MACE results, the average increase of REL for the studied traits was 0.38 points of which 0.08 points could be traced to the inclusion of MACE information. For other categories of genotyped animals, the contribution by MACE information was also high. The Walloon genomic evaluation system passed for the first time the Interbull GEBV tests for several traits in July 2013. Recent experiences reported here refer to its use in April 2016 for the routine genomic evaluations of milk production, udder health and type traits. Results showed that the proposed methodology should also be of interest for other, similar, populations.  相似文献   

8.
A juxtaposed microsatellite system (JMS) is composed of two microsatellite repeat arrays separated by a sequence of less than 200 bp and more than 20 bp. This paper presents the first empirical evaluation of JMSs for the study of genetic admixture induced by man, with brown trout (Salmo trutta) as model organism. Two distinct admixture situations were studied: native populations from streams of the Atlantic basin and of the Mediterranean basin, respectively, all stocked with domestic strains originating from the Atlantic basin. For these two situations, we first evaluated by simulation the ability of JMSs to differentiate between alien alleles and naturally shared homoplasious or ancestral alleles, and thus to behave as diagnostic markers for admixture. Simulations indicated that JMSs are expected to be reliable diagnostic markers in most divergent (i.e. Mediterranean) populations and nonreliable diagnostic markers in most closely related (i.e. Atlantic) populations. Three JMSs were genotyped in domestic strains as well as in nonstocked and stocked populations of brown trout sampled in different rivers of the Mediterranean and Atlantic basins. The observed distributions of JMS haplotypes were consistent with simulation predictions confirming that JMSs were reliable diagnostic markers only over a given proportion of the species range, i.e. in substantially divergent populations. JMSs also reinforced the diagnostic character of three microsatellite sites for the studied Mediterranean populations. This last result is consistent with our simulation results which showed that, although much less frequently than at JMSs, diagnostic markers are likely to be found at single site microsatellites provided that the native Mediterranean population has a sufficiently small effective population size. For each population of the Mediterranean basin admixture coefficients did not differ significantly across JMSs and mean admixture coefficients sometimes differ among populations. The interpretation of the origin of JMS haplotypes based on the allele length variants was supported by nucleotide sequence analysis.  相似文献   

9.
In modern dairy cattle breeding, genomic breeding programs have the potential to increase efficiency and genetic gain. At the same time, the requirements and the availability of genotypes and phenotypes present a challenge. The set-up of a large enough reference population for genomic prediction is problematic for numerically small breeds but also for hard to measure traits. The first part of this study is a review of the current literature on strategies to overcome the lack of reference data. One solution is the use of combined reference populations from different breeds, different countries, or different research populations. Results reveal that the level of relationship between the merged populations is the most important factor. Compiling closely related populations facilitates the accurate estimation of marker effects and thus results in high accuracies of genomic prediction. Consequently, mixed reference populations of the same breed, but from different countries are more promising than combining different breeds, especially if those are more distantly related. The use of female reference information has the potential to enlarge the reference population size. Including females is advisable for small populations and difficult traits, and maybe combined with genotyping females and imputing those that are un-genotyped.The efficient use of imputation for un-genotyped individuals requires a set of genotyped related animals and well-considered selection strategies which animals to choose for genotyping and phenotyping. Small populations have to find ways to derive additional advantages from the cost-intensive establishment of genomic breeding schemes. Possible solutions may be the use of genomic information for inbreeding control, parentage verification, within-herd selection, adjusted mating plans or conservation strategies.The second part of the paper deals with the issue of high-quality phenotypes against the background of new, difficult and hard to measure traits. The use of contracted herds for phenotyping is recommended, as additional traits, when compared to standard traits used in dairy cattle breeding can be measured at set moments in time. This can be undertaken even for the recording of health traits, thus resulting in complete contemporary groups for health traits. Future traits to be recorded and used in genomic breeding programs, at least partly will be traits for which traditional selection based on widespread phenotyping is not possible. Enabling phenotyping of sufficient numbers to enable genomic selection will rely on cooperation between scientists from different disciplines and may require multidisciplinary approaches.  相似文献   

10.

Background

The most common application of imputation is to infer genotypes of a high-density panel of markers on animals that are genotyped for a low-density panel. However, the increase in accuracy of genomic predictions resulting from an increase in the number of markers tends to reach a plateau beyond a certain density. Another application of imputation is to increase the size of the training set with un-genotyped animals. This strategy can be particularly successful when a set of closely related individuals are genotyped.

Methods

Imputation on completely un-genotyped dams was performed using known genotypes from the sire of each dam, one offspring and the offspring’s sire. Two methods were applied based on either allele or haplotype frequencies to infer genotypes at ambiguous loci. Results of these methods and of two available software packages were compared. Quality of imputation under different population structures was assessed. The impact of using imputed dams to enlarge training sets on the accuracy of genomic predictions was evaluated for different populations, heritabilities and sizes of training sets.

Results

Imputation accuracy ranged from 0.52 to 0.93 depending on the population structure and the method used. The method that used allele frequencies performed better than the method based on haplotype frequencies. Accuracy of imputation was higher for populations with higher levels of linkage disequilibrium and with larger proportions of markers with more extreme allele frequencies. Inclusion of imputed dams in the training set increased the accuracy of genomic predictions. Gains in accuracy ranged from close to zero to 37.14%, depending on the simulated scenario. Generally, the larger the accuracy already obtained with the genotyped training set, the lower the increase in accuracy achieved by adding imputed dams.

Conclusions

Whenever a reference population resembling the family configuration considered here is available, imputation can be used to achieve an extra increase in accuracy of genomic predictions by enlarging the training set with completely un-genotyped dams. This strategy was shown to be particularly useful for populations with lower levels of linkage disequilibrium, for genomic selection on traits with low heritability, and for species or breeds for which the size of the reference population is limited.  相似文献   

11.
Microsatellite, or simple sequence repeat (SSR), loci can be identified by mining expressed sequence tag (EST) databases, and where these are available, marker development time and expense can be decreased considerably over conventional strategies of probing the entire genome. However, it is unclear whether they provide information on population structure similar to that generated by anonymous genomic SSRs. We performed comparative population genetic analyses between EST-derived SSRs (EST-SSRs) and anonymous SSRs developed from genomic DNA for the same set of populations of the insect Diabrotica virgifera, a beetle in the family Chrysomelidae. Compared with noncoding, nontranscribed regions, EST-SSRs were generally less polymorphic but had reduced occurrence of null alleles and greater cross-species amplification. Neutrality tests suggested the loci were not under positive selection. Across all populations and all loci, the genomic and EST-SSRs performed similarly in estimating genetic diversity, F(IS), F(ST), population assignment and exclusion tests, and detection of distinct populations. These findings, therefore, indicate that the EST-SSRs examined can be used with confidence in future genetic studies of Diabrotica populations and suggest that EST libraries can be added as a valuable source of markers for population genetics studies in insects and other animals.  相似文献   

12.
Heritability is a central element in quantitative genetics. New molecular markers to assess genetic variance and heritability are continually under development. The availability of molecular single nucleotide polymorphism (SNP) markers can be applied for estimation of variance components and heritability on population, where relationship information is unknown. In this study, we evaluated the capabilities of two Bayesian genomic models to estimate heritability in simulated populations. The populations comprised different family structures of either no or a limited number of relatives, a single quantitative trait, and with one of two densities of SNP markers. All individuals were both genotyped and phenotyped. Results illustrated that the two models were capable of estimating heritability, when true heritability was 0.15 or higher and populations had a sample size of 400 or higher. For heritabilities of 0.05, all models had difficulties in estimating the true heritability. The two Bayesian models were compared with a restricted maximum likelihood (REML) approach using a genomic relationship matrix. The comparison showed that the Bayesian approaches performed equally well as the REML approach. Differences in family structure were in general not found to influence the estimation of the heritability. For the sample sizes used in this study, a 10-fold increase of SNP density did not improve precision estimates compared with set-ups with a less dense distribution of SNPs. The methods used in this study showed that it was possible to estimate heritabilities on the basis of SNPs in animals with direct measurements. This conclusion is valuable in cases when quantitative traits are either difficult or expensive to measure.  相似文献   

13.
A kinetic model for subtractive hybridization.   总被引:1,自引:0,他引:1       下载免费PDF全文
Nucleic acid sequences that differ in abundance between two populations (target sequences) can be cloned by multiple rounds of subtractive hybridization and amplification by PCR. These sequences can be cDNAs representing up-regulated mRNAs, or genomic DNAs from deletion mutants. We have derived an equation that describes the recovery of such sequences, and have used this to simulate the outcome of up to 10 rounds of subtractive hybridization and PCR amplification. When the model was tested by comparing its predictions with the published results from genomic and cDNA subtractions, the predictions of the model were generally in good agreement with the published data. We have modelled the outcomes of genomic subtractions, for a variety of genomes, and have used it to compare various strategies for enriching targets. The model predicts that for genomes of less than 5 x 10(8) bp, deletions of as small as 1 kbp should represent > 99% of the DNA after three to six rounds of hybridization (depending on the enrichment procedure). As genomes increase in size, the kinetics of hybridization become an important limiting factor. However, even for genomes as large as 3 x 10(9) bp, it should be possible to isolate deletions of 5 kbp using the appropriate conditions. These simulations suggest that such methods offer a realistic alternative to chromosome walking for identifying genomic deletions for which there are known phenotypes, thereby considerably reducing time and effort. For cDNA subtractive hybridization, the model predicts that after six rounds of hybridization, sequences that do not differ in abundance between the tester and driver populations (the background) will represent < 1% of the subtracted population, and even quite modestly upregulated cDNAs should be successfully enriched. Where several up-regulated cDNAs are present, the predicted final representation is dependent on both the initial abundance and the degree of up-regulation.  相似文献   

14.
ABSTRACT: BACKGROUND: Low cost genotyping of individuals using high density genomic markers were recently introduced as genomic selection in genetic improvement programs in dairy cattle. Most implementations of genomic selection only use marker information, in the models used for prediction of genetic merit. However, in other species it has been shown that only a fraction of the total genetic variance can be explained by markers. Using 5217 bulls in the Nordic Holstein population that were genotyped and had genetic evaluations based on progeny, we partitioned the total additive genetic variance into a genomic component explained by markers and a remaining component explained by familial relationships. The traits analyzed were production and fitness related traits in dairy cattle. Furthermore, we estimated the genomic variance that can be attributed to individual chromosomes and we illustrate methods that can predict the amount of additive genetic variance that can be explained by sets of markers with different density. RESULTS: The amount of additive genetic variance that can be explained by markers was estimated by an analysis of the matrix of genomic relationships. For the traits in the analysis, most of the additive genetic variance can be explained by 44 K informative SNP markers. The same amount of variance can be attributed to individual chromosomes but surprisingly the relation between chromosomal variance and chromosome length was weak. In models including both genomic (marker) and familial (pedigree) effects most (on average 77.2%) of total additive genetic variance was explained by genomic effects while the remaining was explained by familial relationships. CONCLUSIONS: Most of the additive genetic variance for the traits in the Nordic Holstein population can be explained using 44 K informative SNP markers. By analyzing the genomic relationship matrix it is possible to predict the amount of additive genetic variance that can be explained by a reduced (or increased) set of markers. For the population analyzed the improvement of genomic prediction by increasing marker density beyond 44 K is limited.  相似文献   

15.
16.
Admixture mapping is a potentially powerful tool for mapping complex genetic diseases. For application of this method, admixed individuals must have genomes composed of large segments derived intact from each founding population. Such segments are thought to be present in African Americans (AA) and should be demonstrable by examination of linkage disequilibrium (LD). Previous studies using a variety of polymorphic markers have variably reported long-range LD or rapid decay of LD. To further define the extent and characteristics of LD caused by admixture in the AA population, the current study utilized a set of 52 diallelic markers that were selected for large standard variances between putative representatives of the founder populations. LD was examined in over 250 marker-pairs, including linked markers from four different chromosomal regions and an equal number of matched unlinked comparisons. In the representative founder populations, strong LD was not observed for markers separated by more than 10 kb. In contrast, results indicated significant LD ( P<0.001, D'>0.3) in AA over large genomic segments exceeding 10 centiMorgans (cM) and 15 megabases (Mb). Only marginally significant LD was present between unlinked markers in this population, suggesting that choosing appropriate levels of significance for admixture mapping can minimize false positive results. The ability to detect LD for extended chromosomal segments in AA decayed not only as a function of the distance between markers, but also as a function of the standard variance of the markers. This examination of several genomic segments provides strong evidence that appropriate selection of informative markers is a crucial prerequisite for the application of admixture mapping to the AA population.  相似文献   

17.
Population genomics is a useful tool to support integrated pest management as it can elucidate population dynamics, demography, and histories of invasion. Here, we use a restriction site‐associated DNA sequencing approach combined with whole‐genome amplification (WGA) to assess genomic population structure of a newly described pest of canola, the diminutive canola flower midge, Contarinia brassicola. Clustering analyses recovered little geographic structure across the main canola production region but differentiated several geographically disparate populations at edges of the agricultural zone. Given a lack of alternative hypotheses for this pattern, we suggest these data support alternative hosts for this species and thus our canola‐centric view of this midge as a pest has limited our understanding of its biology. These results speak to the need for increased surveying efforts across multiple habitats and other potential hosts within Brassicaceae to improve both our ecological and evolutionary knowledge of this species and contribute to effective management strategies. We additionally found that use of WGA prior to library preparation was an effective method for increasing DNA quantity of these small insects prior to restriction site‐associated DNA sequencing and had no discernible impact on genotyping consistency for population genetic analysis; WGA is therefore likely to be tractable for other similar studies that seek to randomly sample markers across the genome in small organisms.  相似文献   

18.

Background

Size of the reference population and reliability of phenotypes are crucial factors influencing the reliability of genomic predictions. It is therefore useful to combine closely related populations. Increased accuracies of genomic predictions depend on the number of individuals added to the reference population, the reliability of their phenotypes, and the relatedness of the populations that are combined.

Methods

This paper assesses the increase in reliability achieved when combining four Holstein reference populations of 4000 bulls each, from European breeding organizations, i.e. UNCEIA (France), VikingGenetics (Denmark, Sweden, Finland), DHV-VIT (Germany) and CRV (The Netherlands, Flanders). Each partner validated its own bulls using their national reference data and the combined data, respectively.

Results

Combining the data significantly increased the reliability of genomic predictions for bulls in all four populations. Reliabilities increased by 10%, compared to reliabilities obtained with national reference populations alone, when they were averaged over countries and the traits evaluated. For different traits and countries, the increase in reliability ranged from 2% to 19%.

Conclusions

Genomic selection programs benefit greatly from combining data from several closely related populations into a single large reference population.  相似文献   

19.
Genomic prediction utilizes single nucleotide polymorphism (SNP) chip data to predict animal genetic merit. It has the advantage of potentially capturing the effects of the majority of loci that contribute to genetic variation in a trait, even when the effects of the individual loci are very small. To implement genomic prediction, marker effects are estimated with a training set, including individuals with marker genotypes and trait phenotypes; subsequently, genomic estimated breeding values (GEBV) for any genotyped individual in the population can be calculated using the estimated marker effects. In this study, we aimed to: (i) evaluate the potential of genomic prediction to predict GEBV for nematode resistance traits and BW in sheep, within and across populations; (ii) evaluate the accuracy of these predictions through within-population cross-validation; and (iii) explore the impact of population structure on the accuracy of prediction. Four data sets comprising 752 lambs from a Scottish Blackface population, 2371 from a Sarda×Lacaune backcross population, 1000 from a Martinik Black-Belly×Romane backcross population and 64 from a British Texel population were used in this study. Traits available for the analysis were faecal egg count for Nematodirus and Strongyles and BW at different ages or as average effect, depending on the population. Moreover, immunoglobulin A was also available for the Scottish Blackface population. Results show that GEBV had moderate to good within-population predictive accuracy, whereas across-population predictions had accuracies close to zero. This can be explained by our finding that in most cases the accuracy estimates were mostly because of additive genetic relatedness between animals, rather than linkage disequilibrium between SNP and quantitative trait loci. Therefore, our results suggest that genomic prediction for nematode resistance and BW may be of value in closely related animals, but that with the current SNP chip genomic predictions are unlikely to work across breeds.  相似文献   

20.
Biallelic markers such as single nucleotide polymorphisms (SNPs) and insertion/deletion polymorphisms have become increasingly popular markers for various population genetics applications. However, the effort required to develop biallelic markers in nonmodel organisms is still substantial. In this study, we compared the estimation of various population genetic parameters (genetic divergence and structuring, isolation-by-distance, genetic diversity) using a limited number of biallelic markers (in total 7 loci) to those estimated with 14 microsatellite loci in 21 Atlantic salmon (Salmo salar) populations from northern Europe. Pairwise FST values were significantly correlated between biallelic loci and microsatellite datasets, as was overall heterozygosity when both anadromous and nonanadromous populations were analyzed together. However, when the anadromous and nonanadromous samples were analyzed separately, only genetic divergence correlations remained significant. Biallelic markers alone were not sufficient for reliable neighbor-joining clustering of populations but gave highly similar isolation-by-distance signals when compared with microsatellites. Finally, although several population prioritization measures for conservation exhibited significant correlation between different marker types, the specific populations highlighted as being most valuable for conservation purposes varied depending on the marker type and conservation criteria applied. This study demonstrates that a relatively small set of biallelic markers can be sufficient for obtaining concordant results in most of the analyses compared with microsatellites, although estimates of genetic distance are generally more concordant than estimates of genetic diversity. This suggests that a relatively small number of biallelic markers can provide useful information for various population genetic applications. However, we emphasize that the use of much higher number of loci is preferable, especially when the genetic differences between populations are subtle or individual multilocus genotype-based analyses are to be performed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号