首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Genome wide linkage disequilibrium (LD) was investigated in a set of 32 genotypes representing salt tolerant improved varieties and landraces and six salt sensitive genotypes of rice with 64 microsatellite markers to identify the genomic regions that are associated with salt tolerance in rice. Out of 64 markers analyzed, 36% SSR pairs exhibited significant LD at 0.05. A few regions were identified as targets of selection in 10 chromosomes with high r 2 values. The model-based groups from Bayesian clustering analysis are largely consistent with known pedigrees of the lines. The increased percentage of association of SSR loci in the improved varieties indicated the role of selection in linkage disequilibrium especially for salt tolerance. LD was extended as far as 100 cM in the present study. Most of the markers (43.8%) with significant LD values were observed in the genomic regions of reported QTL for salt tolerance in rice.  相似文献   

2.
We compared the accuracies of four genomic-selection prediction methods as affected by marker density, level of linkage disequilibrium (LD), quantitative trait locus (QTL) number, sample size, and level of replication in populations generated from multiple inbred lines. Marker data on 42 two-row spring barley inbred lines were used to simulate high and low LD populations from multiple inbred line crosses: the first included many small full-sib families and the second was derived from five generations of random mating. True breeding values (TBV) were simulated on the basis of 20 or 80 additive QTL. Methods used to derive genomic estimated breeding values (GEBV) were random regression best linear unbiased prediction (RR–BLUP), Bayes-B, a Bayesian shrinkage regression method, and BLUP from a mixed model analysis using a relationship matrix calculated from marker data. Using the best methods, accuracies of GEBV were comparable to accuracies from phenotype for predicting TBV without requiring the time and expense of field evaluation. We identified a trade-off between a method's ability to capture marker-QTL LD vs. marker-based relatedness of individuals. The Bayesian shrinkage regression method primarily captured LD, the BLUP methods captured relationships, while Bayes-B captured both. Under most of the study scenarios, mixed-model analysis using a marker-derived relationship matrix (BLUP) was more accurate than methods that directly estimated marker effects, suggesting that relationship information was more valuable than LD information. When markers were in strong LD with large-effect QTL, or when predictions were made on individuals several generations removed from the training data set, however, the ranking of method performance was reversed and BLUP had the lowest accuracy.  相似文献   

3.

Background

The impact of additive-genetic relationships captured by single nucleotide polymorphisms (SNPs) on the accuracy of genomic breeding values (GEBVs) has been demonstrated, but recent studies on data obtained from Holstein populations have ignored this fact. However, this impact and the accuracy of GEBVs due to linkage disequilibrium (LD), which is fairly persistent over generations, must be known to implement future breeding programs.

Materials and methods

The data set used to investigate these questions consisted of 3,863 German Holstein bulls genotyped for 54,001 SNPs, their pedigree and daughter yield deviations for milk yield, fat yield, protein yield and somatic cell score. A cross-validation methodology was applied, where the maximum additive-genetic relationship (amax) between bulls in training and validation was controlled. GEBVs were estimated by a Bayesian model averaging approach (BayesB) and an animal model using the genomic relationship matrix (G-BLUP). The accuracy of GEBVs due to LD was estimated by a regression approach using accuracy of GEBVs and accuracy of pedigree-based BLUP-EBVs.

Results

Accuracy of GEBVs obtained by both BayesB and G-BLUP decreased with decreasing amax for all traits analyzed. The decay of accuracy tended to be larger for G-BLUP and with smaller training size. Differences between BayesB and G-BLUP became evident for the accuracy due to LD, where BayesB clearly outperformed G-BLUP with increasing training size.

Conclusions

GEBV accuracy of current selection candidates varies due to different additive-genetic relationships relative to the training data. Accuracy of future candidates can be lower than reported in previous studies because information from close relatives will not be available when selection on GEBVs is applied. A Bayesian model averaging approach exploits LD information considerably better than G-BLUP and thus is the most promising method. Cross-validations should account for family structure in the data to allow for long-lasting genomic based breeding plans in animal and plant breeding.  相似文献   

4.
D Gianola  S Qanbari  H Simianer 《Heredity》2013,111(4):275-285
The analysis of systems involving many loci is important in population and quantitative genetics. An important problem is the study of linkage disequilibrium (LD), a concept relevant in genome-enabled prediction of quantitative traits and in exploration of marker–phenotype associations. This article introduces a new estimator of a LD parameter (ρ2) that is much easier to compute than a maximum likelihood (or Bayesian) estimate of a tetra-choric correlation. We examined the conjecture that the sampling distribution of the estimator of ρ2 could be less frequency dependent than that of the estimator of r2, a widely used metric for assessing LD. This was done via an empirical evaluation of LD in 806 Holstein–Friesian cattle using 771 single-nucleotide polymorphism (SNP) markers and of HapMap III data on 21 991 SNPs (chromosome 3) observed in 88 unrelated individuals from Tuscany. Also, 1600 haplotypes over a region of 1 Mb simulated under the coalescent were used to estimate LD using the two measures. Subsequently, a simulation study compared the new estimator with that of r2 using several scenarios of LD and allelic frequencies. From these studies, it is concluded that ρ2 provides a useful metric for the study of LD as the distribution of its estimator is less frequency dependent than that of the standard estimator of r2.  相似文献   

5.

Background

Numerous methods have been developed over the last decade to predict allelic identity at unobserved loci between pairs of chromosome segments along the genome. These loci are often unobserved positions tested for the presence of quantitative trait loci (QTL). The main objective of this study was to understand from a theoretical standpoint the relation between linkage disequilibrium (LD) and allelic identity prediction when using haplotypes for fine mapping of QTL. In addition, six allelic identity predictors (AIP) were also compared in this study to determine which one performed best in theory and application.

Results

A criterion based on a simple measure of matrix distance was used to study the relation between LD and allelic identity prediction when using haplotypes. The consistency of this criterion with the accuracy of QTL localization, another criterion commonly used to compare AIP, was evaluated on a set of real chromosomes. For this set of chromosomes, the criterion was consistent with the mapping accuracy of a simulated QTL with either low or high effect. As measured by the matrix distance, the best AIP for QTL mapping were those that best captured LD between a tested position and a QTL. Moreover the matrix distance between a tested position and a QTL was shown to decrease for some AIP when LD increased. However, the matrix distance for AIP with continuous predictions in the [0,1] interval was algebraically proven to decrease less rapidly up to a lower bound with increasing LD in the simplest situations, than the discrete predictor based on identity by state between haplotypes (IBS hap), for which there was no lower bound. The expected LD between haplotypes at a tested position and alleles at a QTL is a quantity that increases naturally when the tested position gets closer to the QTL. This behavior was demonstrated with pig and unrelated human chromosomes.

Conclusions

When the density of markers is high, and therefore LD between adjacent loci can be assumed to be high, the discrete predictor IBS hap is recommended since it predicts allele identity correctly when taking LD into account.  相似文献   

6.
In genome-based prediction there is considerable uncertainty about the statistical model and method required to maximize prediction accuracy. For traits influenced by a small number of quantitative trait loci (QTL), predictions are expected to benefit from methods performing variable selection [e.g., BayesB or the least absolute shrinkage and selection operator (LASSO)] compared to methods distributing effects across the genome [ridge regression best linear unbiased prediction (RR-BLUP)]. We investigate the assumptions underlying successful variable selection by combining computer simulations with large-scale experimental data sets from rice (Oryza sativa L.), wheat (Triticum aestivum L.), and Arabidopsis thaliana (L.). We demonstrate that variable selection can be successful when the number of phenotyped individuals is much larger than the number of causal mutations contributing to the trait. We show that the sample size required for efficient variable selection increases dramatically with decreasing trait heritabilities and increasing extent of linkage disequilibrium (LD). We contrast and discuss contradictory results from simulation and experimental studies with respect to superiority of variable selection methods over RR-BLUP. Our results demonstrate that due to long-range LD, medium heritabilities, and small sample sizes, superiority of variable selection methods cannot be expected in plant breeding populations even for traits like FRIGIDA gene expression in Arabidopsis and flowering time in rice, assumed to be influenced by a few major QTL. We extend our conclusions to the analysis of whole-genome sequence data and infer upper bounds for the number of causal mutations which can be identified by LASSO. Our results have major impact on the choice of statistical method needed to make credible inferences about genetic architecture and prediction accuracy of complex traits.  相似文献   

7.
Multiparental designs combined with dense genotyping of parents have been proposed as a way to increase the diversity and resolution of quantitative trait loci (QTL) mapping studies, using methods combining linkage disequilibrium information with linkage analysis (LDLA). Two new nested association mapping designs adapted to European conditions were derived from the complementary dent and flint heterotic groups of maize (Zea mays L.). Ten biparental dent families (N = 841) and 11 biparental flint families (N = 811) were genotyped with 56,110 single nucleotide polymorphism markers and evaluated as test crosses with the central line of the reciprocal design for biomass yield, plant height, and precocity. Alleles at candidate QTL were defined as (i) parental alleles, (ii) haplotypic identity by descent, and (iii) single-marker groupings. Between five and 16 QTL were detected depending on the model, trait, and genetic group considered. In the flint design, a major QTL (R2 = 27%) with pleiotropic effects was detected on chromosome 10, whereas other QTL displayed milder effects (R2 < 10%). On average, the LDLA models detected more QTL but generally explained lower percentages of variance, consistent with the fact that most QTL display complex allelic series. Only 15% of the QTL were common to the two designs. A joint analysis of the two designs detected between 15 and 21 QTL for the five traits. Of these, between 27 for silking date and 41% for tasseling date were significant in both groups. Favorable allelic effects detected in both groups open perspectives for improving biomass production.  相似文献   

8.

Background

The genome sequence and a high-density SNP map are now available for the chicken and can be used to identify genetic markers for use in marker-assisted selection (MAS). Effective MAS requires high linkage disequilibrium (LD) between markers and quantitative trait loci (QTL), and sustained marker-QTL LD over generations. This study used data from a 3,000 SNP panel to assess the level and consistency of LD between single nucleotide polymorphisms (SNPs) over consecutive years in two egg-layer chicken lines, and analyzed one line by two methods (SNP-wise association and genome-wise Bayesian analysis) to identify markers associated with egg-quality and egg-production phenotypes.

Results

The LD between markers pairs was high at short distances (r2 > 0.2 at < 2 Mb) and remained high after one generation (correlations of 0.80 to 0.92 at < 5 Mb) in both lines. Single- and 3-SNP regression analyses using a mixed model with SNP as fixed effect resulted in 159 and 76 significant tests (P < 0.01), respectively, across 12 traits. A Bayesian analysis called BayesB, that fits all SNPs simultaneously as random effects and uses model averaging procedures, identified 33 SNPs that were included in the model >20% of the time (φ > 0.2) and an additional ten 3-SNP windows that had a sum of φ greater than 0.35. Generally, SNPs included in the Bayesian model also had a small P-value in the 1-SNP analyses.

Conclusion

High LD correlations between markers at short distances across two generations indicate that such markers will retain high LD with linked QTL and be effective for MAS. The different association analysis methods used provided consistent results. Multiple single SNPs and 3-SNP windows were significantly associated with egg-related traits, providing genomic positions of QTL that can be useful for both MAS and to identify causal mutations.
  相似文献   

9.

Background

A haplotype approach to genomic prediction using high density data in dairy cattle as an alternative to single-marker methods is presented. With the assumption that haplotypes are in stronger linkage disequilibrium (LD) with quantitative trait loci (QTL) than single markers, this study focuses on the use of haplotype blocks (haploblocks) as explanatory variables for genomic prediction. Haploblocks were built based on the LD between markers, which allowed variable reduction. The haploblocks were then used to predict three economically important traits (milk protein, fertility and mastitis) in the Nordic Holstein population.

Results

The haploblock approach improved prediction accuracy compared with the commonly used individual single nucleotide polymorphism (SNP) approach. Furthermore, using an average LD threshold to define the haploblocks (LD≥0.45 between any two markers) increased the prediction accuracies for all three traits, although the improvement was most significant for milk protein (up to 3.1 % improvement in prediction accuracy, compared with the individual SNP approach). Hotelling’s t-tests were performed, confirming the improvement in prediction accuracy for milk protein. Because the phenotypic values were in the form of de-regressed proofs, the improved accuracy for milk protein may be due to higher reliability of the data for this trait compared with the reliability of the mastitis and fertility data. Comparisons between best linear unbiased prediction (BLUP) and Bayesian mixture models also indicated that the Bayesian model produced the most accurate predictions in every scenario for the milk protein trait, and in some scenarios for fertility.

Conclusions

The haploblock approach to genomic prediction is a promising method for genomic selection in animal breeding. Building haploblocks based on LD reduced the number of variables without the loss of information. This method may play an important role in the future genomic prediction involving while genome sequences.  相似文献   

10.
A genome-wide association study of seed protein and oil content in soybean   总被引:8,自引:0,他引:8  

Background

Association analysis is an alternative to conventional family-based methods to detect the location of gene(s) or quantitative trait loci (QTL) and provides relatively high resolution in terms of defining the genome position of a gene or QTL. Seed protein and oil concentration are quantitative traits which are determined by the interaction among many genes with small to moderate genetic effects and their interaction with the environment. In this study, a genome-wide association study (GWAS) was performed to identify quantitative trait loci (QTL) controlling seed protein and oil concentration in 298 soybean germplasm accessions exhibiting a wide range of seed protein and oil content.

Results

A total of 55,159 single nucleotide polymorphisms (SNPs) were genotyped using various methods including Illumina Infinium and GoldenGate assays and 31,954 markers with minor allele frequency >0.10 were used to estimate linkage disequilibrium (LD) in heterochromatic and euchromatic regions. In euchromatic regions, the mean LD (r 2 ) rapidly declined to 0.2 within 360 Kbp, whereas the mean LD declined to 0.2 at 9,600 Kbp in heterochromatic regions. The GWAS results identified 40 SNPs in 17 different genomic regions significantly associated with seed protein. Of these, the five SNPs with the highest associations and seven adjacent SNPs were located in the 27.6-30.0 Mbp region of Gm20. A major seed protein QTL has been previously mapped to the same location and potential candidate genes have recently been identified in this region. The GWAS results also detected 25 SNPs in 13 different genomic regions associated with seed oil. Of these markers, seven SNPs had a significant association with both protein and oil.

Conclusions

This research indicated that GWAS not only identified most of the previously reported QTL controlling seed protein and oil, but also resulted in narrower genomic regions than the regions reported as containing these QTL. The narrower GWAS-defined genome regions will allow more precise marker-assisted allele selection and will expedite positional cloning of the causal gene(s).  相似文献   

11.
Recently, the use of linkage disequilibrium (LD) to locate genes which affect quantitative traits (QTL) has received an increasing interest, but the plausibility of fine mapping using linkage disequilibrium techniques for QTL has not been well studied. The main objectives of this work were to (1) measure the extent and pattern of LD between a putative QTL and nearby markers in finite populations and (2) investigate the usefulness of LD in fine mapping QTL in simulated populations using a dense map of multiallelic or biallelic marker loci. The test of association between a marker and QTL and the power of the test were calculated based on single-marker regression analysis. The results show the presence of substantial linkage disequilibrium with closely linked marker loci after 100 to 200 generations of random mating. Although the power to test the association with a frequent QTL of large effect was satisfactory, the power was low for the QTL with a small effect and/or low frequency. More powerful, multi-locus methods may be required to map low frequent QTL with small genetic effects, as well as combining both linkage and linkage disequilibrium information. The results also showed that multiallelic markers are more useful than biallelic markers to detect linkage disequilibrium and association at an equal distance.  相似文献   

12.
A key question for the implementation of marker-assisted selection (MAS) using markers in linkage disequilibrium with quantitative trait loci (QTLs) is how many markers surrounding each QTL should be used to ensure the marker or marker haplotypes are in sufficient linkage disequilibrium (LD) with the QTL. In this paper we compare the accuracy of MAS using either single markers or marker haplotypes in an Angus cattle data set consisting of 9323 genome-wide single nucleotide polymorphisms (SNPs) genotyped in 379 Angus cattle. The extent of LD in the data set was such that the average marker-marker r2 was 0.2 at 200 kb. The accuracy of MAS increased as the number of markers in the haplotype surrounding the QTL increased, although only when the number of markers in the haplotype was 4 or greater did the accuracy exceed that achieved when the SNP in the highest LD with the QTL was used. A large number of phenotypic records (>1000) were required to accurately estimate the effects of the haplotypes.  相似文献   

13.
Although the concept of genomic selection relies on linkage disequilibrium (LD) between quantitative trait loci and markers, reliability of genomic predictions is strongly influenced by family relationships. In this study, we investigated the effects of LD and family relationships on reliability of genomic predictions and the potential of deterministic formulas to predict reliability using population parameters in populations with complex family structures. Five groups of selection candidates were simulated by taking different information sources from the reference population into account: (1) allele frequencies, (2) LD pattern, (3) haplotypes, (4) haploid chromosomes, and (5) individuals from the reference population, thereby having real family relationships with reference individuals. Reliabilities were predicted using genomic relationships among 529 reference individuals and their relationships with selection candidates and with a deterministic formula where the number of effective chromosome segments (Me) was estimated based on genomic and additive relationship matrices for each scenario. At a heritability of 0.6, reliabilities based on genomic relationships were 0.002 ± 0.0001 (allele frequencies), 0.022 ± 0.001 (LD pattern), 0.018 ± 0.001 (haplotypes), 0.100 ± 0.008 (haploid chromosomes), and 0.318 ± 0.077 (family relationships). At a heritability of 0.1, relative differences among groups were similar. For all scenarios, reliabilities were similar to predictions with a deterministic formula using estimated Me. So, reliabilities can be predicted accurately using empirically estimated Me and level of relationship with reference individuals has a much higher effect on the reliability than linkage disequilibrium per se. Furthermore, accumulated length of shared haplotypes is more important in determining the reliability of genomic prediction than the individual shared haplotype length.  相似文献   

14.
Association mapping is considered to be an important alternative strategy for the identification of quantitative trait loci (QTL) as compared to traditional QTL mapping. A necessary prerequisite for association analysis to succeed is detailed information regarding hidden population structure and the extent of linkage disequilibrium. A collection of 430 tetraploid potato cultivars, comprising two association panels, has been analysed with 41 AFLP® and 53 SSR primer combinations yielding 3364 AFLP fragments and 653 microsatellite alleles, respectively. Polymorphism information content values and detected number of alleles for the SSRs studied illustrate that commercial potato germplasm seems to be equally diverse as Latin American landrace material. Genome-wide linkage disequilibrium (LD)—reported for the first time for tetraploid potato—was observed up to approximately 5 cM using r 2 higher than 0.1 as a criterion for significant LD. Within-group LD, however, stretched on average twice as far when compared to overall LD. A Bayesian approach, a distance-based hierarchical clustering approach as well as principal coordinate analysis were adopted to enquire into population structure. Groups differing in year of market release and market segment (starch, processing industry and fresh consumption) were repeatedly detected. The observation of LD up to 5 cM is promising because the required marker density is not likely to disable the possibilities for association mapping research in tetraploid potato. Population structure appeared to be weak, but strong enough to demand careful modelling of genetic relationships in subsequent marker-trait association analyses. There seems to be a good chance that linkage-based marker-trait associations can be identified at moderate marker densities.  相似文献   

15.
Knowledge of population structure and linkage disequilibrium among the worldwide collections of peppers currently classified as hot, mild, sweet and ornamental types is indispensable for applying association mapping and genomic selection to improve pepper. The current study aimed to resolve the genetic diversity and relatedness of Capsicum annuum germplasm by use of simple sequence repeat (SSR) loci across all chromosomes in samples collected in 2011 and 2012. The physical distance covered by the entire set of SSRs used was 2,265.9 Mb from the 3.48-Gb hot-pepper genome size. The model-based program STRUCTURE was used to infer five clusters, which was further confirmed by classical molecular-genetic diversity analysis. Mean heterozygosity of various loci was estimated to be 0.15. Linkage disequilibrium (LD) was used to identify 17 LD blocks across various chromosomes with sizes from 0.154 Kb to 126.28 Mb. CAMS-142 of chromosome 1 was significantly associated with both capsaicin (CA) and dihydrocapsaicin (DCA) levels. Further, CAMS-142 was located in an LD block of 98.18 Mb. CAMS-142 amplified bands of 244, 268, 283 and 326 bp. Alleles 268 and 283 bp had positive effects on both CA and DCA levels, with an average R 2 of 12.15 % (CA) and 12.3 % (DCA). Eight markers from seven different chromosomes were significantly associated with fruit weight, contributing an average effect of 15 %. CAMS-199, HpmsE082 and CAMS-190 are the three major quantitative trait loci located on chromosomes 8, 9, and 10, respectively, and were associated with fruit weight in samples from both years of the study. This research demonstrates the effectiveness of using genome-wide SSR-based markers to assess features of LD and genetic diversity within C. annuum.  相似文献   

16.
17.
A set of 84 diverse rice genotypes were assessed for seedling stage salt tolerance and their genetic diversity using 41 polymorphic SSR markers comprising of 19 Saltol QTL linked and 22 random markers. Phenotypic screening under hydroponics identified three indica landraces (Badami, Shah Pasand and Pechi Badam), two Oryza rufipogon accessions (NKSWR2 and NKSWR17) and one each of Basmati rice (Seond Basmati) and japonica cultivars (Tompha Khau) as salt tolerant, having similar tolerance as of Pokkali and FL478. Among the salt tolerant genotypes, biomass showed positive correlation with shoot fresh weight and negative association with root and shoot Na+ content. The results indicated repression of Na+ loading within the tolerant plants. Linkage disequilibrium (LD) of the Saltol linked markers was weak, suggestive of high fragmentation of Pokkali haplotype, a result of evolutionary active recombination events. Poor haplotype structure of the Saltol region, may reduce its usefulness in marker assisted breeding programmes, if the target foreground markers chosen are wide apart. LD mapping identified eight robust marker-trait associations (QTLs), of which RM10927 was found linked to root and shoot Na+ content and RM10871 with shoot Na+/K+ ratio. RM271 on chromosome 10, an extra Saltol marker, was found associated to root Na+/K+ ratio. This marker showed a distinct allele among O. rufipogon accessions. There were also other novel loci detected on chromosomes 2, 5 and 10 influencing salt tolerance in the tested germplasm. Although Saltol remained as the key locus, the role of other genomic regions cannot be neglected in tailoring seedling stage salt tolerance in rice.  相似文献   

18.
The genetic diversity, population structure, and linkage disequilibrium (LD) of peaches are greatly important in genome-wide association mapping. In the current study, 104 peach landrace accessions from six Chinese geographical regions were evaluated for fruit and phenological period. The accessions were genotyped with 53 genome-wide simple sequence repeat (SSR) markers. All SSR markers were highly polymorphic across the accessions, and a total of 340 alleles were detected, including 59 private alleles. Of the six regions studied, the northern part of China as well as the middle and lower reaches of the Changjiang River were found to be the most highly diverse genetically. Based on population structure analysis, the peaches were divided into five groups, which well agreed with the geographical distribution. Of the SSR pairs in these accessions, 18.07% (P?<?0.05) were in LD. The mean r 2 value for all intrachromosomal loci pairs was 0.0149, and LD decayed at 6.01?cM. The general linear model was used to calculate the genome-wide marker-trait associations of 10 complex traits. The traits include flesh color around the stone, red pigment in the flesh, flesh texture, flesh adhesion, flesh firmness, fruit weight, chilling requirement, flowering time, ripening time, and fruit development period. These traits were estimated by analyzing the 104 landraces. Many of the associated markers were located in regions where quantitative trait loci (QTLs) were previously identified. Peach association mapping is an effective approach for identifying QTLs and may be an alternative to QTL mapping based on crosses between different lines.  相似文献   

19.
Genomic best linear unbiased prediction (BLUP) is a statistical method that uses relationships between individuals calculated from single-nucleotide polymorphisms (SNPs) to capture relationships at quantitative trait loci (QTL). We show that genomic BLUP exploits not only linkage disequilibrium (LD) and additive-genetic relationships, but also cosegregation to capture relationships at QTL. Simulations were used to study the contributions of those types of information to accuracy of genomic estimated breeding values (GEBVs), their persistence over generations without retraining, and their effect on the correlation of GEBVs within families. We show that accuracy of GEBVs based on additive-genetic relationships can decline with increasing training data size and speculate that modeling polygenic effects via pedigree relationships jointly with genomic breeding values using Bayesian methods may prevent that decline. Cosegregation information from half sibs contributes little to accuracy of GEBVs in current dairy cattle breeding schemes but from full sibs it contributes considerably to accuracy within family in corn breeding. Cosegregation information also declines with increasing training data size, and its persistence over generations is lower than that of LD, suggesting the need to model LD and cosegregation explicitly. The correlation between GEBVs within families depends largely on additive-genetic relationship information, which is determined by the effective number of SNPs and training data size. As genomic BLUP cannot capture short-range LD information well, we recommend Bayesian methods with t-distributed priors.  相似文献   

20.
A novel multiple regression method (RM) is developed to predict identity-by-descent probabilities at a locus L (IBDL), among individuals without pedigree, given information on surrounding markers and population history. These IBDL probabilities are a function of the increase in linkage disequilibrium (LD) generated by drift in a homogeneous population over generations. Three parameters are sufficient to describe population history: effective population size (Ne), number of generations since foundation (T), and marker allele frequencies among founders (p). IBDL are used in a simulation study to map a quantitative trait locus (QTL) via variance component estimation. RM is compared to a coalescent method (CM) in terms of power and robustness of QTL detection. Differences between RM and CM are small but significant. For example, RM is more powerful than CM in dioecious populations, but not in monoecious populations. Moreover, RM is more robust than CM when marker phases are unknown or when there is complete LD among founders or Ne is wrong, and less robust when p is wrong. CM utilises all marker haplotype information, whereas RM utilises information contained in each individual marker and all possible marker pairs but not in higher order interactions. RM consists of a family of models encompassing four different population structures, and two ways of using marker information, which contrasts with the single model that must cater for all possible evolutionary scenarios in CM.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号