首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The aim of this paper was to compare the effect of haplotype definition on the precision of QTL-mapping and on the accuracy of predicted genomic breeding values. In a multiple QTL model using identity-by-descent (IBD) probabilities between haplotypes, various haplotype definitions were tested i.e. including 2, 6, 12 or 20 marker alleles and clustering base haplotypes related with an IBD probability of > 0.55, 0.75 or 0.95. Simulated data contained 1100 animals with known genotypes and phenotypes and 1000 animals with known genotypes and unknown phenotypes. Genomes comprising 3 Morgan were simulated and contained 74 polymorphic QTL and 383 polymorphic SNP markers with an average r2 value of 0.14 between adjacent markers. The total number of haplotypes decreased up to 50% when the window size was increased from two to 20 markers and decreased by at least 50% when haplotypes related with an IBD probability of > 0.55 instead of > 0.95 were clustered. An intermediate window size led to more precise QTL mapping. Window size and clustering had a limited effect on the accuracy of predicted total breeding values, ranging from 0.79 to 0.81. Our conclusion is that different optimal window sizes should be used in QTL-mapping versus genome-wide breeding value prediction.  相似文献   

2.
A novel multiple regression method (RM) is developed to predict identity-by-descent probabilities at a locus L (IBDL), among individuals without pedigree, given information on surrounding markers and population history. These IBDL probabilities are a function of the increase in linkage disequilibrium (LD) generated by drift in a homogeneous population over generations. Three parameters are sufficient to describe population history: effective population size (Ne), number of generations since foundation (T), and marker allele frequencies among founders (p). IBDL are used in a simulation study to map a quantitative trait locus (QTL) via variance component estimation. RM is compared to a coalescent method (CM) in terms of power and robustness of QTL detection. Differences between RM and CM are small but significant. For example, RM is more powerful than CM in dioecious populations, but not in monoecious populations. Moreover, RM is more robust than CM when marker phases are unknown or when there is complete LD among founders or Ne is wrong, and less robust when p is wrong. CM utilises all marker haplotype information, whereas RM utilises information contained in each individual marker and all possible marker pairs but not in higher order interactions. RM consists of a family of models encompassing four different population structures, and two ways of using marker information, which contrasts with the single model that must cater for all possible evolutionary scenarios in CM.  相似文献   

3.
Meuwissen TH  Goddard ME 《Genetics》2000,155(1):421-430
A multimarker linkage disequilibrium mapping method was developed for the fine mapping of quantitative trait loci (QTL) using a dense marker map. The method compares the expected covariances between haplotype effects given a postulated QTL position to the covariances that are found in the data. The expected covariances between the haplotype effects are proportional to the probability that the QTL position is identical by descent (IBD) given the marker haplotype information, which is calculated using the genedropping method. Simulation results showed that a QTL was correctly positioned within a region of 3, 1.5, or 0.75 cM in 70, 62, and 68%, respectively, of the replicates using markers spaced at intervals of 1, 0.5, and 0.25 cM, respectively. These results were rather insensitive to the number of generations since the QTL occurred and to the effective population size, except that 10 generations yielded rather poor estimates of the QTL position. The position estimates of this multimarker disequilibrium mapping method were more accurate than those from a single marker transmission disequilibrium test. A general approach for identifying QTL is suggested, where several stages of disequilibrium mapping are used with increasingly dense marker spacing.  相似文献   

4.
Conditional probability methods for haplotyping in pedigrees   总被引:3,自引:0,他引:3  
Gao G  Hoeschele I  Sorensen P  Du F 《Genetics》2004,167(4):2055-2065
Efficient haplotyping in pedigrees is important for the fine mapping of quantitative trait locus (QTL) or complex disease genes. To reconstruct haplotypes efficiently for a large pedigree with a large number of linked loci, two algorithms based on conditional probabilities and likelihood computations are presented. The first algorithm (the conditional probability method) produces a single, approximately optimal haplotype configuration, with computing time increasing linearly in the number of linked loci and the pedigree size. The other algorithm (the conditional enumeration method) identifies a set of haplotype configurations with high probabilities conditional on the observed genotype data for a pedigree. Its computing time increases less than exponentially with the size of a subset of the set of person-loci with unordered genotypes and linearly with its complement. The size of the subset is controlled by a threshold parameter. The set of identified haplotype configurations can be used to estimate the identity-by-descent (IBD) matrix at a map position for a pedigree. The algorithms have been tested on published and simulated data sets. The new haplotyping methods are much faster and provide more information than several existing stochastic and rule-based methods. The accuracies of the new methods are equivalent to or better than those of these existing methods.  相似文献   

5.
Amplified fragment length polymorphisms (AFLPs) are a widely used marker system: the technique is very cost-effective, easy and rapid, and reproducibly generates hundreds of markers. Unfortunately, AFLP alleles are typically scored as the presence or absence of a band and, thus, heterozygous and dominant homozygous genotypes cannot be distinguished. This results in a significant loss of information, especially as regards mapping of quantitative trait loci (QTLs). We present a Monte Carlo Markov Chain method that allows us to compute the identity by descent probabilities (IBD) in a general pedigree whose individuals have been typed for dominant markers. The method allows us to include the information provided by the fluorescent band intensities of the markers, the rationale being that homozygous individuals have on average higher band intensities than heterozygous individuals, as well as information from linked markers in each individual and its relatives. Once IBD probabilities are obtained, they can be combined into the QTL mapping strategy of choice. We illustrate the method with two simulated populations: an outbred population consisting of full sib families, and an F2 cross between inbred lines. Two marker spacings were considered, 5 or 20 cM, in the outbred population. There was almost no difference, for the practical purpose of QTL estimation, between AFLPs and biallelic codominant markers when the band density is taken into account, especially at the 5 cM spacing. The performance of AFLPs every 5 cM was also comparable to that of highly polymorphic markers (microsatellites) spaced every 20 cM. In economic terms, QTL mapping with a dense map of AFLPs is clearly better than microsatellite QTL mapping and little is lost in terms of accuracy of position. Nevertheless, at low marker densities, AFLPs or other biallelic markers result in very inaccurate estimates of QTL position.  相似文献   

6.
Meuwissen TH  Goddard ME 《Genetics》2007,176(4):2551-2560
A novel multipoint method, based on an approximate coalescence approach, to analyze multiple linked markers is presented. Unlike other approximate coalescence methods, it considers all markers simultaneously but only two haplotypes at a time. We demonstrate the use of this method for linkage disequilibrium (LD) mapping of QTL and estimation of effective population size. The method estimates identity-by-descent (IBD) probabilities between pairs of marker haplotypes. Both LD and combined linkage and LD mapping rely on such IBD probabilities. The method is approximate in that it considers only the information on a pair of haplotypes, whereas a full modeling of the coalescence process would simultaneously consider all haplotypes. However, full coalescence modeling is computationally feasible only for few linked markers. Using simulations of the coalescence process, the method is shown to give almost unbiased estimates of the effective population size. Compared to direct marker and haplotype association analyses, IBD-based QTL mapping showed clearly a higher power to detect a QTL and a more realistic confidence interval for its position. The modeling of LD could be extended to estimate other LD-related parameters such as recombination rates.  相似文献   

7.
S. Xu  W. R. Atchley 《Genetics》1995,141(3):1189-1197
Mapping quantitative trait loci in outbred populations is important because many populations of organisms are noninbred. Unfortunately, information about the genetic architecture of the trait may not be available in outbred populations. Thus, the allelic effects of genes can not be estimated with ease. In addition, under linkage equilibrium, marker genotypes provide no information about the genotype of a QTL (our terminology for a single quantitative trait locus is QTL while multiple loci are referred to as QTLs). To circumvent this problem, an interval mapping procedure based on a random model approach is described. Under a random model, instead of estimating the effects, segregating variances of QTLs are estimated by a maximum likelihood method. Estimation of the variance component of a QTL depends on the proportion of genes identical-by-descent (IBD) shared by relatives at the locus, which is predicted by the IBD of two markers flanking the QTL. The marker IBD shared by two relatives are inferred from the observed marker genotypes. The procedure offers an advantage over the regression interval mapping in terms of high power and small estimation errors and provides flexibility for large sibships, irregular pedigree relationships and incorporation of common environmental and fixed effects.  相似文献   

8.
A linkage disequilibrium-based method for fine mapping quantitative trait loci (QTL) has been described that uses similarity between individuals' marker haplotypes to determine if QTL alleles are identical by descent (IBD) to model covariances among individuals' QTL alleles for a mixed linear model. Mapping accuracy with this method was found to be sensitive to the number of linked markers that was included in the haplotype when fitting the model at a putative position of the QTL. The objective of this study was to determine the optimal haplotype structure for this IBD-based method for fine mapping a QTL in a previously identified QTL region. Haplotypes consisting of 1, 2, 4, 6, or all 10 available markers were fit as a "sliding window" across the QTL region under ideal and nonideal simulated population conditions. It was found that using haplotypes of 4 or 6 markers as a sliding "window" resulted in the greatest mapping accuracy under nearly all conditions, although the true IBD state at a putative QTL position was most accurately predicted by IBD probabilities obtained using all markers. Using 4 or 6 markers resulted in greater discrimination of IBD probabilities between positions while maintaining sufficient accuracy of IBD probabilities to detect the QTL. Fitting IBD probabilities on the basis of a single marker resulted in the worst mapping accuracy under all conditions because it resulted in poor accuracy of IBD probabilities. In conclusion, for fine mapping using IBD methods, marker information must be used in a manner that results in sensitivity of IBD probabilities to the putative position of the QTL while maintaining sufficient accuracy of IBD probabilities to detect the QTL. Contrary to expectation, use of haplotypes of 4-6 markers to derive IBD probabilities, rather than all available markers, best fits these criteria. Thus for populations similar to those simulated here, optimal mapping accuracy for this IBD-based fine-mapping method is obtained with a haplotype structure including a subset of all available markers.  相似文献   

9.
A novel and robust method for the fine-scale mapping of genes affecting complex traits, which combines linkage and linkage-disequilibrium information, is proposed. Linkage information refers to recombinations within the marker-genotyped generations and linkage disequilibrium to historical recombinations before genotyping started. The identity-by-descent (IBD) probabilities at the quantitative trait locus (QTL) between first generation haplotypes were obtained from the similarity of the marker alleles surrounding the QTL, whereas IBD probabilities at the QTL between later generation haplotypes were obtained by using the markers to trace the inheritance of the QTL. The variance explained by the QTL is estimated by residual maximum likelihood using the correlation structure defined by the IBD probabilities. Unlinked background genes were accounted for by fitting a polygenic variance component. The method was used to fine map a QTL for twinning rate in cattle, previously mapped on chromosome 5 by linkage analysis. The data consisted of large half-sib families, but the method could also handle more complex pedigrees. The likelihood of the putative QTL was very small along most of the chromosome, except for a sharp likelihood peak in the ninth marker bracket, which positioned the QTL within a region <1 cM in the middle part of bovine chromosome 5. The method was expected to be robust against multiple genes affecting the trait, multiple mutations at the QTL, and relatively low marker density.  相似文献   

10.
Prediction of multilocus identity-by-descent   总被引:2,自引:1,他引:1  
Hill WG  Hernández-Sánchez J 《Genetics》2007,176(4):2307-2315
Previous studies have enabled exact prediction of probabilities of identity-by-descent (IBD) in random-mating populations for a few loci (up to four or so), with extension to more using approximate regression methods. Here we present a precise predictor of multiple-locus IBD using simple formulas based on exact results for two loci. In particular, the probability of non-IBD X(ABC) at each of ordered loci A, B, and C can be well approximated by X(ABC) = X(AB)X(BC)/X(B) and generalizes to X(123...k) = X(12)X(23...)X(k)(-1,k)/X(k-2), where X is the probability of non-IBD at each locus. Predictions from this chain rule are very precise with population bottlenecks and migration, but are rather poorer in the presence of mutation. From these coefficients, the probabilities of multilocus IBD and non-IBD can also be computed for genomic regions as functions of population size, time, and map distances. An approximate but simple recurrence formula is also developed, which generally is less accurate than the chain rule but is more robust with mutation. Used together with the chain rule it leads to explicit equations for non-IBD in a region. The results can be applied to detection of quantitative trait loci (QTL) by computing the probability of IBD at candidate loci in terms of identity-by-state at neighboring markers.  相似文献   

11.
A fast, partly recursive deterministic method for calculating Identity-by-Descent (IBD) probabilities was developed with the objective of using IBD in Quantitative Trait Locus (QTL) mapping. The method combined a recursive method for a single marker locus with a method to estimate IBD between sibs using multiple markers. Simulated data was used to compare the deterministic method developed in the present paper with a stochastic method (LOKI) for precision in estimating IBD probabilities and performance in the task of QTL detection with the variance component approach. This comparison was made in a variety of situations by varying family size and degree of polymorphism among marker loci. The following were observed for the deterministic method relative to MCMC: (i) it was an order of magnitude faster; (ii) its estimates of IBD probabilities were found to agree closely, even though it does not extract information when haplotypes are not known with certainty; (iii) the shape of the profile for the QTL test statistic as a function of location was similar, although the magnitude of the test statistic was slightly smaller; and (iv) the estimates of QTL variance was similar. It was concluded that the method proposed provided a rapid means of calculating the IBD matrix with only a small loss in precision, making it an attractive alternative to the use of stochastic MCMC methods. Furthermore, developments in marker technology providing denser maps would enhance the relative advantage of this method.  相似文献   

12.
Zhao HH  Fernando RL  Dekkers JC 《Genetics》2007,175(4):1975-1986
Linkage disequilibrium (LD) analysis in outbred populations uses historical recombinations to detect and fine map quantitative trait loci (QTL). Our objective was to evaluate the effect of various factors on power and precision of QTL detection and to compare LD mapping methods on the basis of regression and identity by descent (IBD) in populations of limited effective population size (N(e)). An 11-cM region with 6-38 segregating single-nucleotide polymorphisms (SNPs) and a central QTL was simulated. After 100 generations of random mating with N(e) of 50, 100, or 200, SNP genotypes and phenotypes were generated on 200, 500, or 1000 individuals with the QTL explaining 2 or 5% of phenotypic variance. To detect and map the QTL, phenotypes were regressed on genotypes or (assumed known) haplotypes, in comparison with the IBD method. Power and precision to detect QTL increased with sample size, marker density, and QTL effect. Power decreased with N(e), but precision was affected little by N(e). Single-marker regression had similar or greater power and precision than other regression models, and was comparable to the IBD method. Thus, for rapid initial screening of samples of adequate size in populations in which drift is the primary force that has created LD, QTL can be detected and mapped by regression on SNP genotypes without recovering haplotypes.  相似文献   

13.
The common assumption in quantitative trait locus (QTL) linkage mapping studies that parents of multiple connected populations are unrelated is unrealistic for many plant breeding programs. We remove this assumption and propose a Bayesian approach that clusters the alleles of the parents of the current mapping populations from locus-specific identity by descent (IBD) matrices that capture ancestral marker and pedigree information. Moreover, we demonstrate how the parental IBD data can be incorporated into a QTL linkage analysis framework by using two approaches: a Threshold IBD model (TIBD) and a Latent Ancestral Allele Model (LAAM). The TIBD and LAAM models are empirically tested via numerical simulation based on the structure of a commercial maize breeding program. The simulations included a pilot dataset with closely linked QTL on a single linkage group and 100 replicated datasets with five linkage groups harboring four unlinked QTL. The simulation results show that including parental IBD data (similarly for TIBD and LAAM) significantly improves the power and particularly accuracy of QTL mapping, e.g., position, effect size and individuals’ genotype probability without significantly increasing computational demand.  相似文献   

14.
A multiple marker least squares approach is presented for the analysis of a single three-generation pedigree for quantitative trait locus (QTL) characterisation. It is an extension of the approach by Haley et?al. (1994) to the situation where grandparents cannot be assumed to be homozygous at QTLs for the trait of interest. The method is applied to the analysis of wood specific gravity in loblolly pine (Pinus taeda L.). Within a similar framework, a series of preliminary analyses are carried out, followed by a more detailed search of the genome for one or more QTLs. The preliminary analyses provide information about whether the contribution from each linkage group appears to be polygenic, localised to a small region (e.g. a single QTL) or oligogenic (i.e. several QTLs). Significance levels are obtained using a permutation test that uses the observed phenotypes and marker genotypes. The conclusion of these analyses is that in this pedigree single QTLs with very large effect on wood specific gravity do not appear to be segregating, although there is evidence for QTLs with small effect. Finally, in order to assess the potential power of this pedigree, we simulated QTLs within the framework of the actual marker data. As expected, QTL effects would need to be large to be reliably detected in this study, and the power to detect QTLs varies at different positions in the genome depending on the level of information in the local markers.  相似文献   

15.
Although molecular markers allow more accurate selection in early generations than conventional screens, large numbers can make selection impracticable while screening in later generations may provide little or no advantage over conventional selection techniques. Investigation of different crossing strategies and consideration of when to screen, what proportion to retain and the impacts of dominant vs. codominant marker expression revealed important choices in the design of marker-assisted selection programs that can produce large efficiency gains. Using F2 enrichment increased the frequency of selected alleles allowing large reductions in minimum population size for recovery of target genotypes (commonly around 90%) and/or selection at a greater number of loci. Increasing homozygosity by inbreeding from F2 to F2:3 also reduced population size by around 90% in some crosses with smaller incremental reductions in subsequent generations. Backcrossing was found to be a useful strategy to reduce population size compared with a biparental population where one parent contributed more target alleles than the other and was complementary to F2 enrichment and increasing homozygosity. Codominant markers removed the need for progeny testing reducing the number of individuals that had to be screened to identify a target genotype. However, although codominant markers allow target alleles to be fixed in early generations, minimum population sizes are often so large in F2 that it is not efficient to do so at this stage. Formulae and tables for calculating genotypic frequencies and minimum population sizes are provided to allow extension to different breeding systems, numbers of target loci, and probabilities of failure. Principles outlined are applicable to implementation of markers for both quantitative trait loci (QTL) and major genes.  相似文献   

16.
Association or linkage disequilibrium (LD)-based mapping strategies are receiving increased attention for the identification of quantitative trait loci (QTL) in plants as an alternative to more traditional, purely linkage-based approaches. An attractive property of association approaches is that they do not require specially designed crosses between inbred parents, but can be applied to collections of genotypes with arbitrary and often unknown relationships between the genotypes. A less obvious additional attractive property is that association approaches offer possibilities for QTL identification in crops with hard to model segregation patterns. The availability of candidate genes and targeted marker systems facilitates association approaches, as will appropriate methods of analysis. We propose an association mapping approach based on mixed models with attention to the incorporation of the relationships between genotypes, whether induced by pedigree, population substructure, or otherwise. Furthermore, we emphasize the need to pay attention to the environmental features of the data as well, i.e., adequate representation of the relations among multiple observations on the same genotypes. We illustrate our modeling approach using 25 years of Dutch national variety list data on late blight resistance in the genetically complex crop of potato. As markers, we used nucleotide binding-site markers, a specific type of marker that targets resistance or resistance-analog genes. To assess the consistency of QTL identified by our mixed-model approach, a second independent data set was analyzed. Two markers were identified that are potentially useful in selection for late blight resistance in potato.  相似文献   

17.

Background

Populational linkage disequilibrium and within-family linkage are commonly used for QTL mapping and marker assisted selection. The combination of both results in more robust and accurate locations of the QTL, but models proposed so far have been either single marker, complex in practice or well fit to a particular family structure.

Results

We herein present linear model theory to come up with additive effects of the QTL alleles in any member of a general pedigree, conditional to observed markers and pedigree, accounting for possible linkage disequilibrium among QTLs and markers. The model is based on association analysis in the founders; further, the additive effect of the QTLs transmitted to the descendants is a weighted (by the probabilities of transmission) average of the substitution effects of founders'' haplotypes. The model allows for non-complete linkage disequilibrium QTL-markers in the founders. Two submodels are presented: a simple and easy to implement Haley-Knott type regression for half-sib families, and a general mixed (variance component) model for general pedigrees. The model can use information from all markers. The performance of the regression method is compared by simulation with a more complex IBD method by Meuwissen and Goddard. Numerical examples are provided.

Conclusion

The linear model theory provides a useful framework for QTL mapping with dense marker maps. Results show similar accuracies but a bias of the IBD method towards the center of the region. Computations for the linear regression model are extremely simple, in contrast with IBD methods. Extensions of the model to genomic selection and multi-QTL mapping are straightforward.  相似文献   

18.
A novel multitrait fine-mapping method is presented. The method is implemented by a model that treats QTL effects as random variables. The covariance matrix of allelic effects is proportional to the IBD matrix, where each element is the probability that a pair of alleles is identical by descent, given marker information and QTL position. These probabilities are calculated on the basis of similarities of marker haplotypes of individuals of the first generation of genotyped individuals, using "gene dropping" (linkage disequilibrium) and transmission of markers from genotyped parents to genotyped offspring (linkage). A small simulation study based on a granddaughter design was carried out to illustrate that the method provides accurate estimates of QTL position. Results from the simulation also indicate that it is possible to distinguish between a model postulating one pleiotropic QTL affecting two traits vs. one postulating two closely linked loci, each affecting one of the traits.  相似文献   

19.
R L Wu 《Genetics》1999,152(4):1741-1752
Mapping strategies based on a half- or full-sib family design have been developed to map quantitative trait loci (QTL) for outcrossing species. However, these strategies are dependent on controlled crosses where marker-allelic frequency and linkage disequilibrium between the marker and QTL may limit their application. In this article, a maximum-likelihood method is developed to map QTL segregating in an open-pollinated progeny population using dominant markers derived from haploid tissues from single meiotic events. Results from the haploid-based mapping strategy are not influenced by the allelic frequencies of markers and their linkage disequilibria with QTL, because the probabilities of QTL genotypes conditional on marker genotypes of haploid tissues are independent of these population parameters. Parameter estimation and hypothesis testing are implemented via expectation/conditional maximization algorithm. Parameters estimated include the additive effect, the dominant effect, the population mean, the chromosomal location of the QTL in the interval, and the residual variance within the QTL genotypes, plus two population parameters, outcrossing rate and QTL-allelic frequency. Simulation experiments show that the accuracy and power of parameter estimates are affected by the magnitude of QTL effects, heritability levels of a trait, and sample sizes used. The application and limitation of the method are discussed.  相似文献   

20.
Lee SH  Van der Werf JH 《Genetics》2006,174(2):1009-1016
Dominance (intralocus allelic interactions) plays often an important role in quantitative trait variation. However, few studies about dominance in QTL mapping have been reported in outbred animal or human populations. This is because common dominance effects can be predicted mainly for many full sibs, which do not often occur in outbred or natural populations with a general pedigree. Moreover, incomplete genotypes for such a pedigree make it infeasible to estimate dominance relationship coefficients between individuals. In this study, identity-by-descent (IBD) coefficients are estimated on the basis of population-wide linkage disequilibrium (LD), which makes it possible to track dominance relationships between unrelated founders. Therefore, it is possible to use dominance effects in QTL mapping without full sibs. Incomplete genotypes with a complex pedigree and many markers can be efficiently dealt with by a Markov chain Monte Carlo method for estimating IBD and dominance relationship matrices (D(RM)). It is shown by simulation that the use of D(RM) increases the likelihood ratio at the true QTL position and the mapping accuracy and power with complete dominance, overdominance, and recessive inheritance modes when using 200 genotyped and phenotyped individuals.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号