首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
多QTL定位的压缩估计方法   总被引:1,自引:0,他引:1  
章元明 《遗传学报》2006,33(10):861-869
本文综述了多标记分析和多QTL定位的压缩估计方法。对于前者,Xu(Genetics,2003,163:789—801)首先提出了Bayesian压缩估计方法。其关键在于让每个效应有一个特定的方差参数,而该方差又服从一定的先验分布,以致能从资料中估计之。由此,能够同时估计大量分子标记基因座的遗传效应,即使大多数标记的效应是可忽略的。然而,对于上位性遗传模型,其运算时间还是过长。为此,笔者将上述思想嵌入极大似然法,提出了惩罚最大似然方法。模拟研究显示:该方法能处理变量个数大于样本容量10倍左右的线性遗传模型。对于后者,本文详细介绍了基于固定区间和可变区间的Bayesian压缩估计方法。固定区间方法可处理中等密度的分子标记资料;可变区间方法则可分析高密度分子标记资料,甚至是上位性遗传模型。对于上位性检测,已介绍的惩罚最大似然方法和可变区间Bayesian压缩估计方法可供利用。应当指出,压缩估计方法在今后的eQTL和QTN定位以及基因互作网络分析等研究中也是有应用价值的。  相似文献   

2.
In this article, shrinkage estimation method for multiple-marker analysis and for mapping multiple quantitative trait loci (QTL) was reviewed. For multiple-marker analysis, Xu (Genetics, 2003, 163:789-801) developed a Bayesian shrinkage estimation (BSE) method. The key to the success of this method is to allow each marker effect have its own variance parameter, which in turn has its own prior distribution so that the variance can be estimated from the data. Under this hierarchical model, a large number of markers can be handled although most of them may have negligible effects. Under epistatic genetic model, however, the running time is very long. To overcome this problem, a novel method of incorporating the idea described above into maximum likelihood, known as penalized likelihood method, was proposed. A simulated study showed that this method can handle a model with multiple effects, which are ten times larger than the sample size. For multiple QTL analysis, two modified versions for the BSE method were introduced: one is the fixed-interval method and another is the variable-interval method. The former deals with markers with intermediate density, and the latter can handle markers with extremely high density as well as model with epistatic effects. For the detection of epistatic effects, penalized likelihood method and the variable-interval approach of the BSE method are available.  相似文献   

3.
Lide Han  Shizhong Xu 《Genetica》2010,138(9-10):1099-1109
The identity-by-descent (IBD) based variance component analysis is an important method for mapping quantitative trait loci (QTL) in outbred populations. The interval-mapping approach and various modified versions of it may have limited use in evaluating the genetic variances of the entire genome because they require evaluation of multiple models and model selection. In this study, we developed a multiple variance component model for genome-wide evaluation using both the maximum likelihood (ML) method and the MCMC implemented Bayesian method. We placed one QTL in every few cM on the entire genome and estimated the QTL variances and positions simultaneously in a single model. Genomic regions that have no QTL usually showed no evidence of QTL while regions with large QTL always showed strong evidence of QTL. While the Bayesian method produced the optimal result, the ML method is computationally more efficient than the Bayesian method. Simulation experiments were conducted to demonstrate the efficacy of the new methods.  相似文献   

4.
Yi N  Xu S 《Genetics》2000,156(1):411-422
Variance component analysis of quantitative trait loci (QTL) is an important strategy of genetic mapping for complex traits in humans. The method is robust because it can handle an arbitrary number of alleles with arbitrary modes of gene actions. The variance component method is usually implemented using the proportion of alleles with identity-by-descent (IBD) shared by relatives. As a result, information about marker linkage phases in the parents is not required. The method has been studied extensively under either the maximum-likelihood framework or the sib-pair regression paradigm. However, virtually all investigations are limited to normally distributed traits under a single QTL model. In this study, we develop a Bayes method to map multiple QTL. We also extend the Bayesian mapping procedure to identify QTL responsible for the variation of complex binary diseases in humans under a threshold model. The method can also treat the number of QTL as a parameter and infer its posterior distribution. We use the reversible jump Markov chain Monte Carlo method to infer the posterior distributions of parameters of interest. The Bayesian mapping procedure ends with an estimation of the joint posterior distribution of the number of QTL and the locations and variances of the identified QTL. Utilities of the method are demonstrated using a simulated population consisting of multiple full-sib families.  相似文献   

5.
Bayesian shrinkage estimation of quantitative trait loci parameters   总被引:13,自引:0,他引:13       下载免费PDF全文
Wang H  Zhang YM  Li X  Masinde GL  Mohan S  Baylink DJ  Xu S 《Genetics》2005,170(1):465-480
Mapping multiple QTL is a typical problem of variable selection in an oversaturated model because the potential number of QTL can be substantially larger than the sample size. Currently, model selection is still the most effective approach to mapping multiple QTL, although further research is needed. An alternative approach to analyzing an oversaturated model is the shrinkage estimation in which all candidate variables are included in the model but their estimated effects are forced to shrink toward zero. In contrast to the usual shrinkage estimation where all model effects are shrunk by the same factor, we develop a Bayesian method that allows the shrinkage factor to vary across different effects. The new shrinkage method forces marker intervals that contain no QTL to have estimated effects close to zero whereas intervals containing notable QTL have estimated effects subject to virtually no shrinkage. We demonstrate the method using both simulated and real data for QTL mapping. A simulation experiment with 500 backcross (BC) individuals showed that the method can localize closely linked QTL and QTL with effects as small as 1% of the phenotypic variance of the trait. The method was also used to map QTL responsible for wound healing in a family of a (MRL/MPJ x SJL/J) cross with 633 F(2) mice derived from two inbred lines.  相似文献   

6.
 A common problem in mapping quantitative trait loci (QTLs) is that marker data are often incomplete. This includes missing data, dominant markers, and partially informative markers, arising in outbred populations. Here we briefly present an iteratively re-weighted least square method (IRWLS) to incorporate dominant and missing markers for mapping QTLs in four-way crosses under a heterogeneous variance model. The algorithm uses information from all markers in a linkage group to infer the QTL genotype. Monte Carlo simulations indicate that with half dominant markers, QTL detection is almost as efficient as with all co-dominant markers. However, the precision of the estimated QTL parameters generally decreases as more markers become missing or dominant. Notable differences are observed on the standard deviation of the estimated QTL position for varying levels of marker information content. The method is relatively simple so that more complex models including multiple QTLs or fixed effects can be fitted. Finally, the method can be readily extended to QTL mapping in full-sib families. Received: 16 June 1998 / Accepted: 29 September 1998  相似文献   

7.
Summary The decision of whether or not to use QTLassociated markers in breeding programs needs further information about the magnitude of the additive and dominance effects that can be estimated. The objectives of this paper are (1) to apply some of the Moreno-Gonzalez (1993) genetic models to backcross simulation data generated by the Monte Carlo method, and (2) to get simulation information about the number of testing progenies and mapping density in relation to the magnitude of gene effect estimates. Results of the Monte Carlo study show that the stepwise regression analysis was able to detect relatively small additive and dominance effects when the QTL are independently segregating. When testing selfed families derived from backcross individuals, dominance effects had a larger error standard deviation and were estimated at a lower frequency. Linked QTL require a higher marker mapping density on the genome and a larger number of progenies to detect small genetic effects. Reduction of the environmental error variance by evaluating selfed backcross families in replicate experiments increased the power of the test. Expressions of the number of progenies for detecting significant additive effects were developed for some genetic situations. The ratio of the within-backcross genetic variance to the square of a gene effect estimate is a function of the number of progenies, the heritability of the trait, the marker map density and the portion of the genetic variance explained by the model. Different values (from 0 to 1) assigned to (relative position of the QTL in the marker segment) did not cause a large shift in the residual mean square of the model.  相似文献   

8.
Bayesian LASSO for quantitative trait loci mapping   总被引:7,自引:1,他引:6       下载免费PDF全文
Yi N  Xu S 《Genetics》2008,179(2):1045-1055
The mapping of quantitative trait loci (QTL) is to identify molecular markers or genomic loci that influence the variation of complex traits. The problem is complicated by the facts that QTL data usually contain a large number of markers across the entire genome and most of them have little or no effect on the phenotype. In this article, we propose several Bayesian hierarchical models for mapping multiple QTL that simultaneously fit and estimate all possible genetic effects associated with all markers. The proposed models use prior distributions for the genetic effects that are scale mixtures of normal distributions with mean zero and variances distributed to give each effect a high probability of being near zero. We consider two types of priors for the variances, exponential and scaled inverse-chi(2) distributions, which result in a Bayesian version of the popular least absolute shrinkage and selection operator (LASSO) model and the well-known Student's t model, respectively. Unlike most applications where fixed values are preset for hyperparameters in the priors, we treat all hyperparameters as unknowns and estimate them along with other parameters. Markov chain Monte Carlo (MCMC) algorithms are developed to simulate the parameters from the posteriors. The methods are illustrated using well-known barley data.  相似文献   

9.
Without consideration of other linked QTLs responsible for dynamic trait, original functional mapping based on a single QTL model is not optimal for analyzing multiple dynamic trait loci. Despite that composite functional mapping incorporates the effects of genetic background outside the tested QTL in mapping model, the arbitrary choice of background markers also impact on the power of QTL detection. In this study, we proposed Bayesian functional mapping strategy that can simultaneously identify multiple QTL controlling developmental patterns of dynamic traits over the genome. Our proposed method fits the change of each QTL effect with the time by Legendre polynomial and takes the residual covariance structure into account using the first autoregressive equation. Also, Bayesian shrinkage estimation was employed to estimate the model parameters. Especially, we specify the gamma distribution as the prior for the first-order auto-regressive coefficient, which will guarantee the convergence of Bayesian sampling. Simulations showed that the proposed method could accurately estimate the QTL parameters and had a greater statistical power of QTL detection than the composite functional mapping. A real data analysis of leaf age growth in rice is used for the demonstration of our method. It shows that our Bayesian functional mapping can detect more QTLs as compared to composite functional mapping.  相似文献   

10.
Summary A Bayesian method was developed for identifying genetic markers linked to quantitative trait loci (QTL) by analyzing data from daughter or granddaughter designs and single markers or marker pairs. Traditional methods may yield unrealistic results because linkage tests depend on number of markers and QTL gene effects associated with selected markers are overestimated. The Bayesian or posterior probability of linkage combines information from a daughter or granddaughter design with the prior probability of linkage between a marker locus and a QTL. If the posterior probability exceeds a certain quantity, linkage is declared. Upon linkage acceptance, Bayesian estimates of marker-QTL recombination rate and QTL gene effects and frequencies are obtained. The Bayesian estimates of QTL gene effects account for different amounts of information by shrinking information from data toward the mean or mode of a prior exponential distribution of gene effects. Computation of the Bayesian analysis is feasible. Exact results are given for biallelic QTL, and extensions to multiallelic QTL are suggested.  相似文献   

11.
Genomewide scans for mapping loci have proved to be extremely powerful and popular. We present a semiparametric method of mapping a quantitative-trait locus (QTL) or QTLs with the use of sib-pair data generated from a two-stage genomic scan. In a two-stage genomic scan, either the entire genome or a large portion of the genome is saturated with low-density markers at the first stage. At the second stage, the intervals that are identified as probable locations of the trait loci, by means of analysis of data from the first stage, are then saturated with higher-density markers. These data are then analyzed for fine mapping of the loci. Our statistical strategy for analysis of data from the first stage is a low-stringency method based on the rank correlation of squared trait-difference values of the sib pairs and the estimated identity-by-descent scores at the marker loci. We suggest the use of a low-stringency method at the first stage, to save on computational time and to avoid missing any marker interval that may contain the trait loci. For analysis of data from the second stage, we have developed a high-stringency nonparametric-regression approach, using the kernel-smoothing technique. Through extensive simulations, we show that this approach is more powerful than is a currently used method for mapping QTLs by use of sib pairs, particularly in the presence of dominance and epistatic effects at the trait loci.  相似文献   

12.
Bayesian mapping of quantitative trait loci for complex binary traits   总被引:13,自引:0,他引:13  
Yi N  Xu S 《Genetics》2000,155(3):1391-1403
A complex binary trait is a character that has a dichotomous expression but with a polygenic genetic background. Mapping quantitative trait loci (QTL) for such traits is difficult because of the discrete nature and the reduced variation in the phenotypic distribution. Bayesian statistics are proved to be a powerful tool for solving complicated genetic problems, such as multiple QTL with nonadditive effects, and have been successfully applied to QTL mapping for continuous traits. In this study, we show that Bayesian statistics are particularly useful for mapping QTL for complex binary traits. We model the binary trait under the classical threshold model of quantitative genetics. The Bayesian mapping statistics are developed on the basis of the idea of data augmentation. This treatment allows an easy way to generate the value of a hypothetical underlying variable (called the liability) and a threshold, which in turn allow the use of existing Bayesian statistics. The reversible jump Markov chain Monte Carlo algorithm is used to simulate the posterior samples of all unknowns, including the number of QTL, the locations and effects of identified QTL, genotypes of each individual at both the QTL and markers, and eventually the liability of each individual. The Bayesian mapping ends with an estimation of the joint posterior distribution of the number of QTL and the locations and effects of the identified QTL. Utilities of the method are demonstrated using a simulated outbred full-sib family. A computer program written in FORTRAN language is freely available on request.  相似文献   

13.
Fine mapping of quantitative trait loci (QTL) from previous linkage studies was performed on pig chromosomes 1, 4, 7, 8, 17, and X which were known to harbor QTL. Traits were divided into: growth performance, carcass, internal organs, cut yields, and meat quality. Fifty families were used of a F2 population produced by crossing local Brazilian Piau boars with commercial sows. The linkage map consisted of 237 SNP and 37 microsatellite markers covering 866 centimorgans. QTL were identified by regression interval mapping using GridQTL. Individual marker effects were estimated by Bayesian LASSO regression using R. In total, 32 QTL affecting the evaluated traits were detected along the chromosomes studied. Seven of the QTL were known from previous studies using our F2 population, and 25 novel QTL resulted from the increased marker coverage. Six of the seven QTL that were significant at the 5% genome-wide level had SNPs within their confidence interval whose effects were among the 5% largest effects. The combined use of microsatellites along with SNP markers increased the saturation of the genome map and led to smaller confidence intervals of the QTL. The results showed that the tested models yield similar improvements in QTL mapping accuracy.  相似文献   

14.
Z Hu  Z Wang  S Xu 《PloS one》2012,7(7):e41336
We developed a marker based infinitesimal model for quantitative trait analysis. In contrast to the classical infinitesimal model, we now have new information about the segregation of every individual locus of the entire genome. Under this new model, we propose that the genetic effect of an individual locus is a function of the genome location (a continuous quantity). The overall genetic value of an individual is the weighted integral of the genetic effect function along the genome. Numerical integration is performed to find the integral, which requires partitioning the entire genome into a finite number of bins. Each bin may contain many markers. The integral is approximated by the weighted sum of all the bin effects. We now turn the problem of marker analysis into bin analysis so that the model dimension has decreased from a virtual infinity to a finite number of bins. This new approach can efficiently handle virtually unlimited number of markers without marker selection. The marker based infinitesimal model requires high linkage disequilibrium of all markers within a bin. For populations with low or no linkage disequilibrium, we develop an adaptive infinitesimal model. Both the original and the adaptive models are tested using simulated data as well as beef cattle data. The simulated data analysis shows that there is always an optimal number of bins at which the predictability of the bin model is much greater than the original marker analysis. Result of the beef cattle data analysis indicates that the bin model can increase the predictability from 10% (multiple marker analysis) to 33% (multiple bin analysis). The marker based infinitesimal model paves a way towards the solution of genetic mapping and genomic selection using the whole genome sequence data.  相似文献   

15.
Prediction of total genetic value using genome-wide dense marker maps   总被引:63,自引:0,他引:63  
Meuwissen TH  Hayes BJ  Goddard ME 《Genetics》2001,157(4):1819-1829
Recent advances in molecular genetic techniques will make dense marker maps available and genotyping many individuals for these markers feasible. Here we attempted to estimate the effects of approximately 50,000 marker haplotypes simultaneously from a limited number of phenotypic records. A genome of 1000 cM was simulated with a marker spacing of 1 cM. The markers surrounding every 1-cM region were combined into marker haplotypes. Due to finite population size N(e) = 100, the marker haplotypes were in linkage disequilibrium with the QTL located between the markers. Using least squares, all haplotype effects could not be estimated simultaneously. When only the biggest effects were included, they were overestimated and the accuracy of predicting genetic values of the offspring of the recorded animals was only 0.32. Best linear unbiased prediction of haplotype effects assumed equal variances associated to each 1-cM chromosomal segment, which yielded an accuracy of 0.73, although this assumption was far from true. Bayesian methods that assumed a prior distribution of the variance associated with each chromosome segment increased this accuracy to 0.85, even when the prior was not correct. It was concluded that selection on genetic values predicted from markers could substantially increase the rate of genetic gain in animals and plants, especially if combined with reproductive techniques to shorten the generation interval.  相似文献   

16.
Yang R  Xu S 《Genetics》2007,176(2):1169-1185
Many quantitative traits are measured repeatedly during the life of an organism. Such traits are called dynamic traits. The pattern of the changes of a dynamic trait is called the growth trajectory. Studying the growth trajectory may enhance our understanding of the genetic architecture of the growth trajectory. Recently, we developed an interval-mapping procedure to map QTL for dynamic traits under the maximum-likelihood framework. We fit the growth trajectory by Legendre polynomials. The method intended to map one QTL at a time and the entire QTL analysis involved scanning the entire genome by fitting multiple single-QTL models. In this study, we propose a Bayesian shrinkage analysis for estimating and mapping multiple QTL in a single model. The method is a combination between the shrinkage mapping for individual quantitative traits and the Legendre polynomial analysis for dynamic traits. The multiple-QTL model is implemented in two ways: (1) a fixed-interval approach where a QTL is placed in each marker interval and (2) a moving-interval approach where the position of a QTL can be searched in a range that covers many marker intervals. Simulation study shows that the Bayesian shrinkage method generates much better signals for QTL than the interval-mapping approach. We propose several alternative methods to present the results of the Bayesian shrinkage analysis. In particular, we found that the Wald test-statistic profile can serve as a mechanism to test the significance of a putative QTL.  相似文献   

17.
A Bayesian approach is presented for mapping a quantitative trait locus (QTL) using the 'Fernando and Grossman' multivariate Normal approximation to QTL inheritance. For this model, a Bayesian implementation that includes QTL position is problematic because standard Markov chain Monte Carlo (MCMC) algorithms do not mix, i.e. the QTL position gets stuck in one marker interval. This is because of the dependence of the covariance structure for the QTL effects on the adjacent markers and may be typical of the 'Fernando and Grossman' model. A relatively new MCMC technique, simulated tempering, allows mixing and so makes possible inferences about QTL position based on marginal posterior probabilities. The model was implemented for estimating variance ratios and QTL position using a continuous grid of allowed positions and was applied to simulated data of a standard granddaughter design. The results showed a smooth mixing of QTL position after implementation of the simulated tempering sampler. In this implementation, map distance between QTL and its flanking markers was artificially stretched to reduce the dependence of markers and covariance. The method generalizes easily to more complicated applications and can ultimately contribute to QTL mapping in complex, heterogeneous, human, animal or plant populations.  相似文献   

18.
Pérez-Enciso M 《Genetics》2003,163(4):1497-1510
We present a Bayesian method that combines linkage and linkage disequilibrium (LDL) information for quantitative trait locus (QTL) mapping. This method uses jointly all marker information (haplotypes) and all available pedigree information; i.e., it is not restricted to any specific experimental design and it is not required that phases are known. Infinitesimal genetic effects or environmental noise ("fixed") effects can equally be fitted. A diallelic QTL is assumed and both additive and dominant effects can be estimated. We have implemented a combined Gibbs/Metropolis-Hastings sampling to obtain the marginal posterior distributions of the parameters of interest. We have also implemented a Bayesian variant of usual disequilibrium measures like D' and r(2) between QTL and markers. We illustrate the method with simulated data in "simple" (two-generation full-sib families) and "complex" (four-generation) pedigrees. We compared the estimates with and without using linkage disequilibrium information. In general, using LDL resulted in estimates of QTL position that were much better than linkage-only estimates when there was complete disequilibrium between the mutant QTL allele and the marker. This advantage, however, decreased when the association was only partial. In all cases, additive and dominant effects were estimated accurately either with or without disequilibrium information.  相似文献   

19.
In most quantitative trait loci (QTL) mapping studies, phenotypes are assumed to follow normal distributions. Deviations from this assumption may affect the accuracy of QTL detection, leading to detection of false positive QTL. To improve the robustness of QTL mapping methods, we replace the normal distribution assumption for residuals in a multiple QTL model with a Student-t distribution that is able to accommodate residual outliers. A Robust Bayesian mapping strategy is proposed on the basis of the Bayesian shrinkage analysis for QTL effects. The simulations show that Robust Bayesian mapping approach can substantially increase the power of QTL detection when the normality assumption does not hold and applying it to data already normally distributed does not influence the result. The proposed QTL mapping method is applied to mapping QTL for the traits associated with physics–chemical characters and quality in rice. Similarly to the simulation study in the real data case the robust approach was able to detect additional QTLs when compared to the traditional approach. The program to implement the method is available on request from the first or the corresponding author. Xin Wang and Zhongze Piao contributed equally to this study.  相似文献   

20.
QTL analysis in arbitrary pedigrees with incomplete marker information   总被引:3,自引:0,他引:3  
Vogl C  Xu S 《Heredity》2002,89(5):339-345
Mapping quantitative trait loci (QTL) in arbitrary outbred pedigrees is complicated by the combinatorial possibilities of allele flow relationships and of the founder allelic configurations. Exact methods are only available for rather short and simple pedigrees. Stochastic simulation using Markov chain Monte Carlo (MCMC) integration offers more flexibility. MCMC methods are less natural in a frequentist than in a Bayesian context, which we therefore adopt. Among the MCMC algorithms for updating marker locus genotypes, we implement the descent-graph algorithm. It can be used to update marker locus allele flow relationships and can handle arbitrarily complex pedigrees and missing marker information. Compared with updating marker genotypic information, updating QTL parameters, such as position, effects, and the allele flow relationships is relatively easy with MCMC. We treat the effect of each diploid combination of founder alleles as a random variable and only estimate the variance of these effects, ie, we model diploid genotypic effects instead of the usual partition in additive and dominance effects. This is a variant of the random model approach. The number of QTL alleles is generally unknown. In the Bayesian context, the number of QTL present on a linkage group can be treated as variable. Computer simulations suggest that the algorithm can indeed handle complex pedigrees and detect two QTL on a linkage group, but that the number of individuals in a single extended family is limited to about 50 to 100 individuals.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号