首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
多QTL定位的压缩估计方法   总被引:1,自引:0,他引:1  
章元明 《遗传学报》2006,33(10):861-869
本文综述了多标记分析和多QTL定位的压缩估计方法。对于前者,Xu(Genetics,2003,163:789—801)首先提出了Bayesian压缩估计方法。其关键在于让每个效应有一个特定的方差参数,而该方差又服从一定的先验分布,以致能从资料中估计之。由此,能够同时估计大量分子标记基因座的遗传效应,即使大多数标记的效应是可忽略的。然而,对于上位性遗传模型,其运算时间还是过长。为此,笔者将上述思想嵌入极大似然法,提出了惩罚最大似然方法。模拟研究显示:该方法能处理变量个数大于样本容量10倍左右的线性遗传模型。对于后者,本文详细介绍了基于固定区间和可变区间的Bayesian压缩估计方法。固定区间方法可处理中等密度的分子标记资料;可变区间方法则可分析高密度分子标记资料,甚至是上位性遗传模型。对于上位性检测,已介绍的惩罚最大似然方法和可变区间Bayesian压缩估计方法可供利用。应当指出,压缩估计方法在今后的eQTL和QTN定位以及基因互作网络分析等研究中也是有应用价值的。  相似文献   

2.
A penalized maximum likelihood method for estimating epistatic effects of QTL   总被引:16,自引:0,他引:16  
Zhang YM  Xu S 《Heredity》2005,95(1):96-104
Although epistasis is an important phenomenon in the genetics and evolution of complex traits, epistatic effects are hard to estimate. The main problem is due to the overparameterized epistatic genetic models. An epistatic genetic model should include potential pair-wise interaction effects of all loci. However, the model is saturated quickly as the number of loci increases. Therefore, a variable selection technique is usually considered to exclude those interactions with negligible effects. With such techniques, we may run a high risk of missing some important interaction effects by not fully exploring the extremely large parameter space of models. We develop a penalized maximum likelihood method. The method developed here adopts a penalty that depends on the values of the parameters. The penalized likelihood method allows spurious QTL effects to be shrunk towards zero, while QTL with large effects are estimated with virtually no shrinkage. A simulation study shows that the new method can handle a model with a number of effects 15 times larger than the sample size. Simulation studies also show that results of the penalized likelihood method are comparable to the Bayesian shrinkage analysis, but the computational speed of the penalized method is orders of magnitude faster.  相似文献   

3.
Methodologies for segregation analysis and QTL mapping in plants   总被引:1,自引:0,他引:1  
Zhang YM  Gai J 《Genetica》2009,136(2):311-318
Most characters of biological interest and economic importance are quantitative traits. To uncover the genetic architecture of quantitative traits, two approaches have become popular in China. One is the establishment of an analytical model for mixed major-gene plus polygenes inheritance and the other the discovery of quantitative trait locus (QTL). Here we review our progress employing these two approaches. First, we proposed joint segregation analysis of multiple generations for mixed major-gene plus polygenes inheritance. Second, we extended the multilocus method of Lander and Green (1987), Jiang and Zeng (1997) to a more generalized approach. Our methodology handles distorted, dominant and missing markers, including the effect of linked segregation distortion loci on the estimation of map distance. Finally, we developed several QTL mapping methods. In the Bayesian shrinkage estimation (BSE) method, we suggested a method to test the significance of QTL effects and studied the effect of the prior distribution of the variance of QTL effect on QTL mapping. To reduce running time, a penalized maximum likelihood method was adopted. To mine novel genes in crop inbred lines generated in the course of normal crop breeding work, three methods were introduced. If a well-documented genealogical history of the lines is available, two-stage variance component analysis and multi-QTL Haseman-Elston regression were suggested; if unavailable, multiple loci in silico mapping was proposed.  相似文献   

4.
Xu S 《Biometrics》2007,63(2):513-521
Summary .   The genetic variance of a quantitative trait is often controlled by the segregation of multiple interacting loci. Linear model regression analysis is usually applied to estimating and testing effects of these quantitative trait loci (QTL). Including all the main effects and the effects of interaction (epistatic effects), the dimension of the linear model can be extremely high. Variable selection via stepwise regression or stochastic search variable selection (SSVS) is the common procedure for epistatic effect QTL analysis. These methods are computationally intensive, yet they may not be optimal. The LASSO (least absolute shrinkage and selection operator) method is computationally more efficient than the above methods. As a result, it has been widely used in regression analysis for large models. However, LASSO has never been applied to genetic mapping for epistatic QTL, where the number of model effects is typically many times larger than the sample size. In this study, we developed an empirical Bayes method (E-BAYES) to map epistatic QTL under the mixed model framework. We also tested the feasibility of using LASSO to estimate epistatic effects, examined the fully Bayesian SSVS, and reevaluated the penalized likelihood (PENAL) methods in mapping epistatic QTL. Simulation studies showed that all the above methods performed satisfactorily well. However, E-BAYES appears to outperform all other methods in terms of minimizing the mean-squared error (MSE) with relatively short computing time. Application of the new method to real data was demonstrated using a barley dataset.  相似文献   

5.
Zhang J  Yue C  Zhang YM 《Heredity》2012,108(4):396-402
A penalized maximum likelihood method has been proposed as an important approach to the detection of epistatic quantitative trait loci (QTL). However, this approach is not optimal in two special situations: (1) closely linked QTL with effects in opposite directions and (2) small-effect QTL, because the method produces downwardly biased estimates of QTL effects. The present study aims to correct the bias by using correction coefficients and shifting from the use of a uniform prior on the variance parameter of a QTL effect to that of a scaled inverse chi-square prior. The results of Monte Carlo simulation experiments show that the improved method increases the power from 25 to 88% in the detection of two closely linked QTL of equal size in opposite directions and from 60 to 80% in the identification of QTL with small effects (0.5% of the total phenotypic variance). We used the improved method to detect QTL responsible for the barley kernel weight trait using 145 doubled haploid lines developed in the North American Barley Genome Mapping Project. Application of the proposed method to other shrinkage estimation of QTL effects is discussed.  相似文献   

6.
Estimating polygenic effects using markers of the entire genome   总被引:26,自引:0,他引:26  
Xu S 《Genetics》2003,163(2):789-801
Molecular markers have been used to map quantitative trait loci. However, they are rarely used to evaluate effects of chromosome segments of the entire genome. The original interval-mapping approach and various modified versions of it may have limited use in evaluating the genetic effects of the entire genome because they require evaluation of multiple models and model selection. Here we present a Bayesian regression method to simultaneously estimate genetic effects associated with markers of the entire genome. With the Bayesian method, we were able to handle situations in which the number of effects is even larger than the number of observations. The key to the success is that we allow each marker effect to have its own variance parameter, which in turn has its own prior distribution so that the variance can be estimated from the data. Under this hierarchical model, we were able to handle a large number of markers and most of the markers may have negligible effects. As a result, it is possible to evaluate the distribution of the marker effects. Using data from the North American Barley Genome Mapping Project in double-haploid barley, we found that the distribution of gene effects follows closely an L-shaped Gamma distribution, which is in contrast to the bell-shaped Gamma distribution when the gene effects were estimated from interval mapping. In addition, we show that the Bayesian method serves as an alternative or even better QTL mapping method because it produces clearer signals for QTL. Similar results were found from simulated data sets of F(2) and backcross (BC) families.  相似文献   

7.
Bayesian shrinkage estimation of quantitative trait loci parameters   总被引:13,自引:0,他引:13       下载免费PDF全文
Wang H  Zhang YM  Li X  Masinde GL  Mohan S  Baylink DJ  Xu S 《Genetics》2005,170(1):465-480
Mapping multiple QTL is a typical problem of variable selection in an oversaturated model because the potential number of QTL can be substantially larger than the sample size. Currently, model selection is still the most effective approach to mapping multiple QTL, although further research is needed. An alternative approach to analyzing an oversaturated model is the shrinkage estimation in which all candidate variables are included in the model but their estimated effects are forced to shrink toward zero. In contrast to the usual shrinkage estimation where all model effects are shrunk by the same factor, we develop a Bayesian method that allows the shrinkage factor to vary across different effects. The new shrinkage method forces marker intervals that contain no QTL to have estimated effects close to zero whereas intervals containing notable QTL have estimated effects subject to virtually no shrinkage. We demonstrate the method using both simulated and real data for QTL mapping. A simulation experiment with 500 backcross (BC) individuals showed that the method can localize closely linked QTL and QTL with effects as small as 1% of the phenotypic variance of the trait. The method was also used to map QTL responsible for wound healing in a family of a (MRL/MPJ x SJL/J) cross with 633 F(2) mice derived from two inbred lines.  相似文献   

8.
It has long been recognized that epistasis or interactions between non-allelic genes plays an important role in the genetic control and evolution of quantitative traits. However, the detection of epistasis and estimation of epistatic effects are difficult due to the complexity of epistatic patterns, insufficient sample size of mapping populations and lack of efficient statistical methods. Under the assumption of additivity of QTL effects on the phenotype of a trait in interest, the additive effect of a QTL can be completely absorbed by the flanking marker variables, and the epistatic effect between two QTL can be completely absorbed by the four marker-pair multiplication variables between the two pairs of flanking markers. Based on this property, we proposed an inclusive composite interval mapping (ICIM) by simultaneously considering marker variables and marker-pair multiplications in a linear model. Stepwise regression was applied to identify the most significant markers and marker-pair multiplications. Then a two-dimensional scanning (or interval mapping) was conducted to identify QTL with significant digenic epistasis using adjusted phenotypic values based on the best multiple regression model. The adjusted values retain the information of QTL on the two current mapping intervals but exclude the influence of QTL on other intervals and chromosomes. Epistatic QTL can be identified by ICIM, no matter whether the two interacting QTL have any additive effects. Simulated populations and one barley doubled haploids (DH) population were used to demonstrate the efficiency of ICIM in mapping both additive QTL and digenic interactions.  相似文献   

9.
It is shown that maximum likelihood estimation of variance components from twin data can be parameterized in the framework of linear mixed models. Standard statistical packages can be used to analyze univariate or multivariate data for simple models such as the ACE and CE models. Furthermore, specialized variance component estimation software that can handle pedigree data and user-defined covariance structures can be used to analyze multivariate data for simple and complex models, including those where dominance and/or QTL effects are fitted. The linear mixed model framework is particularly useful for analyzing multiple traits in extended (twin) families with a large number of random effects.  相似文献   

10.
Four-way crosses (4WC) involving four different inbred lines often appear in plant and animal commercial breeding programs. Direct mapping of quantitative trait loci (QTL) in these commercial populations is both economical and practical. However, the existing statistical methods for mapping QTL in a 4WC population are built on the single-QTL genetic model. This simple genetic model fails to take into account QTL interactions, which play an important role in the genetic architecture of complex traits. In this paper, therefore, we attempted to develop a statistical method to detect epistatic QTL in 4WC population. Conditional probabilities of QTL genotypes, computed by the multi-point single locus method, were used to sample the genotypes of all putative QTL in the entire genome. The sampled genotypes were used to construct the design matrix for QTL effects. All QTL effects, including main and epistatic effects, were simultaneously estimated by the penalized maximum likelihood method. The proposed method was confirmed by a series of Monte Carlo simulation studies and real data analysis of cotton. The new method will provide novel tools for the genetic dissection of complex traits, construction of QTL networks, and analysis of heterosis.  相似文献   

11.
植物QTL分析的理论研究进展   总被引:2,自引:0,他引:2  
数量性状的表型是由数量性状基因座 ( Quantitative trait locus,QTL)和环境效应共同作用的结果。传统的数量遗传学采用统计学的方法由一级统计量和二级统计量描述处理 QTL的复合作用 ,估计各种遗传参数 (例如遗传力、遗传相关、遗传进度、有效因子数等 ) ,用于指导遗传育种实践。然而 ,在传统的数量遗传学分析中 ,往往假设数量性状受微效多基因控制 ,这些基因具有相同的并且是较微小的效应 ,所估计的遗传参数反映的是数量性状多基因系统的整体特征 ,其理论方法不能用于追踪研究和描述单个数量性状基因的作用。近年来 ,由于分子生物学技…  相似文献   

12.
S. Xu  W. R. Atchley 《Genetics》1995,141(3):1189-1197
Mapping quantitative trait loci in outbred populations is important because many populations of organisms are noninbred. Unfortunately, information about the genetic architecture of the trait may not be available in outbred populations. Thus, the allelic effects of genes can not be estimated with ease. In addition, under linkage equilibrium, marker genotypes provide no information about the genotype of a QTL (our terminology for a single quantitative trait locus is QTL while multiple loci are referred to as QTLs). To circumvent this problem, an interval mapping procedure based on a random model approach is described. Under a random model, instead of estimating the effects, segregating variances of QTLs are estimated by a maximum likelihood method. Estimation of the variance component of a QTL depends on the proportion of genes identical-by-descent (IBD) shared by relatives at the locus, which is predicted by the IBD of two markers flanking the QTL. The marker IBD shared by two relatives are inferred from the observed marker genotypes. The procedure offers an advantage over the regression interval mapping in terms of high power and small estimation errors and provides flexibility for large sibships, irregular pedigree relationships and incorporation of common environmental and fixed effects.  相似文献   

13.
Estimating the genetic architecture of quantitative traits   总被引:20,自引:0,他引:20  
Understanding and estimating the structure and parameters associated with the genetic architecture of quantitative traits is a major research focus in quantitative genetics. With the availability of a well-saturated genetic map of molecular markers, it is possible to identify a major part of the structure of the genetic architecture of quantitative traits and to estimate the associated parameters. Multiple interval mapping, which was recently proposed for simultaneously mapping multiple quantitative trait loci (QTL), is well suited to the identification and estimation of the genetic architecture parameters, including the number, genomic positions, effects and interactions of significant QTL and their contribution to the genetic variance. With multiple traits and multiple environments involved in a QTL mapping experiment, pleiotropic effects and QTL by environment interactions can also be estimated. We review the method and discuss issues associated with multiple interval mapping, such as likelihood analysis, model selection, stopping rules and parameter estimation. The potential power and advantages of the method for mapping multiple QTL and estimating the genetic architecture are discussed. We also point out potential problems and difficulties in resolving the details of the genetic architecture as well as other areas that require further investigation. One application of the analysis is to improve genome-wide marker-assisted selection, particularly when the information about epistasis is used for selection with mating.  相似文献   

14.
Yi N  Xu S  Allison DB 《Genetics》2003,165(2):867-883
Most complex traits of animals, plants, and humans are influenced by multiple genetic and environmental factors. Interactions among multiple genes play fundamental roles in the genetic control and evolution of complex traits. Statistical modeling of interaction effects in quantitative trait loci (QTL) analysis must accommodate a very large number of potential genetic effects, which presents a major challenge to determining the genetic model with respect to the number of QTL, their positions, and their genetic effects. In this study, we use the methodology of Bayesian model and variable selection to develop strategies for identifying multiple QTL with complex epistatic patterns in experimental designs with two segregating genotypes. Specifically, we develop a reversible jump Markov chain Monte Carlo algorithm to determine the number of QTL and to select main and epistatic effects. With the proposed method, we can jointly infer the genetic model of a complex trait and the associated genetic parameters, including the number, positions, and main and epistatic effects of the identified QTL. Our method can map a large number of QTL with any combination of main and epistatic effects. Utility and flexibility of the method are demonstrated using both simulated data and a real data set. Sensitivity of posterior inference to prior specifications of the number and genetic effects of QTL is investigated.  相似文献   

15.
Huang J  Harrington D 《Biometrics》2002,58(4):781-791
The Cox proportional hazards model is often used for estimating the association between covariates and a potentially censored failure time, and the corresponding partial likelihood estimators are used for the estimation and prediction of relative risk of failure. However, partial likelihood estimators are unstable and have large variance when collinearity exists among the explanatory variables or when the number of failures is not much greater than the number of covariates of interest. A penalized (log) partial likelihood is proposed to give more accurate relative risk estimators. We show that asymptotically there always exists a penalty parameter for the penalized partial likelihood that reduces mean squared estimation error for log relative risk, and we propose a resampling method to choose the penalty parameter. Simulations and an example show that the bootstrap-selected penalized partial likelihood estimators can, in some instances, have smaller bias than the partial likelihood estimators and have smaller mean squared estimation and prediction errors of log relative risk. These methods are illustrated with a data set in multiple myeloma from the Eastern Cooperative Oncology Group.  相似文献   

16.
Jung J  Fan R  Jin L 《Genetics》2005,170(2):881-898
Using multiple diallelic markers, variance component models are proposed for high-resolution combined linkage and association mapping of quantitative trait loci (QTL) based on nuclear families. The objective is to build a model that may fully use marker information for fine association mapping of QTL in the presence of prior linkage. The measures of linkage disequilibrium and the genetic effects are incorporated in the mean coefficients and are decomposed into orthogonal additive and dominance effects. The linkage information is modeled in variance-covariance matrices. Hence, the proposed methods model both association and linkage in a unified model. On the basis of marker information, a multipoint interval mapping method is provided to estimate the proportion of allele sharing identical by descent (IBD) and the probability of sharing two alleles IBD at a putative QTL for a sib-pair. To test the association between the trait locus and the markers, both likelihood-ratio tests and F-tests can be constructed on the basis of the proposed models. In addition, analytical formulas of noncentrality parameter approximations of the F-test statistics are provided. Type I error rates of the proposed test statistics are calculated to show their robustness. After comparing with the association between-family and association within-family (AbAw) approach by Abecasis and Fulker et al., it is found that the method proposed in this article is more powerful and advantageous based on simulation study and power calculation. By power and sample size comparison, it is shown that models that use more markers may have higher power than models that use fewer markers. The multiple-marker analysis can be more advantageous and has higher power in fine mapping QTL. As an application, the Genetic Analysis Workshop 12 German asthma data are analyzed using the proposed methods.  相似文献   

17.
Boer MP  Ter Braak CJ  Jansen RC 《Genetics》2002,162(2):951-960
Epistasis is a common and important phenomenon, as indicated by results from a number of recent experiments. Unfortunately, the discovery of epistatic quantitative trait loci (QTL) is difficult since one must search for multiple QTL simultaneously in two or more dimensions. Such a multidimensional search necessitates many statistical tests, and a high statistical threshold must be adopted to avoid false positives. Furthermore, the large number of (interaction) parameters in comparison with the number of observations results in a serious danger of overfitting and overinterpretation of the data. In this article we present a new statistical framework for mapping epistasis in inbred line crosses. It is based on reducing the high dimensionality of the problem in two ways. First, epistatic QTL are mapped in a one-dimensional genome scan for high interactions between QTL and the genetic background. Second, the dimension of the search is bounded by penalized likelihood methods. We use simulated backcross data to illustrate the new approach.  相似文献   

18.
A novel and robust method for the fine-scale mapping of genes affecting complex traits, which combines linkage and linkage-disequilibrium information, is proposed. Linkage information refers to recombinations within the marker-genotyped generations and linkage disequilibrium to historical recombinations before genotyping started. The identity-by-descent (IBD) probabilities at the quantitative trait locus (QTL) between first generation haplotypes were obtained from the similarity of the marker alleles surrounding the QTL, whereas IBD probabilities at the QTL between later generation haplotypes were obtained by using the markers to trace the inheritance of the QTL. The variance explained by the QTL is estimated by residual maximum likelihood using the correlation structure defined by the IBD probabilities. Unlinked background genes were accounted for by fitting a polygenic variance component. The method was used to fine map a QTL for twinning rate in cattle, previously mapped on chromosome 5 by linkage analysis. The data consisted of large half-sib families, but the method could also handle more complex pedigrees. The likelihood of the putative QTL was very small along most of the chromosome, except for a sharp likelihood peak in the ninth marker bracket, which positioned the QTL within a region <1 cM in the middle part of bovine chromosome 5. The method was expected to be robust against multiple genes affecting the trait, multiple mutations at the QTL, and relatively low marker density.  相似文献   

19.
Without consideration of other linked QTLs responsible for dynamic trait, original functional mapping based on a single QTL model is not optimal for analyzing multiple dynamic trait loci. Despite that composite functional mapping incorporates the effects of genetic background outside the tested QTL in mapping model, the arbitrary choice of background markers also impact on the power of QTL detection. In this study, we proposed Bayesian functional mapping strategy that can simultaneously identify multiple QTL controlling developmental patterns of dynamic traits over the genome. Our proposed method fits the change of each QTL effect with the time by Legendre polynomial and takes the residual covariance structure into account using the first autoregressive equation. Also, Bayesian shrinkage estimation was employed to estimate the model parameters. Especially, we specify the gamma distribution as the prior for the first-order auto-regressive coefficient, which will guarantee the convergence of Bayesian sampling. Simulations showed that the proposed method could accurately estimate the QTL parameters and had a greater statistical power of QTL detection than the composite functional mapping. A real data analysis of leaf age growth in rice is used for the demonstration of our method. It shows that our Bayesian functional mapping can detect more QTLs as compared to composite functional mapping.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号