首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A Nonparametric Approach for Mapping Quantitative Trait Loci   总被引:23,自引:3,他引:20       下载免费PDF全文
L. Kruglyak  E. S. Lander 《Genetics》1995,139(3):1421-1428
Genetic mapping of quantitative trait loci (QTLs) is performed typically by using a parametric approach, based on the assumption that the phenotype follows a normal distribution. Many traits of interest, however, are not normally distributed. In this paper, we present a nonparametric approach to QTL mapping applicable to any phenotypic distribution. The method is based on a statistic Z(w), which generalizes the nonparametric Wilcoxon rank-sum test to the situation of whole-genome search by interval mapping. We determine the appropriate significance level for the statistic Z(w), by showing that its asymptotic null distribution follows an Ornstein-Uhlenbeck process. These results provide a robust, distribution-free method for mapping QTLs.  相似文献   

2.
A major consideration in multitrait analysis is which traits should be jointly analyzed. As a common strategy, multitrait analysis is performed either on pairs of traits or on all of traits. To fully exploit the power of multitrait analysis, we propose variable selection to choose a subset of informative traits for multitrait quantitative trait locus (QTL) mapping. The proposed method is very useful for achieving optimal statistical power for QTL identification and for disclosing the most relevant traits. It is also a practical strategy to effectively take advantage of multitrait analysis when the number of traits under consideration is too large, making the usual multivariate analysis of all traits challenging. We study the impact of selection bias and the usage of permutation tests in the context of variable selection and develop a powerful implementation procedure of variable selection for genome scanning. We demonstrate the proposed method and selection procedure in a backcross population, using both simulated and real data. The extension to other experimental mapping populations is straightforward.  相似文献   

3.
This paper demonstrates the advantages of sharing information about unknown features of covariates across multiple model components in various nonparametric regression problems including multivariate, heteroscedastic, and semicontinuous responses. In this paper, we present a methodology which allows for information to be shared nonparametrically across various model components using Bayesian sum-of-tree models. Our simulation results demonstrate that sharing of information across related model components is often very beneficial, particularly in sparse high-dimensional problems in which variable selection must be conducted. We illustrate our methodology by analyzing medical expenditure data from the Medical Expenditure Panel Survey (MEPS). To facilitate the Bayesian nonparametric regression analysis, we develop two novel models for analyzing the MEPS data using Bayesian additive regression trees—a heteroskedastic log-normal hurdle model with a “shrink-toward-homoskedasticity” prior and a gamma hurdle model.  相似文献   

4.
C-L Wang  X-D Ding  J-Y Wang  J-F Liu  W-X Fu  Z Zhang  Z-J Yin  Q Zhang 《Heredity》2013,110(3):213-219
Estimation of genomic breeding values is the key step in genomic selection (GS). Many methods have been proposed for continuous traits, but methods for threshold traits are still scarce. Here we introduced threshold model to the framework of GS, and specifically, we extended the three Bayesian methods BayesA, BayesB and BayesCπ on the basis of threshold model for estimating genomic breeding values of threshold traits, and the extended methods are correspondingly termed BayesTA, BayesTB and BayesTCπ. Computing procedures of the three BayesT methods using Markov Chain Monte Carlo algorithm were derived. A simulation study was performed to investigate the benefit of the presented methods in accuracy with the genomic estimated breeding values (GEBVs) for threshold traits. Factors affecting the performance of the three BayesT methods were addressed. As expected, the three BayesT methods generally performed better than the corresponding normal Bayesian methods, in particular when the number of phenotypic categories was small. In the standard scenario (number of categories=2, incidence=30%, number of quantitative trait loci=50, h2=0.3), the accuracies were improved by 30.4%, 2.4%, and 5.7% points, respectively. In most scenarios, BayesTB and BayesTCπ generated similar accuracies and both performed better than BayesTA. In conclusion, our work proved that threshold model fits well for predicting GEBVs of threshold traits, and BayesTCπ is supposed to be the method of choice for GS of threshold traits.  相似文献   

5.
玉米5个农艺性状的QTL定位   总被引:35,自引:0,他引:35  
利用“豫玉22”构建的266个玉米F2:3家系为材料,通过一年两点的随机区组田间试验和分子标记分析,研究了玉米穗位高、雄穗分支数、茎粗、抽雄期、吐丝期5个重要农艺性状。相关分析表明,穗位高、雄穗分支数、茎粗与单株产量显著正相关,抽雄期与吐丝期高度正相关,雄穗分支数与茎粗显著正相关。采用复合区间作图法,通过500次排列测验分别确定各性状的LOD阈值,在武汉和襄樊两地共定位了7个穗位高QTL、9个雄穗分支数QTL、8个茎粗QTL、9个抽雄期QTL和7个吐丝期QTL;这些QTL在染色体上分布不均匀,具有集中分布的特点。研究表明,数量性状间的表型相关可能源于控制数量性状的QTL位点间的相关。  相似文献   

6.
Models for genome-wide prediction and association studies usually target a single phenotypic trait. However, in animal and plant genetics it is common to record information on multiple phenotypes for each individual that will be genotyped. Modeling traits individually disregards the fact that they are most likely associated due to pleiotropy and shared biological basis, thus providing only a partial, confounded view of genetic effects and phenotypic interactions. In this article we use data from a Multiparent Advanced Generation Inter-Cross (MAGIC) winter wheat population to explore Bayesian networks as a convenient and interpretable framework for the simultaneous modeling of multiple quantitative traits. We show that they are equivalent to multivariate genetic best linear unbiased prediction (GBLUP) and that they are competitive with single-trait elastic net and single-trait GBLUP in predictive performance. Finally, we discuss their relationship with other additive-effects models and their advantages in inference and interpretation. MAGIC populations provide an ideal setting for this kind of investigation because the very low population structure and large sample size result in predictive models with good power and limited confounding due to relatedness.  相似文献   

7.
8.
采用最大似然区间定位法对阈模型与一般线性模型的QTL定位效率进行了比较,并对影响离散性状QTL检测效率的主要因素(QTL效应、性状的遗传力和表型发生率)进行了模拟研究,实验设计为多个家系的女儿设计.资源群体大小为500头。研究结果表明:在QTL参数估计及检验功效方面,阈模型方法具有较大的优势,对离散性状QTL定位的效率明显高于LM(Linear Model)方法,定位的准确性也较高。另外,性状遗传力、QTL效应的大小和性状表型发生率对QTL定位的准确度也有直接的影响,随着性状遗传力和表型发生率的提高,随着QTL效应的增大,QTL定位的效率也进一步提高。  相似文献   

9.
10.
植物数量性状基因定位研究概述   总被引:10,自引:0,他引:10  
植物重要的性状多为数量性状。长期以来,人类一直寻求解释植物数量性状的遗传规律以便对其进行遗传操纵。现代分子生物技术的发展为植物数量性状基因的定位、分离等研究提供了条件。本文从数量性状基因座(QTL)作图群体类型及其特点,QTL定位方法,植物QTL研究现状,以及QTL精细定位、克隆、利用等方面进行了综述,并对今后植物QTL研究进行了展望。  相似文献   

11.
Bayesian shrinkage analysis is arguably the state-of-the-art technique for large-scale multiple quantitative trait locus (QTL) mapping. However, when the shrinkage model does not involve indicator variables for marker inclusion, QTL detection remains heavily dependent on significance thresholds derived from phenotype permutation under the null hypothesis of no phenotype-to-genotype association. This approach is computationally intensive and more importantly, the hypothetical data generation at the heart of the permutation-based method violates the Bayesian philosophy. Here we propose a fully Bayesian decision rule for QTL detection under the recently introduced extended Bayesian LASSO for QTL mapping. Our new decision rule is free of any hypothetical data generation and relies on the well-established Bayes factors for evaluating the evidence for QTL presence at any locus. Simulation results demonstrate the remarkable performance of our decision rule. An application to real-world data is considered as well.  相似文献   

12.
Summary A surrogate marker (S) is a variable that can be measured earlier and often more easily than the true endpoint (T) in a clinical trial. Most previous research has been devoted to developing surrogacy measures to quantify how well S can replace T or examining the use of S in predicting the effect of a treatment (Z). However, the research often requires one to fit models for the distribution of T given S and Z. It is well known that such models do not have causal interpretations because the models condition on a postrandomization variable S. In this article, we directly model the relationship among T, S, and Z using a potential outcomes framework introduced by Frangakis and Rubin (2002, Biometrics 58 , 21–29). We propose a Bayesian estimation method to evaluate the causal probabilities associated with the cross‐classification of the potential outcomes of S and T when S and T are both binary. We use a log‐linear model to directly model the association between the potential outcomes of S and T through the odds ratios. The quantities derived from this approach always have causal interpretations. However, this causal model is not identifiable from the data without additional assumptions. To reduce the nonidentifiability problem and increase the precision of statistical inferences, we assume monotonicity and incorporate prior belief that is plausible in the surrogate context by using prior distributions. We also explore the relationship among the surrogacy measures based on traditional models and this counterfactual model. The method is applied to the data from a glaucoma treatment study.  相似文献   

13.
In this article, shrinkage estimation method for multiple-marker analysis and for mapping multiple quantitative trait loci (QTL) was reviewed. For multiple-marker analysis, Xu (Genetics, 2003, 163:789-801) developed a Bayesian shrinkage estimation (BSE) method. The key to the success of this method is to allow each marker effect have its own variance parameter, which in turn has its own prior distribution so that the variance can be estimated from the data. Under this hierarchical model, a large number of markers can be handled although most of them may have negligible effects. Under epistatic genetic model, however, the running time is very long. To overcome this problem, a novel method of incorporating the idea described above into maximum likelihood, known as penalized likelihood method, was proposed. A simulated study showed that this method can handle a model with multiple effects, which are ten times larger than the sample size. For multiple QTL analysis, two modified versions for the BSE method were introduced: one is the fixed-interval method and another is the variable-interval method. The former deals with markers with intermediate density, and the latter can handle markers with extremely high density as well as model with epistatic effects. For the detection of epistatic effects, penalized likelihood method and the variable-interval approach of the BSE method are available.  相似文献   

14.
多QTL定位的压缩估计方法   总被引:1,自引:0,他引:1  
章元明 《遗传学报》2006,33(10):861-869
本文综述了多标记分析和多QTL定位的压缩估计方法。对于前者,Xu(Genetics,2003,163:789—801)首先提出了Bayesian压缩估计方法。其关键在于让每个效应有一个特定的方差参数,而该方差又服从一定的先验分布,以致能从资料中估计之。由此,能够同时估计大量分子标记基因座的遗传效应,即使大多数标记的效应是可忽略的。然而,对于上位性遗传模型,其运算时间还是过长。为此,笔者将上述思想嵌入极大似然法,提出了惩罚最大似然方法。模拟研究显示:该方法能处理变量个数大于样本容量10倍左右的线性遗传模型。对于后者,本文详细介绍了基于固定区间和可变区间的Bayesian压缩估计方法。固定区间方法可处理中等密度的分子标记资料;可变区间方法则可分析高密度分子标记资料,甚至是上位性遗传模型。对于上位性检测,已介绍的惩罚最大似然方法和可变区间Bayesian压缩估计方法可供利用。应当指出,压缩估计方法在今后的eQTL和QTN定位以及基因互作网络分析等研究中也是有应用价值的。  相似文献   

15.
殷宗俊  张勤  张纪刚  丁向东 《遗传学报》2005,32(11):1147-1155
在广义线性模型的框架内模拟研究了家畜抗性等级性状的QTL定位方法,QTL参数的估计采用最大似然方法,比较了阈模型方法与一般线性方法的QTL定位效率,并对影响等级性状QTL定位效率的主要因素(QTL效应、性状的遗传力)进行了模拟研究,实验设计为多个家系的女儿设计,资源群体大小为500头。研究结果表明:在QTL位置参数估计及检验功效方面,阈模型方法具有一定的优势,对抗性等级性状QTL定位的功效也高于线性方法。另外,性状遗传力和QTL效应的大小对QTL定位的准确度也有直接的影响,随着性状遗传力QTL效应的  相似文献   

16.
L Min  R Yang  X Wang  B Wang 《Heredity》2011,106(1):124-133
The dissection of the genetic architecture of quantitative traits, including the number and locations of quantitative trait loci (QTL) and their main and epistatic effects, has been an important topic in current QTL mapping. We extend the Bayesian model selection framework for mapping multiple epistatic QTL affecting continuous traits to dynamic traits in experimental crosses. The extension inherits the efficiency of Bayesian model selection and the flexibility of the Legendre polynomial model fitting to the change in genetic and environmental effects with time. We illustrate the proposed method by simultaneously detecting the main and epistatic QTLs for the growth of leaf age in a doubled-haploid population of rice. The behavior and performance of the method are also shown by computer simulation experiments. The results show that our method can more quickly identify interacting QTLs for dynamic traits in the models with many numbers of genetic effects, enhancing our understanding of genetic architecture for dynamic traits. Our proposed method can be treated as a general form of mapping QTL for continuous quantitative traits, being easier to extend to multiple traits and to a single trait with repeat records.  相似文献   

17.
A Bayesian model for sparse functional data   总被引:1,自引:0,他引:1  
Thompson WK  Rosen O 《Biometrics》2008,64(1):54-63
Summary.   We propose a method for analyzing data which consist of curves on multiple individuals, i.e., longitudinal or functional data. We use a Bayesian model where curves are expressed as linear combinations of B-splines with random coefficients. The curves are estimated as posterior means obtained via Markov chain Monte Carlo (MCMC) methods, which automatically select the local level of smoothing. The method is applicable to situations where curves are sampled sparsely and/or at irregular time points. We construct posterior credible intervals for the mean curve and for the individual curves. This methodology provides unified, efficient, and flexible means for smoothing functional data.  相似文献   

18.
The development of clinical prediction models requires the selection of suitable predictor variables. Techniques to perform objective Bayesian variable selection in the linear model are well developed and have been extended to the generalized linear model setting as well as to the Cox proportional hazards model. Here, we consider discrete time‐to‐event data with competing risks and propose methodology to develop a clinical prediction model for the daily risk of acquiring a ventilator‐associated pneumonia (VAP) attributed to P. aeruginosa (PA) in intensive care units. The competing events for a PA VAP are extubation, death, and VAP due to other bacteria. Baseline variables are potentially important to predict the outcome at the start of ventilation, but may lose some of their predictive power after a certain time. Therefore, we use a landmark approach for dynamic Bayesian variable selection where the set of relevant predictors depends on the time already spent at risk. We finally determine the direct impact of a variable on each competing event through cause‐specific variable selection.  相似文献   

19.
Summary Physical activity has many well‐documented health benefits for cardiovascular fitness and weight control. For pregnant women, the American College of Obstetricians and Gynecologists currently recommends 30 minutes of moderate exercise on most, if not all, days; however, very few pregnant women achieve this level of activity. Traditionally, studies have focused on examining individual or interpersonal factors to identify predictors of physical activity. There is a renewed interest in whether characteristics of the physical environment in which we live and work may also influence physical activity levels. We consider one of the first studies of pregnant women that examines the impact of characteristics of the built environment on physical activity levels. Using a socioecologic framework, we study the associations between physical activity and several factors including personal characteristics, meteorological/air quality variables, and neighborhood characteristics for pregnant women in four counties of North Carolina. We simultaneously analyze six types of physical activity and investigate cross‐dependencies between these activity types. Exploratory analysis suggests that the associations are different in different regions. Therefore, we use a multivariate regression model with spatially varying regression coefficients. This model includes a regression parameter for each covariate at each spatial location. For our data with many predictors, some form of dimension reduction is clearly needed. We introduce a Bayesian variable selection procedure to identify subsets of important variables. Our stochastic search algorithm determines the probabilities that each covariate's effect is null, non‐null but constant across space, and spatially varying. We found that individual‐level covariates had a greater influence on women's activity levels than neighborhood environmental characteristics, and some individual‐level covariates had spatially varying associations with the activity levels of pregnant women.  相似文献   

20.
直播条件下水稻6个穗部性状的QTL分析   总被引:2,自引:0,他引:2  
在大田直播条件下,利用来源于"Lemont/特青"的重组自交系群体,对水稻6个穗部性状及其相互间遗传相关的分子基础进行了QTL分析,共检测到19个QTL,各性状QTL数为2~4个,单个QTL贡献率为4%~22%。共检测到3个染色体区段能同时影响多个穗部性状,其中第1染色体RM212-RM104和第2染色体RM263-RM221区段的QTL能同时影响单株产量、每穗颖花数、着粒密度和二次枝梗数中的3个或4个性状,且这2个区段的QTL对各性状的效应方向相同,增效等位基因均来自‘特青’,为各性状间表型正相关提供了重要的遗传解释。第11染色体RG1022附近的QTL对着粒密度的效应值为负,来自‘特青’的等位基因增加性状值,而对穗长的效应值为正,来自‘特青’的等位基因降低性状值,为这2个性状间表型负相关也提供了一定的遗传解释。此外,对水稻穗部性状QTL在多种环境和遗传背景下的稳定表达及其在分子标记辅助育种中的应用进行了讨论。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号