期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A Nonparametric Approach for Mapping Quantitative Trait Loci 总被引：23，自引：3，他引：20

L. Kruglyak E. S. Lander 《Genetics》1995,139(3):1421-1428

Genetic mapping of quantitative trait loci (QTLs) is performed typically by using a parametric approach, based on the assumption that the phenotype follows a normal distribution. Many traits of interest, however, are not normally distributed. In this paper, we present a nonparametric approach to QTL mapping applicable to any phenotypic distribution. The method is based on a statistic Z(w), which generalizes the nonparametric Wilcoxon rank-sum test to the situation of whole-genome search by interval mapping. We determine the appropriate significance level for the statistic Z(w), by showing that its asymptotic null distribution follows an Ornstein-Uhlenbeck process. These results provide a robust, distribution-free method for mapping QTLs. 相似文献

2.

Selecting Informative Traits for Multivariate Quantitative Trait Locus Mapping Helps to Gain Optimal Power

Riyan Cheng Justin Borevitz R. W. Doerge 《Genetics》2013,195(3):683-691

A major consideration in multitrait analysis is which traits should be jointly analyzed. As a common strategy, multitrait analysis is performed either on pairs of traits or on all of traits. To fully exploit the power of multitrait analysis, we propose variable selection to choose a subset of informative traits for multitrait quantitative trait locus (QTL) mapping. The proposed method is very useful for achieving optimal statistical power for QTL identification and for disclosing the most relevant traits. It is also a practical strategy to effectively take advantage of multitrait analysis when the number of traits under consideration is too large, making the usual multivariate analysis of all traits challenging. We study the impact of selection bias and the usage of permutation tests in the context of variable selection and develop a powerful implementation procedure of variable selection for genome scanning. We demonstrate the proposed method and selection procedure in a backcross population, using both simulated and real data. The extension to other experimental mapping populations is straightforward. 相似文献

3.

Semiparametric mixed-scale models using shared Bayesian forests

Antonio R. Linero Debajyoti Sinha Stuart R. Lipsitz 《Biometrics》2020,76(1):131-144

This paper demonstrates the advantages of sharing information about unknown features of covariates across multiple model components in various nonparametric regression problems including multivariate, heteroscedastic, and semicontinuous responses. In this paper, we present a methodology which allows for information to be shared nonparametrically across various model components using Bayesian sum-of-tree models. Our simulation results demonstrate that sharing of information across related model components is often very beneficial, particularly in sparse high-dimensional problems in which variable selection must be conducted. We illustrate our methodology by analyzing medical expenditure data from the Medical Expenditure Panel Survey (MEPS). To facilitate the Bayesian nonparametric regression analysis, we develop two novel models for analyzing the MEPS data using Bayesian additive regression trees—a heteroskedastic log-normal hurdle model with a “shrink-toward-homoskedasticity” prior and a gamma hurdle model. 相似文献

4.

玉米5个农艺性状的QTL定位 总被引：35，自引：0，他引：35

汤华严建兵黄益勤郑用琏李建生《遗传学报》2005,32(2):203-209

利用“豫玉22”构建的266个玉米F2：3家系为材料,通过一年两点的随机区组田间试验和分子标记分析,研究了玉米穗位高、雄穗分支数、茎粗、抽雄期、吐丝期5个重要农艺性状。相关分析表明,穗位高、雄穗分支数、茎粗与单株产量显著正相关,抽雄期与吐丝期高度正相关,雄穗分支数与茎粗显著正相关。采用复合区间作图法,通过500次排列测验分别确定各性状的LOD阈值,在武汉和襄樊两地共定位了7个穗位高QTL、9个雄穗分支数QTL、8个茎粗QTL、9个抽雄期QTL和7个吐丝期QTL;这些QTL在染色体上分布不均匀,具有集中分布的特点。研究表明,数量性状间的表型相关可能源于控制数量性状的QTL位点间的相关。相似文献

5.

Bayesian methods for estimating GEBVs of threshold traits

C-L Wang X-D Ding J-Y Wang J-F Liu W-X Fu Z Zhang Z-J Yin Q Zhang 《Heredity》2013,110(3):213-219

Estimation of genomic breeding values is the key step in genomic selection (GS). Manymethods have been proposed for continuous traits, but methods for threshold traits arestill scarce. Here we introduced threshold model to the framework of GS, and specifically,we extended the three Bayesian methods BayesA, BayesB and BayesCπ on the basis ofthreshold model for estimating genomic breeding values of threshold traits, and theextended methods are correspondingly termed BayesTA, BayesTB and BayesTCπ. Computingprocedures of the three BayesT methods using Markov Chain Monte Carlo algorithm werederived. A simulation study was performed to investigate the benefit of the presentedmethods in accuracy with the genomic estimated breeding values (GEBVs) for thresholdtraits. Factors affecting the performance of the three BayesT methods were addressed. Asexpected, the three BayesT methods generally performed better than the correspondingnormal Bayesian methods, in particular when the number of phenotypic categories was small.In the standard scenario (number of categories=2, incidence=30%,number of quantitative trait loci=50, h²=0.3), theaccuracies were improved by 30.4%, 2.4%, and 5.7% points,respectively. In most scenarios, BayesTB and BayesTCπ generated similar accuracies andboth performed better than BayesTA. In conclusion, our work proved that threshold modelfits well for predicting GEBVs of threshold traits, and BayesTCπ is supposed to be themethod of choice for GS of threshold traits. 相似文献

6.

Multiple Quantitative Trait Analysis Using Bayesian Networks

Marco Scutari Phil Howell David J. Balding Ian Mackay 《Genetics》2014,198(1):129-137

Models for genome-wide prediction and association studies usually target a single phenotypic trait. However, in animal and plant genetics it is common to record information on multiple phenotypes for each individual that will be genotyped. Modeling traits individually disregards the fact that they are most likely associated due to pleiotropy and shared biological basis, thus providing only a partial, confounded view of genetic effects and phenotypic interactions. In this article we use data from a Multiparent Advanced Generation Inter-Cross (MAGIC) winter wheat population to explore Bayesian networks as a convenient and interpretable framework for the simultaneous modeling of multiple quantitative traits. We show that they are equivalent to multivariate genetic best linear unbiased prediction (GBLUP) and that they are competitive with single-trait elastic net and single-trait GBLUP in predictive performance. Finally, we discuss their relationship with other additive-effects models and their advantages in inference and interpretation. MAGIC populations provide an ideal setting for this kind of investigation because the very low population structure and large sample size result in predictive models with good power and limited confounding due to relatedness. 相似文献

7.

A Bayesian random partition model for sequential refinement and coagulation

Carlos Tadeu Pagani Zanini Peter Müller Yuan Ji Fernando A. Quintana 《Biometrics》2019,75(3):988-999

相似文献

8.

Hierarchical Bayesian semiparametric procedures for logistic regression

HSU JOHN S. J.; LEONARD TOM 《Biometrika》1997,84(1):85-93

相似文献

9.

离散性状QTL区间定位的最大似然方法

殷宗俊张勤陈宏权张纪刚丁向东王春考《遗传学报》2005,32(9):923-929

采用最大似然区间定位法对阈模型与一般线性模型的QTL定位效率进行了比较,并对影响离散性状QTL检测效率的主要因素（QTL效应、性状的遗传力和表型发生率）进行了模拟研究,实验设计为多个家系的女儿设计．资源群体大小为500头。研究结果表明：在QTL参数估计及检验功效方面,阈模型方法具有较大的优势,对离散性状QTL定位的效率明显高于LM（Linear Model）方法,定位的准确性也较高。另外,性状遗传力、QTL效应的大小和性状表型发生率对QTL定位的准确度也有直接的影响,随着性状遗传力和表型发生率的提高,随着QTL效应的增大,QTL定位的效率也进一步提高。相似文献

10.

植物数量性状基因定位研究概述 总被引：10，自引：0，他引：10

沈金雄易斌傅廷栋杨光圣《植物学通报》2003,20(3):257-263

植物重要的性状多为数量性状。长期以来,人类一直寻求解释植物数量性状的遗传规律以便对其进行遗传操纵。现代分子生物技术的发展为植物数量性状基因的定位、分离等研究提供了条件。本文从数量性状基因座(QTL)作图群体类型及其特点,QTL定位方法,植物QTL研究现状,以及QTL精细定位、克隆、利用等方面进行了综述,并对今后植物QTL研究进行了展望。相似文献

11.

A Decision Rule for Quantitative Trait Locus Detection Under the Extended Bayesian LASSO Model

Crispin M. Mutshinda Mikko J. Sillanp?? 《Genetics》2012,192(4):1483-1491

Bayesian shrinkage analysis is arguably the state-of-the-art technique for large-scale multiple quantitative trait locus (QTL) mapping. However, when the shrinkage model does not involve indicator variables for marker inclusion, QTL detection remains heavily dependent on significance thresholds derived from phenotype permutation under the null hypothesis of no phenotype-to-genotype association. This approach is computationally intensive and more importantly, the hypothetical data generation at the heart of the permutation-based method violates the Bayesian philosophy. Here we propose a fully Bayesian decision rule for QTL detection under the recently introduced extended Bayesian LASSO for QTL mapping. Our new decision rule is free of any hypothetical data generation and relies on the well-established Bayes factors for evaluating the evidence for QTL presence at any locus. Simulation results demonstrate the remarkable performance of our decision rule. An application to real-world data is considered as well. 相似文献

12.

A Bayesian Approach to Surrogacy Assessment Using Principal Stratification in Clinical Trials

Yun Li Jeremy M.G. Taylor Michael R. Elliott 《Biometrics》2010,66(2):523-531

Summary A surrogate marker (S) is a variable that can be measured earlier and often more easily than the true endpoint (T) in a clinical trial. Most previous research has been devoted to developing surrogacy measures to quantify how well S can replace T or examining the use of S in predicting the effect of a treatment (Z). However, the research often requires one to fit models for the distribution of T given S and Z. It is well known that such models do not have causal interpretations because the models condition on a postrandomization variable S. In this article, we directly model the relationship among T, S, and Z using a potential outcomes framework introduced by Frangakis and Rubin (2002, Biometrics 58 , 21–29). We propose a Bayesian estimation method to evaluate the causal probabilities associated with the cross‐classification of the potential outcomes of S and T when S and T are both binary. We use a log‐linear model to directly model the association between the potential outcomes of S and T through the odds ratios. The quantities derived from this approach always have causal interpretations. However, this causal model is not identifiable from the data without additional assumptions. To reduce the nonidentifiability problem and increase the precision of statistical inferences, we assume monotonicity and incorporate prior belief that is plausible in the surrogate context by using prior distributions. We also explore the relationship among the surrogacy measures based on traditional models and this counterfactual model. The method is applied to the data from a glaucoma treatment study. 相似文献

13.

Shrinkage Estimation Method for Mapping Multiple Quantitative Trait Loci

Yuan-Ming ZHANG 《Acta Genetica Sinica》2006,33(10):861-869

In this article, shrinkage estimation method for multiple-marker analysis and for mapping multiple quantitative trait loci (QTL) was reviewed. For multiple-marker analysis, Xu (Genetics, 2003, 163:789-801) developed a Bayesian shrinkage estimation (BSE) method. The key to the success of this method is to allow each marker effect have its own variance parameter, which in turn has its own prior distribution so that the variance can be estimated from the data. Under this hierarchical model, a large number of markers can be handled although most of them may have negligible effects. Under epistatic genetic model, however, the running time is very long. To overcome this problem, a novel method of incorporating the idea described above into maximum likelihood, known as penalized likelihood method, was proposed. A simulated study showed that this method can handle a model with multiple effects, which are ten times larger than the sample size. For multiple QTL analysis, two modified versions for the BSE method were introduced: one is the fixed-interval method and another is the variable-interval method. The former deals with markers with intermediate density, and the latter can handle markers with extremely high density as well as model with epistatic effects. For the detection of epistatic effects, penalized likelihood method and the variable-interval approach of the BSE method are available. 相似文献

14.

多QTL定位的压缩估计方法 总被引：1，自引：0，他引：1

章元明《遗传学报》2006,33(10):861-869

本文综述了多标记分析和多QTL定位的压缩估计方法。对于前者,Xu（Genetics,2003,163：789—801）首先提出了Bayesian压缩估计方法。其关键在于让每个效应有一个特定的方差参数,而该方差又服从一定的先验分布,以致能从资料中估计之。由此,能够同时估计大量分子标记基因座的遗传效应,即使大多数标记的效应是可忽略的。然而,对于上位性遗传模型,其运算时间还是过长。为此,笔者将上述思想嵌入极大似然法,提出了惩罚最大似然方法。模拟研究显示：该方法能处理变量个数大于样本容量10倍左右的线性遗传模型。对于后者,本文详细介绍了基于固定区间和可变区间的Bayesian压缩估计方法。固定区间方法可处理中等密度的分子标记资料;可变区间方法则可分析高密度分子标记资料,甚至是上位性遗传模型。对于上位性检测,已介绍的惩罚最大似然方法和可变区间Bayesian压缩估计方法可供利用。应当指出,压缩估计方法在今后的eQTL和QTN定位以及基因互作网络分析等研究中也是有应用价值的。相似文献

15.

家畜抗性等级性状的QTL定位方法

殷宗俊张勤张纪刚丁向东《遗传学报》2005,32(11):1147-1155

在广义线性模型的框架内模拟研究了家畜抗性等级性状的QTL定位方法,QTL参数的估计采用最大似然方法,比较了阈模型方法与一般线性方法的QTL定位效率,并对影响等级性状QTL定位效率的主要因素（QTL效应、性状的遗传力）进行了模拟研究,实验设计为多个家系的女儿设计,资源群体大小为500头。研究结果表明：在QTL位置参数估计及检验功效方面,阈模型方法具有一定的优势,对抗性等级性状QTL定位的功效也高于线性方法。另外,性状遗传力和QTL效应的大小对QTL定位的准确度也有直接的影响,随着性状遗传力QTL效应的相似文献

16.

A Bayesian model for sparse functional data 总被引：1，自引：0，他引：1

Thompson WK Rosen O 《Biometrics》2008,64(1):54-63

Summary. We propose a method for analyzing data which consist of curves on multiple individuals, i.e., longitudinal or functional data. We use a Bayesian model where curves are expressed as linear combinations of B-splines with random coefficients. The curves are estimated as posterior means obtained via Markov chain Monte Carlo (MCMC) methods, which automatically select the local level of smoothing. The method is applicable to situations where curves are sampled sparsely and/or at irregular time points. We construct posterior credible intervals for the mean curve and for the individual curves. This methodology provides unified, efficient, and flexible means for smoothing functional data. 相似文献

17.

Bayesian analysis for genetic architecture of dynamic traits

L Min R Yang X Wang B Wang 《Heredity》2011,106(1):124-133

The dissection of the genetic architecture of quantitative traits, including the number and locations of quantitative trait loci (QTL) and their main and epistatic effects, has been an important topic in current QTL mapping. We extend the Bayesian model selection framework for mapping multiple epistatic QTL affecting continuous traits to dynamic traits in experimental crosses. The extension inherits the efficiency of Bayesian model selection and the flexibility of the Legendre polynomial model fitting to the change in genetic and environmental effects with time. We illustrate the proposed method by simultaneously detecting the main and epistatic QTLs for the growth of leaf age in a doubled-haploid population of rice. The behavior and performance of the method are also shown by computer simulation experiments. The results show that our method can more quickly identify interacting QTLs for dynamic traits in the models with many numbers of genetic effects, enhancing our understanding of genetic architecture for dynamic traits. Our proposed method can be treated as a general form of mapping QTL for continuous quantitative traits, being easier to extend to multiple traits and to a single trait with repeat records. 相似文献

18.

Dynamic clinical prediction models for discrete time‐to‐event data with competing risks—A case study on the OUTCOMEREA database

Rachel Heyard Jean‐Franois Timsit Wafa Ibn Essaied Leonhard Held 《Biometrical journal. Biometrische Zeitschrift》2019,61(3):514-534

The development of clinical prediction models requires the selection of suitable predictor variables. Techniques to perform objective Bayesian variable selection in the linear model are well developed and have been extended to the generalized linear model setting as well as to the Cox proportional hazards model. Here, we consider discrete time‐to‐event data with competing risks and propose methodology to develop a clinical prediction model for the daily risk of acquiring a ventilator‐associated pneumonia (VAP) attributed to P. aeruginosa (PA) in intensive care units. The competing events for a PA VAP are extubation, death, and VAP due to other bacteria. Baseline variables are potentially important to predict the outcome at the start of ventilation, but may lose some of their predictive power after a certain time. Therefore, we use a landmark approach for dynamic Bayesian variable selection where the set of relevant predictors depends on the time already spent at risk. We finally determine the direct impact of a variable on each competing event through cause‐specific variable selection. 相似文献

19.

Bayesian Variable Selection for Multivariate Spatially Varying Coefficient Regression

Brian J. Reich Montserrat Fuentes Amy H. Herring Kelly R. Evenson 《Biometrics》2010,66(3):772-782

Summary Physical activity has many well‐documented health benefits for cardiovascular fitness and weight control. For pregnant women, the American College of Obstetricians and Gynecologists currently recommends 30 minutes of moderate exercise on most, if not all, days; however, very few pregnant women achieve this level of activity. Traditionally, studies have focused on examining individual or interpersonal factors to identify predictors of physical activity. There is a renewed interest in whether characteristics of the physical environment in which we live and work may also influence physical activity levels. We consider one of the first studies of pregnant women that examines the impact of characteristics of the built environment on physical activity levels. Using a socioecologic framework, we study the associations between physical activity and several factors including personal characteristics, meteorological/air quality variables, and neighborhood characteristics for pregnant women in four counties of North Carolina. We simultaneously analyze six types of physical activity and investigate cross‐dependencies between these activity types. Exploratory analysis suggests that the associations are different in different regions. Therefore, we use a multivariate regression model with spatially varying regression coefficients. This model includes a regression parameter for each covariate at each spatial location. For our data with many predictors, some form of dimension reduction is clearly needed. We introduce a Bayesian variable selection procedure to identify subsets of important variables. Our stochastic search algorithm determines the probabilities that each covariate's effect is null, non‐null but constant across space, and spatially varying. We found that individual‐level covariates had a greater influence on women's activity levels than neighborhood environmental characteristics, and some individual‐level covariates had spatially varying associations with the activity levels of pregnant women. 相似文献

20.

A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation

Simon Aeschbacher Mark A. Beaumont Andreas Futschik 《Genetics》2012,192(3):1027-1047

The choice of summary statistics is a crucial step in approximate Bayesian computation (ABC). Since statistics are often not sufficient, this choice involves a trade-off between loss of information and reduction of dimensionality. The latter may increase the efficiency of ABC. Here, we propose an approach for choosing summary statistics based on boosting, a technique from the machine-learning literature. We consider different types of boosting and compare them to partial least-squares regression as an alternative. To mitigate the lack of sufficiency, we also propose an approach for choosing summary statistics locally, in the putative neighborhood of the true parameter value. We study a demographic model motivated by the reintroduction of Alpine ibex (Capra ibex) into the Swiss Alps. The parameters of interest are the mean and standard deviation across microsatellites of the scaled ancestral mutation rate (θ_anc = 4N_eu) and the proportion of males obtaining access to matings per breeding season (ω). By simulation, we assess the properties of the posterior distribution obtained with the various methods. According to our criteria, ABC with summary statistics chosen locally via boosting with the L₂-loss performs best. Applying that method to the ibex data, we estimate

{\hat{θ}}_{anc} \approx 1.288

and find that most of the variation across loci of the ancestral mutation rate u is between 7.7 × 10⁻⁴ and 3.5 × 10⁻³ per locus per generation. The proportion of males with access to matings is estimated as

\hat{ω} \approx 0.21

, which is in good agreement with recent independent estimates. 相似文献