共查询到20条相似文献,搜索用时 15 毫秒
1.
As the two most popular models in survival analysis, the accelerated failure time (AFT) model can more easily fit survival data than the Cox proportional hazards model (PHM). In this study, we develop a general parametric AFT model for identifying survival trait loci, in which the flexible generalized F distribution, including many commonly used distributions as special cases, is specified as the baseline survival distribution. EM algorithm for maximum likelihood estimation of model parameters is given. Simulations are conducted to validate the flexibility and the utility of the proposed mapping procedure. In analyzing survival time following hyperoxic acute lung injury (HALI) of mice in an F(2) mating population, the generalized F distribution performed best among the six competing survival distributions and detected four QTLs controlling differential HALI survival. 相似文献
2.
3.
4.
5.
Shizhong Xu Zhiqiu Hu 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2010,121(1):47-63
We developed a generalized linear model of QTL mapping for discrete traits in line crossing experiments. Parameter estimation was achieved using two different algorithms, a mixture model-based EM (expectation–maximization) algorithm and a GEE (generalized estimating equation) algorithm under a heterogeneous residual variance model. The methods were developed using ordinal data, binary data, binomial data and Poisson data as examples. Applications of the methods to simulated as well as real data are presented. The two different algorithms were compared in the data analyses. In most situations, the two algorithms were indistinguishable, but when large QTL are located in large marker intervals, the mixture model-based EM algorithm can fail to converge to the correct solutions. Both algorithms were coded in C++ and interfaced with SAS as a user-defined SAS procedure called PROC QTL. 相似文献
6.
Genomic imprinting, a genetic phenomenon of non-equivalent allele expression that depends on parental origins, has been ubiquitously
observed in nature. It does not only control the traits of growth and development but also may be responsible for survival
traits. Based on the accelerated failure time model, we construct a general parametric model for mapping the imprinted QTL
(iQTL). Within the framework of interval mapping, maximum likelihood estimation of iQTL parameters is implemented via EM algorithm.
The imprinting patterns of the detected iQTL are statistically tested according to a series of null hypotheses. BIC model
selection criterion is employed to choose an optimal baseline hazard function with maximum likelihood and parsimonious parameters.
Simulations are used to validate the proposed mapping procedure. A published dataset from a mouse model system was used to
illustrate the proposed framework. Results show that among the five commonly used survival distributions, Log-logistic distribution
is the optimal baseline hazard function for mapping QTL of hyperoxic acute lung injury (HALI) survival; under the log-logistic
distribution, four QTLs were identified, in which only one QTL was inherited in Mendelian fashion, whereas others were imprinted
in different imprinting patterns. 相似文献
7.
Summary . In the presence of high-dimensional predictors, it is challenging to develop reliable regression models that can be used to accurately predict future outcomes. Further complications arise when the outcome of interest is an event time, which is often not fully observed due to censoring. In this article, we develop robust prediction models for event time outcomes by regularizing the Gehan's estimator for the accelerated failure time (AFT) model ( Tsiatis, 1996 , Annals of Statistics 18, 305–328) with least absolute shrinkage and selection operator (LASSO) penalty. Unlike existing methods based on the inverse probability weighting and the Buckley and James estimator ( Buckley and James, 1979 , Biometrika 66, 429–436), the proposed approach does not require additional assumptions about the censoring and always yields a solution that is convergent. Furthermore, the proposed estimator leads to a stable regression model for prediction even if the AFT model fails to hold. To facilitate the adaptive selection of the tuning parameter, we detail an efficient numerical algorithm for obtaining the entire regularization path. The proposed procedures are applied to a breast cancer dataset to derive a reliable regression model for predicting patient survival based on a set of clinical prognostic factors and gene signatures. Finite sample performances of the procedures are evaluated through a simulation study. 相似文献
8.
Rank-based inference for the accelerated failure time model 总被引:10,自引:0,他引:10
9.
The mapping of quantitative trait loci (QTL) is to identify molecular markers or genomic loci that influence the variation of complex traits. The problem is complicated by the facts that QTL data usually contain a large number of markers across the entire genome and most of them have little or no effect on the phenotype. In this article, we propose several Bayesian hierarchical models for mapping multiple QTL that simultaneously fit and estimate all possible genetic effects associated with all markers. The proposed models use prior distributions for the genetic effects that are scale mixtures of normal distributions with mean zero and variances distributed to give each effect a high probability of being near zero. We consider two types of priors for the variances, exponential and scaled inverse-chi(2) distributions, which result in a Bayesian version of the popular least absolute shrinkage and selection operator (LASSO) model and the well-known Student's t model, respectively. Unlike most applications where fixed values are preset for hyperparameters in the priors, we treat all hyperparameters as unknowns and estimate them along with other parameters. Markov chain Monte Carlo (MCMC) algorithms are developed to simulate the parameters from the posteriors. The methods are illustrated using well-known barley data. 相似文献
10.
Two major aspects have made the genetic and genomic study of polyploids extremely difficult. First, increased allelic or nonallelic combinations due to multiple alleles result in complex gene actions and interactions for quantitative trait loci (QTL) in polyploids. Second, meiotic configurations in polyploids undergo a complex biological process including either bivalent or multivalent formation, or both. For bivalent polyploids, different degrees of preferential chromosome pairings may occur during meiosis. In this article, we develop a maximum-likelihood-based model for mapping QTL in tetraploids by considering the quantitative inheritance and meiotic mechanism of bivalent polyploids. This bivalent polyploid model is implemented with the EM algorithm to simultaneously estimate QTL position, QTL effects, and QTL-marker linkage phases by incorporating the impact of a cytological parameter determining bivalent chromosome pairings (the preferential pairing factor). Simulation studies are performed to investigate the performance and robustness of our statistical method for parameter estimation. The implication and extension of the bivalent polyploid model are discussed. 相似文献
11.
We present an approach for quantitative trait locus (QTL) mapping, termed as ‘lineage-specific QTL mapping'', for inferring allelic changes of QTL evolution along with branches in a phylogeny. We describe and analyze the simplest case: by adding a third taxon into the normal procedure of QTL mapping between pairs of taxa, such inferences can be made along lineages to a presumed common ancestor. Although comparisons of QTL maps among species can identify homology of QTLs by apparent co-location, lineage-specific mapping of QTL can classify homology into (1) orthology (shared origin of QTL) versus (2) paralogy (independent origin of QTL within resolution of map distance). In this light, we present a graphical method that identifies six modes of QTL evolution in a three taxon comparison. We then apply our model to map lineage-specific QTLs for inbreeding among three taxa of yellow monkey-flower: Mimulus guttatus and two inbreeders M. platycalyx and M. micranthus, but critically assuming outcrossing was the ancestral state. The two most common modes of homology across traits were orthologous (shared ancestry of mutation for QTL alleles). The outbreeder M. guttatus had the fewest lineage-specific QTL, in accordance with the presumed ancestry of outbreeding. Extensions of lineage-specific QTL mapping to other types of data and crosses, and to inference of ancestral QTL state, are discussed. 相似文献
12.
Case-cohort analysis with accelerated failure time model 总被引:1,自引:0,他引:1
Summary . In a case–cohort design, covariates are assembled only for a subcohort that is randomly selected from the entire cohort and any additional cases outside the subcohort. This design is appealing for large cohort studies of rare disease, especially when the exposures of interest are expensive to ascertain for all the subjects. We propose statistical methods for analyzing the case–cohort data with a semiparametric accelerated failure time model that interprets the covariates effects as to accelerate or decelerate the time to failure. Asymptotic properties of the proposed estimators are developed. The finite sample properties of case–cohort estimator and its relative efficiency to full cohort estimator are assessed via simulation studies. A real example from a study of cardiovascular disease is provided to illustrate the estimating procedure. 相似文献
13.
A Bayesian semiparametric approach is described for an accelerated failure time model. The error distribution is assigned a Pólya tree prior and the regression parameters a noninformative hierarchical prior. Two cases are considered: the first assumes error terms are exchangeable; the second assumes that error terms are partially exchangeable. A Markov chain Monte Carlo algorithm is described to obtain a predictive distribution for a future observation given both uncensored and censored data. 相似文献
14.
Multiple interval mapping for quantitative trait loci. 总被引:72,自引:0,他引:72
A new statistical method for mapping quantitative trait loci (QTL), called multiple interval mapping (MIM), is presented. It uses multiple marker intervals simultaneously to fit multiple putative QTL directly in the model for mapping QTL. The MIM model is based on Cockerham's model for interpreting genetic parameters and the method of maximum likelihood for estimating genetic parameters. With the MIM approach, the precision and power of QTL mapping could be improved. Also, epistasis between QTL, genotypic values of individuals, and heritabilities of quantitative traits can be readily estimated and analyzed. Using the MIM model, a stepwise selection procedure with likelihood ratio test statistic as a criterion is proposed to identify QTL. This MIM method was applied to a mapping data set of radiata pine on three traits: brown cone number, tree diameter, and branch quality scores. Based on the MIM result, seven, six, and five QTL were detected for the three traits, respectively. The detected QTL individually contributed from approximately 1 to 27% of the total genetic variation. Significant epistasis between four pairs of QTL in two traits was detected, and the four pairs of QTL contributed approximately 10.38 and 14.14% of the total genetic variation. The asymptotic variances of QTL positions and effects were also provided to construct the confidence intervals. The estimated heritabilities were 0.5606, 0.5226, and 0. 3630 for the three traits, respectively. With the estimated QTL effects and positions, the best strategy of marker-assisted selection for trait improvement for a specific purpose and requirement can be explored. The MIM FORTRAN program is available on the worldwide web (http://www.stat.sinica.edu.tw/chkao/). 相似文献
15.
Marker pair selection for mapping quantitative trait loci 总被引:10,自引:0,他引:10
Mapping of quantitative trait loci (QTL) for backcross and F(2) populations may be set up as a multiple linear regression problem, where marker types are the regressor variables. It has been shown previously that flanking markers absorb all information on isolated QTL. Therefore, selection of pairs of markers flanking QTL is useful as a direct approach to QTL detection. Alternatively, selected pairs of flanking markers can be used as cofactors in composite interval mapping (CIM). Overfitting is a serious problem, especially if the number of regressor variables is large. We suggest a procedure denoted as marker pair selection (MPS) that uses model selection criteria for multiple linear regression. Markers enter the model in pairs, which reduces the number of models to be considered, thus alleviating the problem of overfitting and increasing the chances of detecting QTL. MPS entails an exhaustive search per chromosome to maximize the chance of finding the best-fitting models. A simulation study is conducted to study the merits of different model selection criteria for MPS. On the basis of our results, we recommend the Schwarz Bayesian criterion (SBC) for use in practice. 相似文献
16.
Most quantitative trait loci (QTL) mapping experiments typically collect phenotypic data on multiple correlated complex traits. However, there is a lack of a comprehensive genomewide mapping strategy for correlated traits in the literature. We develop Bayesian multiple-QTL mapping methods for correlated continuous traits using two multivariate models: one that assumes the same genetic model for all traits, the traditional multivariate model, and the other known as the seemingly unrelated regression (SUR) model that allows different genetic models for different traits. We develop computationally efficient Markov chain Monte Carlo (MCMC) algorithms for performing joint analysis. We conduct extensive simulation studies to assess the performance of the proposed methods and to compare with the conventional single-trait model. Our methods have been implemented in the freely available package R/qtlbim (http://www.qtlbim.org), which greatly facilitates the general usage of the Bayesian methodology for unraveling the genetic architecture of complex traits. 相似文献
17.
The power to separate the variance of a quantitative trait locus (QTL) from the polygenic variance is determined by the variability
of genes identical by descent (IBD) at the QTL. This variability may increase with inbreeding. Selfing, the most extreme form
of inbreeding, increases the variability of the IBD value shared by siblings, and thus has a higher efficiency for QTL mapping
than random mating. In self-incompatible organisms, sib mating is the closest form of inbreeding. Similar to selfing, sib
mating may also increase the power of QTL detection relative to random mating. In this study, we develop an IBD-based method
under sib mating designs for QTL mapping. The efficiency of sib mating is then compared with random mating. Monte Carlo simulations
show that sib mating designs notably increase the power for QTL detection. When power is intermediate, the power to detect
a QTL using full-sib mating is, on average, 7% higher than under random mating. In addition, the IBD-based method proposed
in this paper can be used to combine data from multiple families. As a result, the estimated QTL parameters can be applied
to a wide statistical inference space relating to the entire reference population.
This revised version was published online in July 2006 with corrections to the Cover Date. 相似文献
18.
Nonparametric analysis of an accelerated failure time model 总被引:1,自引:0,他引:1
19.
20.
A general mixture model for mapping quantitative trait loci by using molecular markers 总被引:3,自引:0,他引:3
R. C. Jansen 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》1992,85(2-3):252-260
Summary In a segregating population a quantitative trait may be considered to follow a mixture of (normal) distributions, the mixing proportions being based on Mendelian segregation rules. A general and flexible mixture model is proposed for mapping quantitative trait loci (QTLs) by using molecular markers. A method is discribed to fit the model to data. The model makes it possible to (1) analyse non-normally distributed traits such as lifetimes, counts or percentages in addition to normally distributed traits, (2) reduce environmental variation by taking into account the effects of experimental design factors and interaction between genotype and environment, (3) reduce genotypic variation by taking into account the effects of two or more QTLs simultaneously, (4) carry out a (combined) analysis of different population types, (5) estimate recombination frequencies between markers or use known marker distances, (6) cope with missing marker observations, (7) use markers as covariables in detection and mapping of QTLs, and finally to (8) implement the mapping in standard statistical packages. 相似文献