首页 | 本学科首页   官方微博 | 高级检索  
 共查询到11条相似文献,搜索用时 0 毫秒
Summary .   Standard prospective logistic regression analysis of case–control data often leads to very imprecise estimates of gene-environment interactions due to small numbers of cases or controls in cells of crossing genotype and exposure. In contrast, under the assumption of gene-environment independence, modern "retrospective" methods, including the "case-only" approach, can estimate the interaction parameters much more precisely, but they can be seriously biased when the underlying assumption of gene-environment independence is violated. In this article, we propose a novel empirical Bayes-type shrinkage estimator to analyze case–control data that can relax the gene-environment independence assumption in a data-adaptive fashion. In the special case, involving a binary gene and a binary exposure, the method leads to an estimator of the interaction log odds ratio parameter in a simple closed form that corresponds to an weighted average of the standard case-only and case–control estimators. We also describe a general approach for deriving the new shrinkage estimator and its variance within the retrospective maximum-likelihood framework developed by Chatterjee and Carroll (2005, Biometrika 92, 399–418). Both simulated and real data examples suggest that the proposed estimator strikes a balance between bias and efficiency depending on the true nature of the gene-environment association and the sample size for a given study.  相似文献   

Naskar M  Das K 《Biometrics》2006,62(4):1004-1013
In medical studies, paired binary responses are often observed for each study subject over timepoints or clusters. A primary interest is to investigate how the bivariate association and marginal univariate risks are affected by repeated measurements on each subject. To achieve this we propose a very general class of semiparametric bivariate binary models. The subject-specific effects involved in the bivariate log odds ratio and the univariate logit components are assumed to follow a nonparametric Dirichlet process (DP). We propose a hybrid method to draw model-based inferences. In the framework of the proposed hybrid method, estimation of parameters is done by implementing the Monte Carlo expectation-maximization algorithm. The proposed methodology is illustrated through a study on the effectiveness of tibolone for reducing menopausal problems experienced by Indian women. A simulation study is also conducted to evaluate the efficiency of the new methodology.  相似文献   

Logistic regression of family data from case-control studies   总被引:3,自引:0,他引:3  
WHITTEMORE  ALICE S. 《Biometrika》1995,82(1):57-67

In many modern experimental settings, observations are obtainedin the form of functions and interest focuses on inferencesabout a collection of such functions. We propose a hierarchicalmodel that allows us simultaneously to estimate multiple curvesnonparametrically by using dependent Dirichlet process mixturesof Gaussian distributions to characterize the joint distributionof predictors and outcomes. Function estimates are then inducedthrough the conditional distribution of the outcome given thepredictors. The resulting approach allows for flexible estimationand clustering, while borrowing information across curves. Wealso show that the function estimates we obtain are consistenton the space of integrable functions. As an illustration, weconsider an application to the analysis of conductivity andtemperature at depth data in the north Atlantic.  相似文献   

Models of amino acid substitution present challenges beyond those often faced with the analysis of DNA sequences. The alignments of amino acid sequences are often small, whereas the number of parameters to be estimated is potentially large when compared with the number of free parameters for nucleotide substitution models. Most approaches to the analysis of amino acid alignments have focused on the use of fixed amino acid models in which all of the potentially free parameters are fixed to values estimated from a large number of sequences. Often, these fixed amino acid models are specific to a gene or taxonomic group (e.g. the Mtmam model, which has parameters that are specific to mammalian mitochondrial gene sequences). Although the fixed amino acid models succeed in reducing the number of free parameters to be estimated--indeed, they reduce the number of free parameters from approximately 200 to 0--it is possible that none of the currently available fixed amino acid models is appropriate for a specific alignment. Here, we present four approaches to the analysis of amino acid sequences. First, we explore the use of a general time reversible model of amino acid substitution using a Dirichlet prior probability distribution on the 190 exchangeability parameters. Second, we then explore the behaviour of prior probability distributions that are'centred' on the rates specified by the fixed amino acid model. Third, we consider a mixture of fixed amino acid models. Finally, we consider constraints on the exchangeability parameters as partitions,similar to how nucleotide substitution models are specified, and place a Dirichlet process prior model on all the possible partitioning schemes.  相似文献   

Summary : We propose a semiparametric Bayesian method for handling measurement error in nutritional epidemiological data. Our goal is to estimate nonparametrically the form of association between a disease and exposure variable while the true values of the exposure are never observed. Motivated by nutritional epidemiological data, we consider the setting where a surrogate covariate is recorded in the primary data, and a calibration data set contains information on the surrogate variable and repeated measurements of an unbiased instrumental variable of the true exposure. We develop a flexible Bayesian method where not only is the relationship between the disease and exposure variable treated semiparametrically, but also the relationship between the surrogate and the true exposure is modeled semiparametrically. The two nonparametric functions are modeled simultaneously via B‐splines. In addition, we model the distribution of the exposure variable as a Dirichlet process mixture of normal distributions, thus making its modeling essentially nonparametric and placing this work into the context of functional measurement error modeling. We apply our method to the NIH‐AARP Diet and Health Study and examine its performance in a simulation study.  相似文献   

Recent advancements in miniaturized fluorescence microscopy have made it possible to investigate neuronal responses to external stimuli in awake behaving animals through the analysis of intracellular calcium signals. An ongoing challenge is deconvolving the temporal signals to extract the spike trains from the noisy calcium signals' time series. In this article, we propose a nested Bayesian finite mixture specification that allows the estimation of spiking activity and, simultaneously, reconstructing the distributions of the calcium transient spikes' amplitudes under different experimental conditions. The proposed model leverages two nested layers of random discrete mixture priors to borrow information between experiments and discover similarities in the distributional patterns of neuronal responses to different stimuli. Furthermore, the spikes' intensity values are also clustered within and between experimental conditions to determine the existence of common (recurring) response amplitudes. Simulation studies and the analysis of a dataset from the Allen Brain Observatory show the effectiveness of the method in clustering and detecting neuronal activities.  相似文献   

Informative drop-out arises in longitudinal studies when the subject's follow-up time depends on the unobserved values of the response variable. We specify a semiparametric linear regression model for the repeatedly measured response variable and an accelerated failure time model for the time to informative drop-out. The error terms from the two models are assumed to have a common, but completely arbitrary joint distribution. Using a rank-based estimator for the accelerated failure time model and an artificial censoring device, we construct an asymptotically unbiased estimating function for the linear regression model. The resultant estimator is shown to be consistent and asymptotically normal. A resampling scheme is developed to estimate the limiting covariance matrix. Extensive simulation studies demonstrate that the proposed methods are suitable for practical use. Illustrations with data taken from two AIDS clinical trials are provided.  相似文献   

Greenland S 《Biometrics》2003,59(1):92-99
Conjugate priors for Bayesian analyses of relative risks can be quite restrictive, because their shape depends on their location. By introducing a separate location parameter, however, these priors generalize to allow modeling of a broad range of prior opinions, while still preserving the computational simplicity of conjugate analyses. The present article illustrates the resulting generalized conjugate analyses using examples from case-control studies of the association of residential wire codes and magnetic fields with childhood leukemia.  相似文献   

Bias reduction of maximum likelihood estimates   总被引:9,自引:0,他引:9  
FIRTH  DAVID 《Biometrika》1993,80(1):27-38

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号