期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

We develop graphical and numerical methods for checking the adequacy of generalized linear mixed models (GLMMs). These methods are based on the cumulative sums of residuals over covariates or predicted values of the response variable. Under the assumed model, the asymptotic distributions of these stochastic processes can be approximated by certain zero-mean Gaussian processes, whose realizations can be generated through Monte Carlo simulation. Each observed process can then be compared, both visually and analytically, to a number of realizations simulated from the null distribution. These comparisons enable one to assess objectively whether the observed residual patterns reflect model misspecification or random variation. The proposed methods are particularly useful for checking the functional form of a covariate or the link function. Extensive simulation studies show that the proposed goodness-of-fit tests have proper sizes and are sensitive to model misspecification. Applications to two medical studies lead to improved models. 相似文献

6.

Latent variable models for longitudinal data with multiple continuous outcomes

Roy J Lin X 《Biometrics》2000,56(4):1047-1054

Multiple outcomes are often used to properly characterize an effect of interest. This paper proposes a latent variable model for the situation where repeated measures over time are obtained on each outcome. These outcomes are assumed to measure an underlying quantity of main interest from different perspectives. We relate the observed outcomes using regression models to a latent variable, which is then modeled as a function of covariates by a separate regression model. Random effects are used to model the correlation due to repeated measures of the observed outcomes and the latent variable. An EM algorithm is developed to obtain maximum likelihood estimates of model parameters. Unit-specific predictions of the latent variables are also calculated. This method is illustrated using data from a national panel study on changes in methadone treatment practices. 相似文献

7.

Models for censored survival analysis: Constant-sum and variable-sum models

WILLIAMS J. S.; LAGAKOS S. W. 《Biometrika》1977,64(2):215-224

相似文献

8.

Mixed effects models with bivariate and univariate association parameters for longitudinal bivariate binary response data

Ten Have TR Morabia A 《Biometrics》1999,55(1):85-93

When two binary responses are measured for each study subject across time, it may be of interest to model how the bivariate associations and marginal univariate risks involving the two responses change across time. To achieve such a goal, marginal models with bivariate log odds ratio and univariate logit components are extended to include random effects for all components. Specifically, separate normal random effects are specified on the log odds ratio scale for bivariate responses and on the logit scale for univariate responses. Assuming conditional independence given the random effects facilitates the modeling of bivariate associations across time with missing at random incomplete data. We fit the model to a dataset for which such structures are feasible: a longitudinal randomized trial of a cardiovascular educational program where the responses of interest are change in hypertension and hypercholestemia status. The proposed model is compared to a naive bivariate model that assumes independence between time points and univariate mixed effects logit models. 相似文献

9.

Confidence bands for multiplicative hazards models: Flexible resampling approaches

Dennis Dobler Markus Pauly ThomasH. Scheike 《Biometrics》2019,75(3):906-916

We propose new resampling‐based approaches to construct asymptotically valid time‐simultaneous confidence bands for cumulative hazard functions in multistate Cox models. In particular, we exemplify the methodology in detail for the simple Cox model with time‐dependent covariates, where the data may be subject to independent right‐censoring or left‐truncation. We use simulations to investigate their finite sample behavior. Finally, the methods are utilized to analyze two empirical examples with survival and competing risks data. 相似文献

10.

Conditional generalized estimating equations for the analysis of clustered and longitudinal data

Goetgeluk S Vansteelandt S 《Biometrics》2008,64(3):772-780

Summary . A common and important problem in clustered sampling designs is that the effect of within-cluster exposures (i.e., exposures that vary within clusters) on outcome may be confounded by both measured and unmeasured cluster-level factors (i.e., measurements that do not vary within clusters). When some of these are ill/not accounted for, estimation of this effect through population-averaged models or random-effects models may introduce bias. We accommodate this by developing a general theory for the analysis of clustered data, which enables consistent and asymptotically normal estimation of the effects of within-cluster exposures in the presence of cluster-level confounders. Semiparametric efficient estimators are obtained by solving so-called conditional generalized estimating equations. We compare this approach with a popular proposal by Neuhaus and Kalbfleisch (1998, Biometrics 54, 638–645) who separate the exposure effect into a within- and a between-cluster component within a random intercept model. We find that the latter approach yields consistent and efficient estimators when the model is linear, but is less flexible in terms of model specification. Under nonlinear models, this approach may yield inconsistent and inefficient estimators, though with little bias in most practical settings. 相似文献

11.

Testing the correlation for clustered categorical and censored discrete time-to-event data when covariates are measured without/with errors

Li Y Lin X 《Biometrics》2003,59(1):25-35

In the analysis of clustered categorical data, it is of common interest to test for the correlation within clusters, and the heterogeneity across different clusters. We address this problem by proposing a class of score tests for the null hypothesis that the variance components are zero in random effects models, for clustered nominal and ordinal categorical responses. We extend the results to accommodate clustered censored discrete time-to-event data. We next consider such tests in the situation where covariates are measured with errors. We propose using the SIMEX method to construct the score tests for the null hypothesis that the variance components are zero. Key advantages of the proposed score tests are that they can be easily implemented by fitting standard polytomous regression models and discrete failure time models, and that they are robust in the sense that no assumptions need to be made regarding the distributions of the random effects and the unobserved covariates. The asymptotic properties of the proposed tests are studied. We illustrate these tests by analyzing two data sets and evaluate their performance with simulations. 相似文献

12.

Nonparametric scanning tests of homogeneity for hierarchical models with continuous covariates

David Todem Wei-Wen Hsu KyungMann Kim 《Biometrics》2023,79(3):2063-2075

In many applications of hierarchical models, there is often interest in evaluating the inherent heterogeneity in view of observed data. When the underlying hypothesis involves parameters resting on the boundary of their support space such as variances and mixture proportions, it is a usual practice to entertain testing procedures that rely on common heterogeneity assumptions. Such procedures, albeit omnibus for general alternatives, may entail a substantial loss of power for specific alternatives such as heterogeneity varying with covariates. We introduce a novel and flexible approach that uses covariate information to improve the power to detect heterogeneity, without imposing unnecessary restrictions. With continuous covariates, the approach does not impose a regression model relating heterogeneity parameters to covariates or rely on arbitrary discretizations. Instead, a scanning approach requiring continuous dichotomizations of the covariates is proposed. Empirical processes resulting from these dichotomizations are then used to construct the test statistics, with limiting null distributions shown to be functionals of tight random processes. We illustrate our proposals and results on a popular class of two-component mixture models, followed by simulation studies and applications to two real datasets in cancer and caries research. 相似文献

13.

Analysing censored data in agricultural research: A review with examples and software tips

Andrea Onofri Hans‐Peter Piepho Marcin Kozak 《The Annals of applied biology》2019,174(1):3-13

Metric data are usually assessed on a continuous scale with good precision, but sometimes agricultural researchers cannot obtain precise measurements of a variable. Values of such a variable cannot then be expressed as real numbers (e.g., 1.51 or 2.56), but often can be represented by intervals into which the values fall (e.g., from 1 to 2 or from 2 to 3). In this situation, statisticians talk about censoring and censored data, as opposed to missing data, where no information is available at all. Traditionally, in agriculture and biology, three methods have been used to analyse such data: (a) when intervals are narrow, some form of imputation (e.g., mid‐point imputation) is used to replace the interval and traditional methods for continuous data are employed (such as analyses of variance [ANOVA] and regression); (b) for time‐to‐event data, the cumulative proportions of individuals that experienced the event of interest are analysed, instead of the individual observed times‐to‐event; (c) when intervals are wide and many individuals are collected, non‐parametric methods of data analysis are favoured, where counts are considered instead of the individual observed value for each sample element. In this paper, we show that these methods may be suboptimal: The first one does not respect the process of data collection, the second leads to unreliable standard errors (SEs), while the third does not make full use of all the available information. As an alternative, methods of survival analysis for censored data can be useful, leading to reliable inferences and sound hypotheses testing. These methods are illustrated using three examples from plant and crop sciences. 相似文献

14.

Marginal proportional hazards models for clustered interval-censored data with time-dependent covariates

Kaitlyn Cook Wenbin Lu Rui Wang 《Biometrics》2023,79(3):1670-1685

The Botswana Combination Prevention Project was a cluster-randomized HIV prevention trial whose follow-up period coincided with Botswana's national adoption of a universal test and treat strategy for HIV management. Of interest is whether, and to what extent, this change in policy modified the preventative effects of the study intervention. To address such questions, we adopt a stratified proportional hazards model for clustered interval-censored data with time-dependent covariates and develop a composite expectation maximization algorithm that facilitates estimation of model parameters without placing parametric assumptions on either the baseline hazard functions or the within-cluster dependence structure. We show that the resulting estimators for the regression parameters are consistent and asymptotically normal. We also propose and provide theoretical justification for the use of the profile composite likelihood function to construct a robust sandwich estimator for the variance. We characterize the finite-sample performance and robustness of these estimators through extensive simulation studies. Finally, we conclude by applying this stratified proportional hazards model to a re-analysis of the Botswana Combination Prevention Project, with the national adoption of a universal test and treat strategy now modeled as a time-dependent covariate. 相似文献

15.

Outcome-dependent sampling from existing cohorts with longitudinal binary response data: study planning and analysis

Schildcrout JS Heagerty PJ 《Biometrics》2011,67(4):1583-1593

When novel scientific questions arise after longitudinal binary data have been collected, the subsequent selection of subjects from the cohort for whom further detailed assessment will be undertaken is often necessary to efficiently collect new information. Key examples of additional data collection include retrospective questionnaire data, novel data linkage, or evaluation of stored biological specimens. In such cases, all data required for the new analyses are available except for the new target predictor or exposure. We propose a class of longitudinal outcome-dependent sampling schemes and detail a design corrected conditional maximum likelihood analysis for highly efficient estimation of time-varying and time-invariant covariate coefficients when resource limitations prohibit exposure ascertainment on all participants. Additionally, we detail an important study planning phase that exploits available cohort data to proactively examine the feasibility of any proposed substudy as well as to inform decisions regarding the most desirable study design. The proposed designs and associated analyses are discussed in the context of a study that seeks to examine the modifying effect of an interleukin-10 cytokine single nucleotide polymorphism on asthma symptom regression in adolescents participating Childhood Asthma Management Program Continuation Study. Using this example we assume that all data necessary to conduct the study are available except subject-specific genotype data. We also assume that these data would be ascertained by analyzing stored blood samples, the cost of which limits the sample size. 相似文献

16.

Using prior data to improve models for reintroduced populations: A case study with North Island Saddlebacks

Jay V. Gedir Joanne M. Thorne Kelly Brider Doug P. Armstrong 《The Journal of wildlife management》2013,77(6):1114-1123

相似文献

17.

Statistical methods for describing developmental milestones with censored data: effects of birth weight status and sex in neonatal pigtailed macaques

Kroeker R Sackett G Reynolds J 《American journal of primatology》2007,69(12):1313-1324

Neurobehavioral tests are used to assess early neonatal behavioral functioning and detect effects of prenatal and perinatal events. However, common measurement and data collection methods create specific data features requiring thoughtful statistical analysis. Assessment response measurements are often ordinal scaled, not interval scaled; the magnitude of the physical response may not directly correlate with the underlying state of developmental maturity; and a subject's assessment record may be censored. Censoring occurs when the milestone is exhibited at the first test (left censoring), when the milestone is not exhibited before the end of the study (right censoring), or when the exact age of attaining the milestone is uncertain due to irregularly spaced test sessions or missing data (interval censoring). Such milestone data is best analyzed using survival analysis methods. Two methods are contrasted: the non-parametric Kaplan-Meier estimator and the fully parametric interval censored regression. The methods represent the spectrum of survival analyses in terms of parametric assumptions, ability to handle simultaneous testing of multiple predictors, and accommodation of different types of censoring. Both methods were used to assess birth weight status and sex effects on 14 separate test items from assessments on 255 healthy pigtailed macaques. The methods gave almost identical results. Compared to the normal birth weight group, the low birth weight group had significantly delayed development on all but one test item. Within the low birth weight group, males had significantly delayed development for some responses relative to females. 相似文献

18.

Asymptotic and explicit formulae for a non-markovian model with linear transition rule

M. E. H. Ismail R. Theodorescu 《Biometrical journal. Biometrische Zeitschrift》1980,22(1):67-71

Asymptotic and explicit formulae are obtained for the n-step stochastic kernel of a special class of non-MARKOVIAN models with linear transition rule which occur in learning theory, adaption theory, control theory, and biological research. 相似文献

19.

Generalized character process models: estimating the genetic basis of traits that cannot be observed and that change with age or environmental conditions

Pletcher SD Jaffrézic F 《Biometrics》2002,58(1):157-162

The genetic analysis of characters that change as a function of some independent and continuous variable has received increasing attention in the biological and statistical literature. Previous work in this area has focused on the analysis of normally distributed characters that are directly observed. We propose a framework for the development and specification of models for a quantitative genetic analysis of function-valued characters that are not directly observed, such as genetic variation in age-specific mortality rates or complex threshold characters. We employ a hybrid Markov chain Monte Carlo algorithm involving a Monte Carlo EM algorithm coupled with a Markov chain approximation to the likelihood, which is quite robust and provides accurate estimates of the parameters in our models. The methods are investigated using simulated data and are applied to a large data set measuring mortality rates in the fruit fly, Drosophila melanogaster. 相似文献

20.

Precision gain versus effort with joint models using detection/non‐detection and banding data

Jamie S. Sanderlin William M. Block Brenda E. Strohmeyer Victoria A. Saab Joseph L. Ganey 《Ecology and evolution》2019,9(2):804-817

Capture–recapture techniques provide valuable information, but are often more cost‐prohibitive at large spatial and temporal scales than less‐intensive sampling techniques. Model development combining multiple data sources to leverage data source strengths and for improved parameter precision has increased, but with limited discussion on precision gain versus effort. We present a general framework for evaluating trade‐offs between precision gained and costs associated with acquiring multiple data sources, useful for designing future or new phases of current studies.We illustrated how Bayesian hierarchical joint models using detection/non‐detection and banding data can improve abundance, survival, and recruitment inference, and quantified data source costs in a northern Arizona, USA, western bluebird (Sialia mexicana) population. We used an 8‐year detection/non‐detection (distributed across the landscape) and banding (subset of locations within landscape) data set to estimate parameters. We constructed separate models using detection/non‐detection and banding data, and a joint model using both data types to evaluate parameter precision gain relative to effort.Joint model parameter estimates were more precise than single data model estimates, but parameter precision varied (apparent survival > abundance > recruitment). Banding provided greater apparent survival precision than detection/non‐detection data. Therefore, little precision was gained when detection/non‐detection data were added to banding data. Additional costs were minimal; however, additional spatial coverage and ability to estimate abundance and recruitment improved inference. Conversely, more precision was gained when adding banding to detection/non‐detection data at higher cost. Spatial coverage was identical, yet survival and abundance estimates were more precise. Justification of increased costs associated with additional data types depends on project objectives.We illustrate a general framework for evaluating precision gain relative to effort, applicable to joint data models with any data type combination. This framework evaluates costs and benefits from and effort levels between multiple data types, thus improving population monitoring designs. 相似文献