期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Latent transition regression for mixed outcomes

Miglioretti DL 《Biometrics》2003,59(3):710-720

Health status is a complex outcome, often characterized by multiple measures. When assessing changes in health status over time, multiple measures are typically collected longitudinally. Analytic challenges posed by these multivariate longitudinal data are further complicated when the outcomes are combinations of continuous, categorical, and count data. To address these challenges, we propose a fully Bayesian latent transition regression approach for jointly analyzing a mixture of longitudinal outcomes from any distribution. Health status is assumed to be a categorical latent variable, and the multiple outcomes are treated as surrogate measures of the latent health state, observed with error. Using this approach, both baseline latent health state prevalences and the probabilities of transitioning between the health states over time are modeled as functions of covariates. The observed outcomes are related to the latent health states through regression models that include subject-specific effects to account for residual correlation among repeated measures over time, and covariate effects to account for differential measurement of the latent health states. We illustrate our approach with data from a longitudinal study of back pain. 相似文献

2.

A comparison of the multilevel MIMIC model to the multilevel regression and mixed ANOVA model for the estimation and testing of a cross-level interaction effect: A simulation study

Rob Kessels Mirjam Moerbeek 《Biometrical journal. Biometrische Zeitschrift》2023,65(5):2200112

When observing data on a patient-reported outcome measure in, for example, clinical trials, the variables observed are often correlated and intended to measure a latent variable. In addition, such data are also often characterized by a hierarchical structure, meaning that the outcome is repeatedly measured within patients. To analyze such data, it is important to use an appropriate statistical model, such as structural equation modeling (SEM). However, researchers may rely on simpler statistical models that are applied to an aggregated data structure. For example, correlated variables are combined into one sum score that approximates a latent variable. This may have implications when, for example, the sum score consists of indicators that relate differently to the latent variable being measured. This study compares three models that can be applied to analyze such data: the multilevel multiple indicators multiple causes (ML-MIMIC) model, a univariate multilevel model, and a mixed analysis of variance (ANOVA) model. The focus is on the estimation of a cross-level interaction effect that presents the difference over time on the patient-reported outcome between two treatment groups. The ML-MIMIC model is an SEM-type model that considers the relationship between the indicators and the latent variable in a multilevel setting, whereas the univariate multilevel and mixed ANOVA model rely on sum scores to approximate the latent variable. In addition, the mixed ANOVA model uses aggregated second-level means as outcome. This study showed that the ML-MIMIC model produced unbiased cross-level interaction effect estimates when the relationships between the indicators and the latent variable being measured varied across indicators. In contrast, under similar conditions, the univariate multilevel and mixed ANOVA model underestimated the cross-level interaction effect. 相似文献

3.

A semiparametric transition model with latent traits for longitudinal multistate data

Lin H Guo Z Peduzzi PN Gill TM Allore HG 《Biometrics》2008,64(4):1032-1042

SUMMARY: We propose a general multistate transition model. The model is developed for the analysis of repeated episodes of multiple states representing different health status. Transitions among multiple states are modeled jointly using multivariate latent traits with factor loadings. Different types of state transition are described by flexible transition-specific nonparametric baseline intensities. A state-specific latent trait is used to capture individual tendency of the sojourn in the state that cannot be explained by covariates and to account for correlation among repeated sojourns in the same state within an individual. Correlation among sojourns across different states within an individual is accounted for by the correlation between the different latent traits. The factor loadings for a latent trait accommodate the dependence of the transitions to different competing states from a same state. We obtain the semiparametric maximum likelihood estimates through an expectation-maximization (EM) algorithm. The method is illustrated by studying repeated transitions between independence and disability states of activities of daily living (ADL) with death as an absorbing state in a longitudinal aging study. The performance of the estimation procedure is assessed by simulation studies. 相似文献

4.

Joint analysis of time-to-event and multiple binary indicators of latent classes

Larsen K 《Biometrics》2004,60(1):85-92

Multiple categorical variables are commonly used in medical and epidemiological research to measure specific aspects of human health and functioning. To analyze such data, models have been developed considering these categorical variables as imperfect indicators of an individual's "true" status of health or functioning. In this article, the latent class regression model is used to model the relationship between covariates, a latent class variable (the unobserved status of health or functioning), and the observed indicators (e.g., variables from a questionnaire). The Cox model is extended to encompass a latent class variable as predictor of time-to-event, while using information about latent class membership available from multiple categorical indicators. The expectation-maximization (EM) algorithm is employed to obtain maximum likelihood estimates, and standard errors are calculated based on the profile likelihood, treating the nonparametric baseline hazard as a nuisance parameter. A sampling-based method for model checking is proposed. It allows for graphical investigation of the assumption of proportional hazards across latent classes. It may also be used for checking other model assumptions, such as no additional effect of the observed indicators given latent class. The usefulness of the model framework and the proposed techniques are illustrated in an analysis of data from the Women's Health and Aging Study concerning the effect of severe mobility disability on time-to-death for elderly women. 相似文献

5.

A class of latent Markov models for capture-recapture data allowing for time, heterogeneity, and behavior effects 总被引：1，自引：0，他引：1

Bartolucci F Pennoni F 《Biometrics》2007,63(2):568-578

We propose an extension of the latent class model for the analysis of capture-recapture data which allows us to take into account the effect of a capture on the behavior of a subject with respect to future captures. The approach is based on the assumption that the variable indexing the latent class of a subject follows a Markov chain with transition probabilities depending on the previous capture history. Several constraints are allowed on these transition probabilities and on the parameters of the conditional distribution of the capture configuration given the latent process. We also allow for the presence of discrete explanatory variables, which may affect the parameters of the latent process. To estimate the resulting models, we rely on the conditional maximum likelihood approach and for this aim we outline an EM algorithm. We also give some simple rules for point and interval estimation of the population size. The approach is illustrated by applying it to two data sets concerning small mammal populations. 相似文献

6.

Latent class regression on latent factors 总被引：1，自引：0，他引：1

Guo J Wall M Amemiya Y 《Biostatistics (Oxford, England)》2006,7(1):145-163

In the research of public health, psychology, and social sciences, many research questions investigate the relationship between a categorical outcome variable and continuous predictor variables. The focus of this paper is to develop a model to build this relationship when both the categorical outcome and the predictor variables are latent (i.e. not observable directly). This model extends the latent class regression model so that it can include regression on latent predictors. Maximum likelihood estimation is used and two numerical methods for performing it are described: the Monte Carlo expectation and maximization algorithm and Gaussian quadrature followed by quasi-Newton algorithm. A simulation study is carried out to examine the behavior of the model under different scenarios. A data example involving adolescent health is used for demonstration where the latent classes of eating disorders risk are predicted by the latent factor body satisfaction. 相似文献

7.

The Measurement of Dynamic Latent Variables in Longitudinal Aging Research: Quantifying Adult Development

Linda M. Collins 《Gerodontology》1990,9(4):127-134

Dynamic latent variables involve systematic intraindividual change over time. Although it seems natural to apply traditional measurement theory to dynamic latent variables, in fact this is often inappropriate. Traditional measurement theory is based on the idea of static latent variables and offers little guidance to the researcher who wishes to measure a dynamic latent variable with a high degree of accuracy and validity. It is the contention of this article that measurement of a dynamic latent variable must start from a clearly defined substantive theory about human development. Two approaches that take this perspective are presented; the longitudinal Guttman simplex (LGS), a measurement model for dynamic latent variables undergoing irreversible cumulative, unitary development; and latent transition analysis (LTA), a more general latent class measurement model. 相似文献

8.

A joint latent variable model approach to item reduction and validation

Halberstadt SM Schmitz KH Sammel MD 《Biostatistics (Oxford, England)》2012,13(1):48-60

Many applications of biomedical science involve unobservable constructs, from measurement of health states to severity of complex diseases. The primary aim of measurement is to identify relevant pieces of observable information that thoroughly describe the construct of interest. Validation of the construct is often performed separately. Noting the increasing popularity of latent variable methods in biomedical research, we propose a Multiple Indicator Multiple Cause (MIMIC) latent variable model that combines item reduction and validation. Our joint latent variable model accounts for the bias that occurs in the traditional 2-stage process. The methods are motivated by an example from the Physical Activity and Lymphedema clinical trial in which the objectives were to describe lymphedema severity through self-reported Likert scale symptoms and to determine the relationship between symptom severity and a "gold standard" diagnostic measure of lymphedema. The MIMIC model identified 1 symptom as a potential candidate for removal. We present this paper as an illustration of the advantages of joint latent variable models and as an example of the applicability of these models for biomedical research. 相似文献

9.

A composite likelihood approach to latent multivariate Gaussian modeling of SNP data with application to genetic association testing

Han F Pan W 《Biometrics》2012,68(1):307-315

Many statistical tests have been proposed for case-control data to detect disease association with multiple single nucleotide polymorphisms (SNPs) in linkage disequilibrium. The main reason for the existence of so many tests is that each test aims to detect one or two aspects of many possible distributional differences between cases and controls, largely due to the lack of a general and yet simple model for discrete genotype data. Here we propose a latent variable model to represent SNP data: the observed SNP data are assumed to be obtained by discretizing a latent multivariate Gaussian variate. Because the latent variate is multivariate Gaussian, its distribution is completely characterized by its mean vector and covariance matrix, in contrast to much more complex forms of a general distribution for discrete multivariate SNP data. We propose a composite likelihood approach for parameter estimation. A direct application of this latent variable model is to association testing with multiple SNPs in a candidate gene or region. In contrast to many existing tests that aim to detect only one or two aspects of many possible distributional differences of discrete SNP data, we can exclusively focus on testing the mean and covariance parameters of the latent Gaussian distributions for cases and controls. Our simulation results demonstrate potential power gains of the proposed approach over some existing methods. 相似文献

10.

Bayesian modeling of embryonic growth using latent variables

Slaughter JC Herring AH Hartmann KE 《Biostatistics (Oxford, England)》2008,9(2):373-389

In a growth model, individuals move progressively through a series of states in which each state is indicative of developmental status. Interest lies in estimating the rate of progression through each state while incorporating covariates that might affect the transition rates. We develop a Bayesian discrete-time multistate growth model for inference from cross-sectional data with unknown initiation times. For each subject, data are collected at only one time point at which we observe the state as well as covariates that measure developmental progress. We link the developmental progress variables to an underlying latent growth variable that can also affect the state transition rates. A subject with slow latent growth will then have relatively small developmental progress covariates and move through state transitions slowly. We then examine the association between latent growth and the probability of future events in a novel study of embryonic development and pregnancy loss. Using a Markov chain Monte Carlo (MCMC) algorithm for posterior computation, we found evidence in favor of a previously hypothesized but unproven association between slow growth early in pregnancy and increased risk of future spontaneous abortion. 相似文献

11.

Latent variable models for longitudinal data with multiple continuous outcomes

Roy J Lin X 《Biometrics》2000,56(4):1047-1054

Multiple outcomes are often used to properly characterize an effect of interest. This paper proposes a latent variable model for the situation where repeated measures over time are obtained on each outcome. These outcomes are assumed to measure an underlying quantity of main interest from different perspectives. We relate the observed outcomes using regression models to a latent variable, which is then modeled as a function of covariates by a separate regression model. Random effects are used to model the correlation due to repeated measures of the observed outcomes and the latent variable. An EM algorithm is developed to obtain maximum likelihood estimates of model parameters. Unit-specific predictions of the latent variables are also calculated. This method is illustrated using data from a national panel study on changes in methadone treatment practices. 相似文献

12.

A mover-stayer model for longitudinal marker data

Albert PS 《Biometrics》1999,55(4):1252-1257

Studies of chronic disease often focus on estimating prevalence and incidence in which the presence of active disease is based on dichotomizing a continuous marker variable measured with error. Examples include hypertension, asthma, and depression, where active disease is defined by setting a threshold on a continuous measure of blood pressure, respiratory function, and mood, respectively. This paper proposes a model for inference about prevalence and incidence when active disease is determined by dichotomizing a continuous marker variable in a population-based study. In this formulation, it is postulated that there are three groups of people, those that are not susceptible to the disease, those who are always in the disease state, and those who have the potential to transition between the disease and the disease-free states over time. The model is used to estimate the prevalence and incidence of the disease in the population while accounting for measurement error in the marker. An EM algorithm is used for parameter estimation and the methodology is illustrated on Framingham heart study hypertension data. A simulation study is conducted in order to demonstrate the importance of accounting for measurement error in estimating prevalence and incidence for this example. 相似文献

13.

Bayesian inference for stochastic kinetic models using a diffusion approximation

Golightly A Wilkinson DJ 《Biometrics》2005,61(3):781-788

This article is concerned with the Bayesian estimation of stochastic rate constants in the context of dynamic models of intracellular processes. The underlying discrete stochastic kinetic model is replaced by a diffusion approximation (or stochastic differential equation approach) where a white noise term models stochastic behavior and the model is identified using equispaced time course data. The estimation framework involves the introduction of m- 1 latent data points between every pair of observations. MCMC methods are then used to sample the posterior distribution of the latent process and the model parameters. The methodology is applied to the estimation of parameters in a prokaryotic autoregulatory gene network. 相似文献

14.

Structural time series models with feedback mechanisms

Guo W Brown MB 《Biometrics》2000,56(3):686-691

Structural time series models have applications in many different fields such as biology, economics, and meteorology. A structural times series model can be represented as a state-space model where the states of the system represent the unobserved components and the structural parameters have clear interpretations. This paper introduces a class of structural time series models that incorporate feedback from the latent components of the history. An iterative procedure is proposed for estimation. These models allow flexible and robust feedback mechanisms, have clear interpretations, and have a computationally efficient estimation procedure. They are applied to hormone data to characterize hormone secretion and to explore a potential feedback mechanism. 相似文献

15.

Modeling longitudinal data with nonignorable dropouts using a latent dropout class model

Roy J 《Biometrics》2003,59(4):829-836

In longitudinal studies with dropout, pattern-mixture models form an attractive modeling framework to account for nonignorable missing data. However, pattern-mixture models assume that the components of the mixture distribution are entirely determined by the dropout times. That is, two subjects with the same dropout time have the same distribution for their response with probability one. As that is unlikely to be the case, this assumption made lead to classification error. In addition, if there are certain dropout patterns with very few subjects, which often occurs when the number of observation times is relatively large, pattern-specific parameters may be weakly identified or require identifying restrictions. We propose an alternative approach, which is a latent-class model. The dropout time is assumed to be related to the unobserved (latent) class membership, where the number of classes is less than the number of observed patterns; a regression model for the response is specified conditional on the latent variable. This is a type of shared-parameter model, where the shared "parameter" is discrete. Parameter estimates are obtained using the method of maximum likelihood. Averaging the estimates of the conditional parameters over the distribution of the latent variable yields estimates of the marginal regression parameters. The methodology is illustrated using longitudinal data on depression from a study of HIV in women. 相似文献

16.

Joint analysis of longitudinal data with informative right censoring

Liu M Ying Z 《Biometrics》2007,63(2):363-371

Longitudinal data arise when subjects are followed over a period of time. A commonly encountered complication in the analysis of such data is the variable length of follow-up due to right censorship. This can be further exacerbated by the possible dependency between the censoring time and the longitudinal measurements. This article proposes a combination of a semiparametric transformation model for the censoring time and a linear mixed effects model for the longitudinal measurements. The dependency is handled via latent variables which are naturally incorporated. We show that the likelihood function has an explicit form and develops a two-stage estimation procedure to avoid direct maximization over a high-dimensional parameter space. The resulting estimators are shown to be consistent and asymptotically normal, with a closed form for the variance-covariance matrix that can be used to obtain a plug-in estimator. Finite sample performance of the proposed approach is assessed through extensive simulations. The method is applied to renal disease data. 相似文献

17.

Causal mediation analysis with a latent mediator

下载免费PDF全文

Jeffrey M. Albert Cuiyu Geng Suchitra Nelson 《Biometrical journal. Biometrische Zeitschrift》2016,58(3):535-548

Health researchers are often interested in assessing the direct effect of a treatment or exposure on an outcome variable, as well as its indirect (or mediation) effect through an intermediate variable (or mediator). For an outcome following a nonlinear model, the mediation formula may be used to estimate causally interpretable mediation effects. This method, like others, assumes that the mediator is observed. However, as is common in structural equations modeling, we may wish to consider a latent (unobserved) mediator. We follow a potential outcomes framework and assume a generalized structural equations model (GSEM). We provide maximum‐likelihood estimation of GSEM parameters using an approximate Monte Carlo EM algorithm, coupled with a mediation formula approach to estimate natural direct and indirect effects. The method relies on an untestable sequential ignorability assumption; we assess robustness to this assumption by adapting a recently proposed method for sensitivity analysis. Simulation studies show good properties of the proposed estimators in plausible scenarios. Our method is applied to a study of the effect of mother education on occurrence of adolescent dental caries, in which we examine possible mediation through latent oral health behavior. 相似文献

18.

Bayesian modeling of multiple episode occurrence and severity with a terminating event

Herring AH Yang J 《Biometrics》2007,63(2):381-388

An individual's health condition can affect the frequency and intensity of episodes that can occur repeatedly and that may be related to an event time of interest. For example, bleeding episodes during pregnancy may indicate problems predictive of preterm delivery. Motivated by this application, we propose a joint model for a multiple episode process and an event time. The frequency of occurrence and severity of the episodes are characterized by a latent variable model, which allows an individual's episode intensity to change dynamically over time. This latent episode intensity is then incorporated as a predictor in a discrete time model for the terminating event. Time-varying coefficients are used to distinguish among effects earlier versus later in gestation. Formulating the model within a Bayesian framework, prior distributions are chosen so that conditional posterior distributions are conjugate after data augmentation. Posterior computation proceeds via an efficient Gibbs sampling algorithm. The methods are illustrated using bleeding episode and gestational length data from a pregnancy study. 相似文献

19.

Stochastic search variable selection for identifying multiple quantitative trait loci 总被引：9，自引：0，他引：9

Yi N George V Allison DB 《Genetics》2003,164(3):1129-1138

In this article, we utilize stochastic search variable selection methodology to develop a Bayesian method for identifying multiple quantitative trait loci (QTL) for complex traits in experimental designs. The proposed procedure entails embedding multiple regression in a hierarchical normal mixture model, where latent indicators for all markers are used to identify the multiple markers. The markers with significant effects can be identified as those with higher posterior probability included in the model. A simple and easy-to-use Gibbs sampler is employed to generate samples from the joint posterior distribution of all unknowns including the latent indicators, genetic effects for all markers, and other model parameters. The proposed method was evaluated using simulated data and illustrated using a real data set. The results demonstrate that the proposed method works well under typical situations of most QTL studies in terms of number of markers and marker density. 相似文献

20.

A multilevel latent Markov model for the evaluation of nursing homes' performance

下载免费PDF全文

Giorgio E. Montanari Marco Doretti Francesco Bartolucci 《Biometrical journal. Biometrische Zeitschrift》2018,60(5):962-978

The periodic evaluation of health care services is a primary concern for many institutions. We consider services provided by nursing homes with the aim of ranking a set of these structures with respect to their effect on resident health status. Since the overall health status is not directly observable, and given the longitudinal and multilevel structure of the available data, we rely on latent variable models and, in particular, on a multilevel latent Markov model where residents and nursing homes are the first and the second level units, respectively. The model includes individual covariates to account for resident characteristics. The impact of nursing home membership is modelled through a pair of random effects affecting the initial distribution and the transition probabilities between different levels of health status. Through the prediction of these random effects we obtain a ranking of the nursing homes. Furthermore, the proposed model accounts for nonignorable dropout due to resident death, which typically occurs in these contexts. The motivating dataset is gathered from the Long Term Care Facilities programme, a health care protocol implemented in Umbria (Italy). Our results show that differences in performance between nursing homes are statistically significant. 相似文献