首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The information in aggregate data from Markov chains   总被引:1,自引:0,他引:1  
LAWLESS  J. F.; McLEISH  D. L. 《Biometrika》1984,71(3):419-430
  相似文献   

2.
3.
Estimation in linear models with censored data   总被引:1,自引:0,他引:1  
  相似文献   

4.
The integro-differential growth model of Eakman, Fredriekson, and Tsuehiya has been employed to fit cell size distribution data for Schizosaccharomyces pombe grown in a chemostat under severe product inhibition by ethanol. The distributions were obtained with a Coulter aperture and an electronic system patterned after that of Harvey and Marr. Four parameters—mean cell division size, cell division size standard deviation, daughter cell size standard deviation, and a growth rate coefficient—were calculated for models where the cell growth rate was inversely proportional to size, constant, and proportional to size. A fourth model, one where sigmoidal growth behavior was simulated by two linear growth segments, was also investigated. Linear and sigmoidal models fit the distribution data best. While the mean cell division size remained relatively constant at all growth rates, standard deviation of division size distribution increased with increasing holding times. Standard deviation of the daughter size distribution remained small at all dilution rates. Unlike previous findings with other organisms, the average cell size of Schizosaccharomyces pobme increased at low growth rates.  相似文献   

5.
HMMGEP: clustering gene expression data using hidden Markov models   总被引:3,自引:0,他引:3  
SUMMARY: The package HMMGEP performs cluster analysis on gene expression data using hidden Markov models. AVAILABILITY: HMMGEP, including the source code, documentation and sample data files, is available at http://www.bioinfo.tsinghua.edu.cn:8080/~rich/hmmgep_download/index.html.  相似文献   

6.
Surveillance data for communicable nosocomial pathogens usually consist of short time series of low-numbered counts of infected patients. These often show overdispersion and autocorrelation. To date, almost all analyses of such data have ignored the communicable nature of the organisms and have used methods appropriate only for independent outcomes. Inferences that depend on such analyses cannot be considered reliable when patient-to-patient transmission is important. We propose a new method for analysing these data based on a mechanistic model of the epidemic process. Since important nosocomial pathogens are often carried asymptomatically with overt infection developing in only a proportion of patients, the epidemic process is usually only partially observed by routine surveillance data. We therefore develop a 'structured' hidden Markov model where the underlying Markov chain is generated by a simple transmission model. We apply both structured and standard (unstructured) hidden Markov models to time series for three important pathogens. We find that both methods can offer marked improvements over currently used approaches when nosocomial spread is important. Compared to the standard hidden Markov model, the new approach is more parsimonious, is more biologically plausible, and allows key epidemiological parameters to be estimated.  相似文献   

7.
8.
MOTIVATION: Cellular processes cause changes over time. Observing and measuring those changes over time allows insights into the how and why of regulation. The experimental platform for doing the appropriate large-scale experiments to obtain time-courses of expression levels is provided by microarray technology. However, the proper way of analyzing the resulting time course data is still very much an issue under investigation. The inherent time dependencies in the data suggest that clustering techniques which reflect those dependencies yield improved performance. RESULTS: We propose to use Hidden Markov Models (HMMs) to account for the horizontal dependencies along the time axis in time course data and to cope with the prevalent errors and missing values. The HMMs are used within a model-based clustering framework. We are given a number of clusters, each represented by one Hidden Markov Model from a finite collection encompassing typical qualitative behavior. Then, our method finds in an iterative procedure cluster models and an assignment of data points to these models that maximizes the joint likelihood of clustering and models. Partially supervised learning--adding groups of labeled data to the initial collection of clusters--is supported. A graphical user interface allows querying an expression profile dataset for time course similar to a prototype graphically defined as a sequence of levels and durations. We also propose a heuristic approach to automate determination of the number of clusters. We evaluate the method on published yeast cell cycle and fibroblasts serum response datasets, and compare them, with favorable results, to the autoregressive curves method.  相似文献   

9.
Longitudinal ordinal data are common in many scientific studies, including those of multiple sclerosis (MS), and are frequently modeled using Markov dependency. Several authors have proposed random-effects Markov models to account for heterogeneity in the population. In this paper, we go one step further and study prediction based on random-effects Markov models. In particular, we show how to calculate the probabilities of future events and confidence intervals for those probabilities, given observed data on the ordinal outcome and a set of covariates, and how to update them over time. We discuss the usefulness of depicting these probabilities for visualization and interpretation of model results and illustrate our method using data from a phase III clinical trial that evaluated the utility of interferon beta-1a (trademark Avonex) to MS patients of type relapsing-remitting.  相似文献   

10.
In many observational studies, individuals are measured repeatedly over time, although not necessarily at a set of pre-specified occasions. Instead, individuals may be measured at irregular intervals, with those having a history of poorer health outcomes being measured with somewhat greater frequency and regularity. In this paper, we consider likelihood-based estimation of the regression parameters in marginal models for longitudinal binary data when the follow-up times are not fixed by design, but can depend on previous outcomes. In particular, we consider assumptions regarding the follow-up time process that result in the likelihood function separating into two components: one for the follow-up time process, the other for the outcome measurement process. The practical implication of this separation is that the follow-up time process can be ignored when making likelihood-based inferences about the marginal regression model parameters. That is, maximum likelihood (ML) estimation of the regression parameters relating the probability of success at a given time to covariates does not require that a model for the distribution of follow-up times be specified. However, to obtain consistent parameter estimates, the multinomial distribution for the vector of repeated binary outcomes must be correctly specified. In general, ML estimation requires specification of all higher-order moments and the likelihood for a marginal model can be intractable except in cases where the number of repeated measurements is relatively small. To circumvent these difficulties, we propose a pseudolikelihood for estimation of the marginal model parameters. The pseudolikelihood uses a linear approximation for the conditional distribution of the response at any occasion, given the history of previous responses. The appeal of this approximation is that the conditional distributions are functions of the first two moments of the binary responses only. When the follow-up times depend only on the previous outcome, the pseudolikelihood requires correct specification of the conditional distribution of the current outcome given the outcome at the previous occasion only. Results from a simulation study and a study of asymptotic bias are presented. Finally, we illustrate the main results using data from a longitudinal observational study that explored the cardiotoxic effects of doxorubicin chemotherapy for the treatment of acute lymphoblastic leukemia in children.  相似文献   

11.
Hidden Markov models (HMM) are introduced for the offline classification of single-trail EEG data in a brain-computer-interface (BCI). The HMMs are used to classify Hjorth parameters calculated from bipolar EEG data, recorded during the imagination of a left or right hand movement. The effects of different types of HMMs on the recognition rate are discussed. Furthermore a comparison of the results achieved with the linear discriminant (LD) and the HMM, is presented.  相似文献   

12.
Ji X  Li-Ling J  Sun Z 《FEBS letters》2003,542(1-3):125-131
In this work we have developed a new framework for microarray gene expression data analysis. This framework is based on hidden Markov models. We have benchmarked the performance of this probability model-based clustering algorithm on several gene expression datasets for which external evaluation criteria were available. The results showed that this approach could produce clusters of quality comparable to two prevalent clustering algorithms, but with the major advantage of determining the number of clusters. We have also applied this algorithm to analyze published data of yeast cell cycle gene expression and found it able to successfully dig out biologically meaningful gene groups. In addition, this algorithm can also find correlation between different functional groups and distinguish between function genes and regulation genes, which is helpful to construct a network describing particular biological associations. Currently, this method is limited to time series data. Supplementary materials are available at http://www.bioinfo.tsinghua.edu.cn/~rich/hmmgep_supp/.  相似文献   

13.
We examine bias in Markov models of diseases, including both chronic and infectious diseases. We consider two common types of Markov disease models: ones where disease progression changes by severity of disease, and ones where progression of disease changes in time or by age. We find sufficient conditions for bias to exist in models with aggregated transition probabilities when compared to models with state/time dependent transition probabilities. We also find that when aggregating data to compute transition probabilities, bias increases with the degree of data aggregation. We illustrate by examining bias in Markov models of Hepatitis C, Alzheimer’s disease, and lung cancer using medical data and find that the bias is significant depending on the method used to aggregate the data. A key implication is that by not incorporating state/time dependent transition probabilities, studies that use Markov models of diseases may be significantly overestimating or underestimating disease progression. This could potentially result in incorrect recommendations from cost-effectiveness studies and incorrect disease burden forecasts.  相似文献   

14.
Hawkins DL  Han CP 《Biometrics》2000,56(3):848-854
Longitudinal studies often collect only aggregate data, which allows only inefficient transition probability estimates. Barring enormous aggregate samples, improving the efficiency of transition probability estimates seems to be impossible without additional partial-transition data. This paper discusses several sampling plans that collect data of both types, as well as a methodology that combines them into efficient estimates of transition probabilities. The method handles both fixed and time-dependent categorical covariates and requires no assumptions (e.g., time homogeneity, Markov) about the population evolution.  相似文献   

15.
Estimation in parameter-redundant models   总被引:3,自引:0,他引:3  
  相似文献   

16.
A simple method for estimating the gene frequency p and the penetrance value K from data on polymorphic monogenic characteristics on monozygotic twin pairs is presented. In spite of the method here presented having limited value because the results it yields cannot be evaluated on their own, the estimates of p and K it provides can be indirectly tested by comparing them to the ones obtained in familial aggregates through classical segregation analysis or by using the latter to calculate the expected proportions of dominant-dominant, dominant-recessive and recessive-recessive monozygotic twin pairs. When the method is applied to data on tongue-rolling ability published in the literature, a good agreement is observed between twin and familial estimates, thus indicating that the method is reliable and that it can be used as an ancillary way of corroborating or otherwise evidence of monogenic autosomal dominant mechanism inferred from the analysis of familial data.  相似文献   

17.
Estimation of spermarche from longitudinal spermaturia data   总被引:2,自引:0,他引:2  
A number of studies have dealt with determination of the age at onset of sperm emission (spermarche), based on observations of first occurrence of spermatozoa in urine. A major problem in this connection is the intermittent occurrence of sperm-negative urine samples after the achievement of spermarche. We have here considered an empirical Bayes approach for handling the probability of a sperm-positive urine sample after spermarche. The investigation was inspired by a concrete longitudinal study concerning 40 Scottish boys, which is used for illustration throughout. In this study, the urine was tested for spermatozoa from well before spermarche and every 3 months thereafter for at most 7 years. Some of the boys left the study earlier and techniques for handling such censoring were also developed.  相似文献   

18.
19.
Estimation of heritability from varietal trials data   总被引:2,自引:0,他引:2  
We present the estimation of heritabilities of an observed trait in situations where evaluation of several pure breeding lines is performed in a trial at a single location and in trials from several locations. For the single location situation, we evaluate exact confidence intervals, the probability of invalid estimates, and the percentage points of the distribution of heritability. Simulations were performed to numerically verify the results. Additionally, approximations to the bias and standard error of the estimate were obtained and are presented along with their simulated values and coefficients of skewness and kurtosis. For trials in several locations, explicit expressions for exact values of confidence limits are not available. Further, one would require knowledge of one more parameter, represented by the ratio of genotype x environment (G x E) interaction variance to error variance, in addition to the number of genotypes, replication and true heritability value. Approximations were made for bias and the standard error of estimates of heritability. The evaluation of the distribution of heritability and its moments was recognized as a problem of the linear function of an independent chi-square. The methods have been illustrated by data from experiments on grain and straw yield of 64 barley genotypes evaluated at three locations.  相似文献   

20.
A Gottschau 《Biometrics》1992,48(3):751-763
Time-homogeneous Markov chain models with state space [0, 1]k are useful in analysis of binary follow-up data on k individuals that interact. The number of parameters increases exponentially with k so more restrictive models are imperative for statistical inference. The hypothesis that the matrix of transition probabilities is invariant under permutation of individuals is discussed. It is shown that if individuals are exchangeable, then the process counting the number of individuals occupying a given state is a Markov chain. This reduction of data is sufficient if either at most a single individual may change state between two consecutive time points or if a state is absorbing. Similar results are obtained for exchangeability within two subgroups. Inference in the multivariate process reduces to a univariate problem if individuals are independent given the group's previous response. It is shown how conditional independence could be tested assuming exchangeability. The different hypotheses re examined in an analysis of the occurrence of bacteria in milk samples of Danish dairy cattle.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号