期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Parameter estimation and goodness-of-fit in log binomial regression

Blizzard L Hosmer DW 《Biometrical journal. Biometrische Zeitschrift》2006,48(1):5-22

An estimate of the risk, adjusted for confounders, can be obtained from a fitted logistic regression model, but it substantially over-estimates when the outcome is not rare. The log binomial model, binomial errors and log link, is increasingly being used for this purpose. However this model's performance, goodness of fit tests and case-wise diagnostics have not been studied. Extensive simulations are used to compare the performance of the log binomial, a logistic regression based method proposed by Schouten et al. (1993) and a Poisson regression approach proposed by Zou (2004) and Carter, Lipsitz, and Tilley (2005). Log binomial regression resulted in "failure" rates (non-convergence, out-of-bounds predicted probabilities) as high as 59%. Estimates by the method of Schouten et al. (1993) produced fitted log binomial probabilities greater than unity in up to 19% of samples to which a log binomial model had been successfully fit and in up to 78% of samples when the log binomial model fit failed. Similar percentages were observed for the Poisson regression approach. Coefficient and standard error estimates from the three models were similar. Rejection rates for goodness of fit tests for log binomial fit were around 5%. Power of goodness of fit tests was modest when an incorrect logistic regression model was fit. Examples demonstrate the use of the methods. Uncritical use of the log binomial regression model is not recommended. 相似文献

2.

Analysis of Longitudinal Data of Epileptic Seizure Counts – A Two‐State Hidden Markov Regression Approach

Peiming Wang Martin L. Puterman 《Biometrical journal. Biometrische Zeitschrift》2001,43(8):941-962

This paper discusses a two‐state hidden Markov Poisson regression (MPR) model for analyzing longitudinal data of epileptic seizure counts, which allows for the rate of the Poisson process to depend on covariates through an exponential link function and to change according to the states of a two‐state Markov chain with its transition probabilities associated with covariates through a logit link function. This paper also considers a two‐state hidden Markov negative binomial regression (MNBR) model, as an alternative, by using the negative binomial instead of Poisson distribution in the proposed MPR model when there exists extra‐Poisson variation conditional on the states of the Markov chain. The two proposed models in this paper relax the stationary requirement of the Markov chain, allow for overdispersion relative to the usual Poisson regression model and for correlation between repeated observations. The proposed methodology provides a plausible analysis for the longitudinal data of epileptic seizure counts, and the MNBR model fits the data much better than the MPR model. Maximum likelihood estimation using the EM and quasi‐Newton algorithms is discussed. A Monte Carlo study for the proposed MPR model investigates the reliability of the estimation method, the choice of probabilities for the initial states of the Markov chain, and some finite sample behaviors of the maximum likelihood estimates, suggesting that (1) the estimation method is accurate and reliable as long as the total number of observations is reasonably large, and (2) the choice of probabilities for the initial states of the Markov process has little impact on the parameter estimates. 相似文献

3.

Calculation of narrower confidence intervals for tree mortality rates when we know nothing but the location of the death/survival events

Gabriel Arellano 《Ecology and evolution》2019,9(17):9644-9653

4.

The log multinomial regression model for nominal outcomes with more than two attributes

Blizzard L Hosmer DW 《Biometrical journal. Biometrische Zeitschrift》2007,49(6):889-902

An estimate of the risk or prevalence ratio, adjusted for confounders, can be obtained from a log binomial model (binomial errors, log link) fitted to binary outcome data. We propose a modification of the log binomial model to obtain relative risk estimates for nominal outcomes with more than two attributes (the "log multinomial model"). Extensive data simulations were undertaken to compare the performance of the log multinomial model with that of an expanded data multinomial logistic regression method based on the approach proposed by Schouten et al. (1993) for binary data, and with that of separate fits of a Poisson regression model based on the approach proposed by Zou (2004) and Carter, Lipsitz and Tilley (2005) for binary data. Log multinomial regression resulted in "inadmissable" solutions (out-of-bounds probabilities) exceeding 50% in some data settings. Coefficient estimates by the alternative methods produced out-of-bounds probabilities for the log multinomial model in up to 27% of samples to which a log multinomial model had been successfully fitted. The log multinomial coefficient estimates generally had lesser relative bias and mean squared error than the alternative methods. The practical utility of the log multinomial regression model was demonstrated with a real data example. The log multinomial model offers a practical solution to the problem of obtaining adjusted estimates of the risk ratio in the multinomial setting, but must be used with some care and attention to detail. 相似文献

5.

Quasi Likelihood/Moment Method for Generalized and Restricted Generalized Poisson Regression Models and Its Application

&#x;lknur zmen 《Biometrical journal. Biometrische Zeitschrift》2000,42(3):303-314

This paper reviews the generalized Poisson regression model, the restricted generalized Poisson regression model and the mixed Poisson regression (negative binomial regression and Poisson inverse Gaussian regression) models which can be used for regression analysis of counts. The aim of this study is to demonstrate the quasi likelihood/moment method, which is used for estimation of the parameters of mixed Poisson regression models, also applicable to obtain the estimates of the parameters of the generalized Poisson regression and the restricted generalized Poisson regression models. Besides, at the end of this study an application related to this method for zoological data is given. 相似文献

6.

Validation of statistical models for estimating hospitalization associated with influenza and other respiratory viruses

Yang L Chiu SS Chan KP Chan KH Wong WH Peiris JS Wong CM 《PloS one》2011,6(3):e17882

Background

Reliable estimates of disease burden associated with respiratory viruses are keys to deployment of preventive strategies such as vaccination and resource allocation. Such estimates are particularly needed in tropical and subtropical regions where some methods commonly used in temperate regions are not applicable. While a number of alternative approaches to assess the influenza associated disease burden have been recently reported, none of these models have been validated with virologically confirmed data. Even fewer methods have been developed for other common respiratory viruses such as respiratory syncytial virus (RSV), parainfluenza and adenovirus.

Methods and Findings

We had recently conducted a prospective population-based study of virologically confirmed hospitalization for acute respiratory illnesses in persons <18 years residing in Hong Kong Island. Here we used this dataset to validate two commonly used models for estimation of influenza disease burden, namely the rate difference model and Poisson regression model, and also explored the applicability of these models to estimate the disease burden of other respiratory viruses. The Poisson regression models with different link functions all yielded estimates well correlated with the virologically confirmed influenza associated hospitalization, especially in children older than two years. The disease burden estimates for RSV, parainfluenza and adenovirus were less reliable with wide confidence intervals. The rate difference model was not applicable to RSV, parainfluenza and adenovirus and grossly underestimated the true burden of influenza associated hospitalization.

Conclusion

The Poisson regression model generally produced satisfactory estimates in calculating the disease burden of respiratory viruses in a subtropical region such as Hong Kong. 相似文献

7.

Unconditional small-sample confidence intervals for the odds ratio

Agresti A Min Y 《Biostatistics (Oxford, England)》2002,3(3):379-386

The traditional approach to 'exact' small-sample interval estimation of the odds ratio for binomial, Poisson, or multinomial samples uses the conditional distribution to eliminate nuisance parameters. This approach can be very conservative. For two independent binomial samples, we study an unconditional approach with overall confidence level guaranteed to equal at least the nominal level. With small samples this interval tends to be shorter and have coverage probabilities nearer the nominal level. 相似文献

8.

Selecting the right statistical model for analysis of insect count data by using information theoretic measures

Sileshi G 《Bulletin of entomological research》2006,96(5):479-488

相似文献

9.

Efficient Simulation of Multivariate Binomial and Poisson Distributions

Frank Krummenauer 《Biometrical journal. Biometrische Zeitschrift》1998,40(7):823-832

Power investigations, for example, in statistical procedures for the assessment of agreement among multiple raters often require the simultaneous simulation of several dependent binomial or Poisson distributions to appropriately model the stochastical dependencies between the raters' results. Regarding the rather large dimensions of the random vectors to be generated and the even larger number of interactions to be introduced into the simulation scenarios to determine all necessary information on their distributions' dependence stucture, one needs efficient and fast algorithms for the simulation of multivariate Poisson and binomial distributions. Therefore two equivalent models for the multivariate Poisson distribution are combined to obtain an algorithm for the quick implementation of its multivariate dependence structure. Simulation of the multivariate Poisson distribution then becomes feasible by first generating and then convoluting independent univariate Poisson variates with appropriate expectations. The latter can be computed via linear recursion formulae. Similar means for simulation are also considered for the binomial setting. In this scenario it turns out, however, that exact computation of the probability function is even easier to perform; therefore corresponding linear recursion formulae for the point probabilities of multivariate binomial distributions are presented, which only require information about the index parameter and the (simultaneous) success probabilities, that is the multivariate dependence structure among the binomial marginals. 相似文献

10.

Influences of size and sex on invasive species aggression and native species vulnerability: a case for modern regression techniques

Kit Magellan Emili García-Berthou 《Reviews in Fish Biology and Fisheries》2015,25(3):537-549

Animal behaviour is of fundamental importance but is often overlooked in biological invasion research. A problem with such studies is that they may add pressure to already threatened species and subject vulnerable individuals to increased risk. One solution is to obtain the maximum possible information from the generated data using a variety of statistical techniques, instead of solely using simple versions of linear regression or generalized linear models as is customary. Here, we exemplify and compare the use of modern regression techniques which have very different conceptual backgrounds and aims (negative binomial models, zero-inflated regression, and expectile regression), and which have rarely been applied to behavioural data in biological invasion studies. We show that our data display overdispersion, which is frequent in ecological and behavioural data, and that conventional statistical methods such as Poisson generalized linear models are inadequate in this case. Expectile regression is similar to quantile regression and allows the estimation of functional relationships between variables for all portions of a probability distribution and is thus well suited for modelling boundaries in polygonal relationships or cases with heterogeneous variances which are frequent in behavioural data. We applied various statistical techniques to aggression in invasive mosquitofish, Gambusia holbrooki, and the concomitant vulnerability of native toothcarp, Aphanius iberus, in relation to individual size and sex. We found that medium sized male G. holbrooki carry out the majority of aggressive acts and that smaller and medium size A. iberus are most vulnerable. Of the regression techniques used, only negative binomial models and zero-inflated and expectile Poisson regressions revealed these relationships. 相似文献

11.

A score test for testing a zero-inflated Poisson regression model against zero-inflated negative binomial alternatives 总被引：3，自引：0，他引：3

Ridout M Hinde J Demétrio CG 《Biometrics》2001,57(1):219-223

Count data often show a higher incidence of zero counts than would be expected if the data were Poisson distributed. Zero-inflated Poisson regression models are a useful class of models for such data, but parameter estimates may be seriously biased if the nonzero counts are overdispersed in relation to the Poisson distribution. We therefore provide a score test for testing zero-inflated Poisson regression models against zero-inflated negative binomial alternatives. 相似文献

12.

Predicting spatial patterns of plant species richness: a comparison of direct macroecological and species stacking modelling approaches

Anne Dubuis Julien Pottier Vanessa Rion Loïc Pellissier Jean‐Paul Theurillat Antoine Guisan 《Diversity & distributions》2011,17(6):1122-1131

Aim This study compares the direct, macroecological approach (MEM) for modelling species richness (SR) with the more recent approach of stacking predictions from individual species distributions (S‐SDM). We implemented both approaches on the same dataset and discuss their respective theoretical assumptions, strengths and drawbacks. We also tested how both approaches performed in reproducing observed patterns of SR along an elevational gradient. Location Two study areas in the Alps of Switzerland. Methods We implemented MEM by relating the species counts to environmental predictors with statistical models, assuming a Poisson distribution. S‐SDM was implemented by modelling each species distribution individually and then stacking the obtained prediction maps in three different ways – summing binary predictions, summing random draws of binomial trials and summing predicted probabilities – to obtain a final species count. Results The direct MEM approach yields nearly unbiased predictions centred around the observed mean values, but with a lower correlation between predictions and observations, than that achieved by the S‐SDM approaches. This method also cannot provide any information on species identity and, thus, community composition. It does, however, accurately reproduce the hump‐shaped pattern of SR observed along the elevational gradient. The S‐SDM approach summing binary maps can predict individual species and thus communities, but tends to overpredict SR. The two other S‐SDM approaches – the summed binomial trials based on predicted probabilities and summed predicted probabilities – do not overpredict richness, but they predict many competing end points of assembly or they lose the individual species predictions, respectively. Furthermore, all S‐SDM approaches fail to appropriately reproduce the observed hump‐shaped patterns of SR along the elevational gradient. Main conclusions Macroecological approach and S‐SDM have complementary strengths. We suggest that both could be used in combination to obtain better SR predictions by following the suggestion of constraining S‐SDM by MEM predictions. 相似文献

13.

Zero‐Inflated Negative Binomial Mixed Regression Modeling of Over‐Dispersed Count Data with Extra Zeros

Kelvin K. W. Yau Kui Wang Andy H. Lee 《Biometrical journal. Biometrische Zeitschrift》2003,45(4):437-452

In many biometrical applications, the count data encountered often contain extra zeros relative to the Poisson distribution. Zero‐inflated Poisson regression models are useful for analyzing such data, but parameter estimates may be seriously biased if the nonzero observations are over‐dispersed and simultaneously correlated due to the sampling design or the data collection procedure. In this paper, a zero‐inflated negative binomial mixed regression model is presented to analyze a set of pancreas disorder length of stay (LOS) data that comprised mainly same‐day separations. Random effects are introduced to account for inter‐hospital variations and the dependency of clustered LOS observations. Parameter estimation is achieved by maximizing an appropriate log‐likelihood function using an EM algorithm. Alternative modeling strategies, namely the finite mixture of Poisson distributions and the non‐parametric maximum likelihood approach, are also considered. The determination of pertinent covariates would assist hospital administrators and clinicians to manage LOS and expenditures efficiently. 相似文献

14.

Promotion time models with time-changing exposure and heterogeneity: application to infectious diseases

Tournoud M Ecochard R 《Biometrical journal. Biometrische Zeitschrift》2008,50(3):395-407

Promotion time models have been recently adapted to the context of infectious diseases to take into account discrete and multiple exposures. However, Poisson distribution of the number of pathogens transmitted at each exposure was a very strong assumption and did not allow for inter-individual heterogeneity. Bernoulli, the negative binomial, and the compound Poisson distributions were proposed as alternatives to Poisson distribution for the promotion time model with time-changing exposure. All were derived within the frailty model framework. All these distributions have a point mass at zero to take into account non-infected people. Bernoulli distribution, the two-component cure rate model, was extended to multiple exposures. Contrary to the negative binomial and the compound Poisson distributions, Bernoulli distribution did not enable to connect the number of pathogens transmitted to the delay between transmission and infection detection. Moreover, the two former distributions enable to account for inter-individual heterogeneity. The delay to surgical site infection was an example of single exposure. The probability of infection was very low; thus, estimation of the effect of selected risk factors on that probability obtained with Bernoulli and Poisson distributions were very close. The delay to nosocomial urinary tract infection was a multiple exposure example. The probabilities of pathogen transmission during catheter placement and catheter presence were estimated. Inter-individual heterogeneity was very high, and the fit was better with the compound Poisson and the negative binomial distributions. The proposed models proved to be also mechanistic. The negative binomial and the compound Poisson distributions were useful alternatives to account for inter-individual heterogeneity. 相似文献

15.

Comparison of Methods for Estimating Bird Abundance and Trends From Historical Count Data

FRANK R. THOMPSON III FRANK A. LA SORTE 《The Journal of wildlife management》2008,72(8):1674-1682

Abstract: The use of bird counts as indices has come under increasing scrutiny because assumptions concerning detection probabilities may not be met, but there also seems to be some resistance to use of model-based approaches to estimating abundance. We used data from the United States Forest Service, Southern Region bird monitoring program to compare several common approaches for estimating annual abundance or indices and population trends from point-count data. We compared indices of abundance estimated as annual means of counts and from a mixed-Poisson model to abundance estimates from a count-removal model with 3 time intervals and a distance model with 3 distance bands. We compared trend estimates calculated from an autoregressive, exponential model fit to annual abundance estimates from the above methods and also by estimating trend directly by treating year as a continuous covariate in the mixed-Poisson model. We produced estimates for 6 forest songbirds based on an average of 621 and 459 points in 2 physiographic areas from 1997 to 2004. There was strong evidence that detection probabilities varied among species and years. Nevertheless, there was good overall agreement across trend estimates from the 5 methods for 9 of 12 comparisons. In 3 of 12 comparisons, however, patterns in detection probabilities potentially confounded interpretation of uncorrected counts. Estimates of detection probabilities differed greatly between removal and distance models, likely because the methods estimated different components of detection probability and the data collection was not optimally designed for either method. Given that detection probabilities often vary among species, years, and observers investigators should address detection probability in their surveys, whether it be by estimation of probability of detection and abundance, estimation of effects of key covariates when modeling count as an index of abundance, or through design-based methods to standardize these effects. 相似文献

16.

Comparison of random-effects meta-analysis models for the relative risk in the case of rare events: A simulation study

Marie Beisemann Philipp Doebler Heinz Holling 《Biometrical journal. Biometrische Zeitschrift》2020,62(7):1597-1630

Pooling the relative risk (RR) across studies investigating rare events, for example, adverse events, via meta-analytical methods still presents a challenge to researchers. The main reason for this is the high probability of observing no events in treatment or control group or both, resulting in an undefined log RR (the basis of standard meta-analysis). Other technical challenges ensue, for example, the violation of normality assumptions, or bias due to exclusion of studies and application of continuity corrections, leading to poor performance of standard approaches. In the present simulation study, we compared three recently proposed alternative models (random-effects [RE] Poisson regression, RE zero-inflated Poisson [ZIP] regression, binomial regression) to the standard methods in conjunction with different continuity corrections and to different versions of beta-binomial regression. Based on our investigation of the models' performance in 162 different simulation settings informed by meta-analyses from the Cochrane database and distinguished by different underlying true effects, degrees of between-study heterogeneity, numbers of primary studies, group size ratios, and baseline risks, we recommend the use of the RE Poisson regression model. The beta-binomial model recommended by Kuss (2015) also performed well. Decent performance was also exhibited by the ZIP models, but they also had considerable convergence issues. We stress that these recommendations are only valid for meta-analyses with larger numbers of primary studies. All models are applied to data from two Cochrane reviews to illustrate differences between and issues of the models. Limitations as well as practical implications and recommendations are discussed; a flowchart summarizing recommendations is provided. 相似文献

17.

Dynamic information in uncertain and changing worlds 总被引：4，自引：0，他引：4

M Mangel 《Journal of theoretical biology》1990,146(3):317-332

A general theory for information processing by organisms living in uncertain and changing worlds is developed. The three fundamental properties of the theory are: (i) the use of a memory parameter that allows the organism to forget the more distant past, (ii) a succinct representation of encounters and information and (iii) flexibility in the estimates of parameters by including the uncertainty in these estimates in a consistent manner. The theory is developed using Bayesian methods (but can also be applied to maximum likelihood estimation) and is applied to the encounter models standardly used in ecology (Poisson, binomial, and negative binomial). Two applications are discussed: (i) patch selection and the matching rule and (ii) superparasitism by a parasitoid. 相似文献

18.

The logistic transform for bounded outcome scores

Lesaffre E Rizopoulos D Tsonaka R 《Biostatistics (Oxford, England)》2007,8(1):72-85

The logistic transformation, originally suggested by Johnson (1949), is applied to analyze responses that are restricted to a finite interval (e.g. (0,1)), so-called bounded outcome scores. Bounded outcome scores often have a non-standard distribution, e.g. J- or U-shaped, precluding classical parametric statistical approaches for analysis. Applying the logistic transformation on a normally distributed random variable, gives rise to a logit-normal (LN) distribution. This distribution can take a variety of shapes on (0,1). Further, the model can be extended to correct for (baseline) covariates. Therefore, the method could be useful for comparative clinical trials. Bounded outcomes can be found in many research areas, e.g. drug compliance research, quality-of-life studies, and pain (and pain relief) studies using visual analog scores, but all these scores can attain the boundary values 0 or 1. A natural extension of the above approach is therefore to assume a latent score on 0,1) having a LN distribution. Two cases are considered: (a) the bounded outcome score is a proportion where the true probabilities have a LN distribution on (0,1) and (b) the bounded outcome score on [0,1] is a coarsened version of a latent score with a LN distribution on (0,1). We also allow the variance (on the transformed scale) to depend on treatment. The usefulness of our approach for comparative clinical trials will be assessed in this paper. It turns out to be important to distinguish the case of equal and unequal variances. For a bounded outcome score of the second type and with equal variances, our approach comes close to ordinal probit (OP) regression. However, ignoring the inequality of variances can lead to highly biased parameter estimates. A simulation study compares the performance of our approach with the two-sample Wilcoxon test and with OP regression. Finally, the different methods are illustrated on two data sets. 相似文献

19.

Estimating and interpreting secondary attack risk: Binomial considered biased

Yushuf Sharker Eben Kenah 《PLoS computational biology》2021,17(1)

The household secondary attack risk (SAR), often called the secondary attack rate or secondary infection risk, is the probability of infectious contact from an infectious household member A to a given household member B, where we define infectious contact to be a contact sufficient to infect B if he or she is susceptible. Estimation of the SAR is an important part of understanding and controlling the transmission of infectious diseases. In practice, it is most often estimated using binomial models such as logistic regression, which implicitly attribute all secondary infections in a household to the primary case. In the simplest case, the number of secondary infections in a household with m susceptibles and a single primary case is modeled as a binomial(m, p) random variable where p is the SAR. Although it has long been understood that transmission within households is not binomial, it is thought that multiple generations of transmission can be neglected safely when p is small. We use probability generating functions and simulations to show that this is a mistake. The proportion of susceptible household members infected can be substantially larger than the SAR even when p is small. As a result, binomial estimates of the SAR are biased upward and their confidence intervals have poor coverage probabilities even if adjusted for clustering. Accurate point and interval estimates of the SAR can be obtained using longitudinal chain binomial models or pairwise survival analysis, which account for multiple generations of transmission within households, the ongoing risk of infection from outside the household, and incomplete follow-up. We illustrate the practical implications of these results in an analysis of household surveillance data collected by the Los Angeles County Department of Public Health during the 2009 influenza A (H1N1) pandemic. 相似文献

20.

Complementary Log–Log Regression for the Estimation of Covariate‐Adjusted Prevalence Ratios in the Analysis of Data from Cross‐Sectional Studies

Alan D. Penman William D. Johnson 《Biometrical journal. Biometrische Zeitschrift》2009,51(3):433-442

We assessed complementary log–log (CLL) regression as an alternative statistical model for estimating multivariable‐adjusted prevalence ratios (PR) and their confidence intervals. Using the delta method, we derived an expression for approximating the variance of the PR estimated using CLL regression. Then, using simulated data, we examined the performance of CLL regression in terms of the accuracy of the PR estimates, the width of the confidence intervals, and the empirical coverage probability, and compared it with results obtained from log–binomial regression and stratified Mantel–Haenszel analysis. Within the range of values of our simulated data, CLL regression performed well, with only slight bias of point estimates of the PR and good confidence interval coverage. In addition, and importantly, the computational algorithm did not have the convergence problems occasionally exhibited by log–binomial regression. The technique is easy to implement in SAS (SAS Institute, Cary, NC), and it does not have the theoretical and practical issues associated with competing approaches. CLL regression is an alternative method of binomial regression that warrants further assessment. 相似文献