共查询到20条相似文献,搜索用时 15 毫秒
1.
Ryan T. Godwin 《Biometrical journal. Biometrische Zeitschrift》2019,61(6):1541-1556
The one‐inflated positive Poisson mixture model (OIPPMM) is presented, for use as the truncated count model in Horvitz–Thompson estimation of an unknown population size. The OIPPMM offers a way to address two important features of some capture–recapture data: one‐inflation and unobserved heterogeneity. The OIPPMM provides markedly different results than some other popular estimators, and these other estimators can appear to be quite biased, or utterly fail due to the boundary problem, when the OIPPMM is the true data‐generating process. In addition, the OIPPMM provides a solution to the boundary problem, by labelling any mixture components on the boundary instead as one‐inflation. 相似文献
2.
Ryan T. Godwin 《Biometrical journal. Biometrische Zeitschrift》2017,59(1):79-93
We present the one‐inflated zero‐truncated negative binomial (OIZTNB) model, and propose its use as the truncated count distribution in Horvitz–Thompson estimation of an unknown population size. In the presence of unobserved heterogeneity, the zero‐truncated negative binomial (ZTNB) model is a natural choice over the positive Poisson (PP) model; however, when one‐inflation is present the ZTNB model either suffers from a boundary problem, or provides extremely biased population size estimates. Monte Carlo evidence suggests that in the presence of one‐inflation, the Horvitz–Thompson estimator under the ZTNB model can converge in probability to infinity. The OIZTNB model gives markedly different population size estimates compared to some existing truncated count distributions, when applied to several capture–recapture data that exhibit both one‐inflation and unobserved heterogeneity. 相似文献
3.
4.
Count data often show a higher incidence of zero counts than would be expected if the data were Poisson distributed. Zero-inflated Poisson regression models are a useful class of models for such data, but parameter estimates may be seriously biased if the nonzero counts are overdispersed in relation to the Poisson distribution. We therefore provide a score test for testing zero-inflated Poisson regression models against zero-inflated negative binomial alternatives. 相似文献
5.
Kelvin K. W. Yau Kui Wang Andy H. Lee 《Biometrical journal. Biometrische Zeitschrift》2003,45(4):437-452
In many biometrical applications, the count data encountered often contain extra zeros relative to the Poisson distribution. Zero‐inflated Poisson regression models are useful for analyzing such data, but parameter estimates may be seriously biased if the nonzero observations are over‐dispersed and simultaneously correlated due to the sampling design or the data collection procedure. In this paper, a zero‐inflated negative binomial mixed regression model is presented to analyze a set of pancreas disorder length of stay (LOS) data that comprised mainly same‐day separations. Random effects are introduced to account for inter‐hospital variations and the dependency of clustered LOS observations. Parameter estimation is achieved by maximizing an appropriate log‐likelihood function using an EM algorithm. Alternative modeling strategies, namely the finite mixture of Poisson distributions and the non‐parametric maximum likelihood approach, are also considered. The determination of pertinent covariates would assist hospital administrators and clinicians to manage LOS and expenditures efficiently. 相似文献
6.
Ridout, Hinde, and Demétrio (2001, Biometrics 57, 219-223) derived a score test for testing a zero-inflated Poisson (ZIP) regression model against zero-inflated negative binomial (ZINB) alternatives. They mentioned that the score test using the normal approximation might underestimate the nominal significance level possibly for small sample cases. To remedy this problem, a parametric bootstrap method is proposed. It is shown that the bootstrap method keeps the significance level close to the nominal one and has greater power uniformly than the existing normal approximation for testing the hypothesis. 相似文献
7.
8.
Grunwald GK Bruce SL Jiang L Strand M Rabinovitch N 《Biometrical journal. Biometrische Zeitschrift》2011,53(4):578-594
We propose a likelihood-based model for correlated count data that display under- or overdispersion within units (e.g. subjects). The model is capable of handling correlation due to clustering and/or serial correlation, in the presence of unbalanced, missing or unequally spaced data. A family of distributions based on birth-event processes is used to model within-subject underdispersion. A computational approach is given to overcome a parameterization difficulty with this family, and this allows use of common Markov Chain Monte Carlo software (e.g. WinBUGS) for estimation. Application of the model to daily counts of asthma inhaler use by children shows substantial within-subject underdispersion, between-subject heterogeneity and correlation due to both clustering of measurements within subjects and serial correlation of longitudinal measurements. The model provides a major improvement over Poisson longitudinal models, and diagnostics show that the model fits well. 相似文献
9.
10.
Andy H. Lee Kui Wang Kelvin K.W. Yau 《Biometrical journal. Biometrische Zeitschrift》2001,43(8):963-975
When analyzing Poisson count data sometimes a high frequency of extra zeros is observed. The Zero‐Inflated Poisson (ZIP) model is a popular approach to handle zero‐inflation. In this paper we generalize the ZIP model and its regression counterpart to accommodate the extent of individual exposure. Empirical evidence drawn from an occupational injury data set confirms that the incorporation of exposure information can exert a substantial impact on the model fit. Tests for zero‐inflation are also considered. Their finite sample properties are examined in a Monte Carlo study. 相似文献
11.
We discuss the problem of estimating the number of nests of different species of seabirds on North East Herald Cay based on the data from a 1996 survey of quadrats along transects and data from similar past surveys. We consider three approaches based on different plausible models, namely a conditional negative binomial model that allows for additional zeroes in the data, a weighting approach (based on a heteroscedastic regression model), and a transform-both-sides regression approach. We find that the conditional negative binomial approach and a linear regression approach work well but that the transform-both-sides approach should not be used. We apply the conditional negative binomial and linear regression approaches with poststratification based on data quality and availability to estimate the number of frigatebird nests on North East Herald Cay. 相似文献
12.
A randomised controlled trial to evaluate a training programme for physician-patient communication required the analysis of paired count data. The impact of departures from the Poisson assumption when paired count data are analysed through use of a conditional likelihood is illustrated. A simple approach to providing robust inference is outlined and illustrated. 相似文献
13.
Russell B. Millar 《Biometrics》2009,65(3):962-969
Summary . When replicate count data are overdispersed, it is common practice to incorporate this extra-Poisson variability by including latent parameters at the observation level. For example, the negative binomial and Poisson-lognormal (PLN) models are obtained by using gamma and lognormal latent parameters, respectively. Several recent publications have employed the deviance information criterion (DIC) to choose between these two models, with the deviance defined using the Poisson likelihood that is obtained from conditioning on these latent parameters. The results herein show that this use of DIC is inappropriate. Instead, DIC was seen to perform well if calculated using likelihood that was marginalized at the group level by integrating out the observation-level latent parameters. This group-level marginalization is explicit in the case of the negative binomial, but requires numerical integration for the PLN model. Similarly, DIC performed well to judge whether zero inflation was required when calculated using the group-marginalized form of the zero-inflated likelihood. In the context of comparing multilevel hierarchical models, the top-level DIC was obtained using likelihood that was further marginalized by additional integration over the group-level latent parameters, and the marginal densities of the models were calculated for the purpose of providing Bayes' factors. The computational viability and interpretability of these different measures is considered. 相似文献
14.
15.
16.
Gillian Z. Heller Dominique‐Laurent Couturier Stephane R. Heritier 《Biometrical journal. Biometrische Zeitschrift》2019,61(2):333-342
In clinical trials one traditionally models the effect of treatment on the mean response. The underlying assumption is that treatment affects the response distribution through a mean location shift on a suitable scale, with other aspects of the distribution (shape/dispersion/variance) remaining the same. This work is motivated by a trial in Parkinson's disease patients in which one of the endpoints is the number of falls during a 10‐week period. Inspection of the data reveals that the Poisson‐inverse Gaussian (PiG) distribution is appropriate, and that the experimental treatment reduces not only the mean, but also the variability, substantially. The conventional analysis assumes a treatment effect on the mean, either adjusted or unadjusted for covariates, and a constant dispersion parameter. On our data, this analysis yields a non‐significant treatment effect. However, if we model a treatment effect on both mean and dispersion parameters, both effects are highly significant. A simulation study shows that if a treatment effect exists on the dispersion and is ignored in the modelling, estimation of the treatment effect on the mean can be severely biased. We show further that if we use an orthogonal parametrization of the PiG distribution, estimates of the mean model are robust to misspecification of the dispersion model. We also discuss inferential aspects that are more difficult than anticipated in this setting. These findings have implications in the planning of statistical analyses for count data in clinical trials. 相似文献
17.
Capture‐recapture studies have attracted a lot of attention over the past few decades, especially in applied disciplines where a direct estimate for the size of a population of interest is not available. Epidemiology, ecology, public health, and biodiversity are just a few examples. The estimation of the number of unseen units has been a challenge for theoretical statisticians, and considerable progress has been made in providing lower bound estimators for the population size. In fact, it is well known that consistent estimators for this cannot be provided in the very general case. Considering a case where capture‐recapture studies are summarized by a frequency of frequencies distribution, we derive a simple upper bound of the population size based on the cumulative distribution function. We introduce two estimators of this bound, without any specific parametric assumption on the distribution of the observed frequency counts. The behavior of the proposed estimators is investigated using several benchmark datasets and a large‐scale simulation experiment based on the scheme discussed by Pledger. 相似文献
18.
19.
Wileyto et al. [E.P. Wileyto, W.J. Ewens, M.A. Mullen, Markov-recapture population estimates: a tool for improving interpretation of trapping experiments, Ecology 75 (1994) 1109] propose a four-state discrete time Markov process, which describes the structure of a marking-capture experiment as a method of population estimation. They propose this method primarily for estimation of closed insect populations. Their method provides a mark-recapture estimate from a single trap observation by allowing subjects to mark themselves. The estimate of the unknown population size is based on the assumption of a closed population and a simple Markov model in which the rates of marking, capture, and recapture are assumed to be equal. Using the one step transition probability matrix of their model, we illustrate how to go from an embedded discrete time Markov process to a continuous time Markov process assuming exponentially distributed holding times. We also compute the transition probabilities after time t for the continuous time case and compare the limiting behavior of the continuous and discrete time processes. Finally, we generalize their model by relaxing the assumption of equal per capita rates for marking, capture, and recapture. Other questions about how their results change when using a continuous time Markov process are examined. 相似文献
20.
Thibaut N. Bouveroux Michelle Caputo Pierre W. Froneman Stephanie Plön 《Marine Mammal Science》2018,34(3):645-665
This study investigates how group size of Indo‐Pacific bottlenose dolphins (Tursiops aduncus) changes temporally, spatially, and/or with predominant behavior at two discreet sites along the Eastern Cape coastline of South Africa: Algoa Bay and the Wild Coast. The mean group size of bottlenose dolphins was large with an average of 52 animals. Significantly larger groups were observed in Algoa Bay ( = 60, range = 1–600) than off the Wild Coast ( = 32.9, range = 1–250). In Algoa Bay, the mean group size increased significantly over the study period, from an average 18 animals in 2008 to 76 animals in 2016. Additionally, the largest average and maximum group sizes ever reported both in South Africa and worldwide, were recorded in Algoa Bay (maximum group size = 600). Neither season nor behavior had a significant effect on mean group size at both sites. Similarly environmental variables such as the depth and substrate type also had no influence on group size. It remains unclear which ecological drivers, such as predation risk and food availability, are leading to the large groups observed in this area, and further research on abundance and distribution of both predators and prey is necessary. 相似文献