共查询到20条相似文献,搜索用时 15 毫秒
1.
Summary Exposure to high levels of air pollution during the pregnancy is associated with increased probability of preterm birth (PTB), a major cause of infant morbidity and mortality. New statistical methodology is required to specifically determine when a particular pollutant impacts the PTB outcome, to determine the role of different pollutants, and to characterize the spatial variability in these results. We develop a new Bayesian spatial model for PTB which identifies susceptible windows throughout the pregnancy jointly for multiple pollutants (PM2.5 , ozone) while allowing these windows to vary continuously across space and time. We geo‐code vital record birth data from Texas (2002–2004) and link them with standard pollution monitoring data and a newly introduced EPA product of calibrated air pollution model output. We apply the fully spatial model to a region of 13 counties in eastern Texas consisting of highly urban as well as rural areas. Our results indicate significant signal in the first two trimesters of pregnancy with different pollutants leading to different critical windows. Introducing the spatial aspect uncovers critical windows previously unidentified when space is ignored. A proper inference procedure is introduced to correctly analyze these windows. 相似文献
2.
Cerioli A 《Biometrics》2002,58(4):888-897
A common feature of data collected in environmental and earth sciences is that they typically exhibit spatial autocorrelation. Violating the assumption of independent observations can have dramatic effects on inferences derived from standard statistical methods. In this article, we examine the consequences of spatial autocorrelation on Pearson's chi-squared test of mutual independence between two categorical responses with a general number of classes. Correspondingly, we suggest a simple modification to the standard test statistic that allows for spatial autocorrelation. Our modified statistic is based on a first-order correction factor and thus provides only an approximate test. However, we show by Monte Carlo simulation that this approximation results in satisfactory inferences in several situations of practical interest. The usefulness of the method is displayed through an application to categorical data arising in the study of the relationship between the distribution pattern of plant species and woodland age in a forest in northern Belgium. 相似文献
3.
Carsten F. Dormann 《Global Ecology and Biogeography》2007,16(2):129-138
Aim Spatial autocorrelation (SAC) in data, i.e. the higher similarity of closer samples, is a common phenomenon in ecology. SAC is starting to be considered in the analysis of species distribution data, and over the last 10 years several studies have incorporated SAC into statistical models (here termed 'spatial models'). Here, I address the question of whether incorporating SAC affects estimates of model coefficients and inference from statistical models.
Methods I review ecological studies that compare spatial and non-spatial models.
Results In all cases coefficient estimates for environmental correlates of species distributions were affected by SAC, leading to a mis-estimation of on average c . 25%. Model fit was also improved by incorporating SAC.
Main conclusions These biased estimates and incorrect model specifications have implications for predicting species occurrences under changing environmental conditions. Spatial models are therefore required to estimate correctly the effects of environmental drivers on species present distributions, for a statistically unbiased identification of the drivers of distribution, and hence for more accurate forecasts of future distributions. 相似文献
Methods I review ecological studies that compare spatial and non-spatial models.
Results In all cases coefficient estimates for environmental correlates of species distributions were affected by SAC, leading to a mis-estimation of on average c . 25%. Model fit was also improved by incorporating SAC.
Main conclusions These biased estimates and incorrect model specifications have implications for predicting species occurrences under changing environmental conditions. Spatial models are therefore required to estimate correctly the effects of environmental drivers on species present distributions, for a statistically unbiased identification of the drivers of distribution, and hence for more accurate forecasts of future distributions. 相似文献
4.
In the fields of medicine and public health, a common application of areal data models is the study of geographical patterns of disease. When we have several measurements recorded at each spatial location (for example, information on p>/= 2 diseases from the same population groups or regions), we need to consider multivariate areal data models in order to handle the dependence among the multivariate components as well as the spatial dependence between sites. In this article, we propose a flexible new class of generalized multivariate conditionally autoregressive (GMCAR) models for areal data, and show how it enriches the MCAR class. Our approach differs from earlier ones in that it directly specifies the joint distribution for a multivariate Markov random field (MRF) through the specification of simpler conditional and marginal models. This in turn leads to a significant reduction in the computational burden in hierarchical spatial random effect modeling, where posterior summaries are computed using Markov chain Monte Carlo (MCMC). We compare our approach with existing MCAR models in the literature via simulation, using average mean square error (AMSE) and a convenient hierarchical model selection criterion, the deviance information criterion (DIC; Spiegelhalter et al., 2002, Journal of the Royal Statistical Society, Series B64, 583-639). Finally, we offer a real-data application of our proposed GMCAR approach that models lung and esophagus cancer death rates during 1991-1998 in Minnesota counties. 相似文献
5.
We propose an approximate Bayesian method for comparing an experimental treatment to a control based on a randomized clinical trial with multivariate patient outcomes. Overall treatment effect is characterized by a vector of parameters corresponding to effects on the individual patient outcomes. We partition the parameter space into four sets where, respectively, the experimental treatment is superior to the control, the control is superior to the experimental, the two treatments are equivalent, and the treatment effects are discordant. We compute posterior probabilities of the parameter sets by treating an estimator of the parameter vector like a random variable in the Bayesian paradigm. The approximation may be used in any setting where a consistent, asymptotically normal estimator of the parameter vector is available. The method is illustrated by application to a breast cancer data set consisting of multiple time-to-event outcomes with covariates and to count data arising from a cross-classification of response, infection, and treatment in an acute leukemia trial. 相似文献
6.
Marcos O. Prates Robert H. Aseltine Jr. Dipak K. Dey Jun Yan 《Biometrical journal. Biometrische Zeitschrift》2013,55(6):912-924
Unhealthy alcohol use is one of the leading causes of morbidity and mortality in the United States. Brief interventions with high‐risk drinkers during an emergency department (ED) visit are of great interest due to their possible efficacy and low cost. In a collaborative study with patients recruited at 14 academic ED across the United States, we examined the self‐reported number of drinks per week by each patient following the exposure to a brief intervention. Count data with overdispersion have been mostly analyzed with generalized linear mixed models (GLMMs), of which only a limited number of link functions are available. Different choices of link function provide different fit and predictive power for a particular dataset. We propose a class of link functions from an alternative way to incorporate random effects in a GLMM, which encompasses many existing link functions as special cases. The methodology is naturally implemented in a Bayesian framework, with competing links selected with Bayesian model selection criteria such as the conditional predictive ordinate (CPO). In application to the ED intervention study, all models suggest that the intervention was effective in reducing the number of drinks, but some new models are found to significantly outperform the traditional model as measured by CPO. The validity of CPO in link selection is confirmed in a simulation study that shared the same characteristics as the count data from high‐risk drinkers. The dataset and the source code for the best fitting model are available in Supporting Information. 相似文献
7.
Semi-competing risks data include the time to a nonterminating event and the time to a terminating event, while competing risks data include the time to more than one terminating event. Our work is motivated by a prostate cancer study, which has one nonterminating event and two terminating events with both semi-competing risks and competing risks present as well as two censoring times. In this paper, we propose a new multi-risks survival (MRS) model for this type of data. In addition, the proposed MRS model can accommodate noninformative right-censoring times for nonterminating and terminating events. Properties of the proposed MRS model are examined in detail. Theoretical and empirical results show that the estimates of the cumulative incidence function for a nonterminating event may be biased if the information on a terminating event is ignored. A Markov chain Monte Carlo sampling algorithm is also developed. Our methodology is further assessed using simulations and also an analysis of the real data from a prostate cancer study. As a result, a prostate-specific antigen velocity greater than 2.0 ng/mL per year and higher biopsy Gleason scores are positively associated with a shorter time to death due to prostate cancer. 相似文献
8.
This paper presents a Bayesian nonlinear approach for the analysis of spatial count data. It extends the Bayesian partition methodology of Holmes, Denison, and Mallick (1999, Bayesian partitioning for classification and regression, Technical Report, Imperial College, London) to handle data that involve counts. A demonstration involving incidence rates of leukemia in New York state is used to highlight the methodology. The model allows us to make probability statements on the incidence rates around point sources without making any parametric assumptions about the nature of the influence between the sources and the surrounding location. 相似文献
9.
A nonparametric Bayesian formulation is given to the problem of modeling nonhomogeneous spatial point patterns influenced by concomitant variables. Only incomplete information on the concomitant variables is assumed, consisting of a relatively small number of point measurements. Residual variation, caused by other unmeasured influential factors, is modeled in terms of a spatially varying baseline intensity function. A Markov chain Monte Carlo scheme is proposed for the simultaneous nonparametric estimation of each unknown function in the model. The suggested method is illustrated by reanalysing a data set in Rathbun (1996, Biometrics 52, 226-242), and the estimated models are compared with those obtained by Rathbun. 相似文献
10.
Even though the reed, Phragmites australis, is an extensively studied wetland species, little is known about reproduction and dispersal modes within and among reed populations at the scale of small river systems. Using microsatellite analysis of 189 individuals from three adjacent river catchments in the Czech Republic, we elucidated the role of the river corridors in the dispersal of P. australis. Using Bayesian clustering of individuals, we found that 19% of clusters were distributed only along one river, which implied dispersal by water (or by wind) along river corridors, whereas 38% of clusters were widely distributed and were likely the product of wind long-distance dispersal among rivers. Intensive exchange of propagules among river systems is further demonstrated by only 6% of total variance being attributed to the variance among rivers in the AMOVA-analysis. Spatial autocorrelation analysis revealed a decreasing pattern up to 5–10 km and no clear pattern over longer distances. This gives an evidence for pollen and seed dispersal at short distances (up to 1 km), whereas most likely only seed dispersal at longer distances up to 10 km. We found five multilocus genotypes distributed in two different populations. The distances between populations with the same genotype ranged from 0.5 to 10.8 km. This can be interpreted as long-distance vegetative dispersal. 相似文献
11.
A Bayesian approach to the direct mapping of a quantitative trait locus (QTL), fully utilizing information from multiple linked gene markers, is presented in this paper. The joint posterior distribution (a mixture distribution modeling the linkage between a biallelic QTL and N gene markers) is computationally challenging and invites exploration via Markov chain Monte Carlo methods. The parameter's complete marginal posterior densities are obtained, allowing a diverse range of inferences. Parameters estimated include the QTL genotype probabilities for the sires and the offspring, the allele frequencies for the QTL, and the position and additive and dominance effects of the QTL. The methodology is applied through simulation to a half-sib design to form an outbred pedigree structure where there is an entire class of missing information. The capacity of the technique to accurately estimate parameters is examined for a range of scenarios. 相似文献
12.
Epperson BK 《Theoretical population biology》2003,64(1):81-87
Spatial distributions of biological variables are often well-characterized with pairwise measures of spatial autocorrelation. In this article, the probability theory for products and covariances of join-count spatial autocorrelation measures are developed for spatial distributions of multiple nominal (e.g. species or genotypes) types. This more fully describes the joint distributions of pairwise measures in spatial distributions of multiple (i.e. more than two) types. An example is given on how the covariances can be used for finding standard errors of weighted averages of join-counts in spatial autocorrelation analysis of more than two types, as is typical for genetic data for multiallelic loci. 相似文献
13.
Researchers often measure stress using questionnaire data on the occurrence of potentially stress-inducing life events and the strength of reaction to these events, characterized as negative or positive and assigned an ordinal ranking. In studying the health effects of stress, one needs to obtain measures of an individual's negative and positive stress levels to be used as predictors. Motivated by data of this type, we propose a latent variable model, which is characterized by event-specific negative and positive reaction scores. If the positive reaction score dominates the negative reaction score for an event, then the individual's reported response to that event will be positive, with an ordinal ranking determined by the value of the score. Measures of overall positive and negative stress can be obtained by summing the reactivity scores across the events that occur for an individual. By incorporating these measures as predictors in a regression model and fitting the stress and outcome models jointly using Bayesian methods, inferences can be conducted without the need to assume known weights for the different events. We propose an MCMC algorithm for posterior computation and apply the approach to study the effects of stress on preterm delivery. 相似文献
14.
Hiebeler D 《Journal of theoretical biology》2005,232(1):143-149
The basic contact process in continuous time is studied, where instead of single occupied sites becoming empty independently, larger-scale disturbance events simultaneously remove the population from contiguous blocks of sites. Stochastic spatial simulations and pair approximations were used to investigate the model. Increasing the spatial scale of disturbance events increases spatial clustering of the population and variability in growth rates within localized regions, reduces the effective overall population density, and increases the critical reproductive rate necessary for the population to persist. Pair approximations yield a closed-form analytic expression for equilibrium population density and the critical value necessary for persistence. 相似文献
15.
Summary We estimate the parameters of a stochastic process model for a macroparasite population within a host using approximate Bayesian computation (ABC). The immunity of the host is an unobserved model variable and only mature macroparasites at sacrifice of the host are counted. With very limited data, process rates are inferred reasonably precisely. Modeling involves a three variable Markov process for which the observed data likelihood is computationally intractable. ABC methods are particularly useful when the likelihood is analytically or computationally intractable. The ABC algorithm we present is based on sequential Monte Carlo, is adaptive in nature, and overcomes some drawbacks of previous approaches to ABC. The algorithm is validated on a test example involving simulated data from an autologistic model before being used to infer parameters of the Markov process model for experimental data. The fitted model explains the observed extra‐binomial variation in terms of a zero‐one immunity variable, which has a short‐lived presence in the host. 相似文献
16.
The aggregate data study design (Prentice and Sheppard, 1995, Biometrika 82, 113-125) estimates individual-level exposure effects by regressing population-based disease rates on covariate data from survey samples in each population group. In this work, we further develop the aggregate data model to allow for residual spatial correlation among disease rates across populations. Geographical variation that is not explained by model predictors and has a spatial component often arises in studies of rare chronic diseases, such as breast cancer. We combine the aggregate and Bayesian disease-mapping models to provide an intuitive approach to the modeling of spatial effects while drawing correct inference regarding the exposure effect. Based on the results of simulation studies, we suggest guidelines for use of the proposed model. 相似文献
17.
Large‐scale biodiversity data are needed to predict species' responses to global change and to address basic questions in macroecology. While such data are increasingly becoming available, their analysis is challenging because of the typically large heterogeneity in spatial sampling intensity and the need to account for observation processes. Two further challenges are accounting for spatial effects that are not explained by covariates, and drawing inference on dynamics at these large spatial scales. We developed dynamic occupancy models to analyze large‐scale atlas data. In addition to occupancy, these models estimate local colonization and persistence probabilities. We accounted for spatial autocorrelation using conditional autoregressive models and autologistic models. We fitted the models to detection/nondetection data collected on a quarter‐degree grid across southern Africa during two atlas projects, using the hadeda ibis (Bostrychia hagedash) as an example. The model accurately reproduced the range expansion between the first (SABAP1: 1987–1992) and second (SABAP2: 2007–2012) Southern African Bird Atlas Project into the drier parts of interior South Africa. Grid cells occupied during SABAP1 generally remained occupied, but colonization of unoccupied grid cells was strongly dependent on the number of occupied grid cells in the neighborhood. The detection probability strongly varied across space due to variation in effort, observer identity, seasonality, and unexplained spatial effects. We present a flexible hierarchical approach for analyzing grid‐based atlas data using dynamical occupancy models. Our model is similar to a species' distribution model obtained using generalized additive models but has a number of advantages. Our model accounts for the heterogeneous sampling process, spatial correlation, and perhaps most importantly, allows us to examine dynamic aspects of species ranges. 相似文献
18.
Bayesian analyses of spatial data often use a conditionally autoregressive (CAR) prior, which can be written as the kernel of an improper density that depends on a precision parameter tau that is typically unknown. To include tau in the Bayesian analysis, the kernel must be multiplied by tau(k) for some k. This article rigorously derives k = (n - I)/2 for the L2 norm CAR prior (also called a Gaussian Markov random field model) and k = n - I for the L1 norm CAR prior, where n is the number of regions and I the number of \"islands\" (disconnected groups of regions) in the spatial map. Since I = 1 for a spatial structure defining a connected graph, this supports Knorr-Held's (2002, in Highly Structured Stochastic Systems, 260-264) suggestion that k = (n - 1)/2 in the L2 norm case, instead of the more common k = n/2. We illustrate the practical significance of our results using a periodontal example. 相似文献
19.
20.
Deborah A. Costain 《Biometrics》2009,65(4):1123-1132
Summary Methods for modeling and mapping spatial variation in disease risk continue to motivate much research. In particular, spatial analyses provide a useful tool for exploring geographical heterogeneity in health outcomes, and consequently can yield clues as to disease etiology, direct public health management, and generate research hypotheses. This article presents a Bayesian partitioning approach for the analysis of individual level geo‐referenced health data. The model makes few assumptions about the underlying form of the risk surface, is data adaptive, and allows for the inclusion of known determinants of disease. The methodology is used to model spatial variation in neonatal mortality in Porto Alegre, Brazil. 相似文献