首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
The K function is a summary of spatial dependence in spatial point processes. In practice one observes a realization of the spatial point process, called a spatial point pattern. Although the K function of a spatial point process is typically unknown, several estimators of the process K function have been put forth. These estimators, however, are based upon empirical averages; the complicated distributional properties of the estimators unfortunately complicates interval estimation. In this paper, we propose a Bayesian inferential framework, allowing inference for the K function of the spatial point process (including interval estimation). Of particular interest is the unique use of the posterior predictive distribution to (efficiently) enable such inferences. To demonstrate our technique, the well known Swedish pine sapling data (Strand, 1972) is analyzed, including a discussion on evaluating model fit.  相似文献   

2.
This article deals with the identification of gene regulatory networks from experimental data using a statistical machine learning approach. A stochastic model of gene interactions capable of handling missing variables is proposed. It can be described as a dynamic Bayesian network particularly well suited to tackle the stochastic nature of gene regulation and gene expression measurement. Parameters of the model are learned through a penalized likelihood maximization implemented through an extended version of EM algorithm. Our approach is tested against experimental data relative to the S.O.S. DNA Repair network of the Escherichia coli bacterium. It appears to be able to extract the main regulations between the genes involved in this network. An added missing variable is found to model the main protein of the network. Good prediction abilities on unlearned data are observed. These first results are very promising: they show the power of the learning algorithm and the ability of the model to capture gene interactions.  相似文献   

3.
Understanding mammalian evolution using Bayesian phylogenetic inference   总被引:1,自引:0,他引:1  
1. Phylogenetic trees are critical in addressing evolutionary hypotheses; however, the reconstruction of a phylogeny is no easy task. This process has recently been made less arduous by using a Bayesian statistical approach. This method offers the advantage that one can determine the probability of some hypothesis (i.e. a phylogeny), conditional on the observed data (i.e. nucleotide sequences). 2. By reconstructing phylogenies using Bayes’ theorem in combination with Markov chain Monte Carlo, phylogeneticists are able to test hypotheses more quickly compared with using standard methods such as neighbour-joining, maximum likelihood or parsimony. Critics of the Bayesian approach suggest that it is not a panacea, and argue that the prior probability is too subjective and the resulting posterior probability is too liberal compared with maximum likelihood. 3. These issues are currently debated in the arena of mammalian evolution. Recently, proponents and opponents of the Bayesian approach have constructed the mammalian phylogeny using different methods under different conditions and with a variety of parameters. These analyses showed the robustness (or lack of) of the Bayesian approach. In the end, consensus suggests that Bayesian methods are robust and give essentially the same answer as maximum likelihood methods but in less time. 4. Approaches based on fossils and molecules typically agree on ordinal-level relationships among mammals but not on higher-level relationships, as Bayesian analyses recognize the African radiation, Afrotheria, and the two Laurasian radiations, Laurasiatheria and Euarchontoglires, whereas fossils did not predict Afrotheria.  相似文献   

4.
A statistical analysis of a weighted averaging procedure for the estimation of small signals buried in noise (Hoke et al. 1984a) is given. The weighting factor used by this method is in inverse proportion to the variance estimated for the noise. It is shown that, compred to conventional averaging, weighted averaging can improve the signal-to-noise ratio to a high extent if the variance of the noise changes as a function of time. On the other hand, uncritical application of the method involves the danger that the signal amplitude is underestimated. How serious this effect is depends on the number of degrees of freedom available for the estimation of the weighting factor. The effect can be neglected, if this number is sufficiently increased by means of an appropriate preprocessing.  相似文献   

5.
6.
Bayesian inference of recent migration rates using multilocus genotypes   总被引:25,自引:0,他引:25  
Wilson GA  Rannala B 《Genetics》2003,163(3):1177-1191
A new Bayesian method that uses individual multilocus genotypes to estimate rates of recent immigration (over the last several generations) among populations is presented. The method also estimates the posterior probability distributions of individual immigrant ancestries, population allele frequencies, population inbreeding coefficients, and other parameters of potential interest. The method is implemented in a computer program that relies on Markov chain Monte Carlo techniques to carry out the estimation of posterior probabilities. The program can be used with allozyme, microsatellite, RFLP, SNP, and other kinds of genotype data. We relax several assumptions of early methods for detecting recent immigrants, using genotype data; most significantly, we allow genotype frequencies to deviate from Hardy-Weinberg equilibrium proportions within populations. The program is demonstrated by applying it to two recently published microsatellite data sets for populations of the plant species Centaurea corymbosa and the gray wolf species Canis lupus. A computer simulation study suggests that the program can provide highly accurate estimates of migration rates and individual migrant ancestries, given sufficient genetic differentiation among populations and sufficient numbers of marker loci.  相似文献   

7.
Using polynomial regression methods this paper introduces a new approoach to computer-aided interpretation of the ECG. An algorithm is presented for the identification and separation of ECG segments. The method of interpretation is efficient and its results are consistent and reliable.  相似文献   

8.
Bayesian inference for variance components using only error contrasts   总被引:6,自引:0,他引:6  
HARVILLE  DAVID A. 《Biometrika》1974,61(2):383-385
  相似文献   

9.
Carey VJ  Baker CJ  Platt R 《Biometrics》2001,57(1):135-142
In the study of immune responses to infectious pathogens, the minimum protective antibody concentration (MPAC) is a quantity of great interest. We use case-control data to estimate the posterior distribution of the conditional risk of disease given a lower bound on antibody concentration in an at-risk subject. The concentration bound beyond which there is high credibility that infection risk is zero or nearly so is a candidate for the MPAC. A very simple Gibbs sampling procedure that permits inference on the risk of disease given antibody level is presented. In problems involving small numbers of patients, the procedure is shown to have favorable accuracy and robustness to choice/misspecification of priors. Frequentist evaluation indicates good coverage probabilities of credibility intervals for antibody-dependent risk, and rules for estimation of the MPAC are illustrated with epidemiological data.  相似文献   

10.
Polytomies and Bayesian phylogenetic inference   总被引:16,自引:0,他引:16  
Bayesian phylogenetic analyses are now very popular in systematics and molecular evolution because they allow the use of much more realistic models than currently possible with maximum likelihood methods. There are, however, a growing number of examples in which large Bayesian posterior clade probabilities are associated with very short branch lengths and low values for non-Bayesian measures of support such as nonparametric bootstrapping. For the four-taxon case when the true tree is the star phylogeny, Bayesian analyses become increasingly unpredictable in their preference for one of the three possible resolved tree topologies as data set size increases. This leads to the prediction that hard (or near-hard) polytomies in nature will cause unpredictable behavior in Bayesian analyses, with arbitrary resolutions of the polytomy receiving very high posterior probabilities in some cases. We present a simple solution to this problem involving a reversible-jump Markov chain Monte Carlo (MCMC) algorithm that allows exploration of all of tree space, including unresolved tree topologies with one or more polytomies. The reversible-jump MCMC approach allows prior distributions to place some weight on less-resolved tree topologies, which eliminates misleadingly high posteriors associated with arbitrary resolutions of hard polytomies. Fortunately, assigning some prior probability to polytomous tree topologies does not appear to come with a significant cost in terms of the ability to assess the level of support for edges that do exist in the true tree. Methods are discussed for applying arbitrary prior distributions to tree topologies of varying resolution, and an empirical example showing evidence of polytomies is analyzed and discussed.  相似文献   

11.
12.
Recently, several statistical methods for estimating fine-scale recombination rates using population samples have been developed. However, currently available methods that can be applied to large-scale data are limited to approximated likelihoods. Here, we developed a full-likelihood Markov chain Monte Carlo method for estimating recombination rate under a Bayesian framework. Genealogies underlying a sampling of chromosomes are effectively modelled by using marginal individual single nucleotide polymorphism genealogies related through an ancestral recombination graph. The method is compared with two existing composite-likelihood methods using simulated data.Simulation studies show that our method performs well for different simulation scenarios. The method is applied to two human population genetic variation datasets that have been studied by sperm typing. Our results are consistent with the estimates from sperm crossover analysis.  相似文献   

13.
1. Data on macroinvertebrates and stream chemistry were collected from sixty-four streams in Finland. Weighted averaging (WA) regression and calibration models were constructed to infer the minimum pH of streams from their invertebrate assemblages. The purpose was to develop an instrument for biological assessment and monitoring of stream acidification. The WA method was compared with simpler approaches, based on qualitative invertebrate data and pH tolerance limits, that are widely used.
2. Performance of the two approaches was assessed in terms of correlation between the inferred and observed minimum pH within the 'training set', and in terms of root mean squared differences (predicted – observed) (RMSEP) estimated by cross-validation or bootstrap resampling techniques. The models were further tested using independent data from the literature representative of a wide geographical range.
3. The predictive power of the WA models was reasonable (RMSEP 0.40–0.44 pH units) in the training set and consistently better than that of the tolerance limit method. In contrast to the latter, the WA models were able to infer a minimum pH above 5.5, suggesting they could detect the early stages of acidification.
4. The WA models performed better than the tolerance limit method in inferring pH from the independent literature, further demonstrating the superiority and generality of the WA approach.
5. The weighted averaging technique could be an effective and widely applicable tool for contemporary biological monitoring and assessment using aquatic invertebrates.  相似文献   

14.
1. Data on macroinvertebrates and stream chemistry were collected from sixty-four streams in Finland. Weighted averaging (WA) regression and calibration models were constructed to infer the minimum pH of streams from their invertebrate assemblages. The purpose was to develop an instrument for biological assessment and monitoring of stream acidification. The WA method was compared with simpler approaches, based on qualitative invertebrate data and pH tolerance limits, that are widely used.
2. Performance of the two approaches was assessed in terms of correlation between the inferred and observed minimum pH within the 'training set', and in terms of root mean squared differences (predicted – observed) (RMSEP) estimated by cross-validation or bootstrap resampling techniques. The models were further tested using independent data from the literature representative of a wide geographical range.
3. The predictive power of the WA models was reasonable (RMSEP 0.40–0.44 pH units) in the training set and consistently better than that of the tolerance limit method. In contrast to the latter, the WA models were able to infer a minimum pH above 5.5, suggesting they could detect the early stages of acidification.
4. The WA models performed better than the tolerance limit method in inferring pH from the independent literature, further demonstrating the superiority and generality of the WA approach.
5. The weighted averaging technique could be an effective and widely applicable tool for contemporary biological monitoring and assessment using aquatic invertebrates.  相似文献   

15.
This article is concerned with the Bayesian estimation of stochastic rate constants in the context of dynamic models of intracellular processes. The underlying discrete stochastic kinetic model is replaced by a diffusion approximation (or stochastic differential equation approach) where a white noise term models stochastic behavior and the model is identified using equispaced time course data. The estimation framework involves the introduction of m- 1 latent data points between every pair of observations. MCMC methods are then used to sample the posterior distribution of the latent process and the model parameters. The methodology is applied to the estimation of parameters in a prokaryotic autoregulatory gene network.  相似文献   

16.
Prevalence of disease in wildlife populations, which is necessary for developing disease models and conducting epidemiologic analyses, is often understudied. Laboratory tests used to screen for diseases in wildlife populations often are validated only for domestic animals. Consequently, the use of these tests for wildlife populations may lead to inaccurate estimates of disease prevalence. We demonstrate the use of Bayesian latent class analysis (LCA) in determining the specificity and sensitivity of a competitive enzyme‐linked immunosorbent assay (cELISA; VMRD®, Inc.) serologic test used to identify exposure to Neospora caninum (hereafter N. caninum) in three wildlife populations in southeastern Ohio, USA. True prevalence of N. caninum exposure in these populations was estimated to range from 0.1% to 3.1% in American bison (Bison bison), 51.0% to 53.8% in Père David's deer (Elaphurus davidianus), and 40.0% to 45.9% in white‐tailed deer (Odocoileus virginianus). The accuracy of the cELISA in American bison and Père David's deer was estimated to be close to the 96% sensitivity and 99% specificity reported by the manufacturer. Sensitivity in white‐tailed deer, however, ranged from 78.9% to 99.9%. Apparent prevalence of N. caninum from the test results is not equal to the true prevalence in white‐tailed deer and Père David's deer populations. Even when these species inhabit the same community, the true prevalence in the two deer populations differed from the true prevalence in the American bison population. Variances in prevalence for some species suggest differences in the epidemiology of N. caninum for these colocated populations. Bayesian LCA methods could be used as in this example to overcome some of the constraints on validating tests in wildlife species. The ability to accurately evaluate disease status and prevalence in a population improves our understanding of the epidemiology of multihost pathogen systems at the community level.  相似文献   

17.
Group testing, also known as pooled testing, and inverse sampling are both widely used methods of data collection when the goal is to estimate a small proportion. Taking a Bayesian approach, we consider the new problem of estimating disease prevalence from group testing when inverse (negative binomial) sampling is used. Using different distributions to incorporate prior knowledge of disease incidence and different loss functions, we derive closed form expressions for posterior distributions and resulting point and credible interval estimators. We then evaluate our new estimators, on Bayesian and classical grounds, and apply our methods to a West Nile Virus data set.  相似文献   

18.

Background

Information for mapping of quantitative trait loci (QTL) comes from two sources: linkage disequilibrium (non-random association of allele states) and cosegregation (non-random association of allele origin). Information from LD can be captured by modeling conditional means and variances at the QTL given marker information. Similarly, information from cosegregation can be captured by modeling conditional covariances. Here, we consider a Bayesian model based on gene frequency (BGF) where both conditional means and variances are modeled as a function of the conditional gene frequencies at the QTL. The parameters in this model include these gene frequencies, additive effect of the QTL, its location, and the residual variance. Bayesian methodology was used to estimate these parameters. The priors used were: logit-normal for gene frequencies, normal for the additive effect, uniform for location, and inverse chi-square for the residual variance. Computer simulation was used to compare the power to detect and accuracy to map QTL by this method with those from least squares analysis using a regression model (LSR).

Results

To simplify the analysis, data from unrelated individuals in a purebred population were simulated, where only LD information contributes to map the QTL. LD was simulated in a chromosomal segment of 1 cM with one QTL by random mating in a population of size 500 for 1000 generations and in a population of size 100 for 50 generations. The comparison was studied under a range of conditions, which included SNP density of 0.1, 0.05 or 0.02 cM, sample size of 500 or 1000, and phenotypic variance explained by QTL of 2 or 5%. Both 1 and 2-SNP models were considered. Power to detect the QTL for the BGF, ranged from 0.4 to 0.99, and close or equal to the power of the regression using least squares (LSR). Precision to map QTL position of BGF, quantified by the mean absolute error, ranged from 0.11 to 0.21 cM for BGF, and was better than the precision of LSR, which ranged from 0.12 to 0.25 cM.

Conclusions

In conclusion given a high SNP density, the gene frequency model can be used to map QTL with considerable accuracy even within a 1 cM region.  相似文献   

19.

Background  

Likelihood (ML)-based phylogenetic inference has become a popular method for estimating the evolutionary relationships among species based on genomic sequence data. This method is used in applications such as RAxML, GARLI, MrBayes, PAML, and PAUP. The Phylogenetic Likelihood Function (PLF) is an important kernel computation for this method. The PLF consists of a loop with no conditional behavior or dependencies between iterations. As such it contains a high potential for exploiting parallelism using micro-architectural techniques. In this paper, we describe a technique for mapping the PLF and supporting logic onto a Field Programmable Gate Array (FPGA)-based co-processor. By leveraging the FPGA's on-chip DSP modules and the high-bandwidth local memory attached to the FPGA, the resultant co-processor can accelerate ML-based methods and outperform state-of-the-art multi-core processors.  相似文献   

20.
Bayesian inference has emerged as a general framework that captures how organisms make decisions under uncertainty. Recent experimental findings reveal disparate mechanisms for how the brain generates behaviors predicted by normative Bayesian theories. Here, we identify two broad classes of neural implementations for Bayesian inference: a modular class, where each probabilistic component of Bayesian computation is independently encoded and a transform class, where uncertain measurements are converted to Bayesian estimates through latent processes. Many recent experimental neuroscience findings studying probabilistic inference broadly fall into these classes. We identify potential avenues for synthesis across these two classes and the disparities that, at present, cannot be reconciled. We conclude that to distinguish among implementation hypotheses for Bayesian inference, we require greater engagement among theoretical and experimental neuroscientists in an effort that spans different scales of analysis, circuits, tasks, and species.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号