期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Robust Bayesian mapping of quantitative trait loci using Student-t distribution for residual

Xin Wang Zhongze Piao Biye Wang Runqing Yang Zhixiang Luo 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2009,118(3):609-617

In most quantitative trait loci (QTL) mapping studies, phenotypes are assumed to follow normal distributions. Deviations from this assumption may affect the accuracy of QTL detection, leading to detection of false positive QTL. To improve the robustness of QTL mapping methods, we replace the normal distribution assumption for residuals in a multiple QTL model with a Student-t distribution that is able to accommodate residual outliers. A Robust Bayesian mapping strategy is proposed on the basis of the Bayesian shrinkage analysis for QTL effects. The simulations show that Robust Bayesian mapping approach can substantially increase the power of QTL detection when the normality assumption does not hold and applying it to data already normally distributed does not influence the result. The proposed QTL mapping method is applied to mapping QTL for the traits associated with physics–chemical characters and quality in rice. Similarly to the simulation study in the real data case the robust approach was able to detect additional QTLs when compared to the traditional approach. The program to implement the method is available on request from the first or the corresponding author. Xin Wang and Zhongze Piao contributed equally to this study. 相似文献

2.

Bayesian monitoring of event rates with censored data

Follmann DA Albert PS 《Biometrics》1999,55(2):603-607

A Bayesian approach to monitoring event rates with censored data is proposed. A Dirichlet prior for discrete time event probabilities is blended with discrete survival times to provide a posterior distribution that is a mixture of Dirichlets. Approximation of the posterior distribution via data augmentation is discussed. Practical issues involved in implementing the procedure are discussed and illustrated with a simulation of the single arm Cord Blood Transplantation Study where 6-month survival is monitored. 相似文献

3.

Bayesian Inference for Generalized Linear Mixed Model Based on the Multivariate t Distribution in Population Pharmacokinetic Study

Fang-Rong Yan Yuan Huang Jun-Lin Liu Tao Lu Jin-Guan Lin 《PloS one》2013,8(3)

This article provides a fully Bayesian approach for modeling of single-dose and complete pharmacokinetic data in a population pharmacokinetic (PK) model. To overcome the impact of outliers and the difficulty of computation, a generalized linear model is chosen with the hypothesis that the errors follow a multivariate Student t distribution which is a heavy-tailed distribution. The aim of this study is to investigate and implement the performance of the multivariate t distribution to analyze population pharmacokinetic data. Bayesian predictive inferences and the Metropolis-Hastings algorithm schemes are used to process the intractable posterior integration. The precision and accuracy of the proposed model are illustrated by the simulating data and a real example of theophylline data. 相似文献

4.

Bayesian network and nonparametric heteroscedastic regression for nonlinear modeling of genetic network

Imoto S Kim S Goto T Miyano S Aburatani S Tashiro K Kuhara S 《Journal of bioinformatics and computational biology》2003,1(2):231-252

We propose a new statistical method for constructing a genetic network from microarray gene expression data by using a Bayesian network. An essential point of Bayesian network construction is the estimation of the conditional distribution of each random variable. We consider fitting nonparametric regression models with heterogeneous error variances to the microarray gene expression data to capture the nonlinear structures between genes. Selecting the optimal graph, which gives the best representation of the system among genes, is still a problem to be solved. We theoretically derive a new graph selection criterion from Bayes approach in general situations. The proposed method includes previous methods based on Bayesian networks. We demonstrate the effectiveness of the proposed method through the analysis of Saccharomyces cerevisiae gene expression data newly obtained by disrupting 100 genes. 相似文献

5.

Bayesian Hierarchical Functional Data Analysis Via Contaminated Informative Priors

Bruno Scarpa David B. Dunson 《Biometrics》2009,65(3):772-780

Summary . A variety of flexible approaches have been proposed for functional data analysis, allowing both the mean curve and the distribution about the mean to be unknown. Such methods are most useful when there is limited prior information. Motivated by applications to modeling of temperature curves in the menstrual cycle, this article proposes a flexible approach for incorporating prior information in semiparametric Bayesian analyses of hierarchical functional data. The proposed approach is based on specifying the distribution of functions as a mixture of a parametric hierarchical model and a nonparametric contamination. The parametric component is chosen based on prior knowledge, while the contamination is characterized as a functional Dirichlet process. In the motivating application, the contamination component allows unanticipated curve shapes in unhealthy menstrual cycles. Methods are developed for posterior computation, and the approach is applied to data from a European fecundability study. 相似文献

6.

Bayesian estimation of NMR restraint potential and weight: a validation on a representative set of protein structures

Bernard A Vranken WF Bardiaux B Nilges M Malliavin TE 《Proteins》2011,79(5):1525-1537

The classical procedure for nuclear magnetic resonance structure calculation allocates empirical distance ranges and uses historical values for weighting factors. However, Bayesian analysis suggests that there are more optimal choices for potential shape (bounds-free log-harmonic shape) and restraints weights. We compare the classical protocol with the Bayesian approach for more than 300 protein structures. We analyze the conformation similarity to the corresponding X-ray crystal structure, the distribution of the conformations around their average, and independent validation criteria. On average, the log-harmonic potential reduces the difference to the X-ray crystal structure. If the log-harmonic potential is used, the constant weighting tightens the distribution around the average conformation, with respect to the distributions obtained with Bayesian weighting. Conversely, the structure quality is improved by the Bayesian weighting over the classical procedure, whereas constant weighting worsens some criteria. The quality improvement obtained with the log-harmonic potential coupled to Bayesian weighting validates this approach on a representative set of protein structures. 相似文献

7.

Hierarchical Bayesian Estimation for the Number of Species

Josemar Rodrigues Luis A. Milan Jos G. Leite 《Biometrical journal. Biometrische Zeitschrift》2001,43(6):737-746

This paper is concerned with the estimation of the number of species in a population through a fully hierarchical Bayesian model using the Metropolis algorithm. The proposed Bayesian estimator is based on Poisson random variables with means that are distributed according to some prior distributions with unknown hyperparameters. An empirical Bayes approach is considered and compared with the fully Bayesian approach based on biological data. 相似文献

8.

Adaptive Bayesian sum of trees model for covariate-dependent spectral analysis

Yakun Wang Zeda Li Scott A. Bruce 《Biometrics》2023,79(3):1826-1839

This paper introduces a flexible and adaptive nonparametric method for estimating the association between multiple covariates and power spectra of multiple time series. The proposed approach uses a Bayesian sum of trees model to capture complex dependencies and interactions between covariates and the power spectrum, which are often observed in studies of biomedical time series. Local power spectra corresponding to terminal nodes within trees are estimated nonparametrically using Bayesian penalized linear splines. The trees are considered to be random and fit using a Bayesian backfitting Markov chain Monte Carlo (MCMC) algorithm that sequentially considers tree modifications via reversible-jump MCMC techniques. For high-dimensional covariates, a sparsity-inducing Dirichlet hyperprior on tree splitting proportions is considered, which provides sparse estimation of covariate effects and efficient variable selection. By averaging over the posterior distribution of trees, the proposed method can recover both smooth and abrupt changes in the power spectrum across multiple covariates. Empirical performance is evaluated via simulations to demonstrate the proposed method's ability to accurately recover complex relationships and interactions. The proposed methodology is used to study gait maturation in young children by evaluating age-related changes in power spectra of stride interval time series in the presence of other covariates. 相似文献

9.

A flexible cure rate model for spatially correlated survival data based on generalized extreme value distribution and Gaussian process priors

下载免费PDF全文

Dan Li Xia Wang Dipak K. Dey 《Biometrical journal. Biometrische Zeitschrift》2016,58(5):1178-1197

Our present work proposes a new survival model in a Bayesian context to analyze right‐censored survival data for populations with a surviving fraction, assuming that the log failure time follows a generalized extreme value distribution. Many applications require a more flexible modeling of covariate information than a simple linear or parametric form for all covariate effects. It is also necessary to include the spatial variation in the model, since it is sometimes unexplained by the covariates considered in the analysis. Therefore, the nonlinear covariate effects and the spatial effects are incorporated into the systematic component of our model. Gaussian processes (GPs) provide a natural framework for modeling potentially nonlinear relationship and have recently become extremely powerful in nonlinear regression. Our proposed model adopts a semiparametric Bayesian approach by imposing a GP prior on the nonlinear structure of continuous covariate. With the consideration of data availability and computational complexity, the conditionally autoregressive distribution is placed on the region‐specific frailties to handle spatial correlation. The flexibility and gains of our proposed model are illustrated through analyses of simulated data examples as well as a dataset involving a colon cancer clinical trial from the state of Iowa. 相似文献

10.

Bayesian Inference in Semiparametric Mixed Models for Longitudinal Data 总被引：1，自引：0，他引：1

Yisheng Li Xihong Lin Peter Müller 《Biometrics》2010,66(1):70-78

Summary . We consider Bayesian inference in semiparametric mixed models (SPMMs) for longitudinal data. SPMMs are a class of models that use a nonparametric function to model a time effect, a parametric function to model other covariate effects, and parametric or nonparametric random effects to account for the within-subject correlation. We model the nonparametric function using a Bayesian formulation of a cubic smoothing spline, and the random effect distribution using a normal distribution and alternatively a nonparametric Dirichlet process (DP) prior. When the random effect distribution is assumed to be normal, we propose a uniform shrinkage prior (USP) for the variance components and the smoothing parameter. When the random effect distribution is modeled nonparametrically, we use a DP prior with a normal base measure and propose a USP for the hyperparameters of the DP base measure. We argue that the commonly assumed DP prior implies a nonzero mean of the random effect distribution, even when a base measure with mean zero is specified. This implies weak identifiability for the fixed effects, and can therefore lead to biased estimators and poor inference for the regression coefficients and the spline estimator of the nonparametric function. We propose an adjustment using a postprocessing technique. We show that under mild conditions the posterior is proper under the proposed USP, a flat prior for the fixed effect parameters, and an improper prior for the residual variance. We illustrate the proposed approach using a longitudinal hormone dataset, and carry out extensive simulation studies to compare its finite sample performance with existing methods. 相似文献

11.

Image resolution enhancement using statistical estimation in wavelet domain

Jinglun Shi Zhilong Shan 《Biomedical signal processing and control》2012,7(6):571-578

The goal of medical image resolution enhancement is to reconstruct a higher-resolution image from its lower-resolution counterpart. This paper proposes a Bayesian approach in the wavelet domain by exploiting a Bayesian inference framework to mathematically formulate the image interpolation problem. Furthermore, the proposed approach jointly estimates both the unknown wavelet coefficients of the high-resolution image and the unknown parameters of the statistical model for wavelet coefficients. Experiments are conducted to demonstrate the superior performance of the proposed approach. 相似文献

12.

Allocation Variable-Based Probabilistic Algorithm to Deal with Label Switching Problem in Bayesian Mixture Models

Jia-Chiun Pan Chih-Min Liu Hai-Gwo Hwu Guan-Hua Huang 《PloS one》2015,10(10)

The label switching problem occurs as a result of the nonidentifiability of posterior distribution over various permutations of component labels when using Bayesian approach to estimate parameters in mixture models. In the cases where the number of components is fixed and known, we propose a relabelling algorithm, an allocation variable-based (denoted by AVP) probabilistic relabelling approach, to deal with label switching problem. We establish a model for the posterior distribution of allocation variables with label switching phenomenon. The AVP algorithm stochastically relabel the posterior samples according to the posterior probabilities of the established model. Some existing deterministic and other probabilistic algorithms are compared with AVP algorithm in simulation studies, and the success of the proposed approach is demonstrated in simulation studies and a real dataset. 相似文献

13.

Bayesian influence measures for joint models for longitudinal and survival data

Zhu H Ibrahim JG Chi YY Tang N 《Biometrics》2012,68(3):954-964

Summary This article develops a variety of influence measures for carrying out perturbation (or sensitivity) analysis to joint models of longitudinal and survival data (JMLS) in Bayesian analysis. A perturbation model is introduced to characterize individual and global perturbations to the three components of a Bayesian model, including the data points, the prior distribution, and the sampling distribution. Local influence measures are proposed to quantify the degree of these perturbations to the JMLS. The proposed methods allow the detection of outliers or influential observations and the assessment of the sensitivity of inferences to various unverifiable assumptions on the Bayesian analysis of JMLS. Simulation studies and a real data set are used to highlight the broad spectrum of applications for our Bayesian influence methods. 相似文献

14.

Identifying differentially expressed genes in meta-analysis via Bayesian model-based clustering

Jung YY Oh MS Shin DW Kang SH Oh HS 《Biometrical journal. Biometrische Zeitschrift》2006,48(3):435-450

A Bayesian model-based clustering approach is proposed for identifying differentially expressed genes in meta-analysis. A Bayesian hierarchical model is used as a scientific tool for combining information from different studies, and a mixture prior is used to separate differentially expressed genes from non-differentially expressed genes. Posterior estimation of the parameters and missing observations are done by using a simple Markov chain Monte Carlo method. From the estimated mixture model, useful measure of significance of a test such as the Bayesian false discovery rate (FDR), the local FDR (Efron et al., 2001), and the integration-driven discovery rate (IDR; Choi et al., 2003) can be easily computed. The model-based approach is also compared with commonly used permutation methods, and it is shown that the model-based approach is superior to the permutation methods when there are excessive under-expressed genes compared to over-expressed genes or vice versa. The proposed method is applied to four publicly available prostate cancer gene expression data sets and simulated data sets. 相似文献

15.

Bayesian back-calculation and nowcasting for line list data during the COVID-19 pandemic

Tenglong Li Laura F. White 《PLoS computational biology》2021,17(7)

Surveillance is critical to mounting an appropriate and effective response to pandemics. However, aggregated case report data suffers from reporting delays and can lead to misleading inferences. Different from aggregated case report data, line list data is a table contains individual features such as dates of symptom onset and reporting for each reported case and a good source for modeling delays. Current methods for modeling reporting delays are not particularly appropriate for line list data, which typically has missing symptom onset dates that are non-ignorable for modeling reporting delays. In this paper, we develop a Bayesian approach that dynamically integrates imputation and estimation for line list data. Specifically, this Bayesian approach can accurately estimate the epidemic curve and instantaneous reproduction numbers, even with most symptom onset dates missing. The Bayesian approach is also robust to deviations from model assumptions, such as changes in the reporting delay distribution or incorrect specification of the maximum reporting delay. We apply the Bayesian approach to COVID-19 line list data in Massachusetts and find the reproduction number estimates correspond more closely to the control measures than the estimates based on the reported curve. 相似文献

16.

Comparison of Bayesian and maximum-likelihood inference of population genetic parameters 总被引：9，自引：0，他引：9

Beerli P 《Bioinformatics (Oxford, England)》2006,22(3):341-345

Comparison of the performance and accuracy of different inference methods, such as maximum likelihood (ML) and Bayesian inference, is difficult because the inference methods are implemented in different programs, often written by different authors. Both methods were implemented in the program MIGRATE, that estimates population genetic parameters, such as population sizes and migration rates, using coalescence theory. Both inference methods use the same Markov chain Monte Carlo algorithm and differ from each other in only two aspects: parameter proposal distribution and maximization of the likelihood function. Using simulated datasets, the Bayesian method generally fares better than the ML approach in accuracy and coverage, although for some values the two approaches are equal in performance. MOTIVATION: The Markov chain Monte Carlo-based ML framework can fail on sparse data and can deliver non-conservative support intervals. A Bayesian framework with appropriate prior distribution is able to remedy some of these problems. RESULTS: The program MIGRATE was extended to allow not only for ML(-) maximum likelihood estimation of population genetics parameters but also for using a Bayesian framework. Comparisons between the Bayesian approach and the ML approach are facilitated because both modes estimate the same parameters under the same population model and assumptions. 相似文献

17.

Bayesian analysis for the meiosis I non-disjunction fraction in numerical chromosomal anomalies

Loschi RH Franco GC 《Biometrical journal. Biometrische Zeitschrift》2006,48(2):220-232

The main causes of numerical chromosomal anomalies, including trisomies, arise from an error in the chromosomal segregation during the meiotic process, named a non-disjunction. One of the most used techniques to analyze chromosomal anomalies nowadays is the polymerase chain reaction (PCR), which counts the number of peaks or alleles in a polymorphic microsatellite locus. It was shown in previous works that the number of peaks has a multinomial distribution whose probabilities depend on the non-disjunction fraction F. In this work, we propose a Bayesian approach for estimating the meiosis I non-disjunction fraction F. in the absence of the parental information. Since samples of trisomic patients are, in general, small, the Bayesian approach can be a good alternative for solving this problem. We consider the sampling/importance resampling technique and the Simpson rule to extract information from the posterior distribution of F. Bayes and maximum likelihood estimators are compared through a Monte Carlo simulation, focusing on the influence of different sample sizes and prior specifications in the estimates. We apply the proposed method to estimate F. for patients with trisomy of chromosome 21 providing a sensitivity analysis for the method. The results obtained show that Bayes estimators are better in almost all situations. 相似文献

18.

Fixed and random effects selection in linear and logistic models

Kinney SK Dunson DB 《Biometrics》2007,63(3):690-698

We address the problem of selecting which variables should be included in the fixed and random components of logistic mixed effects models for correlated data. A fully Bayesian variable selection is implemented using a stochastic search Gibbs sampler to estimate the exact model-averaged posterior distribution. This approach automatically identifies subsets of predictors having nonzero fixed effect coefficients or nonzero random effects variance, while allowing uncertainty in the model selection process. Default priors are proposed for the variance components and an efficient parameter expansion Gibbs sampler is developed for posterior computation. The approach is illustrated using simulated data and an epidemiologic example. 相似文献

19.

A closer look at the distribution of number needed to treat (NNT): a Bayesian approach

Thabane L 《Biostatistics (Oxford, England)》2003,4(3):365-370

In this paper, I present a Bayesian approach to estimation of the number needed to treat (NNT). The use of NNT as a measure of clinical benefit is now becoming commonplace. Various methods of estimation have been proposed, but none of them seem to provide entirely good estimates. Very little has been done to understand the statistical properties of NNT. Here, I derive the posterior distribution of NNT and use simulations to investigate the general behaviour of the distribution. The posterior mode of the distribution is proposed as a point estimate and results are compared with the conventional method of estimation of NNT done by inversion. 相似文献

20.

Bayesian semiparametric copula estimation with application to psychiatric genetics

下载免费PDF全文

Ori Rosen Wesley K. Thompson 《Biometrical journal. Biometrische Zeitschrift》2015,57(3):468-484

This paper proposes a semiparametric methodology for modeling multivariate and conditional distributions. We first build a multivariate distribution whose dependence structure is induced by a Gaussian copula and whose marginal distributions are estimated nonparametrically via mixtures of B‐spline densities. The conditional distribution of a given variable is obtained in closed form from this multivariate distribution. We take a Bayesian approach, using Markov chain Monte Carlo methods for inference. We study the frequentist properties of the proposed methodology via simulation and apply the method to estimation of conditional densities of summary statistics, used for computing conditional local false discovery rates, from genetic association studies of schizophrenia and cardiovascular disease risk factors. 相似文献