期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A Bayesian hierarchical model for identifying epitopes in peptide microarray data

Arima S Lin J Pecora V Tardella L 《Biostatistics (Oxford, England)》2012,13(1):101-112

Peptide Microarray Immunoassay (PMI for brevity) is a novel technology that enables researchers to map a large number of proteomic measurements at a peptide level, providing information regarding the relationship between antibody response and clinical sensitivity. PMI studies aim at recognizing antigen-specific antibodies from serum samples and at detecting epitope regions of the protein antigen. PMI data present new challenges for statistical analysis mainly due to the structural dependence among peptides. A PMI is made of a complete library of consecutive peptides. They are synthesized by systematically shifting a window of a fixed number of amino acids through the finite sequence of amino acids of the antigen protein as ordered in the primary structure of the protein. This implies that consecutive peptides have a certain number of amino acids in common and hence are structurally dependent. We propose a new flexible Bayesian hierarchical model framework, which allows one to detect recognized peptides and bound epitope regions in a single framework, taking into account the structural dependence between peptides through a suitable latent Markov structure. The proposed model is illustrated using PMI data from a recent study about egg allergy. A simulation study shows that the proposed model is more powerful and robust in terms of epitope detection than simpler models overlooking some of the dependence structure. 相似文献

2.

A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data 总被引：13，自引：0，他引：13

Zou M Conzen SD 《Bioinformatics (Oxford, England)》2005,21(1):71-79

相似文献

3.

Using hidden Markov models to analyze gene expression time course data

Schliep A Schönhuth A Steinhoff C 《Bioinformatics (Oxford, England)》2003,19(Z1):i255-i263

MOTIVATION: Cellular processes cause changes over time. Observing and measuring those changes over time allows insights into the how and why of regulation. The experimental platform for doing the appropriate large-scale experiments to obtain time-courses of expression levels is provided by microarray technology. However, the proper way of analyzing the resulting time course data is still very much an issue under investigation. The inherent time dependencies in the data suggest that clustering techniques which reflect those dependencies yield improved performance. RESULTS: We propose to use Hidden Markov Models (HMMs) to account for the horizontal dependencies along the time axis in time course data and to cope with the prevalent errors and missing values. The HMMs are used within a model-based clustering framework. We are given a number of clusters, each represented by one Hidden Markov Model from a finite collection encompassing typical qualitative behavior. Then, our method finds in an iterative procedure cluster models and an assignment of data points to these models that maximizes the joint likelihood of clustering and models. Partially supervised learning--adding groups of labeled data to the initial collection of clusters--is supported. A graphical user interface allows querying an expression profile dataset for time course similar to a prototype graphically defined as a sequence of levels and durations. We also propose a heuristic approach to automate determination of the number of clusters. We evaluate the method on published yeast cell cycle and fibroblasts serum response datasets, and compare them, with favorable results, to the autoregressive curves method. 相似文献

4.

A Bayesian hidden Markov model for detecting differentially methylated regions

Tieming Ji 《Biometrics》2019,75(2):663-673

相似文献

5.

Bayesian hierarchical modeling for time course microarray experiments

Chi YY Ibrahim JG Bissahoyo A Threadgill DW 《Biometrics》2007,63(2):496-504

Time course microarray experiments designed to characterize the dynamic regulation of gene expression in biological systems are becoming increasingly important. One critical issue that arises when examining time course microarray data is the identification of genes that show different temporal expression patterns among biological conditions. Here we propose a Bayesian hierarchical model to incorporate important experimental factors and to account for correlated gene expression measurements over time and over different genes. A new gene selection algorithm is also presented with the model to simultaneously identify genes that show changes in expression among biological conditions, in response to time and other experimental factors of interest. The algorithm performs well in terms of the false positive and false negative rates in simulation studies. The methodology is applied to a mouse model time course experiment to correlate temporal changes in azoxymethane-induced gene expression profiles with colorectal cancer susceptibility. 相似文献

6.

Model-based methods for identifying periodically expressed genes based on time course microarray gene expression data 总被引：4，自引：0，他引：4

Luan Y Li H 《Bioinformatics (Oxford, England)》2004,20(3):332-339

相似文献

7.

A hidden Markov movement model for rapidly identifying behavioral states from animal tracks

下载免费PDF全文

Kim Whoriskey Marie Auger‐Méthé Christoffer M. Albertsen Frederick G. Whoriskey Thomas R. Binder Charles C. Krueger Joanna Mills Flemming 《Ecology and evolution》2017,7(7):2112-2121

Electronic telemetry is frequently used to document animal movement through time. Methods that can identify underlying behaviors driving specific movement patterns can help us understand how and why animals use available space, thereby aiding conservation and management efforts. For aquatic animal tracking data with significant measurement error, a Bayesian state‐space model called the first‐Difference Correlated Random Walk with Switching (DCRWS) has often been used for this purpose. However, for aquatic animals, highly accurate tracking data are now becoming more common. We developed a new hidden Markov model (HMM) for identifying behavioral states from animal tracks with negligible error, called the hidden Markov movement model (HMMM). We implemented as the basis for the HMMM the process equation of the DCRWS, but we used the method of maximum likelihood and the R package TMB for rapid model fitting. The HMMM was compared to a modified version of the DCRWS for highly accurate tracks, the DCRWS, and to a common HMM for animal tracks fitted with the R package moveHMM. We show that the HMMM is both accurate and suitable for multiple species by fitting it to real tracks from a grey seal, lake trout, and blue shark, as well as to simulated data. The HMMM is a fast and reliable tool for making meaningful inference from animal movement data that is ideally suited for ecologists who want to use the popular DCRWS implementation and have highly accurate tracking data. It additionally provides a groundwork for development of more complex modeling of animal movement with TMB. To facilitate its uptake, we make it available through the R package swim. 相似文献

8.

A fully Bayesian hidden Ising model for ChIP-seq data analysis

Mo Q 《Biostatistics (Oxford, England)》2012,13(1):113-128

相似文献

9.

A continuous-index Bayesian hidden Markov model for prediction of nucleosome positioning in genomic DNA

Mitra R Gupta M 《Biostatistics (Oxford, England)》2011,12(3):462-477

相似文献

10.

A hidden markov model for identifying differentially methylated sites in bisulfite sequencing data

Farhad Shokoohi David A. Stephens Guillaume Bourque Tomi Pastinen Celia M. T. Greenwood Aurlie Labbe 《Biometrics》2019,75(1):210-221

相似文献

11.

A mutagenetic tree hidden Markov model for longitudinal clonal HIV sequence data 总被引：1，自引：0，他引：1

Beerenwinkel N Drton M 《Biostatistics (Oxford, England)》2007,8(1):53-71

相似文献

12.

A two-sample Bayesian t-test for microarray data

Richard J Fox Matthew W Dimmic 《BMC bioinformatics》2006,7(1):126-11

Background

Determining whether a gene is differentially expressed in two different samples remains an important statistical problem. Prior work in this area has featured the use of t-tests with pooled estimates of the sample variance based on similarly expressed genes. These methods do not display consistent behavior across the entire range of pooling and can be biased when the prior hyperparameters are specified heuristically. 相似文献

13.

Bayesian hierarchical model for identifying changes in gene expression from microarray experiments.

Philippe Bro?t Sylvia Richardson Fran?ois Radvanyi 《Journal of computational biology》2002,9(4):671-683

Recent developments in microarrays technology enable researchers to study simultaneously the expression of thousands of genes from one cell line or tissue sample. This new technology is often used to assess changes in mRNA expression upon a specified transfection for a cell line in order to identify target genes. For such experiments, the range of differential expression is moderate, and teasing out the modified genes is challenging and calls for detailed modeling. The aim of this paper is to propose a methodological framework for studies that investigate differential gene expression through microarrays technology that is based on a fully Bayesian mixture approach (Richardson and Green, 1997). A case study that investigated those genes that were differentially expressed in two cell lines (normal and modified by a gene transfection) is provided to illustrate the performance and usefulness of this approach. 相似文献

14.

A hidden Markov model for continuous longitudinal data with missing responses and dropout

Silvia Pandolfi Francesco Bartolucci Fulvia Pennoni 《Biometrical journal. Biometrische Zeitschrift》2023,65(5):2200016

We propose a hidden Markov model for multivariate continuous longitudinal responses with covariates that accounts for three different types of missing pattern: (I) partially missing outcomes at a given time occasion, (II) completely missing outcomes at a given time occasion (intermittent pattern), and (III) dropout before the end of the period of observation (monotone pattern). The missing-at-random (MAR) assumption is formulated to deal with the first two types of missingness, while to account for the informative dropout, we rely on an extra absorbing state. Estimation of the model parameters is based on the maximum likelihood method that is implemented by an expectation-maximization (EM) algorithm relying on suitable recursions. The proposal is illustrated by a Monte Carlo simulation study and an application based on historical data on primary biliary cholangitis. 相似文献

15.

A Bayesian method for analysing spotted microarray data

Meiklejohn CD Townsend JP 《Briefings in bioinformatics》2005,6(4):318-330

In the decade since their invention, spotted microarrays have been undergoing technical advances that have increased the utility, scope and precision of their ability to measure gene expression. At the same time, more researchers are taking advantage of the fundamentally quantitative nature of these tools with refined experimental designs and sophisticated statistical analyses. These new approaches utilise the power of microarrays to estimate differences in gene expression levels, rather than just categorising genes as up- or down-regulated, and allow the comparison of expression data across multiple samples. In this review, some of the technical aspects of spotted microarrays that can affect statistical inference are highlighted, and a discussion is provided of how several methods for estimating gene expression level across multiple samples deal with these challenges. The focus is on a Bayesian analysis method, BAGEL, which is easy to implement and produces easily interpreted results. 相似文献

16.

A hidden Markov model for predicting protein interfaces

Nguyen C Gardiner KJ Cios KJ 《Journal of bioinformatics and computational biology》2007,5(3):739-753

Protein-protein interactions play a defining role in protein function. Identifying the sites of interaction in a protein is a critical problem for understanding its functional mechanisms, as well as for drug design. To predict sites within a protein chain that participate in protein complexes, we have developed a novel method based on the Hidden Markov Model, which combines several biological characteristics of the sequences neighboring a target residue: structural information, accessible surface area, and transition probability among amino acids. We have evaluated the method using 5-fold cross-validation on 139 unique proteins and demonstrated precision of 66% and recall of 61% in identifying interfaces. These results are better than those achieved by other methods used for identification of interfaces. 相似文献

17.

Infinite hidden Markov models for multiple multivariate time series with missing data

Lauren Hoskovec Matthew D. Koslovsky Kirsten Koehler Nicholas Good Jennifer L. Peel John Volckens Ander Wilson 《Biometrics》2023,79(3):2592-2604

Exposure to air pollution is associated with increased morbidity and mortality. Recent technological advancements permit the collection of time-resolved personal exposure data. Such data are often incomplete with missing observations and exposures below the limit of detection, which limit their use in health effects studies. In this paper, we develop an infinite hidden Markov model for multiple asynchronous multivariate time series with missing data. Our model is designed to include covariates that can inform transitions among hidden states. We implement beam sampling, a combination of slice sampling and dynamic programming, to sample the hidden states, and a Bayesian multiple imputation algorithm to impute missing data. In simulation studies, our model excels in estimating hidden states and state-specific means and imputing observations that are missing at random or below the limit of detection. We validate our imputation approach on data from the Fort Collins Commuter Study. We show that the estimated hidden states improve imputations for data that are missing at random compared to existing approaches. In a case study of the Fort Collins Commuter Study, we describe the inferential gains obtained from our model including improved imputation of missing data and the ability to identify shared patterns in activity and exposure among repeated sampling days for individuals and among distinct individuals. 相似文献

18.

A hidden Markov model for progressive multiple alignment 总被引：4，自引：0，他引：4

Löytynoja A Milinkovitch MC 《Bioinformatics (Oxford, England)》2003,19(12):1505-1513

MOTIVATION: Progressive algorithms are widely used heuristics for the production of alignments among multiple nucleic-acid or protein sequences. Probabilistic approaches providing measures of global and/or local reliability of individual solutions would constitute valuable developments. RESULTS: We present here a new method for multiple sequence alignment that combines an HMM approach, a progressive alignment algorithm, and a probabilistic evolution model describing the character substitution process. Our method works by iterating pairwise alignments according to a guide tree and defining each ancestral sequence from the pairwise alignment of its child nodes, thus, progressively constructing a multiple alignment. Our method allows for the computation of each column minimum posterior probability and we show that this value correlates with the correctness of the result, hence, providing an efficient mean by which unreliably aligned columns can be filtered out from a multiple alignment. 相似文献

19.

Group SCAD regression analysis for microarray time course gene expression data 总被引：1，自引：0，他引：1

Wang L Chen G Li H 《Bioinformatics (Oxford, England)》2007,23(12):1486-1494

相似文献

20.

A hidden Markov model for analysis of frontline veterinary data for emerging zoonotic disease surveillance

Robertson C Sawford K Gunawardana WS Nelson TA Nathoo F Stephen C 《PloS one》2011,6(9):e24833

Surveillance systems tracking health patterns in animals have potential for early warning of infectious disease in humans, yet there are many challenges that remain before this can be realized. Specifically, there remains the challenge of detecting early warning signals for diseases that are not known or are not part of routine surveillance for named diseases. This paper reports on the development of a hidden Markov model for analysis of frontline veterinary sentinel surveillance data from Sri Lanka. Field veterinarians collected data on syndromes and diagnoses using mobile phones. A model for submission patterns accounts for both sentinel-related and disease-related variability. Models for commonly reported cattle diagnoses were estimated separately. Region-specific weekly average prevalence was estimated for each diagnoses and partitioned into normal and abnormal periods. Visualization of state probabilities was used to indicate areas and times of unusual disease prevalence. The analysis suggests that hidden Markov modelling is a useful approach for surveillance datasets from novel populations and/or having little historical baselines. 相似文献