Context-specific infinite mixtures for clustering gene expression profiles across diverse microarray dataset |
| |
Authors: | Liu X Sivaganesan S Yeung K Y Guo J Bumgarner R E Medvedovic Mario |
| |
Institution: | Department of Environmental Health, University of Cincinnati, 3223 Eden Avenue ML 56, Cincinnati, OH 45267, USA. |
| |
Abstract: | MOTIVATION: Identifying groups of co-regulated genes by monitoring their expression over various experimental conditions is complicated by the fact that such co-regulation is condition-specific. Ignoring the context-specific nature of co-regulation significantly reduces the ability of clustering procedures to detect co-expressed genes due to additional 'noise' introduced by non-informative measurements. RESULTS: We have developed a novel Bayesian hierarchical model and corresponding computational algorithms for clustering gene expression profiles across diverse experimental conditions and studies that accounts for context-specificity of gene expression patterns. The model is based on the Bayesian infinite mixtures framework and does not require a priori specification of the number of clusters. We demonstrate that explicit modeling of context-specificity results in increased accuracy of the cluster analysis by examining the specificity and sensitivity of clusters in microarray data. We also demonstrate that probabilities of co-expression derived from the posterior distribution of clusterings are valid estimates of statistical significance of created clusters. AVAILABILITY: The open-source package gimm is available at http://eh3.uc.edu/gimm. |
| |
Keywords: | |
本文献已被 PubMed Oxford 等数据库收录! |