Statistical analysis of microarray data: a Bayesian approach |
| |
Authors: | Gottardo Raphael Pannucci James A Kuske Cheryl R Brettin Thomas |
| |
Affiliation: | University of Washington, Department of Statistics, Box 354322, Seattle, WA 98195-4322, USA. raph@stat.washington.edu |
| |
Abstract: | The potential of microarray data is enormous. It allows us to monitor the expression of thousands of genes simultaneously. A common task with microarray is to determine which genes are differentially expressed between two samples obtained under two different conditions. Recently, several statistical methods have been proposed to perform such a task when there are replicate samples under each condition. Two major problems arise with microarray data. The first one is that the number of replicates is very small (usually 2-10), leading to noisy point estimates. As a consequence, traditional statistics that are based on the means and standard deviations, e.g. t-statistic, are not suitable. The second problem is that the number of genes is usually very large (approximately 10,000), and one is faced with an extreme multiple testing problem. Most multiple testing adjustments are relatively conservative, especially when the number of replicates is small. In this paper we present an empirical Bayes analysis that handles both problems very well. Using different parametrizations, we develop four statistics that can be used to test hypotheses about the means and/or variances of the gene expression levels in both one- and two-sample problems. The methods are illustrated using experimental data with prior knowledge. In addition, we present the result of a simulation comparing our methods to well-known statistics and multiple testing adjustments. |
| |
Keywords: | Differential gene expression Hierarchical Bayes Multiple testing Posterior probabilities Replicated cDNA microarrays |
本文献已被 PubMed Oxford 等数据库收录! |
|