期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A discussion of statistical methods for design and analysis of microarray experiments for plant scientists 总被引：3，自引：0，他引：3

下载免费PDF全文

Nettleton D 《The Plant cell》2006,18(9):2112-2121

相似文献

2.

基因芯片筛选差异表达基因方法比较 总被引：1，自引：0，他引：1

单文娟童春发施季森《遗传》2008,30(12):1640-1646

摘要: 使用计算机模拟数据和真实的芯片数据, 对8种筛选差异表达基因的方法进行了比较分析, 旨在比较不同方法对基因芯片数据的筛选效果。模拟数据分析表明, 所使用的8种方法对均匀分布的差异表达基因有很好的识别、检出作用。算法方面, SAM和Wilcoxon秩和检验方法较好; 数据分布方面, 正态分布的识别效果较好, 卡方分布和指数分布的识别效果较差。杨树cDNA芯片分析表明, SAM、Samroc和回归模型方法相近, 而Wilcoxon秩和检验方法与它们有较大差异。相似文献

3.

A statistical framework for differential network analysis from microarray data 总被引：1，自引：0，他引：1

Ryan Gill Somnath Datta Susmita Datta 《BMC bioinformatics》2010,11(1):95

Background

It has been long well known that genes do not act alone; rather groups of genes act in consort during a biological process. Consequently, the expression levels of genes are dependent on each other. Experimental techniques to detect such interacting pairs of genes have been in place for quite some time. With the advent of microarray technology, newer computational techniques to detect such interaction or association between gene expressions are being proposed which lead to an association network. While most microarray analyses look for genes that are differentially expressed, it is of potentially greater significance to identify how entire association network structures change between two or more biological settings, say normal versus diseased cell types. 相似文献

4.

A conditional density error model for the statistical analysis of microarray data 总被引：1，自引：0，他引：1

Love B Rank DR Penn SG Jenkins DA Thomas RS 《Bioinformatics (Oxford, England)》2002,18(8):1064-1072

相似文献

5.

A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments 总被引：10，自引：0，他引：10

Pan W 《Bioinformatics (Oxford, England)》2002,18(4):546-554

MOTIVATION: A common task in analyzing microarray data is to determine which genes are differentially expressed across two kinds of tissue samples or samples obtained under two experimental conditions. Recently several statistical methods have been proposed to accomplish this goal when there are replicated samples under each condition. However, it may not be clear how these methods compare with each other. Our main goal here is to compare three methods, the t-test, a regression modeling approach (Thomas et al., Genome Res., 11, 1227-1236, 2001) and a mixture model approach (Pan et al., http://www.biostat.umn.edu/cgi-bin/rrs?print+2001,2001a,b) with particular attention to their different modeling assumptions. RESULTS: It is pointed out that all the three methods are based on using the two-sample t-statistic or its minor variation, but they differ in how to associate a statistical significance level to the corresponding statistic, leading to possibly large difference in the resulting significance levels and the numbers of genes detected. In particular, we give an explicit formula for the test statistic used in the regression approach. Using the leukemia data of Golub et al. (Science, 285, 531-537, 1999), we illustrate these points. We also briefly compare the results with those of several other methods, including the empirical Bayesian method of Efron et al. (J. Am. Stat. Assoc., to appear, 2001) and the Significance Analysis of Microarray (SAM) method of Tusher et al. (PROC: Natl Acad. Sci. USA, 98, 5116-5121, 2001). 相似文献

6.

Visualization and analysis of microarray and gene ontology data with treemaps

Eric?H?Baehrecke Email author Niem?Dang Ketan?Babaria Ben?Shneiderman 《BMC bioinformatics》2004,5(1):84

Background

The increasing complexity of genomic data presents several challenges for biologists. Limited computer monitor views of data complexity and the dynamic nature of data in the midst of discovery increase the challenge of integrating experimental results with information resources. The use of Gene Ontology enables researchers to summarize results of quantitative analyses in this framework, but the limitations of typical browser presentation restrict data access. 相似文献

7.

Visualization of microarray gene expression data

下载免费PDF全文

Prasad TV Ahson SI 《Bioinformation》2006,1(4):141-145

Microarray gene expression data is used in various biological and medical investigations. Processing of gene expression data requires algorithms in data mining, process automation and knowledge discovery. Available data mining algorithms exploits various visualization techniques. Here, we describe the merits and demerits of various visualization parameters used in gene expression analysis. 相似文献

8.

Statistical methods for microarray assays 总被引：1，自引：0，他引：1

Krajewski P Bocianowski J 《Journal of applied genetics》2002,43(3):269-278

The paper shortly reviews statistical methods used in the area of DNA microarray studies. All stages of the experiment are taken into account: planning, data collection, data preprocessing, analysis and validation. Among the methods of data analysis, the algorithms for estimating differential expression, multivariate approaches, clustering methods, as well as classification and discrimination are reviewed. The need is stressed for routine statistical data processing protocols and for the search of links of microarray data analysis with quantitative genetic models. 相似文献

9.

Bivariate microarray analysis: statistical interpretation of two-channel functional genomics data

Albert Hsiao Shankar Subramaniam 《Systems and synthetic biology》2008,2(3-4):95-104

Conventional statistical methods for interpreting microarray data require large numbers of replicates in order to provide sufficient levels of sensitivity. We recently described a method for identifying differentially-expressed genes in one-channel microarray data 1. Based on the idea that the variance structure of microarray data can itself be a reliable measure of noise, this method allows statistically sound interpretation of as few as two replicates per treatment condition. Unlike the one-channel array, the two-channel platform simultaneously compares gene expression in two RNA samples. This leads to covariation of the measured signals. Hence, by accounting for covariation in the variance model, we can significantly increase the power of the statistical test. We believe that this approach has the potential to overcome limitations of existing methods. We present here a novel approach for the analysis of microarray data that involves modeling the variance structure of paired expression data in the context of a Bayesian framework. We also describe a novel statistical test that can be used to identify differentially-expressed genes. This method, bivariate microarray analysis (BMA), demonstrates dramatically improved sensitivity over existing approaches. We show that with only two array replicates, it is possible to detect gene expression changes that are at best detected with six array replicates by other methods. Further, we show that combining results from BMA with Gene Ontology annotation yields biologically significant results in a ligand-treated macrophage cell system. 相似文献

10.

goCluster integrates statistical analysis and functional interpretation of microarray expression data 总被引：2，自引：0，他引：2

Wrobel G Chalmel F Primig M 《Bioinformatics (Oxford, England)》2005,21(17):3575-3577

相似文献

11.

A probabilistic framework for microarray data analysis: Fundamental probability models and statistical inference

Babatunde A. Ogunnaike Claudio A. Gelmi 《Journal of theoretical biology》2010,264(2):211-222

Gene expression studies generate large quantities of data with the defining characteristic that the number of genes (whose expression profiles are to be determined) exceed the number of available replicates by several orders of magnitude. Standard spot-by-spot analysis still seeks to extract useful information for each gene on the basis of the number of available replicates, and thus plays to the weakness of microarrays. On the other hand, because of the data volume, treating the entire data set as an ensemble, and developing theoretical distributions for these ensembles provides a framework that plays instead to the strength of microarrays. We present theoretical results that under reasonable assumptions, the distribution of microarray intensities follows the Gamma model, with the biological interpretations of the model parameters emerging naturally. We subsequently establish that for each microarray data set, the fractional intensities can be represented as a mixture of Beta densities, and develop a procedure for using these results to draw statistical inference regarding differential gene expression. We illustrate the results with experimental data from gene expression studies on Deinococcus radiodurans following DNA damage using cDNA microarrays. 相似文献

12.

A benchmark for statistical microarray data analysis that preserves actual biological and technical variance

Benoît De Hertogh Bertrand De Meulder Fabrice Berger Michael Pierre Eric Bareke Anthoula Gaigneaux Eric Depiereux 《BMC bioinformatics》2010,11(1):17

Background

Recent reanalysis of spike-in datasets underscored the need for new and more accurate benchmark datasets for statistical microarray analysis. We present here a fresh method using biologically-relevant data to evaluate the performance of statistical methods. 相似文献

13.

Identification of novel universal housekeeping genes by statistical analysis of microarray data

Lee S Jo M Lee J Koh SS Kim S 《Journal of biochemistry and molecular biology》2007,40(2):226-231

Housekeeping genes are widely used as internal controls in a variety of study types, including real time RT-PCR, microarrays, Northern analysis and RNase protection assays. However, even commonly used housekeeping genes may vary in stability depending on the cell type or disease being studied. Thus, it is necessary to identify additional housekeeping-type genes that show sample-independent stability. Here, we used statistical analysis to examine a large human microarray database, seeking genes that were stably expressed in various tissues, disease states and cell lines. We further selected genes that were expressed at different levels, because reference and target genes should be present in similar copy numbers to achieve reliable quantitative results. Real time RT-PCR amplification of three newly identified reference genes, CGI-119, CTBP1 and GOLGAl, alongside three well-known housekeeping genes, B2M, GAPD, and TUBB, confirmed that the newly identified genes were more stably expressed in individual samples with similar ranges. These results collectively suggest that statistical analysis of microarray data can be used to identify new candidate housekeeping genes showing consistent expression across tissues and diseases. Our analysis identified three novel candidate housekeeping genes (CGI-119, GOLGA1, and CTBP1) that could prove useful for normalization across a variety of RNA-based techniques. 相似文献

14.

Comparison of statistical methods for the analysis of limiting dilution assays

Loren Cobb Louis Cyr Marcia K. Schmehl Harvey L. Bank 《In vitro cellular & developmental biology. Plant》1989,25(1):76-81

Summary This study reports the results of a critical comparison of five statistical methods for estimating the density of viable cells in a limiting dilution assay (LDA). Artificial data were generated using Monte Carlo simulation. The performance of each statistical method was examined with respect to the accuracy of its estimator and, most importantly, the accuracy of its associated estimated standard error (SE). The regression method was found to perform at a level that is unacceptable for scientific research, due primarily to gross underestimation of the SE. The maximum likelihood method exhibited the best overall performance. A corrected version of Taswell's weighted-mean method, which provides the best performance among all noniterative methods examined, is also presented. 相似文献

15.

Integrating biological information into the statistical analysis and design of microarray experiments

Rosa GJ Vazquez AI 《Animal : an international journal of animal bioscience》2010,4(2):165-172

Microarray technology is a powerful tool for animal functional genomics studies, with applications spanning from gene identification and mapping, to function and control of gene expression. Microarray assays, however, are complex and costly, and hence generally performed with relatively small number of animals. Nevertheless, they generate data sets of unprecedented complexity and dimensionality. Therefore, such trials require careful planning and experimental design, in addition to tailored statistical and computational tools for their appropriate data mining. In this review, we discuss experimental design and data analysis strategies, which incorporate prior genomic and biological knowledge, such as genotypes and gene function and pathway membership. We focus the discussion on the design of genetical genomics studies, and on significance testing for detection of differential expression. It is shown that the use of prior biological information can improve the efficiency of microarray experiments. 相似文献

16.

Identification of arthritis-related gene clusters by microarray analysis of two independent mouse models for rheumatoid arthritis

Fujikado N Saijo S Iwakura Y 《Arthritis research & therapy》2006,8(4):R100-13

相似文献

17.

Comprehensive literature review and statistical considerations for microarray meta-analysis

Tseng GC Ghosh D Feingold E 《Nucleic acids research》2012,40(9):3785-3799

With the rapid advances of various high-throughput technologies, generation of '-omics' data is commonplace in almost every biomedical field. Effective data management and analytical approaches are essential to fully decipher the biological knowledge contained in the tremendous amount of experimental data. Meta-analysis, a set of statistical tools for combining multiple studies of a related hypothesis, has become popular in genomic research. Here, we perform a systematic search from PubMed and manual collection to obtain 620 genomic meta-analysis papers, of which 333 microarray meta-analysis papers are summarized as the basis of this paper and the other 249 GWAS meta-analysis papers are discussed in the next companion paper. The review in the present paper focuses on various biological purposes of microarray meta-analysis, databases and software and related statistical procedures. Statistical considerations of such an analysis are further scrutinized and illustrated by a case study. Finally, several open questions are listed and discussed. 相似文献

18.

Evaluation of normalization methods for microarray data

Taesung?Park Email author Sung-Gon?Yi Sung-Hyun?Kang SeungYeoun?Lee Yong-Sung?Lee Richard?Simon 《BMC bioinformatics》2003,4(1):33

Background

Microarray technology allows the monitoring of expression levels for thousands of genes simultaneously. This novel technique helps us to understand gene regulation as well as gene by gene interactions more systematically. In the microarray experiment, however, many undesirable systematic variations are observed. Even in replicated experiment, some variations are commonly observed. Normalization is the process of removing some sources of variation which affect the measured gene expression levels. Although a number of normalization methods have been proposed, it has been difficult to decide which methods perform best. Normalization plays an important role in the earlier stage of microarray data analysis. The subsequent analysis results are highly dependent on normalization.

Results

In this paper, we use the variability among the replicated slides to compare performance of normalization methods. We also compare normalization methods with regard to bias and mean square error using simulated data.

Conclusions

Our results show that intensity-dependent normalization often performs better than global normalization methods, and that linear and nonlinear normalization methods perform similarly. These conclusions are based on analysis of 36 cDNA microarrays of 3,840 genes obtained in an experiment to search for changes in gene expression profiles during neuronal differentiation of cortical stem cells. Simulation studies confirm our findings.

相似文献

19.

Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis

Pan Du Xiao Zhang Chiang-Ching Huang Nadereh Jafari Warren A Kibbe Lifang Hou Simon M Lin 《BMC bioinformatics》2010,11(1):587

Background

High-throughput profiling of DNA methylation status of CpG islands is crucial to understand the epigenetic regulation of genes. The microarray-based Infinium methylation assay by Illumina is one platform for low-cost high-throughput methylation profiling. Both Beta-value and M-value statistics have been used as metrics to measure methylation levels. However, there are no detailed studies of their relations and their strengths and limitations. 相似文献

20.

Evaluation and comparison of gene clustering methods in microarray analysis 总被引：4，自引：0，他引：4

Thalamuthu A Mukhopadhyay I Zheng X Tseng GC 《Bioinformatics (Oxford, England)》2006,22(19):2405-2412

MOTIVATION: Microarray technology has been widely applied in biological and clinical studies for simultaneous monitoring of gene expression in thousands of genes. Gene clustering analysis is found useful for discovering groups of correlated genes potentially co-regulated or associated to the disease or conditions under investigation. Many clustering methods including hierarchical clustering, K-means, PAM, SOM, mixture model-based clustering and tight clustering have been widely used in the literature. Yet no comprehensive comparative study has been performed to evaluate the effectiveness of these methods. RESULTS: In this paper, six gene clustering methods are evaluated by simulated data from a hierarchical log-normal model with various degrees of perturbation as well as four real datasets. A weighted Rand index is proposed for measuring similarity of two clustering results with possible scattered genes (i.e. a set of noise genes not being clustered). Performance of the methods in the real data is assessed by a predictive accuracy analysis through verified gene annotations. Our results show that tight clustering and model-based clustering consistently outperform other clustering methods both in simulated and real data while hierarchical clustering and SOM perform among the worst. Our analysis provides deep insight to the complicated gene clustering problem of expression profile and serves as a practical guideline for routine microarray cluster analysis. 相似文献