期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Identifying significant temporal variation in time course microarray data without replicates

Stephen C Billups Margaret C Neville Michael Rudolph Weston Porter Pepper Schedin 《BMC bioinformatics》2009,10(1):96

Background

An important component of time course microarray studies is the identification of genes that demonstrate significant time-dependent variation in their expression levels. Until recently, available methods for performing such significance tests required replicates of individual time points. This paper describes a replicate-free method that was developed as part of a study of the estrous cycle in the rat mammary gland in which no replicate data was collected. 相似文献

2.

A methodology for global validation of microarray experiments

Mathieu Miron Owen Z Woody Alexandre Marcil Carl Murie Robert Sladek Robert Nadon 《BMC bioinformatics》2006,7(1):333-17

Background

DNA microarrays are popular tools for measuring gene expression of biological samples. This ever increasing popularity is ensuring that a large number of microarray studies are conducted, many of which with data publicly available for mining by other investigators. Under most circumstances, validation of differential expression of genes is performed on a gene to gene basis. Thus, it is not possible to generalize validation results to the remaining majority of non-validated genes or to evaluate the overall quality of these studies. 相似文献

3.

Improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation

Comander J Natarajan S Gimbrone MA García-Cardeña G 《BMC genomics》2004,5(1):17-21

Background

Gene microarray technology provides the ability to study the regulation of thousands of genes simultaneously, but its potential is limited without an estimate of the statistical significance of the observed changes in gene expression. Due to the large number of genes being tested and the comparatively small number of array replicates (e.g., N = 3), standard statistical methods such as the Student's t-test fail to produce reliable results. Two other statistical approaches commonly used to improve significance estimates are a penalized t-test and a Z-test using intensity-dependent variance estimates. 相似文献

4.

Difference-based clustering of short time-course microarray data with replicates

Jihoon Kim Ju Han Kim 《BMC bioinformatics》2007,8(1):253

Background

There are some limitations associated with conventional clustering methods for short time-course gene expression data. The current algorithms require prior domain knowledge and do not incorporate information from replicates. Moreover, the results are not always easy to interpret biologically. 相似文献

5.

Improving the accuracy of expression data analysis in time course experiments using resampling

Wencke Walter Bernd Striberny Emmanuel Gaquerel Ian T Baldwin Sang-Gyu Kim Ines Heiland 《BMC bioinformatics》2014,15(1)

相似文献

6.

An assessment of false discovery rates and statistical significance in label-free quantitative proteomics with combined filters

Qingbo Li Bryan AP Roxas 《BMC bioinformatics》2009,10(1):43-18

Background

Many studies have provided algorithms or methods to assess a statistical significance in quantitative proteomics when multiple replicates for a protein sample and a LC/MS analysis are available. But, confidence is still lacking in using datasets for a biological interpretation without protein sample replicates. Although a fold-change is a conventional threshold that can be used when there are no sample replicates, it does not provide an assessment of statistical significance such as a false discovery rate (FDR) which is an important indicator of the reliability to identify differentially expressed proteins. In this work, we investigate whether differentially expressed proteins can be detected with a statistical significance from a pair of unlabeled protein samples without replicates and with only duplicate LC/MS injections per sample. A FDR is used to gauge the statistical significance of the differentially expressed proteins. 相似文献

7.

A comprehensive re-analysis of the Golden Spike data: Towards a benchmark for differential expression methods

Richard D Pearson 《BMC bioinformatics》2008,9(1):164

Background

The Golden Spike data set has been used to validate a number of methods for summarizing Affymetrix data sets, sometimes with seemingly contradictory results. Much less use has been made of this data set to evaluate differential expression methods. It has been suggested that this data set should not be used for method comparison due to a number of inherent flaws. 相似文献

8.

puma: a Bioconductor package for propagating uncertainty in microarray analysis

Richard D Pearson Xuejun Liu Guido Sanguinetti Marta Milo Neil D Lawrence Magnus Rattray 《BMC bioinformatics》2009,10(1):211

Background

Most analyses of microarray data are based on point estimates of expression levels and ignore the uncertainty of such estimates. By determining uncertainties from Affymetrix GeneChip data and propagating these uncertainties to downstream analyses it has been shown that we can improve results of differential expression detection, principal component analysis and clustering. Previously, implementations of these uncertainty propagation methods have only been available as separate packages, written in different languages. Previous implementations have also suffered from being very costly to compute, and in the case of differential expression detection, have been limited in the experimental designs to which they can be applied. 相似文献

9.

Overdispersed logistic regression for SAGE: Modelling multiple groups and covariates

Keith?A?Baggerly Email author Li?Deng Jeffrey?S?Morris C Marcelo?Aldaz 《BMC bioinformatics》2004,5(1):144

Background

Two major identifiable sources of variation in data derived from the Serial Analysis of Gene Expression (SAGE) are within-library sampling variability and between-library heterogeneity within a group. Most published methods for identifying differential expression focus on just the sampling variability. In recent work, the problem of assessing differential expression between two groups of SAGE libraries has been addressed by introducing a beta-binomial hierarchical model that explicitly deals with both of the above sources of variation. This model leads to a test statistic analogous to a weighted two-sample t-test. When the number of groups involved is more than two, however, a more general approach is needed. 相似文献

10.

ArrayD: A general purpose software for Microarray design

Anu?Sharma Gyan?Prakash?Srivastava Vineet?K?Sharma Srinivasan?Ramachandran Email author 《BMC bioinformatics》2004,5(1):142

Background

Microarray is a high-throughput technology to study expression of thousands of genes in parallel. A critical aspect of microarray production is the design aimed at space optimization while maximizing the number of gene probes and their replicates to be spotted. 相似文献

11.

miR-Q: a novel quantitative RT-PCR approach for the expression profiling of small RNA molecules such as miRNAs in a complex sample

Soroush Sharbati-Tehrani Barbara Kutz-Lohroff Ramona Bergbauer Jutta Scholven Ralf Einspanier 《BMC molecular biology》2008,9(1):34

Background

MicroRNAs (miRNAs) are small endogenous non-coding interfering RNA molecules regarded as major regulators in eukaryotic gene expression. Different methods are employed for miRNA expression profiling. For a better understanding of their role in essential biological processes, convenient methods for differential miRNA expression analysis are required. 相似文献

12.

A meta-data based method for DNA microarray imputation

Rebecka Jörnsten Ming Ouyang Hui-Yu Wang 《BMC bioinformatics》2007,8(1):109

Background

DNA microarray experiments are conducted in logical sets, such as time course profiling after a treatment is applied to the samples, or comparisons of the samples under two or more conditions. Due to cost and design constraints of spotted cDNA microarray experiments, each logical set commonly includes only a small number of replicates per condition. Despite the vast improvement of the microarray technology in recent years, missing values are prevalent. Intuitively, imputation of missing values is best done using many replicates within the same logical set. In practice, there are few replicates and thus reliable imputation within logical sets is difficult. However, it is in the case of few replicates that the presence of missing values, and how they are imputed, can have the most profound impact on the outcome of downstream analyses (e.g. significance analysis and clustering). This study explores the feasibility of imputation across logical sets, using the vast amount of publicly available microarray data to improve imputation reliability in the small sample size setting. 相似文献

13.

Merged consensus clustering to assess and improve class discovery with microarray data

T Ian Simpson J Douglas Armstrong Andrew P Jarman 《BMC bioinformatics》2010,11(1):590

Background

One of the most commonly performed tasks when analysing high throughput gene expression data is to use clustering methods to classify the data into groups. There are a large number of methods available to perform clustering, but it is often unclear which method is best suited to the data and how to quantify the quality of the classifications produced. 相似文献

14.

Identifying differential exon splicing using linear models and correlation coefficients

Sonia H Shah Jacqueline A Pallas 《BMC bioinformatics》2009,10(1):26

Background

With the availability of the Affymetrix exon arrays a number of tools have been developed to enable the analysis. These however can be expensive or have several pre-installation requirements. This led us to develop an analysis workflow for analysing differential splicing using freely available software packages that are already being widely used for gene expression analysis. The workflow uses the packages in the standard installation of R and Bioconductor (BiocLite) to identify differential splicing. We use the splice index method with the LIMMA framework. The main drawback with this approach is that it relies on accurate estimates of gene expression from the probe-level data. Methods such as RMA and PLIER may misestimate when a large proportion of exons are spliced. We therefore present the novel concept of a gene correlation coefficient calculated using only the probeset expression pattern within a gene. We show that genes with lower correlation coefficients are likely to be differentially spliced. 相似文献

15.

Leveraging two-way probe-level block design for identifying differential gene expression with high-density oligonucleotide arrays

Leah?Barrera Chris?Benner Yong-Chuan?Tao Elizabeth?Winzeler Yingyao?Zhou Email author 《BMC bioinformatics》2004,5(1):42

Background

To identify differentially expressed genes across experimental conditions in oligonucleotide microarray experiments, existing statistical methods commonly use a summary of probe-level expression data for each probe set and compare replicates of these values across conditions using a form of the t-test or rank sum test. Here we propose the use of a statistical method that takes advantage of the built-in redundancy architecture of high-density oligonucleotide arrays. 相似文献

16.

Methods for evaluating gene expression from Affymetrix microarray datasets

Ning Jiang Lindsey J Leach Xiaohua Hu Elena Potokina Tianye Jia Arnis Druka Robbie Waugh Michael J Kearsey Zewei W Luo 《BMC bioinformatics》2008,9(1):284

相似文献

17.

The “Window <Emphasis Type="Italic">t</Emphasis> test”: a simple and powerful approach to detect differentially expressed genes in microarray datasets

Fabrice Berger Benoît De Hertogh Michaël Pierre Anthoula Gaigneaux Eric Depiereux 《Central European Journal of Biology》2008,3(3):327-344

This work focuses on differential expression analysis of microarray datasets. One way to improve such statistical analyses is to integrate biological information in the design of these analyses. In this paper, we will use the relationship between the level of gene expression and variability. Using this biological information, we propose to integrate the information from multiple genes to get a better estimate of individual gene variance, when a small number of replicates are available, to increase the power of the statistical analysis. We describe a strategy named the “Window t test” that uses multiple genes which share a similar expression level to compute the variance which is then incorporated a classic t test. The performances of this new method are evaluated by comparison with classic and widely-used methods for differential expression analysis (the classic Student t test, the Regularized t test (reg t test), SAM, Limma, LPE and Shrinkage t). In each case tested, the results obtained were at least equivalent to the best performing method and, in most cases, outperformed it. Moreover, the Window t test relies on a very simple procedure requiring small computing power compared with other methods designed for microarray differential expression analysis. Electronic Supplementary Material Supplementary material is available for this article at and is accessible for authorized users. 相似文献

18.

A nitty-gritty aspect of correlation and network inference from gene expression data

Lev B Klebanov Andrei Yu Yakovlev 《Biology direct》2008,3(1):35

Background

All currently available methods of network/association inference from microarray gene expression measurements implicitly assume that such measurements represent the actual expression levels of different genes within each cell included in the biological sample under study. Contrary to this common belief, modern microarray technology produces signals aggregated over a random number of individual cells, a "nitty-gritty" aspect of such arrays, thereby causing a random effect that distorts the correlation structure of intra-cellular gene expression levels. 相似文献

19.

EDISA: extracting biclusters from multiple time-series of gene expression profiles

Jochen Supper Martin Strauch Dierk Wanke Klaus Harter Andreas Zell 《BMC bioinformatics》2007,8(1):334

Background

Cells dynamically adapt their gene expression patterns in response to various stimuli. This response is orchestrated into a number of gene expression modules consisting of co-regulated genes. A growing pool of publicly available microarray datasets allows the identification of modules by monitoring expression changes over time. These time-series datasets can be searched for gene expression modules by one of the many clustering methods published to date. For an integrative analysis, several time-series datasets can be joined into a three-dimensional gene-condition-time dataset, to which standard clustering or biclustering methods are, however, not applicable. We thus devise a probabilistic clustering algorithm for gene-condition-time datasets. 相似文献

20.

A novel Mixture Model Method for identification of differentially expressed genes from DNA microarray data

Kayvan?Najarian Email author Maryam?Zaheri Ali?A Rad Siamak?Najarian Javad?Dargahi 《BMC bioinformatics》2004,5(1):201

Background

The main goal in analyzing microarray data is to determine the genes that are differentially expressed across two types of tissue samples or samples obtained under two experimental conditions. Mixture model method (MMM hereafter) is a nonparametric statistical method often used for microarray processing applications, but is known to over-fit the data if the number of replicates is small. In addition, the results of the MMM may not be repeatable when dealing with a small number of replicates. In this paper, we propose a new version of MMM to ensure the repeatability of the results in different runs, and reduce the sensitivity of the results on the parameters. 相似文献