期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A scalable method for integration and functional analysis of multiple microarray datasets 总被引：6，自引：0，他引：6

Huttenhower C Hibbs M Myers C Troyanskaya OG 《Bioinformatics (Oxford, England)》2006,22(23):2890-2897

MOTIVATION: The diverse microarray datasets that have become available over the past several years represent a rich opportunity and challenge for biological data mining. Many supervised and unsupervised methods have been developed for the analysis of individual microarray datasets. However, integrated analysis of multiple datasets can provide a broader insight into genetic regulation of specific biological pathways under a variety of conditions. RESULTS: To aid in the analysis of such large compendia of microarray experiments, we present Microarray Experiment Functional Integration Technology (MEFIT), a scalable Bayesian framework for predicting functional relationships from integrated microarray datasets. Furthermore, MEFIT predicts these functional relationships within the context of specific biological processes. All results are provided in the context of one or more specific biological functions, which can be provided by a biologist or drawn automatically from catalogs such as the Gene Ontology (GO). Using MEFIT, we integrated 40 Saccharomyces cerevisiae microarray datasets spanning 712 unique conditions. In tests based on 110 biological functions drawn from the GO biological process ontology, MEFIT provided a 5% or greater performance increase for 54 functions, with a 5% or more decrease in performance in only two functions. 相似文献

2.

Bioinformatics applications for pathway analysis of microarray data 总被引：3，自引：0，他引：3

Werner T 《Current opinion in biotechnology》2008,19(1):50-54

相似文献

3.

Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments

Maureen A Sartor Craig R Tomlinson Scott C Wesselkamper Siva Sivaganesan George D Leikauf Mario Medvedovic 《BMC bioinformatics》2006,7(1):538-17

Background

The small sample sizes often used for microarray experiments result in poor estimates of variance if each gene is considered independently. Yet accurately estimating variability of gene expression measurements in microarray experiments is essential for correctly identifying differentially expressed genes. Several recently developed methods for testing differential expression of genes utilize hierarchical Bayesian models to "pool" information from multiple genes. We have developed a statistical testing procedure that further improves upon current methods by incorporating the well-documented relationship between the absolute gene expression level and the variance of gene expression measurements into the general empirical Bayes framework. 相似文献

4.

A non-parametric meta-analysis approach for combining independent microarray datasets: application using two microarray datasets pertaining to chronic allograft nephropathy

Xiangrong Kong Valeria Mas Kellie J Archer 《BMC genomics》2008,9(1):1-13

相似文献

5.

Methods for evaluating gene expression from Affymetrix microarray datasets

Ning Jiang Lindsey J Leach Xiaohua Hu Elena Potokina Tianye Jia Arnis Druka Robbie Waugh Michael J Kearsey Zewei W Luo 《BMC bioinformatics》2008,9(1):284

相似文献

6.

Sample size calculation for multiple testing in microarray data analysis

Jung SH Bang H Young S 《Biostatistics (Oxford, England)》2005,6(1):157-169

Microarray technology is rapidly emerging for genome-wide screening of differentially expressed genes between clinical subtypes or different conditions of human diseases. Traditional statistical testing approaches, such as the two-sample t-test or Wilcoxon test, are frequently used for evaluating statistical significance of informative expressions but require adjustment for large-scale multiplicity. Due to its simplicity, Bonferroni adjustment has been widely used to circumvent this problem. It is well known, however, that the standard Bonferroni test is often very conservative. In the present paper, we compare three multiple testing procedures in the microarray context: the original Bonferroni method, a Bonferroni-type improved single-step method and a step-down method. The latter two methods are based on nonparametric resampling, by which the null distribution can be derived with the dependency structure among gene expressions preserved and the family-wise error rate accurately controlled at the desired level. We also present a sample size calculation method for designing microarray studies. Through simulations and data analyses, we find that the proposed methods for testing and sample size calculation are computationally fast and control error and power precisely. 相似文献

7.

MADGene: retrieval and processing of gene identifier lists for the analysis of heterogeneous microarray datasets

Baron D Bihouée A Teusan R Dubois E Savagner F Steenman M Houlgatte R Ramstein G 《Bioinformatics (Oxford, England)》2011,27(5):725-726

MADGene is a software environment comprising a web-based database and a java application. This platform aims at unifying gene identifiers (ids) and performing gene set analysis. MADGene allows the user to perform inter-conversion of clone and gene ids over a large range of nomenclatures relative to 17 species. We propose a set of 23 functions to facilitate the analysis of gene sets and we give two microarray applications to show how MADGene can be used to conduct meta-analyses. AVAILABILITY: The MADGene resources are freely available online from http://www.madtools.org, a website dedicated to the analysis and annotation of DNA microarray data. 相似文献

8.

Bayesian network based pathway analysis of microarray data

Senol Isci Cengihan Ozturk Jon Jones Hasan Otu 《Current opinion in biotechnology》2011

相似文献

9.

Special issue: integration of OMICs datasets into metabolic pathway analysis

Kaleta C de Figueiredo LF Heiland I Klamt S Schuster S 《Bio Systems》2011,105(2):107-108

相似文献

10.

Integrated analysis of multiple microarray datasets identifies a reproducible survival predictor in ovarian cancer

Konstantinopoulos PA Cannistra SA Fountzilas H Culhane A Pillay K Rueda B Cramer D Seiden M Birrer M Coukos G Zhang L Quackenbush J Spentzos D 《PloS one》2011,6(3):e18202

Background

Public data integration may help overcome challenges in clinical implementation of microarray profiles. We integrated several ovarian cancer datasets to identify a reproducible predictor of survival.

Methodology/Principal Findings

Four microarray datasets from different institutions comprising 265 advanced stage tumors were uniformly reprocessed into a single training dataset, also adjusting for inter-laboratory variation (“batch-effect”). Supervised principal component survival analysis was employed to identify prognostic models. Models were independently validated in a 61-patient cohort using a custom array genechip and a publicly available 229-array dataset. Molecular correspondence of high- and low-risk outcome groups between training and validation datasets was demonstrated using Subclass Mapping. Previously established molecular phenotypes in the 2^nd validation set were correlated with high and low-risk outcome groups. Functional representational and pathway analysis was used to explore gene networks associated with high and low risk phenotypes. A 19-gene model showed optimal performance in the training set (median OS 31 and 78 months, p<0.01), 1^st validation set (median OS 32 months versus not-yet-reached, p = 0.026) and 2^nd validation set (median OS 43 versus 61 months, p = 0.013) maintaining independent prognostic power in multivariate analysis. There was strong molecular correspondence of the respective high- and low-risk tumors between training and 1^st validation set. Low and high-risk tumors were enriched for favorable and unfavorable molecular subtypes and pathways, previously defined in the public 2^nd validation set.

Conclusions/Significance

Integration of previously generated cancer microarray datasets may lead to robust and widely applicable survival predictors. These predictors are not simply a compilation of prognostic genes but appear to track true molecular phenotypes of good- and poor-outcome. 相似文献

11.

An interactive power analysis tool for microarray hypothesis testing and generation

Seo J Gordish-Dressman H Hoffman EP 《Bioinformatics (Oxford, England)》2006,22(7):808-814

MOTIVATION: Human clinical projects typically require a priori statistical power analyses. Towards this end, we sought to build a flexible and interactive power analysis tool for microarray studies integrated into our public domain HCE 3.5 software package. We then sought to determine if probe set algorithms or organism type strongly influenced power analysis results. RESULTS: The HCE 3.5 power analysis tool was designed to import any pre-existing Affymetrix microarray project, and interactively test the effects of user-defined definitions of alpha (significance), beta (1-power), sample size and effect size. The tool generates a filter for all probe sets or more focused ontology-based subsets, with or without noise filters that can be used to limit analyses of a future project to appropriately powered probe sets. We studied projects from three organisms (Arabidopsis, rat, human), and three probe set algorithms (MAS5.0, RMA, dChip PM/MM). We found large differences in power results based on probe set algorithm selection and noise filters. RMA provided high sensitivity for low numbers of arrays, but this came at a cost of high false positive results (24% false positive in the human project studied). Our data suggest that a priori power calculations are important for both experimental design in hypothesis testing and hypothesis generation, as well as for the selection of optimized data analysis parameters. AVAILABILITY: The Hierarchical Clustering Explorer 3.5 with the interactive power analysis functions is available at www.cs.umd.edu/hcil/hce or www.cnmcresearch.org/bioinformatics. CONTACT: jseo@cnmcresearch.org 相似文献

12.

Multi-membership gene regulation in pathway based microarray analysis

Stelios P Pavlidis Annette M Payne Stephen M Swift 《Algorithms for molecular biology : AMB》2011,6(1):1-22

Background

Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway.

Results

We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency.

Conclusions

We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes. 相似文献

13.

Cross-platform comparability of microarray technology: Intra-platform consistency and appropriate data analysis procedures are essential

Shi L Tong W Fang H Scherf U Han J Puri RK Frueh FW Goodsaid FM Guo L Su Z Han T Fuscoe JC Xu ZA Patterson TA Hong H Xie Q Perkins RG Chen JJ Casciano DA 《BMC bioinformatics》2005,6(Z2):S12

Background

The acceptance of microarray technology in regulatory decision-making is being challenged by the existence of various platforms and data analysis methods. A recent report (E. Marshall, Science, 306, 630–631, 2004), by extensively citing the study of Tan et al. (Nucleic Acids Res., 31, 5676–5684, 2003), portrays a disturbingly negative picture of the cross-platform comparability, and, hence, the reliability of microarray technology.

Results

We reanalyzed Tan's dataset and found that the intra-platform consistency was low, indicating a problem in experimental procedures from which the dataset was generated. Furthermore, by using three gene selection methods (i.e., p-value ranking, fold-change ranking, and Significance Analysis of Microarrays (SAM)) on the same dataset we found that p-value ranking (the method emphasized by Tan et al.) results in much lower cross-platform concordance compared to fold-change ranking or SAM. Therefore, the low cross-platform concordance reported in Tan's study appears to be mainly due to a combination of low intra-platform consistency and a poor choice of data analysis procedures, instead of inherent technical differences among different platforms, as suggested by Tan et al. and Marshall.

Conclusion

Our results illustrate the importance of establishing calibrated RNA samples and reference datasets to objectively assess the performance of different microarray platforms and the proficiency of individual laboratories as well as the merits of various data analysis procedures. Thus, we are progressively coordinating the MAQC project, a community-wide effort for microarray quality control.

相似文献

14.

A new approach for filtering noise from high-density oligonucleotide microarray datasets 总被引：2，自引：4，他引：2

下载免费PDF全文

Mills JC Gordon JI 《Nucleic acids research》2001,29(15):e72-E72

相似文献

15.

Quantitative analysis of DNA hybridization in a flowthrough microarray for molecular testing

Mocanu D Kolesnychenko A Aarts S Dejong AT Pierik A Coene W Vossenaar E Stapert H 《Analytical biochemistry》2008,380(1):84-90

Quantitative information about the nucleic acids hybridization reaction on microarrays is fundamental to designing optimized assays for molecular diagnostics. This study presents the kinetic, equilibrium, and thermodynamic analyses of DNA hybridization in a microarray system designed for fast molecular testing of pathogenic bacteria. Our microarray setup uses a porous, nylon membrane for probe immobilization and flowthrough incubation. The Langmuir model was used to determine the reaction rate constants of hybridization with antisense targets specific to Staphylococcus epidermidis and Staphylococcus aureus strains. The kinetic analysis revealed a sequence-dependent reaction rate, with association rate constants on the order of 10⁵ M⁻¹ s⁻¹ and dissociation rate constants of 10⁻⁴ s⁻¹. We found that by increasing the probe surface density from 10¹¹ to 10¹² molecules/cm², the hybridization rate and efficiency are suppressed while the melting temperature of the DNA duplex increases. The maximum fraction of hybridized capture probes at equilibrium did not exceed 50% for hybridization with antisense sequences and was below 6% for hybridization with long targets obtained from PCR. The van’t Hoff analysis of the temperature denaturation data showed that the DNA hybridization in our porous, flowthrough microarray is thermodynamically less favorable than the hybridization of the same sequences in solution. 相似文献

16.

Recalculation of 23 mouse HDL QTL datasets improves accuracy and allows for better candidate gene analysis

Cheryl Ackert-Bicknell Beverly Paigen Ron Korstanje 《Journal of lipid research》2013,54(4):984-994

In the past 15 years, the quantitative trait locus (QTL) mapping approach has been applied to crosses between different inbred mouse strains to identify genetic loci associated with plasma HDL cholesterol levels. Although successful, a disadvantage of this method is low mapping resolution, as often several hundred candidate genes fall within the confidence interval for each locus. Methods have been developed to narrow these loci by combining the data from the different crosses, but they rely on the accurate mapping of the QTL and the treatment of the data in a consistent manner. We collected 23 raw datasets used for the mapping of previously published HDL QTL and reanalyzed the data from each cross using a consistent method and the latest mouse genetic map. By utilizing this approach, we identified novel QTL and QTL that were mapped to the wrong part of chromosomes. Our new HDL QTL map allows for reliable combining of QTL data and candidate gene analysis, which we demonstrate by identifying Grin3a and Etv6, as candidate genes for QTL on chromosomes 4 and 6, respectively. In addition, we were able to narrow a QTL on Chr 19 to five candidates. 相似文献

17.

Group SCAD regression analysis for microarray time course gene expression data 总被引：1，自引：0，他引：1

Wang L Chen G Li H 《Bioinformatics (Oxford, England)》2007,23(12):1486-1494

相似文献

18.

Comparison of different microarray data analysis programs and description of a database for microarray data management

Xu L Maresh GA Giardina J Pincus SH 《DNA and cell biology》2004,23(10):643-651

Data analysis and management represent a major challenge for gene expression studies using microarrays. Here, we compare different methods of analysis and demonstrate the utility of a personal microarray database. Gene expression during HIV infection of cell lines was studied using Affymetrix U-133 A and B chips. The data were analyzed using Affymetrix Microarray Suite and Data Mining Tool, Silicon Genetics GeneSpring, and dChip from Harvard School of Public Health. A small-scale database was established with FileMaker Pro Developer to manage and analyze the data. There was great variability among the programs in the lists of significantly changed genes constructed from the same data. Similarly choices of different parameters for normalization, comparison, and standardization greatly affected the outcome. As many probe sets on the U133 chip target the same Unigene clusters, the Unigene information can be used as an internal control to confirm and interpret the probe set results. Algorithms used for the determination of changes in gene expression require further refinement and standardization. The use of a personal database powered with Unigene information can enhance the analysis of gene expression data. 相似文献

19.

Self-directed student research through analysis of microarray datasets: a computer-based functional genomics practical class for masters-level students

Grenville-Briggs LJ Stansfield I 《生物化学与分子生物教育》2011,39(6):440-447

相似文献

20.

CARMA: A platform for analyzing microarray datasets that incorporate replicate measures

Kevin A Greer Matthew R McReynolds Heddwen L Brooks James B Hoying 《BMC bioinformatics》2006,7(1):149-13

Background

The incorporation of statistical models that account for experimental variability provides a necessary framework for the interpretation of microarray data. A robust experimental design coupled with an analysis of variance (ANOVA) incorporating a model that accounts for known sources of experimental variability can significantly improve the determination of differences in gene expression and estimations of their significance. 相似文献