期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Gene set enrichment analysis for non-monotone association and multiple experimental categories

Rongheng Lin Shuangshuang Dai Richard D Irwin Alexandra N Heinloth Gary A Boorman Leping Li 《BMC bioinformatics》2008,9(1):481

Background

Recently, microarray data analyses using functional pathway information, e.g., gene set enrichment analysis (GSEA) and significance analysis of function and expression (SAFE), have gained recognition as a way to identify biological pathways/processes associated with a phenotypic endpoint. In these analyses, a local statistic is used to assess the association between the expression level of a gene and the value of a phenotypic endpoint. Then these gene-specific local statistics are combined to evaluate association for pre-selected sets of genes. Commonly used local statistics include t-statistics for binary phenotypes and correlation coefficients that assume a linear or monotone relationship between a continuous phenotype and gene expression level. Methods applicable to continuous non-monotone relationships are needed. Furthermore, for multiple experimental categories, methods that combine multiple GSEA/SAFE analyses are needed. 相似文献

2.

Classification and biomarker identification using gene network modules and support vector machines

Malik Yousef Mohamed Ketany Larry Manevitz Louise C Showe Michael K Showe 《BMC bioinformatics》2009,10(1):337

Background

Classification using microarray datasets is usually based on a small number of samples for which tens of thousands of gene expression measurements have been obtained. The selection of the genes most significant to the classification problem is a challenging issue in high dimension data analysis and interpretation. A previous study with SVM-RCE (Recursive Cluster Elimination), suggested that classification based on groups of correlated genes sometimes exhibits better performance than classification using single genes. Large databases of gene interaction networks provide an important resource for the analysis of genetic phenomena and for classification studies using interacting genes. 相似文献

3.

Effect of data normalization on fuzzy clustering of DNA microarray data

Seo Young Kim Jae Won Lee Jong Sung Bae 《BMC bioinformatics》2006,7(1):134-14

Background

Microarray technology has made it possible to simultaneously measure the expression levels of large numbers of genes in a short time. Gene expression data is information rich; however, extensive data mining is required to identify the patterns that characterize the underlying mechanisms of action. Clustering is an important tool for finding groups of genes with similar expression patterns in microarray data analysis. However, hard clustering methods, which assign each gene exactly to one cluster, are poorly suited to the analysis of microarray datasets because in such datasets the clusters of genes frequently overlap. 相似文献

4.

Zhipeng Cai Randy Goebel Mohammad R Salavatipour Guohui Lin 《BMC bioinformatics》2007,8(1):206

Background

Gene expression microarray is a powerful technology for genetic profiling diseases and their associated treatments. Such a process involves a key step of biomarker identification, which are expected to be closely related to the disease. A most important task of these identified genes is that they can be used to construct a classifier which can effectively diagnose disease and even recognize the disease subtypes. Binary classification, for example, diseased or healthy, in microarray data analysis has been successful, while multi-class classification, such as cancer subtyping, remains challenging. 相似文献

5.

Quadratic regression analysis for gene discovery and pattern recognition for non-cyclic short time-course microarray experiments

Hua?Liu Email author Sergey?Tarima Aaron?S?Borders Thomas?V?Getchell Marilyn?L?Getchell Arnold?J?Stromberg 《BMC bioinformatics》2005,6(1):106

Background

Cluster analyses are used to analyze microarray time-course data for gene discovery and pattern recognition. However, in general, these methods do not take advantage of the fact that time is a continuous variable, and existing clustering methods often group biologically unrelated genes together. 相似文献

6.

Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study

Junbai?Wang Email author Jan?Delabie Hans?Christian?Aasheim Erlend?Smeland Ola?Myklebost 《BMC bioinformatics》2002,3(1):36

Background

A method to evaluate and analyze the massive data generated by series of microarray experiments is of utmost importance to reveal the hidden patterns of gene expression. Because of the complexity and the high dimensionality of microarray gene expression profiles, the dimensional reduction of raw expression data and the feature selections necessary for, for example, classification of disease samples remains a challenge. To solve the problem we propose a two-level analysis. First self-organizing map (SOM) is used. SOM is a vector quantization method that simplifies and reduces the dimensionality of original measurements and visualizes individual tumor sample in a SOM component plane. Next, hierarchical clustering and K-means clustering is used to identify patterns of gene expression useful for classification of samples. 相似文献

7.

FiGS: a filter-based gene selection workbench for microarray data

Taeho Hwang Choong-Hyun Sun Taegyun Yun Gwan-Su Yi 《BMC bioinformatics》2010,11(1):50

Background

The selection of genes that discriminate disease classes from microarray data is widely used for the identification of diagnostic biomarkers. Although various gene selection methods are currently available and some of them have shown excellent performance, no single method can retain the best performance for all types of microarray datasets. It is desirable to use a comparative approach to find the best gene selection result after rigorous test of different methodological strategies for a given microarray dataset. 相似文献

8.

Instance-based concept learning from multiclass DNA microarray data

Daniel Berrar Ian Bradbury Werner Dubitzky 《BMC bioinformatics》2006,7(1):73

Background

Various statistical and machine learning methods have been successfully applied to the classification of DNA microarray data. Simple instance-based classifiers such as nearest neighbor (NN) approaches perform remarkably well in comparison to more complex models, and are currently experiencing a renaissance in the analysis of data sets from biology and biotechnology. While binary classification of microarray data has been extensively investigated, studies involving multiclass data are rare. The question remains open whether there exists a significant difference in performance between NN approaches and more complex multiclass methods. Comparative studies in this field commonly assess different models based on their classification accuracy only; however, this approach lacks the rigor needed to draw reliable conclusions and is inadequate for testing the null hypothesis of equal performance. Comparing novel classification models to existing approaches requires focusing on the significance of differences in performance. 相似文献

9.

Identifying significant temporal variation in time course microarray data without replicates

Stephen C Billups Margaret C Neville Michael Rudolph Weston Porter Pepper Schedin 《BMC bioinformatics》2009,10(1):96

Background

An important component of time course microarray studies is the identification of genes that demonstrate significant time-dependent variation in their expression levels. Until recently, available methods for performing such significance tests required replicates of individual time points. This paper describes a replicate-free method that was developed as part of a study of the estrous cycle in the rat mammary gland in which no replicate data was collected. 相似文献

10.

Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

Jeff W Chou Tong Zhou William K Kaufmann Richard S Paules Pierre R Bushel 《BMC bioinformatics》2007,8(1):427

Background

A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. 相似文献

11.

PAGE: Parametric Analysis of Gene Set Enrichment

Seon-Young?Kim Email author David?J?Volsky Email author 《BMC bioinformatics》2005,6(1):144

Background

Gene set enrichment analysis (GSEA) is a microarray data analysis method that uses predefined gene sets and ranks of genes to identify significant biological changes in microarray data sets. GSEA is especially useful when gene expression changes in a given microarray data set is minimal or moderate. 相似文献

12.

Microarray analysis of relative gene expression stability for selection of internal reference genes in the rhesus macaque brain

Nigel C Noriega Steven G Kohama Henryk F Urbanski 《BMC molecular biology》2010,11(1):47

Background

Normalization of gene expression data refers to the comparison of expression values using reference standards that are consistent across all conditions of an experiment. In PCR studies, genes designated as "housekeeping genes" have been used as internal reference genes under the assumption that their expression is stable and independent of experimental conditions. However, verification of this assumption is rarely performed. Here we assess the use of gene microarray analysis to facilitate selection of internal reference sequences with higher expression stability across experimental conditions than can be expected using traditional selection methods. 相似文献

13.

Combining Affymetrix microarray results

John?R?Stevens RW?Doerge Email author 《BMC bioinformatics》2005,6(1):57

Background

As the use of microarray technology becomes more prevalent it is not unusual to find several laboratories employing the same microarray technology to identify genes related to the same condition in the same species. Although the experimental specifics are similar, typically a different list of statistically significant genes result from each data analysis. 相似文献

14.

Iterative class discovery and feature selection using Minimal Spanning Trees

Sudhir?Varma Email author Richard?Simon 《BMC bioinformatics》2004,5(1):126

Background

Clustering is one of the most commonly used methods for discovering hidden structure in microarray gene expression data. Most current methods for clustering samples are based on distance metrics utilizing all genes. This has the effect of obscuring clustering in samples that may be evident only when looking at a subset of genes, because noise from irrelevant genes dominates the signal from the relevant genes in the distance calculation. 相似文献

15.

An approach for clustering gene expression data with error information

Brian Tjaden 《BMC bioinformatics》2006,7(1):17-15

相似文献

16.

Gene selection for classification of microarray data based on the Bayes error

Zhang JG Deng HW 《BMC bioinformatics》2007,8(1):370

Background

With DNA microarray data, selecting a compact subset of discriminative genes from thousands of genes is a critical step for accurate classification of phenotypes for, e.g., disease diagnosis. Several widely used gene selection methods often select top-ranked genes according to their individual discriminative power in classifying samples into distinct categories, without considering correlations among genes. A limitation of these gene selection methods is that they may result in gene sets with some redundancy and yield an unnecessary large number of candidate genes for classification analyses. Some latest studies show that incorporating gene to gene correlations into gene selection can remove redundant genes and improve classification accuracy. 相似文献

17.

Sample phenotype clusters in high-density oligonucleotide microarray data sets are revealed using Isomap,a nonlinear algorithm 总被引：2，自引：0，他引：2

Kevin?Dawson Email author Raymond?L?Rodriguez Wasyl?Malyj 《BMC bioinformatics》2005,6(1):195

Background

Life processes are determined by the organism's genetic profile and multiple environmental variables. However the interaction between these factors is inherently non-linear [1]. Microarray data is one representation of the nonlinear interactions among genes and genes and environmental factors. Still most microarray studies use linear methods for the interpretation of nonlinear data. In this study, we apply Isomap, a nonlinear method of dimensionality reduction, to analyze three independent large Affymetrix high-density oligonucleotide microarray data sets. 相似文献

18.

Considerations when using the significance analysis of microarrays (SAM) algorithm

Ola?Larsson Email author Claes?Wahlestedt James?A?Timmons Email author 《BMC bioinformatics》2005,6(1):129

Background

Users of microarray technology typically strive to use universally acceptable data analysis strategies to determine significant expression changes in their experiments. One of the most frequently utilised methods for gene expression data analysis is SAM (significance analysis of microarrays). The impact of selection thresholds, on the output from SAM, may critically alter the conclusion of a study, yet this consideration has not been systematically evaluated in any publication. 相似文献

19.

Array2BIO: from microarray expression data to functional annotation of co-regulated genes

Gabriela G Loots Patrick SG Chain Shalini Mabery Amy Rasley Emilio Garcia Ivan Ovcharenko 《BMC bioinformatics》2006,7(1):307-8

Background

There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. 相似文献

20.

The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison

Allan A Sioson Shrinivasrao P Mane Pinghua Li Wei Sha Lenwood S Heath Hans J Bohnert Ruth Grene 《BMC bioinformatics》2006,7(1):215-15

Background

Analysis of DNA microarray data takes as input spot intensity measurements from scanner software and returns differential expression of genes between two conditions, together with a statistical significance assessment. This process typically consists of two steps: data normalization and identification of differentially expressed genes through statistical analysis. The Expresso microarray experiment management system implements these steps with a two-stage, log-linear ANOVA mixed model technique, tailored to individual experimental designs. The complement of tools in TM4, on the other hand, is based on a number of preset design choices that limit its flexibility. In the TM4 microarray analysis suite, normalization, filter, and analysis methods form an analysis pipeline. TM4 computes integrated intensity values (IIV) from the average intensities and spot pixel counts returned by the scanner software as input to its normalization steps. By contrast, Expresso can use either IIV data or median intensity values (MIV). Here, we compare Expresso and TM4 analysis of two experiments and assess the results against qRT-PCR data. 相似文献