期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

New resampling method for evaluating stability of clusters

Irina M Gana Dresen Tanja Boes Johannes Huesing Markus Neuhaeuser Karl-Heinz Joeckel 《BMC bioinformatics》2008,9(1):42

Background

Hierarchical clustering is a widely applied tool in the analysis of microarray gene expression data. The assessment of cluster stability is a major challenge in clustering procedures. Statistical methods are required to distinguish between real and random clusters. Several methods for assessing cluster stability have been published, including resampling methods such as the bootstrap. 相似文献

2.

Domain-oriented functional analysis based on expression profiling

Ding W Wang L Qiu P Kostich M Greene J Hernandez M 《BMC genomics》2002,3(1):32-10

相似文献

3.

Effect of data normalization on fuzzy clustering of DNA microarray data

Seo Young Kim Jae Won Lee Jong Sung Bae 《BMC bioinformatics》2006,7(1):134-14

Background

Microarray technology has made it possible to simultaneously measure the expression levels of large numbers of genes in a short time. Gene expression data is information rich; however, extensive data mining is required to identify the patterns that characterize the underlying mechanisms of action. Clustering is an important tool for finding groups of genes with similar expression patterns in microarray data analysis. However, hard clustering methods, which assign each gene exactly to one cluster, are poorly suited to the analysis of microarray datasets because in such datasets the clusters of genes frequently overlap. 相似文献

4.

A robust measure of correlation between two genes on a microarray

Johanna Hardin Aya Mitani Leanne Hicks Brian VanKoten 《BMC bioinformatics》2007,8(1):220

Background

The underlying goal of microarray experiments is to identify gene expression patterns across different experimental conditions. Genes that are contained in a particular pathway or that respond similarly to experimental conditions could be co-expressed and show similar patterns of expression on a microarray. Using any of a variety of clustering methods or gene network analyses we can partition genes of interest into groups, clusters, or modules based on measures of similarity. Typically, Pearson correlation is used to measure distance (or similarity) before implementing a clustering algorithm. Pearson correlation is quite susceptible to outliers, however, an unfortunate characteristic when dealing with microarray data (well known to be typically quite noisy.) 相似文献

5.

Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis

Tae-Min Kim Yeun-Jun Chung Mun-Gan Rhyu Myeong Ho Jung 《BMC bioinformatics》2007,8(1):453

Background

Gene clustering has been widely used to group genes with similar expression pattern in microarray data analysis. Subsequent enrichment analysis using predefined gene sets can provide clues on which functional themes or regulatory sequence motifs are associated with individual gene clusters. In spite of the potential utility, gene clustering and enrichment analysis have been used in separate platforms, thus, the development of integrative algorithm linking both methods is highly challenging. 相似文献

6.

Comparison study of microarray meta-analysis methods

Anna Campain Yee Hwa Yang 《BMC bioinformatics》2010,11(1):408

Background

Meta-analysis methods exist for combining multiple microarray datasets. However, there are a wide range of issues associated with microarray meta-analysis and a limited ability to compare the performance of different meta-analysis methods. 相似文献

7.

Computational cluster validation for microarray data analysis: experimental assessment of Clest,Consensus Clustering,Figure of Merit,Gap Statistics and Model Explorer

Raffaele Giancarlo Davide Scaturro Filippo Utro 《BMC bioinformatics》2008,9(1):462

Background

Inferring cluster structure in microarray datasets is a fundamental task for the so-called -omic sciences. It is also a fundamental question in Statistics, Data Analysis and Classification, in particular with regard to the prediction of the number of clusters in a dataset, usually established via internal validation measures. Despite the wealth of internal measures available in the literature, new ones have been recently proposed, some of them specifically for microarray data. 相似文献

8.

Methodological study of affine transformations of gene expression data with proposed robust non-parametric multi-dimensional normalization method

Henrik Bengtsson Ola Hössjer 《BMC bioinformatics》2006,7(1):100-18

Background

Low-level processing and normalization of microarray data are most important steps in microarray analysis, which have profound impact on downstream analysis. Multiple methods have been suggested to date, but it is not clear which is the best. It is therefore important to further study the different normalization methods in detail and the nature of microarray data in general. 相似文献

9.

Microarray data mining using landmark gene-guided clustering

Pankaj Chopra Jaewoo Kang Jiong Yang HyungJun Cho Heenam Stanley Kim Min-Goo Lee 《BMC bioinformatics》2008,9(1):92

Background

Clustering is a popular data exploration technique widely used in microarray data analysis. Most conventional clustering algorithms, however, generate only one set of clusters independent of the biological context of the analysis. This is often inadequate to explore data from different biological perspectives and gain new insights. We propose a new clustering model that can generate multiple versions of different clusters from a single dataset, each of which highlights a different aspect of the given dataset. 相似文献

10.

DiffCoEx: a simple and sensitive method to find differentially coexpressed gene modules

Bruno M Tesson Rainer Breitling Ritsert C Jansen 《BMC bioinformatics》2010,11(1):497

Background

Large microarray datasets have enabled gene regulation to be studied through coexpression analysis. While numerous methods have been developed for identifying differentially expressed genes between two conditions, the field of differential coexpression analysis is still relatively new. More specifically, there is so far no sensitive and untargeted method to identify gene modules (also known as gene sets or clusters) that are differentially coexpressed between two conditions. Here, sensitive and untargeted means that the method should be able to construct de novo modules by grouping genes based on shared, but subtle, differential correlation patterns. 相似文献

11.

Sample size calculation for microarray experiments with blocked one-way design

Sin-Ho Jung Insuk Sohn Stephen L George Liping Feng Phyllis C Leppert 《BMC bioinformatics》2009,10(1):164

Background

One of the main objectives of microarray analysis is to identify differentially expressed genes for different types of cells or treatments. Many statistical methods have been proposed to assess the treatment effects in microarray experiments. 相似文献

12.

Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures

Meng P Tan Erin N Smith James R Broach Christodoulos A Floudas 《BMC bioinformatics》2008,9(1):268

Background

DNA microarray technology allows for the measurement of genome-wide expression patterns. Within the resultant mass of data lies the problem of analyzing and presenting information on this genomic scale, and a first step towards the rapid and comprehensive interpretation of this data is gene clustering with respect to the expression patterns. Classifying genes into clusters can lead to interesting biological insights. In this study, we describe an iterative clustering approach to uncover biologically coherent structures from DNA microarray data based on a novel clustering algorithm EP_GOS_Clust. 相似文献

13.

Nearest Neighbor Networks: clustering expression data based on gene neighborhoods

Curtis Huttenhower Avi I Flamholz Jessica N Landis Sauhard Sahi Chad L Myers Kellen L Olszewski Matthew A Hibbs Nathan O Siemers Olga G Troyanskaya Hilary A Coller 《BMC bioinformatics》2007,8(1):250

Background

The availability of microarrays measuring thousands of genes simultaneously across hundreds of biological conditions represents an opportunity to understand both individual biological pathways and the integrated workings of the cell. However, translating this amount of data into biological insight remains a daunting task. An important initial step in the analysis of microarray data is clustering of genes with similar behavior. A number of classical techniques are commonly used to perform this task, particularly hierarchical and K-means clustering, and many novel approaches have been suggested recently. While these approaches are useful, they are not without drawbacks; these methods can find clusters in purely random data, and even clusters enriched for biological functions can be skewed towards a small number of processes (e.g. ribosomes). 相似文献

14.

Phylogenetic detection of conserved gene clusters in microbial genomes

Yu?Zheng Brian?P?Anton Richard?J?Roberts Simon?Kasif Email author 《BMC bioinformatics》2005,6(1):243

Background

Microbial genomes contain an abundance of genes with conserved proximity forming clusters on the chromosome. However, the conservation can be a result of many factors such as vertical inheritance, or functional selection. Thus, identification of conserved gene clusters that are under functional selection provides an effective channel for gene annotation, microarray screening, and pathway reconstruction. The problem of devising a robust method to identify these conserved gene clusters and to evaluate the significance of the conservation in multiple genomes has a number of implications for comparative, evolutionary and functional genomics as well as synthetic biology. 相似文献

15.

Cluster stability scores for microarray data in cancer studies

Mark?Smolkin Debashis?Ghosh Email author 《BMC bioinformatics》2003,4(1):36

Background

A potential benefit of profiling of tissue samples using microarrays is the generation of molecular fingerprints that will define subtypes of disease. Hierarchical clustering has been the primary analytical tool used to define disease subtypes from microarray experiments in cancer settings. Assessing cluster reliability poses a major complication in analyzing output from clustering procedures. While most work has focused on estimating the number of clusters in a dataset, the question of stability of individual-level clusters has not been addressed. 相似文献

16.

Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes

Patrick?Warnat Roland?Eils Email author Benedikt?Brors 《BMC bioinformatics》2005,6(1):265

相似文献

17.

AGGRESCAN: a server for the prediction and evaluation of "hot spots" of aggregation in polypeptides

Oscar Conchillo-Solé Natalia S de Groot Francesc X Avilés Josep Vendrell Xavier Daura Salvador Ventura 《BMC bioinformatics》2007,8(1):1-17

Background

A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure.

Results

We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data.

Conclusion

We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods. 相似文献

18.

Reuse of imputed data in microarray analysis increases imputation efficiency

Ki-Yeol Kim Byoung-Jin Kim Gwan-Su Yi 《BMC bioinformatics》2004,5(1):160

Background

The imputation of missing values is necessary for the efficient use of DNA microarray data, because many clustering algorithms and some statistical analysis require a complete data set. A few imputation methods for DNA microarray data have been introduced, but the efficiency of the methods was low and the validity of imputed values in these methods had not been fully checked. 相似文献

19.

Array2BIO: from microarray expression data to functional annotation of co-regulated genes

Gabriela G Loots Patrick SG Chain Shalini Mabery Amy Rasley Emilio Garcia Ivan Ovcharenko 《BMC bioinformatics》2006,7(1):307-8

Background

There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. 相似文献

20.

Reordering hierarchical tree based on bilateral symmetric distance

Chae M Chen JJ 《PloS one》2011,6(8):e22546

Background

In microarray data analysis, hierarchical clustering (HC) is often used to group samples or genes according to their gene expression profiles to study their associations. In a typical HC, nested clustering structures can be quickly identified in a tree. The relationship between objects is lost, however, because clusters rather than individual objects are compared. This results in a tree that is hard to interpret.

Methodology/Principal Findings

This study proposes an ordering method, HC-SYM, which minimizes bilateral symmetric distance of two adjacent clusters in a tree so that similar objects in the clusters are located in the cluster boundaries. The performance of HC-SYM was evaluated by both supervised and unsupervised approaches and compared favourably with other ordering methods.

Conclusions/Significance

The intuitive relationship between objects and flexibility of the HC-SYM method can be very helpful in the exploratory analysis of not only microarray data but also similar high-dimensional data. 相似文献