期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A temporal precedence based clustering method for gene expression microarray data

Ritesh Krishna Chang-Tsun Li Vicky Buchanan-Wollaston 《BMC bioinformatics》2010,11(1):68

Background

Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together according to certain characteristics. The majority of clustering techniques are based on distance or visual similarity measures which may not be suitable for clustering of temporal microarray data where the sequential nature of time is important. We present a Granger causality based technique to cluster temporal microarray gene expression data, which measures the interdependence between two time-series by statistically testing if one time-series can be used for forecasting the other time-series or not. 相似文献

2.

A robust measure of correlation between two genes on a microarray

Johanna Hardin Aya Mitani Leanne Hicks Brian VanKoten 《BMC bioinformatics》2007,8(1):220

Background

The underlying goal of microarray experiments is to identify gene expression patterns across different experimental conditions. Genes that are contained in a particular pathway or that respond similarly to experimental conditions could be co-expressed and show similar patterns of expression on a microarray. Using any of a variety of clustering methods or gene network analyses we can partition genes of interest into groups, clusters, or modules based on measures of similarity. Typically, Pearson correlation is used to measure distance (or similarity) before implementing a clustering algorithm. Pearson correlation is quite susceptible to outliers, however, an unfortunate characteristic when dealing with microarray data (well known to be typically quite noisy.) 相似文献

3.

Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study

Junbai?Wang Email author Jan?Delabie Hans?Christian?Aasheim Erlend?Smeland Ola?Myklebost 《BMC bioinformatics》2002,3(1):36

Background

A method to evaluate and analyze the massive data generated by series of microarray experiments is of utmost importance to reveal the hidden patterns of gene expression. Because of the complexity and the high dimensionality of microarray gene expression profiles, the dimensional reduction of raw expression data and the feature selections necessary for, for example, classification of disease samples remains a challenge. To solve the problem we propose a two-level analysis. First self-organizing map (SOM) is used. SOM is a vector quantization method that simplifies and reduces the dimensionality of original measurements and visualizes individual tumor sample in a SOM component plane. Next, hierarchical clustering and K-means clustering is used to identify patterns of gene expression useful for classification of samples. 相似文献

4.

HAMSTER: visualizing microarray experiments as a set of minimum spanning trees

Raymond Wan Larisa Kiseleva Hajime Harada Hiroshi Mamitsuka Paul Horton 《Source code for biology and medicine》2009,4(1):1-18

Background

Visualization tools allow researchers to obtain a global view of the interrelationships between the probes or experiments of a gene expression (e.g. microarray) data set. Some existing methods include hierarchical clustering and k-means. In recent years, others have proposed applying minimum spanning trees (MST) for microarray clustering. Although MST-based clustering is formally equivalent to the dendrograms produced by hierarchical clustering under certain conditions; visually they can be quite different. 相似文献

5.

STEM: a tool for the analysis of short time series gene expression data 总被引：2，自引：0，他引：2

Jason Ernst Ziv Bar-Joseph 《BMC bioinformatics》2006,7(1):191

Background

Time series microarray experiments are widely used to study dynamical biological processes. Due to the cost of microarray experiments, and also in some cases the limited availability of biological material, about 80% of microarray time series experiments are short (3–8 time points). Previously short time series gene expression data has been mainly analyzed using more general gene expression analysis tools not designed for the unique challenges and opportunities inherent in short time series gene expression data. 相似文献

6.

Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures

Meng P Tan Erin N Smith James R Broach Christodoulos A Floudas 《BMC bioinformatics》2008,9(1):268

Background

DNA microarray technology allows for the measurement of genome-wide expression patterns. Within the resultant mass of data lies the problem of analyzing and presenting information on this genomic scale, and a first step towards the rapid and comprehensive interpretation of this data is gene clustering with respect to the expression patterns. Classifying genes into clusters can lead to interesting biological insights. In this study, we describe an iterative clustering approach to uncover biologically coherent structures from DNA microarray data based on a novel clustering algorithm EP_GOS_Clust. 相似文献

7.

Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments

Tianqing Liu Nan Lin Ningzhong Shi Baoxue Zhang 《BMC bioinformatics》2009,10(1):146-20

Background

Time-course microarray experiments produce vector gene expression profiles across a series of time points. Clustering genes based on these profiles is important in discovering functional related and co-regulated genes. Early developed clustering algorithms do not take advantage of the ordering in a time-course study, explicit use of which should allow more sensitive detection of genes that display a consistent pattern over time. Peddada et al. [1] proposed a clustering algorithm that can incorporate the temporal ordering using order-restricted statistical inference. This algorithm is, however, very time-consuming and hence inapplicable to most microarray experiments that contain a large number of genes. Its computational burden also imposes difficulty to assess the clustering reliability, which is a very important measure when clustering noisy microarray data. 相似文献

8.

New resampling method for evaluating stability of clusters

Irina M Gana Dresen Tanja Boes Johannes Huesing Markus Neuhaeuser Karl-Heinz Joeckel 《BMC bioinformatics》2008,9(1):42

Background

Hierarchical clustering is a widely applied tool in the analysis of microarray gene expression data. The assessment of cluster stability is a major challenge in clustering procedures. Statistical methods are required to distinguish between real and random clusters. Several methods for assessing cluster stability have been published, including resampling methods such as the bootstrap. 相似文献

9.

Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis

Tae-Min Kim Yeun-Jun Chung Mun-Gan Rhyu Myeong Ho Jung 《BMC bioinformatics》2007,8(1):453

Background

Gene clustering has been widely used to group genes with similar expression pattern in microarray data analysis. Subsequent enrichment analysis using predefined gene sets can provide clues on which functional themes or regulatory sequence motifs are associated with individual gene clusters. In spite of the potential utility, gene clustering and enrichment analysis have been used in separate platforms, thus, the development of integrative algorithm linking both methods is highly challenging. 相似文献

10.

Effect of data normalization on fuzzy clustering of DNA microarray data

Seo Young Kim Jae Won Lee Jong Sung Bae 《BMC bioinformatics》2006,7(1):134-14

Background

Microarray technology has made it possible to simultaneously measure the expression levels of large numbers of genes in a short time. Gene expression data is information rich; however, extensive data mining is required to identify the patterns that characterize the underlying mechanisms of action. Clustering is an important tool for finding groups of genes with similar expression patterns in microarray data analysis. However, hard clustering methods, which assign each gene exactly to one cluster, are poorly suited to the analysis of microarray datasets because in such datasets the clusters of genes frequently overlap. 相似文献

11.

Iterative class discovery and feature selection using Minimal Spanning Trees

Sudhir?Varma Email author Richard?Simon 《BMC bioinformatics》2004,5(1):126

Background

Clustering is one of the most commonly used methods for discovering hidden structure in microarray gene expression data. Most current methods for clustering samples are based on distance metrics utilizing all genes. This has the effect of obscuring clustering in samples that may be evident only when looking at a subset of genes, because noise from irrelevant genes dominates the signal from the relevant genes in the distance calculation. 相似文献

12.

Identification of significant periodic genes in microarray gene expression data

Jie?Chen Email author 《BMC bioinformatics》2005,6(1):286

Background

One frequent application of microarray experiments is in the study of monitoring gene activities in a cell during cell cycle or cell division. A new challenge for analyzing the microarray experiments is to identify genes that are statistically significantly periodically expressed during the cell cycle. Such a challenge occurs due to the large number of genes that are simultaneously measured, a moderate to small number of measurements per gene taken at different time points, and high levels of non-normal random noises inherited in the data. 相似文献

13.

Methods for evaluating gene expression from Affymetrix microarray datasets

Ning Jiang Lindsey J Leach Xiaohua Hu Elena Potokina Tianye Jia Arnis Druka Robbie Waugh Michael J Kearsey Zewei W Luo 《BMC bioinformatics》2008,9(1):284

相似文献

14.

FLAME,a novel fuzzy clustering method for the analysis of DNA microarray data 总被引：3，自引：0，他引：3

Limin Fu Enzo Medico 《BMC bioinformatics》2007,8(1):3

Background

Data clustering analysis has been extensively applied to extract information from gene expression profiles obtained with DNA microarrays. To this aim, existing clustering approaches, mainly developed in computer science, have been adapted to microarray data analysis. However, previous studies revealed that microarray datasets have very diverse structures, some of which may not be correctly captured by current clustering methods. We therefore approached the problem from a new starting point, and developed a clustering algorithm designed to capture dataset-specific structures at the beginning of the process. 相似文献

15.

Characterization and simulation of cDNA microarray spots using a novel mathematical model

Hye Young Kim Seo Eun Lee Min Jung Kim Jin Il Han Bo Kyung Kim Yong Sung Lee Young Seek Lee Jin Hyuk Kim 《BMC bioinformatics》2007,8(1):485

Background

The quality of cDNA microarray data is crucial for expanding its application to other research areas, such as the study of gene regulatory networks. Despite the fact that a number of algorithms have been suggested to increase the accuracy of microarray gene expression data, it is necessary to obtain reliable microarray images by improving wet-lab experiments. As the first step of a cDNA microarray experiment, spotting cDNA probes is critical to determining the quality of spot images. 相似文献

16.

R/BHC: fast Bayesian hierarchical clustering for microarray data

Richard S Savage Katherine Heller Yang Xu Zoubin Ghahramani William M Truman Murray Grant Katherine J Denby David L Wild 《BMC bioinformatics》2009,10(1):242

Background

Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data analysis, little attention has been paid to uncertainty in the results obtained. 相似文献

17.

A permutation-based multiple testing method for time-course microarray experiments

Insuk Sohn Kouros Owzar Stephen L George Sujong Kim Sin-Ho Jung 《BMC bioinformatics》2009,10(1):336

Background

Time-course microarray experiments are widely used to study the temporal profiles of gene expression. Storey et al. (2005) developed a method for analyzing time-course microarray studies that can be applied to discovering genes whose expression trajectories change over time within a single biological group, or those that follow different time trajectories among multiple groups. They estimated the expression trajectories of each gene using natural cubic splines under the null (no time-course) and alternative (time-course) hypotheses, and used a goodness of fit test statistic to quantify the discrepancy. The null distribution of the statistic was approximated through a bootstrap method. Gene expression levels in microarray data are often complicatedly correlated. An accurate type I error control adjusting for multiple testing requires the joint null distribution of test statistics for a large number of genes. For this purpose, permutation methods have been widely used because of computational ease and their intuitive interpretation. 相似文献

18.

Clustering gene expression data with a penalized graph-based metric

Ariel E Bayá Pablo M Granitto 《BMC bioinformatics》2011,12(1):2

Background

The search for cluster structure in microarray datasets is a base problem for the so-called "-omic sciences". A difficult problem in clustering is how to handle data with a manifold structure, i.e. data that is not shaped in the form of compact clouds of points, forming arbitrary shapes or paths embedded in a high-dimensional space, as could be the case of some gene expression datasets. 相似文献

19.

Linking microarray reporters with protein functions

Stan Gaj Arie van Erk Rachel IM van Haaften Chris TA Evelo 《BMC bioinformatics》2007,8(1):360

Background

The analysis of microarray experiments requires accurate and up-to-date functional annotation of the microarray reporters to optimize the interpretation of the biological processes involved. Pathway visualization tools are used to connect gene expression data with existing biological pathways by using specific database identifiers that link reporters with elements in the pathways. 相似文献

20.

Analyzing M-CSF dependent monocyte/macrophage differentiation: Expression modes and meta-modes derived from an independent component analysis

Dominik Lutter Peter Ugocsai Margot Grandl Evelyn Orso Fabian Theis Elmar W Lang Gerd Schmitz 《BMC bioinformatics》2008,9(1):100

Background

The analysis of high-throughput gene expression data sets derived from microarray experiments still is a field of extensive investigation. Although new approaches and algorithms are published continuously, mostly conventional methods like hierarchical clustering algorithms or variance analysis tools are used. Here we take a closer look at independent component analysis (ICA) which is already discussed widely as a new analysis approach. However, deep exploration of its applicability and relevance to concrete biological problems is still missing. In this study, we investigate the relevance of ICA in gaining new insights into well characterized regulatory mechanisms of M-CSF dependent macrophage differentiation. 相似文献