期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

ParaKMeans: Implementation of a parallelized K-means algorithm suitable for general laboratory use

Piotr Kraj Ashok Sharma Nikhil Garge Robert Podolsky Richard A McIndoe 《BMC bioinformatics》2008,9(1):200

相似文献

2.

Reuse of imputed data in microarray analysis increases imputation efficiency

Ki-Yeol Kim Byoung-Jin Kim Gwan-Su Yi 《BMC bioinformatics》2004,5(1):160

Background

The imputation of missing values is necessary for the efficient use of DNA microarray data, because many clustering algorithms and some statistical analysis require a complete data set. A few imputation methods for DNA microarray data have been introduced, but the efficiency of the methods was low and the validity of imputed values in these methods had not been fully checked. 相似文献

3.

New resampling method for evaluating stability of clusters

Irina M Gana Dresen Tanja Boes Johannes Huesing Markus Neuhaeuser Karl-Heinz Joeckel 《BMC bioinformatics》2008,9(1):42

Background

Hierarchical clustering is a widely applied tool in the analysis of microarray gene expression data. The assessment of cluster stability is a major challenge in clustering procedures. Statistical methods are required to distinguish between real and random clusters. Several methods for assessing cluster stability have been published, including resampling methods such as the bootstrap. 相似文献

4.

Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures

Meng P Tan Erin N Smith James R Broach Christodoulos A Floudas 《BMC bioinformatics》2008,9(1):268

Background

DNA microarray technology allows for the measurement of genome-wide expression patterns. Within the resultant mass of data lies the problem of analyzing and presenting information on this genomic scale, and a first step towards the rapid and comprehensive interpretation of this data is gene clustering with respect to the expression patterns. Classifying genes into clusters can lead to interesting biological insights. In this study, we describe an iterative clustering approach to uncover biologically coherent structures from DNA microarray data based on a novel clustering algorithm EP_GOS_Clust. 相似文献

5.

Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis

Tae-Min Kim Yeun-Jun Chung Mun-Gan Rhyu Myeong Ho Jung 《BMC bioinformatics》2007,8(1):453

Background

Gene clustering has been widely used to group genes with similar expression pattern in microarray data analysis. Subsequent enrichment analysis using predefined gene sets can provide clues on which functional themes or regulatory sequence motifs are associated with individual gene clusters. In spite of the potential utility, gene clustering and enrichment analysis have been used in separate platforms, thus, the development of integrative algorithm linking both methods is highly challenging. 相似文献

6.

Identification of coherent patterns in gene expression data using an efficient biclustering algorithm and parallel coordinate visualization 总被引：1，自引：0，他引：1

Kin-On Cheng Ngai-Fong Law Wan-Chi Siu Alan Wee-Chung Liew 《BMC bioinformatics》2008,9(1):210

Background

The DNA microarray technology allows the measurement of expression levels of thousands of genes under tens/hundreds of different conditions. In microarray data, genes with similar functions usually co-express under certain conditions only [1]. Thus, biclustering which clusters genes and conditions simultaneously is preferred over the traditional clustering technique in discovering these coherent genes. Various biclustering algorithms have been developed using different bicluster formulations. Unfortunately, many useful formulations result in NP-complete problems. In this article, we investigate an efficient method for identifying a popular type of biclusters called additive model. Furthermore, parallel coordinate (PC) plots are used for bicluster visualization and analysis. 相似文献

7.

R/BHC: fast Bayesian hierarchical clustering for microarray data

Richard S Savage Katherine Heller Yang Xu Zoubin Ghahramani William M Truman Murray Grant Katherine J Denby David L Wild 《BMC bioinformatics》2009,10(1):242

Background

Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data analysis, little attention has been paid to uncertainty in the results obtained. 相似文献

8.

MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering

Eun-Youn Kim Seon-Young Kim Daniel Ashlock Dougu Nam 《BMC bioinformatics》2009,10(1):260

Background

Uncovering subtypes of disease from microarray samples has important clinical implications such as survival time and sensitivity of individual patients to specific therapies. Unsupervised clustering methods have been used to classify this type of data. However, most existing methods focus on clusters with compact shapes and do not reflect the geometric complexity of the high dimensional microarray clusters, which limits their performance. 相似文献

9.

GeneBins: a database for classifying gene expression data,with application to plant genome arrays

Nicolas Goffard Georg Weiller 《BMC bioinformatics》2007,8(1):87

Background

To interpret microarray experiments, several ontological analysis tools have been developed. However, current tools are limited to specific organisms. 相似文献

10.

Portraits of breast cancer progression

Gul S Dalgin Gabriela Alexe Daniel Scanfeld Pablo Tamayo Jill P Mesirov Shridar Ganesan Charles DeLisi Gyan Bhanot

《BMC bioinformatics》

Background

Clustering analysis of microarray data is often criticized for giving ambiguous results because of sensitivity to data perturbation or clustering techniques used. In this paper, we describe a new method based on principal component analysis and ensemble consensus clustering that avoids these problems. 相似文献

11.

puma: a Bioconductor package for propagating uncertainty in microarray analysis

Richard D Pearson Xuejun Liu Guido Sanguinetti Marta Milo Neil D Lawrence Magnus Rattray 《BMC bioinformatics》2009,10(1):211

Background

Most analyses of microarray data are based on point estimates of expression levels and ignore the uncertainty of such estimates. By determining uncertainties from Affymetrix GeneChip data and propagating these uncertainties to downstream analyses it has been shown that we can improve results of differential expression detection, principal component analysis and clustering. Previously, implementations of these uncertainty propagation methods have only been available as separate packages, written in different languages. Previous implementations have also suffered from being very costly to compute, and in the case of differential expression detection, have been limited in the experimental designs to which they can be applied. 相似文献

12.

Microarray data mining using landmark gene-guided clustering

Pankaj Chopra Jaewoo Kang Jiong Yang HyungJun Cho Heenam Stanley Kim Min-Goo Lee 《BMC bioinformatics》2008,9(1):92

Background

Clustering is a popular data exploration technique widely used in microarray data analysis. Most conventional clustering algorithms, however, generate only one set of clusters independent of the biological context of the analysis. This is often inadequate to explore data from different biological perspectives and gain new insights. We propose a new clustering model that can generate multiple versions of different clusters from a single dataset, each of which highlights a different aspect of the given dataset. 相似文献

13.

Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments

Tianqing Liu Nan Lin Ningzhong Shi Baoxue Zhang 《BMC bioinformatics》2009,10(1):146-20

Background

Time-course microarray experiments produce vector gene expression profiles across a series of time points. Clustering genes based on these profiles is important in discovering functional related and co-regulated genes. Early developed clustering algorithms do not take advantage of the ordering in a time-course study, explicit use of which should allow more sensitive detection of genes that display a consistent pattern over time. Peddada et al. [1] proposed a clustering algorithm that can incorporate the temporal ordering using order-restricted statistical inference. This algorithm is, however, very time-consuming and hence inapplicable to most microarray experiments that contain a large number of genes. Its computational burden also imposes difficulty to assess the clustering reliability, which is a very important measure when clustering noisy microarray data. 相似文献

14.

Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes

Susmita Datta Somnath Datta 《BMC bioinformatics》2006,7(1):397-9

Background

A cluster analysis is the most commonly performed procedure (often regarded as a first step) on a set of gene expression profiles. In most cases, a post hoc analysis is done to see if the genes in the same clusters can be functionally correlated. While past successes of such analyses have often been reported in a number of microarray studies (most of which used the standard hierarchical clustering, UPGMA, with one minus the Pearson's correlation coefficient as a measure of dissimilarity), often times such groupings could be misleading. More importantly, a systematic evaluation of the entire set of clusters produced by such unsupervised procedures is necessary since they also contain genes that are seemingly unrelated or may have more than one common function. Here we quantify the performance of a given unsupervised clustering algorithm applied to a given microarray study in terms of its ability to produce biologically meaningful clusters using a reference set of functional classes. Such a reference set may come from prior biological knowledge specific to a microarray study or may be formed using the growing databases of gene ontologies (GO) for the annotated genes of the relevant species. 相似文献

15.

ArrayMining: a modular web-application for microarray analysis combining ensemble and consensus methods with cross-study normalization

Enrico Glaab Jonathan M Garibaldi Natalio Krasnogor 《BMC bioinformatics》2009,10(1):358

Background

Statistical analysis of DNA microarray data provides a valuable diagnostic tool for the investigation of genetic components of diseases. To take advantage of the multitude of available data sets and analysis methods, it is desirable to combine both different algorithms and data from different studies. Applying ensemble learning, consensus clustering and cross-study normalization methods for this purpose in an almost fully automated process and linking different analysis modules together under a single interface would simplify many microarray analysis tasks. 相似文献

16.

Automating dChip: toward reproducible sharing of microarray data analysis

Cheng Li 《BMC bioinformatics》2008,9(1):231

Background

During the past decade, many software packages have been developed for analysis and visualization of various types of microarrays. We have developed and maintained the widely used dChip as a microarray analysis software package accessible to both biologist and data analysts. However, challenges arise when dChip users want to analyze large number of arrays automatically and share data analysis procedures and parameters. Improvement is also needed when the dChip user support team tries to identify the causes of reported analysis errors or bugs from users. 相似文献

17.

A temporal precedence based clustering method for gene expression microarray data

Ritesh Krishna Chang-Tsun Li Vicky Buchanan-Wollaston 《BMC bioinformatics》2010,11(1):68

Background

Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together according to certain characteristics. The majority of clustering techniques are based on distance or visual similarity measures which may not be suitable for clustering of temporal microarray data where the sequential nature of time is important. We present a Granger causality based technique to cluster temporal microarray gene expression data, which measures the interdependence between two time-series by statistically testing if one time-series can be used for forecasting the other time-series or not. 相似文献

18.

SED, a normalization free method for DNA microarray data analysis

Huajun Wang Hui Huang 《BMC bioinformatics》2004,5(1):121

Background

Analysis of DNA microarray data usually begins with a normalization step where intensities of different arrays are adjusted to the same scale so that the intensity levels from different arrays can be compared with one other. Both simple total array intensity-based as well as more complex "local intensity level" dependent normalization methods have been developed, some of which are widely used. Much less developed methods for microarray data analysis include those that bypass the normalization step and therefore yield results that are not confounded by potential normalization errors. 相似文献

19.

Cluster stability scores for microarray data in cancer studies

Mark?Smolkin Debashis?Ghosh Email author 《BMC bioinformatics》2003,4(1):36

Background

A potential benefit of profiling of tissue samples using microarrays is the generation of molecular fingerprints that will define subtypes of disease. Hierarchical clustering has been the primary analytical tool used to define disease subtypes from microarray experiments in cancer settings. Assessing cluster reliability poses a major complication in analyzing output from clustering procedures. While most work has focused on estimating the number of clusters in a dataset, the question of stability of individual-level clusters has not been addressed. 相似文献

20.

Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein

Gajendra?PS?Raghava Email author Joon?H?Han 《BMC bioinformatics》2005,6(1):59

Background

A large number of papers have been published on analysis of microarray data with particular emphasis on normalization of data, detection of differentially expressed genes, clustering of genes and regulatory network. On other hand there are only few studies on relation between expression level and composition of nucleotide/protein sequence, using expression data. There is a need to understand why particular genes/proteins express more in particular conditions. In this study, we analyze 3468 genes of Saccharomyces cerevisiae obtained from Holstege et al., (1998) to understand the relationship between expression level and amino acid composition. 相似文献