期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Integrated analysis of DNA copy number and gene expression microarray data using gene sets

Renée X Menezes Marten Boetzer Melle Sieswerda Gert-Jan B van Ommen Judith M Boer 《BMC bioinformatics》2009,10(1):203-15

Background

Genes that play an important role in tumorigenesis are expected to show association between DNA copy number and RNA expression. Optimal power to find such associations can only be achieved if analysing copy number and gene expression jointly. Furthermore, some copy number changes extend over larger chromosomal regions affecting the expression levels of multiple resident genes. 相似文献

2.

Integrated analysis of gene expression and copy number data on gene shaving using independent component analysis

Sheng J Deng HW Calhoun VD Wang YP 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2011,8(6):1568-1579

DNA microarray gene expression and microarray-based comparative genomic hybridization (aCGH) have been widely used for biomedical discovery. Because of the large number of genes and the complex nature of biological networks, various analysis methods have been proposed. One such method is "gene shaving," a procedure which identifies subsets of the genes with coherent expression patterns and large variation across samples. Since combining genomic information from multiple sources can improve classification and prediction of diseases, in this paper we proposed a new method, "ICA gene shaving" (ICA, independent component analysis), for jointly analyzing gene expression and copy number data. First we used ICA to analyze joint measurements, gene expression and copy number, of a biological system and project the data onto statistically independent biological processes. Next, we used these results to identify patterns of variation in the data and then applied an iterative shaving method. We investigated the properties of our proposed method by analyzing both simulated and real data. We demonstrated that the robustness of our method to noise using simulated data. Using breast cancer data, we showed that our method is superior to the Generalized Singular Value Decomposition (GSVD) gene shaving method for identifying genes associated with breast cancer. 相似文献

3.

MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data

Bei Yang Yaohui Xu Andrew Maxwell Wonryull Koh Ping Gong Chaoyang Zhang 《BMC systems biology》2018,12(7):115

相似文献

4.

Classification using functional data analysis for temporal gene expression data 总被引：1，自引：0，他引：1

Leng X Müller HG 《Bioinformatics (Oxford, England)》2006,22(1):68-76

MOTIVATION: Temporal gene expression profiles provide an important characterization of gene function, as biological systems are predominantly developmental and dynamic. We propose a method of classifying collections of temporal gene expression curves in which individual expression profiles are modeled as independent realizations of a stochastic process. The method uses a recently developed functional logistic regression tool based on functional principal components, aimed at classifying gene expression curves into known gene groups. The number of eigenfunctions in the classifier can be chosen by leave-one-out cross-validation with the aim of minimizing the classification error. RESULTS: We demonstrate that this methodology provides low-error-rate classification for both yeast cell-cycle gene expression profiles and Dictyostelium cell-type specific gene expression patterns. It also works well in simulations. We compare our functional principal components approach with a B-spline implementation of functional discriminant analysis for the yeast cell-cycle data and simulations. This indicates comparative advantages of our approach which uses fewer eigenfunctions/base functions. The proposed methodology is promising for the analysis of temporal gene expression data and beyond. AVAILABILITY: MATLAB programs are available upon request. 相似文献

5.

Epigenetic manipulation of gene expression: a toolkit for cell biologists

下载免费PDF全文

Juliano RL Dixit VR Kang H Kim TY Miyamoto Y Xu D 《The Journal of cell biology》2005,169(6):847-857

相似文献

6.

lga972: a cross-platform application for optimizing LD studies using a genetic algorithm

Robles JR van den Oord EJ 《Bioinformatics (Oxford, England)》2004,20(17):3244-3245

lga972 is a user-friendly cross-platform application with a graphical interface for determining the design features of two-stage genetic linkage disequilibrium studies that minimize the genotyping burden. 相似文献

7.

基于信息论k—modes聚类法的基因表达数据分析

刘文远李建飞王宝文于家新《生物信息学》2009,7(2):95-98

k-均值聚类算法是一种广泛应用于基因表达数据聚类分析中的迭代变换算法,它通常用距离法来表示基因间的关系,但不能有效的反应基因间的相互依赖的关系。为此,提出基于信息论的k-modes聚类算法,克服了以上缺点。另外,还引入了伪F统计量,一方面,可以对空间中有部分重叠的点进行有效的分类;另一方面,可以给出最佳聚类数目,从而弥补了k-modes聚类法的不足。使其成为一种非常有效的算法,从而达到较优的聚类效果。相似文献

8.

QUBIC: a qualitative biclustering algorithm for analyses of gene expression data

Guojun Li Qin Ma Haibao Tang Andrew H. Paterson Ying Xu 《Nucleic acids research》2009,37(15):e101

Biclustering extends the traditional clustering techniques by attempting to find (all) subgroups of genes with similar expression patterns under to-be-identified subsets of experimental conditions when applied to gene expression data. Still the real power of this clustering strategy is yet to be fully realized due to the lack of effective and efficient algorithms for reliably solving the general biclustering problem. We report a QUalitative BIClustering algorithm (QUBIC) that can solve the biclustering problem in a more general form, compared to existing algorithms, through employing a combination of qualitative (or semi-quantitative) measures of gene expression data and a combinatorial optimization technique. One key unique feature of the QUBIC algorithm is that it can identify all statistically significant biclusters including biclusters with the so-called ‘scaling patterns’, a problem considered to be rather challenging; another key unique feature is that the algorithm solves such general biclustering problems very efficiently, capable of solving biclustering problems with tens of thousands of genes under up to thousands of conditions in a few minutes of the CPU time on a desktop computer. We have demonstrated a considerably improved biclustering performance by our algorithm compared to the existing algorithms on various benchmark sets and data sets of our own. QUBIC was written in ANSI C and tested using GCC (version 4.1.2) on Linux. Its source code is available at: http://csbl.bmb.uga.edu/∼maqin/bicluster. A server version of QUBIC is also available upon request. 相似文献

9.

NGSQC: cross-platform quality analysis pipeline for deep sequencing data 总被引：1，自引：0，他引：1

Dai M Thompson RC Maher C Contreras-Galindo R Kaplan MH Markovitz DM Omenn G Meng F 《BMC genomics》2010,11(Z4):S7

相似文献

10.

Pyicos: a versatile toolkit for the analysis of high-throughput sequencing data

Althammer S González-Vallinas J Ballaré C Beato M Eyras E 《Bioinformatics (Oxford, England)》2011,27(24):3333-3340

相似文献

11.

Multiple interval mapping for gene expression QTL analysis

Wei Zou Zhao-Bang Zeng 《Genetica》2009,137(2):125-134

To find the correlations between genome-wide gene expression variations and sequence polymorphisms in inbred cross populations, we developed a statistical method to claim expression quantitative trait loci (eQTL) in a genome. The method is based on multiple interval mapping (MIM), a model selection procedure, and uses false discovery rate (FDR) to measure the statistical significance of the large number of eQTL. We compared our method with a similar procedure proposed by Storey et al. and found that our method can be more powerful. We identified the features in the two methods that resulted in different statistical powers for eQTL detection, and confirmed them by simulation. We organized our computational procedure in an R package which can estimate FDR for positive findings from similar model selection procedures. The R package, MIM-eQTL, can be found at . 相似文献

12.

GenClust: A genetic algorithm for clustering gene expression data

Vito?Di Gesú Raffaele?Giancarlo Email author Giosué?Lo Bosco Alessandra?Raimondi Davide?Scaturro 《BMC bioinformatics》2005,6(1):289

Background

Clustering is a key step in the analysis of gene expression data, and in fact, many classical clustering algorithms are used, or more innovative ones have been designed and validated for the task. Despite the widespread use of artificial intelligence techniques in bioinformatics and, more generally, data analysis, there are very few clustering algorithms based on the genetic paradigm, yet that paradigm has great potential in finding good heuristic solutions to a difficult optimization problem such as clustering. 相似文献

13.

An improved algorithm for clustering gene expression data 总被引：1，自引：0，他引：1

Bandyopadhyay S Mukhopadhyay A Maulik U 《Bioinformatics (Oxford, England)》2007,23(21):2859-2865

MOTIVATION: Recent advancements in microarray technology allows simultaneous monitoring of the expression levels of a large number of genes over different time points. Clustering is an important tool for analyzing such microarray data, typical properties of which are its inherent uncertainty, noise and imprecision. In this article, a two-stage clustering algorithm, which employs a recently proposed variable string length genetic scheme and a multiobjective genetic clustering algorithm, is proposed. It is based on the novel concept of points having significant membership to multiple classes. An iterated version of the well-known Fuzzy C-Means is also utilized for clustering. RESULTS: The significant superiority of the proposed two-stage clustering algorithm as compared to the average linkage method, Self Organizing Map (SOM) and a recently developed weighted Chinese restaurant-based clustering method (CRC), widely used methods for clustering gene expression data, is established on a variety of artificial and publicly available real life data sets. The biological relevance of the clustering solutions are also analyzed. 相似文献

14.

PCA disjoint models for multiclass cancer analysis using gene expression data 总被引：4，自引：0，他引：4

Bicciato S Luchini A Di Bello C 《Bioinformatics (Oxford, England)》2003,19(5):571-578

MOTIVATION: Microarray expression profiling appears particularly promising for a deeper understanding of cancer biology and to identify molecular signatures supporting the histological classification schemes of neoplastic specimens. However, molecular diagnostics based on microarray data presents major challenges due to the overwhelming number of variables and the complex, multiclass nature of tumor samples. Thus, the development of marker selection methods, that allow the identification of those genes that are most likely to confer high classification accuracy of multiple tumor types, and of multiclass classification schemes is of paramount importance. RESULTS: A computational procedure for marker identification and for classification of multiclass gene expression data through the application of disjoint principal component models is described. The identified features represent a rational and dimensionally reduced base for understanding the basic biology of diseases, defining targets for therapeutic intervention, and developing diagnostic tools for the identification and classification of multiple pathological states. The method has been tested on different microarray data sets obtained from various human tumor samples. The results demonstrate that this procedure allows the identification of specific phenotype markers and can classify previously unseen instances in the presence of multiple classes. 相似文献

15.

Integrative Array Analyzer: a software package for analysis of cross-platform and cross-species microarray data

Pan F Kamath K Zhang K Pulapura S Achar A Nunez-Iglesias J Huang Y Yan X Han J Hu H Xu M Hu J Zhou XJ 《Bioinformatics (Oxford, England)》2006,22(13):1665-1667

相似文献

16.

OpenChrom: a cross-platform open source software for the mass spectrometric analysis of chromatographic data

Philip Wenig Juergen Odermatt 《BMC bioinformatics》2010,11(1):405

Background

Today, data evaluation has become a bottleneck in chromatographic science. Analytical instruments equipped with automated samplers yield large amounts of measurement data, which needs to be verified and analyzed. Since nearly every GC/MS instrument vendor offers its own data format and software tools, the consequences are problems with data exchange and a lack of comparability between the analytical results. To challenge this situation a number of either commercial or non-profit software applications have been developed. These applications provide functionalities to import and analyze several data formats but have shortcomings in terms of the transparency of the implemented analytical algorithms and/or are restricted to a specific computer platform. 相似文献

17.

GEDI: a user-friendly toolbox for analysis of large-scale gene expression data

André Fujita João R Sato Carlos E Ferreira Mari C Sogayar 《BMC bioinformatics》2007,8(1):457

Background

Several mathematical and statistical methods have been proposed in the last few years to analyze microarray data. Most of those methods involve complicated formulas, and software implementations that require advanced computer programming skills. Researchers from other areas may experience difficulties when they attempting to use those methods in their research. Here we present an user-friendly toolbox which allows large-scale gene expression analysis to be carried out by biomedical researchers with limited programming skills. 相似文献

18.

AMDA 2.13: A major update for automated cross-platform microarray data analysis

D Kapetis F Clarelli F Vitulli NK de Rosbo O Beretta M Foti P Ricciardi-Castagnoli F Zolezzi 《BioTechniques》2012,53(1):33-40

相似文献

19.

MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity 总被引：15，自引：0，他引：15

Wang Y Tang H Debarry JD Tan X Li J Wang X Lee TH Jin H Marler B Guo H Kissinger JC Paterson AH 《Nucleic acids research》2012,40(7):e49

MCScan is an algorithm able to scan multiple genomes or subgenomes in order to identify putative homologous chromosomal regions, and align these regions using genes as anchors. The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses. Applications of MCScanX to several sequenced plant genomes and gene families are shown as examples. MCScanX can be used to effectively analyze chromosome structural changes, and reveal the history of gene family expansions that might contribute to the adaptation of lineages and taxa. An integrated view of various modes of gene duplication can supplement the traditional gene tree analysis in specific families. The source code and documentation of MCScanX are freely available at http://chibba.pgml.uga.edu/mcscan2/. 相似文献

20.

A framework for significance analysis of gene expression data using dimension reduction methods

Lars Gidskehaug Endre Anderssen Arnar Flatberg Bjørn K Alsberg 《BMC bioinformatics》2007,8(1):346

Background

The most popular methods for significance analysis on microarray data are well suited to find genes differentially expressed across predefined categories. However, identification of features that correlate with continuous dependent variables is more difficult using these methods, and long lists of significant genes returned are not easily probed for co-regulations and dependencies. Dimension reduction methods are much used in the microarray literature for classification or for obtaining low-dimensional representations of data sets. These methods have an additional interpretation strength that is often not fully exploited when expression data are analysed. In addition, significance analysis may be performed directly on the model parameters to find genes that are important for any number of categorical or continuous responses. We introduce a general scheme for analysis of expression data that combines significance testing with the interpretative advantages of the dimension reduction methods. This approach is applicable both for explorative analysis and for classification and regression problems. 相似文献