期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Enrichment of ultraconserved elements among genomic imbalances causing mental delay and congenital anomalies

Francisco Martínez Sandra Monfort Mónica Roselló Silvestre Oltra David Blesa Ramiro Quiroga Sonia Mayo Carmen Orellana 《BMC medical genomics》2010,3(1):1-6

Background

In translational cancer research, gene expression data is collected together with clinical data and genomic data arising from other chip based high throughput technologies. Software tools for the joint analysis of such high dimensional data sets together with clinical data are required.

Results

We have developed an open source software tool which provides interactive visualization capability for the integrated analysis of high-dimensional gene expression data together with associated clinical data, array CGH data and SNP array data. The different data types are organized by a comprehensive data manager. Interactive tools are provided for all graphics: heatmaps, dendrograms, barcharts, histograms, eventcharts and a chromosome browser, which displays genetic variations along the genome. All graphics are dynamic and fully linked so that any object selected in a graphic will be highlighted in all other graphics. For exploratory data analysis the software provides unsupervised data analytics like clustering, seriation algorithms and biclustering algorithms.

Conclusions

The SEURAT software meets the growing needs of researchers to perform joint analysis of gene expression, genomical and clinical data. 相似文献

2.

Genesis: cluster analysis of microarray data 总被引：26，自引：0，他引：26

Sturn A Quackenbush J Trajanoski Z 《Bioinformatics (Oxford, England)》2002,18(1):207-208

相似文献

3.

CGO: utilizing and integrating gene expression microarray data in clinical research and data management

Bumm K Zheng M Bailey C Zhan F Chiriva-Internati M Eddlemon P Terry J Barlogie B Shaughnessy JD 《Bioinformatics (Oxford, England)》2002,18(2):327-328

Clinical GeneOrganizer (CGO) is a novel windows-based archiving, organization and data mining software for the integration of gene expression profiling in clinical medicine. The program implements various user-friendly tools and extracts data for further statistical analysis. This software was written for Affymetrix GeneChip *.txt files, but can also be used for any other microarray-derived data. The MS-SQL server version acts as a data mart and links microarray data with clinical parameters of any other existing database and therefore represents a valuable tool for combining gene expression analysis and clinical disease characteristics. 相似文献

4.

Identifying differential exon splicing using linear models and correlation coefficients

Sonia H Shah Jacqueline A Pallas 《BMC bioinformatics》2009,10(1):26

Background

With the availability of the Affymetrix exon arrays a number of tools have been developed to enable the analysis. These however can be expensive or have several pre-installation requirements. This led us to develop an analysis workflow for analysing differential splicing using freely available software packages that are already being widely used for gene expression analysis. The workflow uses the packages in the standard installation of R and Bioconductor (BiocLite) to identify differential splicing. We use the splice index method with the LIMMA framework. The main drawback with this approach is that it relies on accurate estimates of gene expression from the probe-level data. Methods such as RMA and PLIER may misestimate when a large proportion of exons are spliced. We therefore present the novel concept of a gene correlation coefficient calculated using only the probeset expression pattern within a gene. We show that genes with lower correlation coefficients are likely to be differentially spliced. 相似文献

5.

Comparative analysis of different label-free mass spectrometry based protein abundance estimates and their correlation with RNA-Seq gene expression data

Ning K Fermin D Nesvizhskii AI 《Journal of proteome research》2012,11(4):2261-2271

相似文献

6.

RNA-Seq vs Dual- and Single-Channel Microarray Data: Sensitivity Analysis for Differential Expression and Clustering

Alina S?rbu Gráinne Kerr Martin Crane Heather J. Ruskin 《PloS one》2012,7(12)

With the fast development of high-throughput sequencing technologies, a new generation of genome-wide gene expression measurements is under way. This is based on mRNA sequencing (RNA-seq), which complements the already mature technology of microarrays, and is expected to overcome some of the latter’s disadvantages. These RNA-seq data pose new challenges, however, as strengths and weaknesses have yet to be fully identified. Ideally, Next (or Second) Generation Sequencing measures can be integrated for more comprehensive gene expression investigation to facilitate analysis of whole regulatory networks. At present, however, the nature of these data is not very well understood. In this paper we study three alternative gene expression time series datasets for the Drosophila melanogaster embryo development, in order to compare three measurement techniques: RNA-seq, single-channel and dual-channel microarrays. The aim is to study the state of the art for the three technologies, with a view of assessing overlapping features, data compatibility and integration potential, in the context of time series measurements. This involves using established tools for each of the three different technologies, and technical and biological replicates (for RNA-seq and microarrays, respectively), due to the limited availability of biological RNA-seq replicates for time series data. The approach consists of a sensitivity analysis for differential expression and clustering. In general, the RNA-seq dataset displayed highest sensitivity to differential expression. The single-channel data performed similarly for the differentially expressed genes common to gene sets considered. Cluster analysis was used to identify different features of the gene space for the three datasets, with higher similarities found for the RNA-seq and single-channel microarray dataset. 相似文献

7.

A Web-based and Grid-enabled dChip version for the analysis of large sets of gene expression data

Luca Corradi Marco Fato Ivan Porro Silvia Scaglione Livia Torterolo 《BMC bioinformatics》2008,9(1):480

Background

Microarray techniques are one of the main methods used to investigate thousands of gene expression profiles for enlightening complex biological processes responsible for serious diseases, with a great scientific impact and a wide application area. Several standalone applications had been developed in order to analyze microarray data. Two of the most known free analysis software packages are the R-based Bioconductor and dChip. The part of dChip software concerning the calculation and the analysis of gene expression has been modified to permit its execution on both cluster environments (supercomputers) and Grid infrastructures (distributed computing). 相似文献

8.

PlasmoDB: exploring genomics and post-genomics data of the malaria parasite, Plasmodium falciparum

Fraunholz MJ Roos DS 《Redox report : communications in free radical research》2003,8(5):317-320

相似文献

9.

Dynamic model-based clustering for time-course gene expression data 总被引：1，自引：0，他引：1

Wu FX Zhang WJ Kusalik AJ 《Journal of bioinformatics and computational biology》2005,3(4):821-836

Microarray technology has produced a huge body of time-course gene expression data. Such gene expression data has proved useful in genomic disease diagnosis and genomic drug design. The challenge is how to uncover useful information in such data. Cluster analysis has played an important role in analyzing gene expression data. Many distance/correlation- and static model-based clustering techniques have been applied to time-course expression data. However, these techniques are unable to account for the dynamics of such data. It is the dynamics that characterize the data and that should be considered in cluster analysis so as to obtain high quality clustering. This paper proposes a dynamic model-based clustering method for time-course gene expression data. The proposed method regards a time-course gene expression dataset as a set of time series, generated by a number of stochastic processes. Each stochastic process defines a cluster and is described by an autoregressive model. A relocation-iteration algorithm is proposed to identity the model parameters and posterior probabilities are employed to assign each gene to an appropriate cluster. A bootstrapping method and an average adjusted Rand index (AARI) are employed to measure the quality of clustering. Computational experiments are performed on a synthetic and three real time-course gene expression datasets to investigate the proposed method. The results show that our method allows the better quality clustering than other clustering methods (e.g. k-means) for time-course gene expression data, and thus it is a useful and powerful tool for analyzing time-course gene expression data. 相似文献

10.

BeadArray expression analysis using bioconductor

Ritchie ME Dunning MJ Smith ML Shi W Lynch AG 《PLoS computational biology》2011,7(12):e1002276

Illumina whole-genome expression BeadArrays are a popular choice in gene profiling studies. Aside from the vendor-provided software tools for analyzing BeadArray expression data (GenomeStudio/BeadStudio), there exists a comprehensive set of open-source analysis tools in the Bioconductor project, many of which have been tailored to exploit the unique properties of this platform. In this article, we explore a number of these software packages and demonstrate how to perform a complete analysis of BeadArray data in various formats. The key steps of importing data, performing quality assessments, preprocessing, and annotation in the common setting of assessing differential expression in designed experiments will be covered. 相似文献

11.

signatureSearch: environment for gene expression signature searching and functional interpretation

Yuzhu Duan Daniel S Evans Richard A Miller Nicholas J Schork Steven R Cummings Thomas Girke 《Nucleic acids research》2020,48(21):e124

signatureSearch is an R/Bioconductor package that integrates a suite of existing and novel algorithms into an analysis environment for gene expression signature (GES) searching combined with functional enrichment analysis (FEA) and visualization methods to facilitate the interpretation of the search results. In a typical GES search (GESS), a query GES is searched against a database of GESs obtained from large numbers of measurements, such as different genetic backgrounds, disease states and drug perturbations. Database matches sharing correlated signatures with the query indicate related cellular responses frequently governed by connected mechanisms, such as drugs mimicking the expression responses of a disease. To identify which processes are predominantly modulated in the GESS results, we developed specialized FEA methods combined with drug-target network visualization tools. The provided analysis tools are useful for studying the effects of genetic, chemical and environmental perturbations on biological systems, as well as searching single cell GES databases to identify novel network connections or cell types. The signatureSearch software is unique in that it provides access to an integrated environment for GESS/FEA routines that includes several novel search and enrichment methods, efficient data structures, and access to pre-built GES databases, and allowing users to work with custom databases. 相似文献

12.

A systems approach to morphogenesis in Arabidopsis thaliana: I. AGNS database

Omelyanchuk N. A. Mironova V. V. Zalevsky E. M. Shamov I. S. Poplavsky A. S. Podkolodny N. L. Ponomaryov D. K. Nikolaev S. V. Mjolsness E. D. Meyerowitz E. M. Kolchanov N. A. 《Biophysics》2008,51(1):75-82

In systems biology, study of a complex and multicomponent system, such as morphogenesis, comprises accumulation of data on morphogenetic processes in databases, classification and logical analysis of this information, and computer simulation of the processes in question using the data accumulated and the results of their analysis. This paper describes realization of the first steps in a systems study of morphogenesis (annotating research papers, compiling information in a database, data systematization, and their logical analysis) by the example of Arabidopsis thaliana, a model object in plant molecular biology. The database AGNS (Arabidopsis GeneNet Supplementary; http://wwwmgs.bionet.nsc.ru/agns) contains the experimentally confirmed information from published papers on specific features of gene expression and phenotypes of wild-type, mutant, and transgenic A. thaliana plants. AGNS queries and logical data analysis with the aid of specially developed software makes it possible to model various morphogenetic processes from gene expression to functioning of gene networks and their contribution to the development of certain traits.

相似文献

13.

Automatic image analysis for gene expression patterns of fly embryos

Peng H Long F Zhou J Leung G Eisen MB Myers EW 《BMC cell biology》2007,8(Z1):S7

相似文献

14.

ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles

Yoshida R Higuchi T Imoto S Miyano S 《Bioinformatics (Oxford, England)》2006,22(12):1538-1539

相似文献

15.

G-InforBIO: integrated system for microbial genomics

Naoto Tanaka Takashi Abe Satoru Miyazaki Hideaki Sugawara 《BMC bioinformatics》2006,7(1):368-7

Background

Genome databases contain diverse kinds of information, including gene annotations and nucleotide and amino acid sequences. It is not easy to integrate such information for genomic study. There are few tools for integrated analyses of genomic data, therefore, we developed software that enables users to handle, manipulate, and analyze genome data with a variety of sequence analysis programs. 相似文献

16.

Discovering semantic features in the literature: a foundation for building functional associations

Monica Chagoyen Pedro Carmona-Saez Hagit Shatkay Jose M Carazo Alberto Pascual-Montano 《BMC bioinformatics》2006,7(1):41

Background

Experimental techniques such as DNA microarray, serial analysis of gene expression (SAGE) and mass spectrometry proteomics, among others, are generating large amounts of data related to genes and proteins at different levels. As in any other experimental approach, it is necessary to analyze these data in the context of previously known information about the biological entities under study. The literature is a particularly valuable source of information for experiment validation and interpretation. Therefore, the development of automated text mining tools to assist in such interpretation is one of the main challenges in current bioinformatics research. 相似文献

17.

MILVA: an interactive tool for the exploration of multidimensional microarray data

D'Alimonte D Lowe D Nabney IT Mersinias V Smith CP 《Bioinformatics (Oxford, England)》2005,21(22):4192-4193

MOTIVATION: Clustering techniques such as k-means and hierarchical clustering are commonly used to analyze DNA microarray derived gene expression data. However, the interactions between processes underlying the cell activity suggest that the complexity of the microarray data structure may not be fully represented with discrete clustering methods. RESULTS: A newly developed software tool called MILVA (microarray latent visualization and analysis) is presented here to investigate microarray data without separating gene expression profiles into discrete classes. The underpinning of the MILVA software is the two-dimensional topographic representation of multidimensional microarray data. On this basis, the interactive MILVA functions allow a continuous exploration of microarray data driven by the direct supervision of the biologist in detecting activity patterns of co-regulated genes. AVAILABILITY: The MILVA software is freely available. The software and the related documentation can be downloaded from http://www.ncrg.aston.ac.uk/Projects/milva. User 'surrey' as username and '3245' as password to login. The software is currently available for Windows platform only. 相似文献

18.

AnaBench: a Web/CORBA-based workbench for biomolecular sequence analysis

Elarbi?Badidi Email author Cristina?De Sousa B?Franz?Lang Gertraud?Burger 《BMC bioinformatics》2003,4(1):63

Background

Sequence data analyses such as gene identification, structure modeling or phylogenetic tree inference involve a variety of bioinformatics software tools. Due to the heterogeneity of bioinformatics tools in usage and data requirements, scientists spend much effort on technical issues including data format, storage and management of input and output, and memorization of numerous parameters and multi-step analysis procedures. 相似文献

19.

Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks 总被引：25，自引：0，他引：25

Trapnell C Roberts A Goff L Pertea G Kim D Kelley DR Pimentel H Salzberg SL Rinn JL Pachter L 《Nature protocols》2012,7(3):562-578

相似文献

20.

Performance comparison and evaluation of software tools for microRNA deep-sequencing data analysis 总被引：1，自引：0，他引：1

Li Y Zhang Z Liu F Vongsangnak W Jing Q Shen B 《Nucleic acids research》2012,40(10):4298-4305

With the development of next-generation sequencing (NGS) techniques, many software tools have emerged for the discovery of novel microRNAs (miRNAs) and for analyzing the miRNAs expression profiles. An overall evaluation of these diverse software tools is lacking. In this study, we evaluated eight software tools based on their common feature and key algorithms. Three deep-sequencing data sets were collected from different species and used to assess the computational time, sensitivity and accuracy of detecting known miRNAs as well as their capacity for predicting novel miRNAs. Our results provide useful information for researchers to facilitate their selection of the optimal software tools for miRNA analysis depending on their specific requirements, i.e. novel miRNAs discovery or miRNA expression profile analysis of sequencing data sets. 相似文献