期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

DASMiner: discovering and integrating data from DAS sources

Diogo FT Veiga Helena F Deus Caner Akdemir Ana Tereza R Vasconcelos Jonas S Almeida 《BMC systems biology》2009,3(1):109-9

Background

DAS is a widely adopted protocol for providing syntactic interoperability among biological databases. The popularity of DAS is due to a simplified and elegant mechanism for data exchange that consists of sources exposing their RESTful interfaces for data access. As a growing number of DAS services are available for molecular biology resources, there is an incentive to explore this protocol in order to advance data discovery and integration among these resources. 相似文献

2.

An eScience-Bayes strategy for analyzing omics data

Martin Eklund Ola Spjuth Jarl ES Wikberg 《BMC bioinformatics》2010,11(1):282

Background

The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of model parameters and performance. Moreover, the integration of omics data with other data sources is difficult to shoehorn into classical statistical models. This has resulted in ad hoc approaches to address specific problems. 相似文献

3.

The BiSciCol Triplifier: bringing biodiversity data to the Semantic Web

Brian J Stucky John Deck Tom Conlin Lukasz Ziemba Nico Cellinese Robert Guralnick 《BMC bioinformatics》2014,15(1)

相似文献

4.

The Gaggle: An open-source software system for integrating bioinformatics software and data sources

Paul T Shannon David J Reiss Richard Bonneau Nitin S Baliga 《BMC bioinformatics》2006,7(1):176-13

Background

Systems biologists work with many kinds of data, from many different sources, using a variety of software tools. Each of these tools typically excels at one type of analysis, such as of microarrays, of metabolic networks and of predicted protein structure. A crucial challenge is to combine the capabilities of these (and other forthcoming) data resources and tools to create a data exploration and analysis environment that does justice to the variety and complexity of systems biology data sets. A solution to this problem should recognize that data types, formats and software in this high throughput age of biology are constantly changing. 相似文献

5.

Sig2BioPAX: Java tool for converting flat files to BioPAX Level 3 format

Ryan L Webb Avi Ma'ayan 《Source code for biology and medicine》2011,6(1):5

相似文献

6.

MultiSeq: unifying sequence and structure data for evolutionary analysis

Elijah Roberts John Eargle Dan Wright Zaida Luthey-Schulten 《BMC bioinformatics》2006,7(1):382

Background

Since the publication of the first draft of the human genome in 2000, bioinformatic data have been accumulating at an overwhelming pace. Currently, more than 3 million sequences and 35 thousand structures of proteins and nucleic acids are available in public databases. Finding correlations in and between these data to answer critical research questions is extremely challenging. This problem needs to be approached from several directions: information science to organize and search the data; information visualization to assist in recognizing correlations; mathematics to formulate statistical inferences; and biology to analyze chemical and physical properties in terms of sequence and structure changes. 相似文献

7.

The Firegoose: two-way integration of diverse data from different bioinformatics web resources with desktop applications

J Christopher Bare Paul T Shannon Amy K Schmid Nitin S Baliga 《BMC bioinformatics》2007,8(1):456

Background

Information resources on the World Wide Web play an indispensable role in modern biology. But integrating data from multiple sources is often encumbered by the need to reformat data files, convert between naming systems, or perform ongoing maintenance of local copies of public databases. Opportunities for new ways of combining and re-using data are arising as a result of the increasing use of web protocols to transmit structured data. 相似文献

8.

A joint finite mixture model for clustering genes from independent Gaussian and beta distributed data 总被引：1，自引：0，他引：1

Xiaofeng Dai Timo Erkkil? Olli Yli-Harja Harri L?hdesm?ki 《BMC bioinformatics》2009,10(1):165

Background

Cluster analysis has become a standard computational method for gene function discovery as well as for more general explanatory data analysis. A number of different approaches have been proposed for that purpose, out of which different mixture models provide a principled probabilistic framework. Cluster analysis is increasingly often supplemented with multiple data sources nowadays, and these heterogeneous information sources should be made as efficient use of as possible. 相似文献

9.

A computational framework for gene regulatory network inference that combines multiple methods and datasets

Rita Gupta Anna Stincone Philipp Antczak Sarah Durant Roy Bicknell Andreas Bikfalvi Francesco Falciani 《BMC systems biology》2011,5(1):52

Background

Reverse engineering in systems biology entails inference of gene regulatory networks from observational data. This data typically include gene expression measurements of wild type and mutant cells in response to a given stimulus. It has been shown that when more than one type of experiment is used in the network inference process the accuracy is higher. Therefore the development of generally applicable and effective methodologies that embed multiple sources of information in a single computational framework is a worthwhile objective. 相似文献

10.

MBEToolbox: a Matlab toolbox for sequence data analysis in molecular biology and evolution

James?J?Cai Email author David?K?Smith Xuhua?Xia Kwok-yung?Yuen 《BMC bioinformatics》2005,6(1):64

Background

MATLAB is a high-performance language for technical computing, integrating computation, visualization, and programming in an easy-to-use environment. It has been widely used in many areas, such as mathematics and computation, algorithm development, data acquisition, modeling, simulation, and scientific and engineering graphics. However, few functions are freely available in MATLAB to perform the sequence data analyses specifically required for molecular biology and evolution. 相似文献

11.

BIOZON: a system for unification, management and analysis of heterogeneous biological data

Aaron Birkland Golan Yona 《BMC bioinformatics》2006,7(1):70-24

Background

Integration of heterogeneous data types is a challenging problem, especially in biology, where the number of databases and data types increase rapidly. Amongst the problems that one has to face are integrity, consistency, redundancy, connectivity, expressiveness and updatability. 相似文献

12.

An exploratory data analysis method to reveal modular latent structures in high-throughput data

Tianwei Yu 《BMC bioinformatics》2010,11(1):440

Background

Modular structures are ubiquitous across various types of biological networks. The study of network modularity can help reveal regulatory mechanisms in systems biology, evolutionary biology and developmental biology. Identifying putative modular latent structures from high-throughput data using exploratory analysis can help better interpret the data and generate new hypotheses. Unsupervised learning methods designed for global dimension reduction or clustering fall short of identifying modules with factors acting in linear combinations. 相似文献

13.

Supervised inference of gene-regulatory networks

Cuong C To Jiri Vohradsky 《BMC bioinformatics》2008,9(1):2

Background

Inference of protein interaction networks from various sources of data has become an important topic of both systems and computational biology. Here we present a supervised approach to identification of gene expression regulatory networks. 相似文献

14.

Normalization method for metabolomics data using optimal selection of multiple internal standards

Marko Sysi-Aho Mikko Katajamaa Laxman Yetukuri Matej Orešič 《BMC bioinformatics》2007,8(1):93

Background

Success of metabolomics as the phenotyping platform largely depends on its ability to detect various sources of biological variability. Removal of platform-specific sources of variability such as systematic error is therefore one of the foremost priorities in data preprocessing. However, chemical diversity of molecular species included in typical metabolic profiling experiments leads to different responses to variations in experimental conditions, making normalization a very demanding task. 相似文献

15.

Time warping of evolutionary distant temporal gene expression data based on noise suppression

Yury Goltsev Dmitri Papatsenko 《BMC bioinformatics》2009,10(1):353

Background

Comparative analysis of genome wide temporal gene expression data has a broad potential area of application, including evolutionary biology, developmental biology, and medicine. However, at large evolutionary distances, the construction of global alignments and the consequent comparison of the time-series data are difficult. The main reason is the accumulation of variability in expression profiles of orthologous genes, in the course of evolution. 相似文献

16.

Incremental genetic K-means algorithm and its application in gene expression data analysis 总被引：1，自引：0，他引：1

Yi?Lu Shiyong?Lu Farshad?Fotouhi Youping?Deng Email author Susan?J?Brown 《BMC bioinformatics》2004,5(1):172

Background

In recent years, clustering algorithms have been effectively applied in molecular biology for gene expression data analysis. With the help of clustering algorithms such as K-means, hierarchical clustering, SOM, etc, genes are partitioned into groups based on the similarity between their expression profiles. In this way, functionally related genes are identified. As the amount of laboratory data in molecular biology grows exponentially each year due to advanced technologies such as Microarray, new efficient and effective methods for clustering must be developed to process this growing amount of biological data. 相似文献

17.

Extracting consistent knowledge from highly inconsistent cancer gene data sources

Xue Gong Ruihong Wu Yuannv Zhang Wenyuan Zhao Lixin Cheng Yunyan Gu Lin Zhang Jing Wang Jing Zhu Zheng Guo 《BMC bioinformatics》2010,11(1):76

Background

Hundreds of genes that are causally implicated in oncogenesis have been found and collected in various databases. For efficient application of these abundant but diverse data sources, it is of fundamental importance to evaluate their consistency. 相似文献

18.

Enhanced protein fold recognition through a novel data integration approach

Yiming Ying Kaizhu Huang Colin Campbell 《BMC bioinformatics》2009,10(1):267-18

Background

Protein fold recognition is a key step in protein three-dimensional (3D) structure discovery. There are multiple fold discriminatory data sources which use physicochemical and structural properties as well as further data sources derived from local sequence alignments. This raises the issue of finding the most efficient method for combining these different informative data sources and exploring their relative significance for protein fold classification. Kernel methods have been extensively used for biological data analysis. They can incorporate separate fold discriminatory features into kernel matrices which encode the similarity between samples in their respective data sources. 相似文献

19.

Model-based extension of high-throughput to high-content data

Andrea C Pfeifer Daniel Kaschek Julie Bachmann Ursula Klingmüller Jens Timmer 《BMC systems biology》2010,4(1):106

Background

High-quality quantitative data is a major limitation in systems biology. The experimental data used in systems biology can be assigned to one of the following categories: assays yielding average data of a cell population, high-content single cell measurements and high-throughput techniques generating single cell data for large cell populations. For modeling purposes, a combination of data from different categories is highly desirable in order to increase the number of observable species and processes and thereby maximize the identifiability of parameters. 相似文献

20.

Integrative investigation of metabolic and transcriptomic data

Pınar Pir Betül Kırdar Andrew Hayes Z İlsen Önsan Kutlu Ö Ülgen Stephen G Oliver 《BMC bioinformatics》2006,7(1):203-12

相似文献