期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

WGCNA: an R package for weighted correlation network analysis 总被引：12，自引：0，他引：12

Peter Langfelder Steve Horvath 《BMC bioinformatics》2008,9(1):1-13

Background

Modelling the time-related behaviour of biological systems is essential for understanding their dynamic responses to perturbations. In metabolic profiling studies, the sampling rate and number of sampling points are often restricted due to experimental and biological constraints.

Results

A supervised multivariate modelling approach with the objective to model the time-related variation in the data for short and sparsely sampled time-series is described. A set of piecewise Orthogonal Projections to Latent Structures (OPLS) models are estimated, describing changes between successive time points. The individual OPLS models are linear, but the piecewise combination of several models accommodates modelling and prediction of changes which are non-linear with respect to the time course. We demonstrate the method on both simulated and metabolic profiling data, illustrating how time related changes are successfully modelled and predicted.

Conclusion

The proposed method is effective for modelling and prediction of short and multivariate time series data. A key advantage of the method is model transparency, allowing easy interpretation of time-related variation in the data. The method provides a competitive complement to commonly applied multivariate methods such as OPLS and Principal Component Analysis (PCA) for modelling and analysis of short time-series data. 相似文献

2.

The identification of informative genes from multiple datasets with increasing complexity

S Yahya Anvar Peter AC 't Hoen Allan Tucker 《BMC bioinformatics》2010,11(1):32

Background

In microarray data analysis, factors such as data quality, biological variation, and the increasingly multi-layered nature of more complex biological systems complicates the modelling of regulatory networks that can represent and capture the interactions among genes. We believe that the use of multiple datasets derived from related biological systems leads to more robust models. Therefore, we developed a novel framework for modelling regulatory networks that involves training and evaluation on independent datasets. Our approach includes the following steps: (1) ordering the datasets based on their level of noise and informativeness; (2) selection of a Bayesian classifier with an appropriate level of complexity by evaluation of predictive performance on independent data sets; (3) comparing the different gene selections and the influence of increasing the model complexity; (4) functional analysis of the informative genes. 相似文献

3.

Physicochemical property distributions for accurate and rapid pairwise protein homology detection

Bobbie-Jo M Webb-Robertson Kyle G Ratuiste Christopher S Oehmen 《BMC bioinformatics》2010,11(1):145

Background

The challenge of remote homology detection is that many evolutionarily related sequences have very little similarity at the amino acid level. Kernel-based discriminative methods, such as support vector machines (SVMs), that use vector representations of sequences derived from sequence properties have been shown to have superior accuracy when compared to traditional approaches for the task of remote homology detection. 相似文献

4.

Kernel-PCA data integration with enhanced interpretability

Ferran Reverter Esteban Vegas Josep M Oller 《BMC systems biology》2014,8(Z2):S6

Background

Nowadays, combining the different sources of information to improve the biological knowledge available is a challenge in bioinformatics. One of the most powerful methods for integrating heterogeneous data types are kernel-based methods. Kernel-based data integration approaches consist of two basic steps: firstly the right kernel is chosen for each data set; secondly the kernels from the different data sources are combined to give a complete representation of the available data for a given statistical task.

Results

We analyze the integration of data from several sources of information using kernel PCA, from the point of view of reducing dimensionality. Moreover, we improve the interpretability of kernel PCA by adding to the plot the representation of the input variables that belong to any dataset. In particular, for each input variable or linear combination of input variables, we can represent the direction of maximum growth locally, which allows us to identify those samples with higher/lower values of the variables analyzed.

Conclusions

The integration of different datasets and the simultaneous representation of samples and variables together give us a better understanding of biological knowledge.

相似文献

5.

KID - an algorithm for fast and efficient text mining used to automatically generate a database containing kinetic information of enzymes

Stephanie Heinen Bernhard Thielen Dietmar Schomburg 《BMC bioinformatics》2010,11(1):375

Background

The amount of available biological information is rapidly increasing and the focus of biological research has moved from single components to networks and even larger projects aiming at the analysis, modelling and simulation of biological networks as well as large scale comparison of cellular properties. It is therefore essential that biological knowledge is easily accessible. However, most information is contained in the written literature in an unstructured way, so that methods for the systematic extraction of knowledge directly from the primary literature have to be deployed. 相似文献

6.

Probability-based model of protein-protein interactions on biological timescales

Alexander L Tournier Paul W Fitzjohn Paul A Bates 《Algorithms for molecular biology : AMB》2006,1(1):25-11

Background

Simulation methods can assist in describing and understanding complex networks of interacting proteins, providing fresh insights into the function and regulation of biological systems. Recent studies have investigated such processes by explicitly modelling the diffusion and interactions of individual molecules. In these approaches, two entities are considered to have interacted if they come within a set cutoff distance of each other. 相似文献

7.

A data integration approach for cell cycle analysis oriented to model simulation in systems biology

Roberta Alfieri Ivan Merelli Ettore Mosca Luciano Milanesi 《BMC systems biology》2007,1(1):35-12

相似文献

8.

Regulation of excitation-contraction coupling in mouse cardiac myocytes: integrative analysis with mathematical modelling

Jussi T Koivumäki Topi Korhonen Jouni Takalo Matti Weckström Pasi Tavi 《BMC physiology》2009,9(1):16-20

Background

The cardiomyocyte is a prime example of inherently complex biological system with inter- and cross-connected feedback loops in signalling, forming the basic properties of intracellular homeostasis. Functional properties of cells and tissues have been studied e.g. with powerful tools of genetic engineering, combined with extensive experimentation. While this approach provides accurate information about the physiology at the endpoint, complementary methods, such as mathematical modelling, can provide more detailed information about the processes that have lead to the endpoint phenotype. 相似文献

9.

OpenFLUX: efficient modelling software for 13C-based metabolic flux analysis

Lake-Ee Quek Christoph Wittmann Lars K Nielsen Jens O Krömer 《Microbial cell factories》2009,8(1):25-15

Background

The quantitative analysis of metabolic fluxes, i.e., in vivo activities of intracellular enzymes and pathways, provides key information on biological systems in systems biology and metabolic engineering. It is based on a comprehensive approach combining (i) tracer cultivation on ¹³C substrates, (ii) ¹³C labelling analysis by mass spectrometry and (iii) mathematical modelling for experimental design, data processing, flux calculation and statistics. Whereas the cultivation and the analytical part is fairly advanced, a lack of appropriate modelling software solutions for all modelling aspects in flux studies is limiting the application of metabolic flux analysis. 相似文献

10.

Discriminating between rival biochemical network models: three approaches to optimal experiment design

Bence Mélykúti Antonis Papachristodoulou Hana El-Samad 《BMC systems biology》2010,4(1):38

Background

The success of molecular systems biology hinges on the ability to use computational models to design predictive experiments, and ultimately unravel underlying biological mechanisms. A problem commonly encountered in the computational modelling of biological networks is that alternative, structurally different models of similar complexity fit a set of experimental data equally well. In this case, more than one molecular mechanism can explain available data. In order to rule out the incorrect mechanisms, one needs to invalidate incorrect models. At this point, new experiments maximizing the difference between the measured values of alternative models should be proposed and conducted. Such experiments should be optimally designed to produce data that are most likely to invalidate incorrect model structures. 相似文献

11.

Stochastic and deterministic multiscale models for systems biology: an auxin-transport case study

Jamie Twycross Leah R Band Malcolm J Bennett John R King Natalio Krasnogor 《BMC systems biology》2010,4(1):34

Background

Stochastic and asymptotic methods are powerful tools in developing multiscale systems biology models; however, little has been done in this context to compare the efficacy of these methods. The majority of current systems biology modelling research, including that of auxin transport, uses numerical simulations to study the behaviour of large systems of deterministic ordinary differential equations, with little consideration of alternative modelling frameworks. 相似文献

12.

An approach to multiscale modelling with graph grammars

Yongzhi Ong Katarína Streit Michael Henke Winfried Kurth 《Annals of botany》2014,114(4):813-827

Background and Aims

Functional–structural plant models (FSPMs) simulate biological processes at different spatial scales. Methods exist for multiscale data representation and modification, but the advantages of using multiple scales in the dynamic aspects of FSPMs remain unclear. Results from multiscale models in various other areas of science that share fundamental modelling issues with FSPMs suggest that potential advantages do exist, and this study therefore aims to introduce an approach to multiscale modelling in FSPMs.

Methods

A three-part graph data structure and grammar is revisited, and presented with a conceptual framework for multiscale modelling. The framework is used for identifying roles, categorizing and describing scale-to-scale interactions, thus allowing alternative approaches to model development as opposed to correlation-based modelling at a single scale. Reverse information flow (from macro- to micro-scale) is catered for in the framework. The methods are implemented within the programming language XL.

Key Results

Three example models are implemented using the proposed multiscale graph model and framework. The first illustrates the fundamental usage of the graph data structure and grammar, the second uses probabilistic modelling for organs at the fine scale in order to derive crown growth, and the third combines multiscale plant topology with ozone trends and metabolic network simulations in order to model juvenile beech stands under exposure to a toxic trace gas.

Conclusions

The graph data structure supports data representation and grammar operations at multiple scales. The results demonstrate that multiscale modelling is a viable method in FSPM and an alternative to correlation-based modelling. Advantages and disadvantages of multiscale modelling are illustrated by comparisons with single-scale implementations, leading to motivations for further research in sensitivity analysis and run-time efficiency for these models. 相似文献

13.

FORG3D: Force-directed 3D graph editor for visualization of integrated genome scale data

Jussi Paananen Garry Wong 《BMC systems biology》2009,3(1):26-8

Background

Genomics research produces vast amounts of experimental data that needs to be integrated in order to understand, model, and interpret the underlying biological phenomena. Interpreting these large and complex data sets is challenging and different visualization methods are needed to help produce knowledge from the data. 相似文献

14.

Bi-directional gene set enrichment and canonical correlation analysis identify key diet-sensitive pathways and biomarkers of metabolic syndrome

Melissa J Morine Jolene McMonagle Sinead Toomey Clare M Reynolds Aidan P Moloney Isobel C Gormley Peadar Ó Gaora Helen M Roche 《BMC bioinformatics》2010,11(1):499

相似文献

15.

VANTED: A system for advanced data analysis and visualization in the context of biological networks

Björn H Junker Christian Klukas Falk Schreiber 《BMC bioinformatics》2006,7(1):109-13

Background

Recent advances with high-throughput methods in life-science research have increased the need for automatized data analysis and visual exploration techniques. Sophisticated bioinformatics tools are essential to deduct biologically meaningful interpretations from the large amount of experimental data, and help to understand biological processes. 相似文献

16.

Towards a genome-scale kinetic model of cellular metabolism

Kieran Smallbone Evangelos Simeonidis Neil Swainston Pedro Mendes 《BMC systems biology》2010,4(1):6

Background

Advances in bioinformatic techniques and analyses have led to the availability of genome-scale metabolic reconstructions. The size and complexity of such networks often means that their potential behaviour can only be analysed with constraint-based methods. Whilst requiring minimal experimental data, such methods are unable to give insight into cellular substrate concentrations. Instead, the long-term goal of systems biology is to use kinetic modelling to characterize fully the mechanics of each enzymatic reaction, and to combine such knowledge to predict system behaviour. 相似文献

17.

Sources of variation in Affymetrix microarray experiments

Stanislav?O?Zakharkin Kyoungmi?Kim Tapan?Mehta Lang?Chen Stephen?Barnes Katherine?E?Scheirer Rudolph?S?Parrish David?B?Allison Grier?P?Page Email author 《BMC bioinformatics》2005,6(1):214

Background

A typical microarray experiment has many sources of variation which can be attributed to biological and technical causes. Identifying sources of variation and assessing their magnitude, among other factors, are important for optimal experimental design. The objectives of this study were: (1) to estimate relative magnitudes of different sources of variation and (2) to evaluate agreement between biological and technical replicates. 相似文献

18.

Prediction of protein-protein binding site by using core interface residue and support vector machine

Nan Li Zhonghua Sun Fan Jiang 《BMC bioinformatics》2008,9(1):553

Background

The prediction of protein-protein binding site can provide structural annotation to the protein interaction data from proteomics studies. This is very important for the biological application of the protein interaction data that is increasing rapidly. Moreover, methods for predicting protein interaction sites can also provide crucial information for improving the speed and accuracy of protein docking methods. 相似文献

19.

Simulation of microarray data with realistic characteristics

Matti Nykter Tommi Aho Miika Ahdesmäki Pekka Ruusuvuori Antti Lehmussola Olli Yli-Harja 《BMC bioinformatics》2006,7(1):349-17

Background

Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algorithms is not possible due to the lack of biological ground truth information. To overcome this fundamental problem, the use of simulated microarray data for algorithm validation has been proposed. 相似文献

20.

Multivariate curve resolution of time course microarray data

Peter D Wentzell Tobias K Karakach Sushmita Roy M Juanita Martinez Christopher P Allen Margaret Werner-Washburne 《BMC bioinformatics》2006,7(1):343

Background

Modeling of gene expression data from time course experiments often involves the use of linear models such as those obtained from principal component analysis (PCA), independent component analysis (ICA), or other methods. Such methods do not generally yield factors with a clear biological interpretation. Moreover, implicit assumptions about the measurement errors often limit the application of these methods to log-transformed data, destroying linear structure in the untransformed expression data. 相似文献