共查询到20条相似文献,搜索用时 15 毫秒
1.
Ames GM George DB Hampson CP Kanarek AR McBee CD Lockwood DR Achter JD Webb CT 《Proceedings. Biological sciences / The Royal Society》2011,278(1724):3544-3550
Recent studies have increasingly turned to graph theory to model more realistic contact structures that characterize disease spread. Because of the computational demands of these methods, many researchers have sought to use measures of network structure to modify analytically tractable differential equation models. Several of these studies have focused on the degree distribution of the contact network as the basis for their modifications. We show that although degree distribution is sufficient to predict disease behaviour on very sparse or very dense human contact networks, for intermediate density networks we must include information on clustering and path length to accurately predict disease behaviour. Using these three metrics, we were able to explain more than 98 per cent of the variation in endemic disease levels in our stochastic simulations. 相似文献
2.
Understanding the relationships between the structure (topology) and function of biological networks is a central question of systems biology. The idea that topology is a major determinant of systems function has become an attractive and highly disputed hypothesis. Although structural analysis of interaction networks demonstrates a correlation between the topological properties of a node (protein, gene) in the network and its functional essentiality, the analysis of metabolic networks fails to find such correlations. In contrast, approaches utilizing both the topology and biochemical parameters of metabolic networks, e.g., flux balance analysis, are more successful in predicting phenotypes of knockout strains. We reconcile these seemingly conflicting results by showing that the topology of the metabolic networks of both Escherichia coli and Saccharomyces cerevisiae are, in fact, sufficient to predict the viability of knockout strains with accuracy comparable to flux balance analysis on large, unbiased mutant data sets. This surprising result is obtained by introducing a novel topology-based measure of network transport: synthetic accessibility. We also show that other popular topology-based characteristics such as node degree, graph diameter, and node usage (betweenness) fail to predict the viability of E. coli mutant strains. The success of synthetic accessibility demonstrates its ability to capture the essential properties of the metabolic network, such as the branching of chemical reactions and the directed transport of material from inputs to outputs. Our results strongly support a link between the topology and function of biological networks and, in agreement with recent genetic studies, emphasize the minimal role of flux rerouting in providing robustness of mutant strains. 相似文献
3.
4.
Background
Predicting the phenotypic effects of mutations is a central goal of genetics research; it has important applications in elucidating how genotype determines phenotype and in identifying human disease genes. 相似文献5.
6.
James C Costello Mehmet M Dalkilic Scott M Beason Jeff R Gehlhausen Rupali Patwardhan Sumit Middha Brian D Eads Justen R Andrews 《Genome biology》2009,10(9):1-29
Background
Discovering the functions of all genes is a central goal of contemporary biomedical research. Despite considerable effort, we are still far from achieving this goal in any metazoan organism. Collectively, the growing body of high-throughput functional genomics data provides evidence of gene function, but remains difficult to interpret.Results
We constructed the first network of functional relationships for Drosophila melanogaster by integrating most of the available, comprehensive sets of genetic interaction, protein-protein interaction, and microarray expression data. The complete integrated network covers 85% of the currently known genes, which we refined to a high confidence network that includes 20,000 functional relationships among 5,021 genes. An analysis of the network revealed a remarkable concordance with prior knowledge. Using the network, we were able to infer a set of high-confidence Gene Ontology biological process annotations on 483 of the roughly 5,000 previously unannotated genes. We also show that this approach is a means of inferring annotations on a class of genes that cannot be annotated based solely on sequence similarity. Lastly, we demonstrate the utility of the network through reanalyzing gene expression data to both discover clusters of coregulated genes and compile a list of candidate genes related to specific biological processes.Conclusions
Here we present the the first genome-wide functional gene network in D. melanogaster. The network enables the exploration, mining, and reanalysis of experimental data, as well as the interpretation of new data. The inferred annotations provide testable hypotheses of previously uncharacterized genes. 相似文献7.
It has been shown that dynamic recurrent neural networks are successful in identifying the complex mapping relationship between
full-wave-rectified electromyographic (EMG) signals and limb trajectories during complex movements. These connectionist models
include two types of adaptive parameters: the interconnection weights between the units and the time constants associated
to each neuron-like unit; they are governed by continuous-time equations. Due to their internal structure, these models are
particularly appropriate to solve dynamical tasks (with time-varying input and output signals). We show in this paper that
the introduction of a modular organization dedicated to different aspects of the dynamical mapping including privileged communication
channels can refine the architecture of these recurrent networks. We first divide the initial individual network into two
communicating subnetworks. These two modules receive the same EMG signals as input but are involved in different identification
tasks related to position and acceleration. We then show that the introduction of an artificial distance in the model (using
a Gaussian modulation factor of weights) induces a reduced modular architecture based on a self-elimination of null synaptic
weights. Moreover, this self-selected reduced model based on two subnetworks performs the identification task better than
the original single network while using fewer free parameters (better learning curve and better identification quality). We
also show that this modular network exhibits several features that can be considered as biologically plausible after the learning
process: self-selection of a specific inhibitory communicating path between both subnetworks after the learning process, appearance
of tonic and phasic neurons, and coherent distribution of the values of the time constants within each subnetwork.
Received: 17 September 2001 / Accepted in revised form: 15 January 2002 相似文献
8.
A three dimensional nutrient-plant-herbivore model was proposed and conditions for boundedness, positive invariance, existence and stability of different equilibrium points, Hopf-bifurcation and global stability were obtained. We performed numerical simulations to observe the simultaneous effect of the top-down and the bottom-up mechanism on the system. It was found that nutrient enrichment destroyed the coexistence steady state of the system. This nutrient enrichment could be due to high nutrient input rate or high nutrient recycling rate. In both cases the system showed instability. Moreover, these results were independent of the grazing pressure and the predation functional form. 相似文献
9.
MOTIVATION: Bayesian network methods have shown promise in gene regulatory network reconstruction because of their capability of capturing causal relationships between genes and handling data with noises found in biological experiments. The problem of learning network structures, however, is NP hard. Consequently, heuristic methods such as hill climbing are used for structure learning. For networks of a moderate size, hill climbing methods are not computationally efficient. Furthermore, relatively low accuracy of the learned structures may be observed. The purpose of this article is to present a novel structure learning method for gene network discovery. RESULTS: In this paper, we present a novel structure learning method to reconstruct the underlying gene networks from the observational gene expression data. Unlike hill climbing approaches, the proposed method first constructs an undirected network based on mutual information between two nodes and then splits the structure into substructures. The directional orientations for the edges that connect two nodes are then obtained by optimizing a scoring function for each substructure. Our method is evaluated using two benchmark network datasets with known structures. The results show that the proposed method can identify networks that are close to the optimal structures. It outperforms hill climbing methods in terms of both computation time and predicted structure accuracy. We also apply the method to gene expression data measured during the yeast cycle and show the effectiveness of the proposed method for network reconstruction. 相似文献
10.
11.
12.
MOTIVATION: Computational gene prediction methods are an important component of whole genome analyses. While ab initio gene finders have demonstrated major improvements in accuracy, the most reliable methods are evidence-based gene predictors. These algorithms can rely on several different sources of evidence including predictions from multiple ab initio gene finders, matches to known proteins, sequence conservation and partial cDNAs to predict the final product. Despite the success of these algorithms, prediction of complete gene structures, especially for alternatively spliced products, remains a difficult task. RESULTS: LOCUS (Length Optimized Characterization of Unknown Spliceforms) is a new evidence-based gene finding algorithm which integrates a length-constraint into a dynamic programming-based framework for prediction of gene products. On a Caenorhabditis elegans test set of alternatively spliced internal exons, its performance exceeds that of current ab initio gene finders and in most cases can accurately predict the correct form of all the alternative products. As the length information used by the algorithm can be obtained in a high-throughput fashion, we propose that integration of such information into a gene-prediction pipeline is feasible and doing so may improve our ability to fully characterize the complete set of mRNAs for a genome. AVAILABILITY: LOCUS is available from http://ural.wustl.edu/software.html 相似文献
13.
Using body size to predict perceptual range 总被引:5,自引:0,他引:5
We examined the relationship between body size and perceptual range (the distance at which an animal can perceive landscape elements) for a group of forest-dwelling rodents. We used previously published data on orientation ability at various distances for three sciurid species (gray squirrel, fox squirrel and chipmunk) and one murid species (white-footed mouse) to build a predictive model. We found a significant positive relationship between perceptual range and body mass. Although this model was built using a 15.5 m high horizon, we used this relation to predict the perceptual range of root voles (3.9–4.3 m) orienting towards a 0.5 m high horizon which was consistent with other empirical work suggesting a value of something less than 5 m. This model illustrates a relationship between perceptual range and body size and can be used to develop starting points for future investigations of perceptual range for similar organisms. 相似文献
14.
Enzyme function is much less conserved than anticipated, i.e., the requirement for sequence similarity that implies similarity in enzymatic function is much higher than the requirement that implies similarity in protein structure. This is because the function of an enzyme is an extremely complicated problem that may involve very subtle structural details as well as many other physical chemistry factors. Accordingly, if simply based on the sequence similarity approach, it would hardly get a decent success rate in predicting enzyme sub-class even for a dataset consisting of samples with 50% sequence identity. To cope with such a situation, the GO-PseAA predictor was adopted to identify the sub-class for each of the six main enzyme families. It has been observed that, even for the much more stringent datasets in which none of the enzymes has 25% sequence identity to any others, the overall success rates are 73-95%, suggesting that the GO-PseAA predictor can catch the core features of the statistical samples concerned and may become a useful high throughput tool in proteomics and bioinformatics. 相似文献
15.
Using subsite coupling to predict signal peptides 总被引:5,自引:0,他引:5
Chou KC 《Protein engineering》2001,14(2):75-79
Given a nascent protein sequence, how can one predict its signal peptide or "Zipcode" sequence? This is a first important problem for scientists to use signal peptides as a vehicle to find new drugs or to reprogram cells for gene therapy. Based on a model that takes into account the coupling effect among some key subsites, the so-called [-3, -1, +1] coupling model, a new prediction algorithm is developed. The overall rate of correct prediction for 1939 secretory proteins and 1440 non-secretary proteins was over 92%. It has not escaped our attention that the new method may also serve as a useful tool for helping investigate further many unclear details regarding the molecular mechanism of the ZIP code protein-sorting system in cells. 相似文献
16.
After the major achievements of the DNA sequencing projects, an equally important challenge now is to uncover the functional relationships among genes (i.e. gene networks). It has become increasingly clear that computational algorithms are crucial for extracting meaningful information from the massive amount of data generated by high-throughput genome-wide technologies. Here, we summarise how systems identification algorithms, originating from physics and control theory, have been adapted for use in biology. We also explain how experimental perturbations combined with genome-wide measurements are being used to uncover gene networks. Perturbation techniques could pave the way for identifying gene networks in more complex settings such as multifactorial diseases and for improving the efficacy of drug evaluation. 相似文献
17.
Background
Cis-regulatory modules (CRMs) are short stretches of DNA that help regulate gene expression in higher eukaryotes. They have been found up to 1 megabase away from the genes they regulate and can be located upstream, downstream, and even within their target genes. Due to the difficulty of finding CRMs using biological and computational techniques, even well-studied regulatory systems may contain CRMs that have not yet been discovered. 相似文献18.
Prediction of protein classification is an important topic in molecular biology. This is because it is able to not only provide useful information from the viewpoint of structure itself, but also greatly stimulate the characterization of many other features of proteins that may be closely correlated with their biological functions. In this paper, the LogitBoost, one of the boosting algorithms developed recently, is introduced for predicting protein structural classes. It performs classification using a regression scheme as the base learner, which can handle multi-class problems and is particularly superior in coping with noisy data. It was demonstrated that the LogitBoost outperformed the support vector machines in predicting the structural classes for a given dataset, indicating that the new classifier is very promising. It is anticipated that the power in predicting protein structural classes as well as many other bio-macromolecular attributes will be further strengthened if the LogitBoost and some other existing algorithms can be effectively complemented with each other. 相似文献
19.
20.
MOTIVATION: The past decade has seen the introduction of fast and relatively inexpensive methods to detect genetic variation across the genome and exponential growth in the number of known single nucleotide variants (SNVs). There is increasing interest in bioinformatics approaches to identify variants that are functionally important from millions of candidate variants. Here, we describe the essential components of bioinformatics tools that predict functional SNVs. RESULTS: Bioinformatics tools have great potential to identify functional SNVs, but the black box nature of many tools can be a pitfall for researchers. Understanding the underlying methods, assumptions and biases of these tools is essential to their intelligent application. 相似文献