首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The Metabolic Models Reconstruction Using Genome-Scale Information (merlin) tool is a user-friendly Java application that aids the reconstruction of genome-scale metabolic models for any organism that has its genome sequenced. It performs the major steps of the reconstruction process, including the functional genomic annotation of the whole genome and subsequent construction of the portfolio of reactions. Moreover, merlin includes tools for the identification and annotation of genes encoding transport proteins, generating the transport reactions for those carriers. It also performs the compartmentalisation of the model, predicting the organelle localisation of the proteins encoded in the genome and thus the localisation of the metabolites involved in the reactions promoted by such enzymes. The gene-proteins-reactions (GPR) associations are automatically generated and included in the model. Finally, merlin expedites the transition from genomic data to draft metabolic models reconstructions exported in the SBML standard format, allowing the user to have a preliminary view of the biochemical network, which can be manually curated within the environment provided by merlin.  相似文献   

2.
Genome-scale metabolic models are central in connecting genotypes to metabolic phenotypes. However, even for well studied organisms, such as Escherichia coli, draft networks do not contain a complete biochemical network. Missing reactions are referred to as gaps. These gaps need to be filled to enable functional analysis, and gap-filling choices influence model predictions. To investigate whether functional networks existed where all gap-filling reactions were supported by sequence similarity to annotated enzymes, four draft networks were supplemented with all reactions from the Model SEED database for which minimal sequence similarity was found in their genomes. Quadratic programming revealed that the number of reactions that could partake in a gap-filling solution was vast: 3,270 in the case of E. coli, where 72% of the metabolites in the draft network could connect a gap-filling solution. Nonetheless, no network could be completed without the inclusion of orphaned enzymes, suggesting that parts of the biochemistry integral to biomass precursor formation are uncharacterized. However, many gap-filling reactions were well determined, and the resulting networks showed improved prediction of gene essentiality compared with networks generated through canonical gap filling. In addition, gene essentiality predictions that were sensitive to poorly determined gap-filling reactions were of poor quality, suggesting that damage to the network structure resulting from the inclusion of erroneous gap-filling reactions may be predictable.  相似文献   

3.
Metabolic databases contain information about thousands of small molecules and reactions, which can be represented as networks. In the context of metabolic reconstruction, pathways can be inferred by searching optimal paths in such networks. A recurrent problem is the presence of pool metabolites (e.g., water, energy carriers, and cofactors), which are connected to hundreds of reactions, thus establishing irrelevant shortcuts between nodes of the network. One solution to this problem relies on weighted networks to penalize highly connected compounds. A more refined solution takes the chemical structure of reactants into account in order to differentiate between side and main compounds of a reaction. Thanks to an intensive annotation effort at KEGG, decompositions of reactions into reactant pairs (RPAIR) categorized by their role (main, trans, cofac, ligase, and leave) are now available.The goal of this article is to evaluate the impact of RPAIR data on pathfinding in metabolic networks. To this end, we measure the impact of different parameters concerning the construction of the metabolic network: mapping of reactions and reactant pairs onto a graph, use of selected categories of reactant pairs, weighting schemes for compounds and reactions, removal of highly connected metabolites, and reaction directionality. In total, we tested 104 combinations of parameters and identified their optimal values for pathfinding on the basis of 55 reference pathways from three organisms.The best-performing metabolic network combines the biochemical knowledge encoded by KEGG RPAIR with a weighting scheme penalizing highly connected compounds. With this network, we could recover reference pathways from Escherichia coli with an average accuracy of 93% (32 pathways), from Saccharomyces cerevisiae with an average accuracy of 66% (11 pathways), and from humans with an average accuracy of 70% (12 pathways). Our pathfinding approach is available as part of the Network Analysis Tools.  相似文献   

4.
Genome-scale metabolic network reconstruction can be used for simulating cellular behaviors by simultaneously monitoring thousands of biochemical reactions, and is therefore important for systems biology studies in microbes. However, the labor-intensive and time-consuming reconstruction process has hindered the progress of this important field. Here we present a web server, MrBac (Metabolic network Reconstructions for Bacteria), to streamline the network reconstruction process for draft genome-scale metabolic networks and to provide annotation information from multiple databases for further curation of the draft reconstructions. MrBac integrates comparative genomics, retrieval of genome annotations, and generation of standard systems biology file format ready for network analyses. We also used MrBac to automatically generate a draft metabolic model of Salmonella enteric serovar Typhimurium LT2. The high similarity between this automatic model and the experimentally validated models further supports the usefulness and accuracy of MrBac. The high efficiency and accuracy of MrBac may accelerate the advances of systems biology studies on microbiology. MrBac is freely available at http://sb.nhri.org.tw/MrBac.  相似文献   

5.

Background  

Current methods for the automated generation of genome-scale metabolic networks focus on genome annotation and preliminary biochemical reaction network assembly, but do not adequately address the process of identifying and filling gaps in the reaction network, and verifying that the network is suitable for systems level analysis. Thus, current methods are only sufficient for generating draft-quality networks, and refinement of the reaction network is still largely a manual, labor-intensive process.  相似文献   

6.
The topology of central carbon metabolism of Aspergillus niger was identified and the metabolic network reconstructed, by integrating genomic, biochemical and physiological information available for this microorganism and other related fungi. The reconstructed network may serve as a valuable database for annotation of genes identified in future genome sequencing projects on aspergilli. Based on the metabolic reconstruction, a stoichiometric model was set up that includes 284 metabolites and 335 reactions, of which 268 represent biochemical conversions and 67 represent transport processes between the different intracellular compartments and between the cell and the extracellular medium. The stoichiometry of the metabolic reactions was used in combination with biosynthetic requirements for growth and pseudo-steady state mass balances over intracellular metabolites for the quantification of metabolic fluxes using metabolite balancing. This framework was employed to perform an in silico characterisation of the phenotypic behaviour of A. niger grown on different carbon sources. The effects on growth of single reaction deletions were assessed and essential biochemical reactions were identified for different carbon sources. Furthermore, application of the stoichiometric model for assessing the metabolic capabilities of A. niger to produce metabolites was evaluated by using succinate production as a case study.  相似文献   

7.
Genome-scale metabolic network models can be reconstructed for well-characterized organisms using genomic annotation and literature information. However, there are many instances in which model predictions of metabolic fluxes are not entirely consistent with experimental data, indicating that the reactions in the model do not match the active reactions in the in vivo system. We introduce a method for determining the active reactions in a genome-scale metabolic network based on a limited number of experimentally measured fluxes. This method, called optimal metabolic network identification (OMNI), allows efficient identification of the set of reactions that results in the best agreement between in silico predicted and experimentally measured flux distributions. We applied the method to intracellular flux data for evolved Escherichia coli mutant strains with lower than predicted growth rates in order to identify reactions that act as flux bottlenecks in these strains. The expression of the genes corresponding to these bottleneck reactions was often found to be downregulated in the evolved strains relative to the wild-type strain. We also demonstrate the ability of the OMNI method to diagnose problems in E. coli strains engineered for metabolite overproduction that have not reached their predicted production potential. The OMNI method applied to flux data for evolved strains can be used to provide insights into mechanisms that limit the ability of microbial strains to evolve towards their predicted optimal growth phenotypes. When applied to industrial production strains, the OMNI method can also be used to suggest metabolic engineering strategies to improve byproduct secretion. In addition to these applications, the method should prove to be useful in general for reconstructing metabolic networks of ill-characterized microbial organisms based on limited amounts of experimental data.  相似文献   

8.
With the emergence of energy scarcity, the use of renewable energy sources such as biodiesel is becoming increasingly necessary. Recently, many researchers have focused their minds on Yarrowia lipolytica, a model oleaginous yeast, which can be employed to accumulate large amounts of lipids that could be further converted to biodiesel. In order to understand the metabolic characteristics of Y. lipolytica at a systems level and to examine the potential for enhanced lipid production, a genome-scale compartmentalized metabolic network was reconstructed based on a combination of genome annotation and the detailed biochemical knowledge from multiple databases such as KEGG, ENZYME and BIGG. The information about protein and reaction associations of all the organisms in KEGG and Expasy-ENZYME database was arranged into an EXCEL file that can then be regarded as a new useful database to generate other reconstructions. The generated model iYL619_PCP accounts for 619 genes, 843 metabolites and 1,142 reactions including 236 transport reactions, 125 exchange reactions and 13 spontaneous reactions. The in silico model successfully predicted the minimal media and the growing abilities on different substrates. With flux balance analysis, single gene knockouts were also simulated to predict the essential genes and partially essential genes. In addition, flux variability analysis was applied to design new mutant strains that will redirect fluxes through the network and may enhance the production of lipid. This genome-scale metabolic model of Y. lipolytica can facilitate system-level metabolic analysis as well as strain development for improving the production of biodiesels and other valuable products by Y. lipolytica and other closely related oleaginous yeasts.  相似文献   

9.
We present an integrated analysis of the molecular repertoire of Chlamydomonas reinhardtii under reference conditions. Bioinformatics annotation methods combined with GCxGC/MS-based metabolomics and LC/MS-based shotgun proteomics profiling technologies have been applied to characterize abundant proteins and metabolites, resulting in the detection of 1069 proteins and 159 metabolites. Of the measured proteins, 204 currently do not have EST sequence support; thus a significant portion of the proteomics-detected proteins provide evidence for the validity of in silico gene models. Furthermore, the generated peptide data lend support to the validity of a number of proteins currently in the proposed model stage. By integrating genomic annotation information with experimentally identified metabolites and proteins, we constructed a draft metabolic network for Chlamydomonas. Computational metabolic modeling allowed an identification of missing enzymatic links. Some experimentally detected metabolites are not producible by the currently known and annotated enzyme set, thus suggesting entry points for further targeted gene discovery or biochemical pathway research. All data sets are made available as supplementary material as well as web-accessible databases and within the functional context via the Chlamydomonas-adapted MapMan annotation platform. Information of identified peptides is also available directly via the JGI-Chlamydomonas genomic resource database (http://genome.jgi-psf.org/Chlre3/Chlre3.home.html).  相似文献   

10.
The construction of dynamic metabolic models at reaction network level requires the use of mechanistic enzymatic rate equations that comprise a large number of parameters. The lack of knowledge on these equations and the difficulty in the experimental identification of their associated parameters, represent nowadays the limiting factor in the construction of such models. In this study, we compare four alternative modeling approaches based on Michaelis–Menten kinetics for the bi-molecular reactions and different types of simplified rate equations for the remaining reactions (generalized mass action, convenience kinetics, lin-log and power-law). Using the mechanistic model for Escherichia coli central carbon metabolism as a benchmark, we investigate the alternative modeling approaches through comparative simulations analyses. The good dynamic behavior and the powerful predictive capabilities obtained using the hybrid model composed of Michaelis–Menten and the approximate lin-log kinetics indicate that this is a possible suitable approach to model complex large-scale networks where the exact rate laws are unknown.  相似文献   

11.
Gene regulatory networks consist of direct interactions but also include indirect interactions mediated by metabolites and signaling molecules. We describe how these indirect interactions can be derived from a model of the underlying biochemical reaction network, using weak time-scale assumptions in combination with sensitivity criteria from metabolic control analysis. We apply this approach to a model of the carbon assimilation network in Escherichia coli. Our results show that the derived gene regulatory network is densely connected, contrary to what is usually assumed. Moreover, the network is largely sign-determined, meaning that the signs of the indirect interactions are fixed by the flux directions of biochemical reactions, independently of specific parameter values and rate laws. An inversion of the fluxes following a change in growth conditions may affect the signs of the indirect interactions though. This leads to a feedback structure that is at the same time robust to changes in the kinetic properties of enzymes and that has the flexibility to accommodate radical changes in the environment.  相似文献   

12.

Background

The iJO1366 reconstruction of the metabolic network of Escherichia coli is one of the most complete and accurate metabolic reconstructions available for any organism. Still, because our knowledge of even well-studied model organisms such as this one is incomplete, this network reconstruction contains gaps and possible errors. There are a total of 208 blocked metabolites in iJO1366, representing gaps in the network.

Results

A new model improvement workflow was developed to compare model based phenotypic predictions to experimental data to fill gaps and correct errors. A Keio Collection based dataset of E. coli gene essentiality was obtained from literature data and compared to model predictions. The SMILEY algorithm was then used to predict the most likely missing reactions in the reconstructed network, adding reactions from a KEGG based universal set of metabolic reactions. The feasibility of these putative reactions was determined by comparing updated versions of the model to the experimental dataset, and genes were predicted for the most feasible reactions.

Conclusions

Numerous improvements to the iJO1366 metabolic reconstruction were suggested by these analyses. Experiments were performed to verify several computational predictions, including a new mechanism for growth on myo-inositol. The other predictions made in this study should be experimentally verifiable by similar means. Validating all of the predictions made here represents a substantial but important undertaking.  相似文献   

13.
14.
In contrast to stoichiometric-based models, the development of large-scale kinetic models of metabolism has been hindered by the challenge of identifying kinetic parameter values and kinetic rate laws applicable to a wide range of environmental and/or genetic perturbations. The recently introduced ensemble modeling (EM) procedure provides a promising remedy to address these challenges by decomposing metabolic reactions into elementary reaction steps and incorporating all phenotypic observations, upon perturbation, in its model parameterization scheme. Here, we present a kinetic model of Escherichia coli core metabolism that satisfies the fluxomic data for wild-type and seven mutant strains by making use of the EM concepts. This model encompasses 138 reactions, 93 metabolites and 60 substrate-level regulatory interactions accounting for glycolysis/gluconeogenesis, pentose phosphate pathway, TCA cycle, major pyruvate metabolism, anaplerotic reactions and a number of reactions in other parts of the metabolism. Parameterization is performed using a formal optimization approach that minimizes the discrepancies between model predictions and flux measurements. The predicted fluxes by the model are within the uncertainty range of experimental flux data for 78% of the reactions (with measured fluxes) for both the wild-type and seven mutant strains. The remaining flux predictions are mostly within three standard deviations of reported ranges. Converting the EM-based parameters into a Michaelis–Menten equivalent formalism revealed that 35% of Km and 77% of kcat parameters are within uncertainty range of the literature-reported values. The predicted metabolite concentrations by the model are also within uncertainty ranges of metabolomic data for 68% of the metabolites. A leave-one-out cross-validation test to evaluate the flux prediction performance of the model showed that metabolic fluxes for the mutants located in the proximity of mutations used for training the model can be predicted more accurately. The constructed model and the parameterization procedure presented in this study pave the way for the construction of larger-scale kinetic models with more narrowly distributed parameter values as new metabolomic/fluxomic data sets are becoming available for E. coli and other organisms.  相似文献   

15.
Bacterial responses to environmental changes rely on a complex network of biochemical reactions. The properties of the metabolic network determining these responses can be divided into two groups: the stoichiometric properties, given by the stoichiometry matrix, and the kinetic/thermodynamic properties, given by the rate equations of the reaction steps. The stoichiometry matrix represents the maximal metabolic capabilities of the organism, and the regulatory mechanisms based on the rate laws could be considered as being responsible for the administration of these capabilities. Post-genomic reconstruction of metabolic networks provides us with the stoichiometry matrix of particular strains of microorganisms, but the kinetic aspects of in vivo rate laws are still largely unknown. Therefore, the validity of predictions of cellular responses requiring detailed knowledge of the rate equations is difficult to assert. In this paper, we show that by applying optimisation criteria to the core stoichiometric network of the metabolism of Escherichia coli, and including information about reversibility/irreversibility only of the reaction steps, it is possible to calculate bacterial responses to growth media with different amounts of glucose and galactose. The target was the minimisation of the number of active reactions (subject to attaining a growth rate higher than a lower limit) and subsequent maximisation of the growth rate (subject to the number of active reactions being equal to the minimum previously calculated). Using this two-level target, we were able to obtain by calculation four fundamental behaviours found experimentally: inhibition of respiration at high glucose concentrations in aerobic conditions, turning on of respiration when glucose decreases, induction of galactose utilisation when the system is depleted of glucose and simultaneous use of glucose and galactose as carbon sources when both sugars are present in low concentrations. Preliminary results of the coarse pattern of sugar utilisation were also obtained with a genome-scale E. coli reconstructed network, yielding similar qualitative results.  相似文献   

16.
The green picoalga Ostreococcus is emerging as a simple plant model organism, and two species, O. lucimarinus and O. tauri, have now been sequenced and annotated manually. To evaluate the completeness of the metabolic annotation of both species, metabolic networks of O. lucimarinus and O. tauri were reconstructed from the KEGG database, thermodynamically constrained, elementally balanced, and functionally evaluated. The draft networks contained extensive gaps and, in the case of O. tauri, no biomass components could be produced due to an incomplete Calvin cycle. To find and remove gaps from the networks, an extensive reference biochemical reaction database was assembled using a stepwise approach that minimized the inclusion of microbial reactions. Gaps were then removed from both Ostreococcus networks using two existing gap-filling methodologies. In the first method, a bottom-up approach, a minimal list of reactions was added to each model to enable the production of all metabolites included in our biomass equation. In the second method, a top-down approach, all reactions in the reference database were added to the target networks and subsequently trimmed away based on the sequence alignment scores of identified orthologues. Because current gap-filling methods do not produce unique solutions, a quality metric that includes a weighting for phylogenetic distance and sequence similarity was developed to distinguish between gap-filling results automatically. The draft O. lucimarinus and O. tauri networks required the addition of 56 and 70 reactions, respectively, in order to produce the same biomass precursor metabolites that were produced by our plant reference database.  相似文献   

17.
In this report, a genome-scale reconstruction of Bacillus subtilis metabolism and its iterative development based on the combination of genomic, biochemical, and physiological information and high-throughput phenotyping experiments is presented. The initial reconstruction was converted into an in silico model and expanded in a four-step iterative fashion. First, network gap analysis was used to identify 48 missing reactions that are needed for growth but were not found in the genome annotation. Second, the computed growth rates under aerobic conditions were compared with high-throughput phenotypic screen data, and the initial in silico model could predict the outcomes qualitatively in 140 of 271 cases considered. Detailed analysis of the incorrect predictions resulted in the addition of 75 reactions to the initial reconstruction, and 200 of 271 cases were correctly computed. Third, in silico computations of the growth phenotypes of knock-out strains were found to be consistent with experimental observations in 720 of 766 cases evaluated. Fourth, the integrated analysis of the large-scale substrate utilization and gene essentiality data with the genome-scale metabolic model revealed the requirement of 80 specific enzymes (transport, 53; intracellular reactions, 27) that were not in the genome annotation. Subsequent sequence analysis resulted in the identification of genes that could be putatively assigned to 13 intracellular enzymes. The final reconstruction accounted for 844 open reading frames and consisted of 1020 metabolic reactions and 988 metabolites. Hence, the in silico model can be used to obtain experimentally verifiable hypothesis on the metabolic functions of various genes.  相似文献   

18.
We have compared 12 genome-scale models of the Saccharomyces cerevisiae metabolic network published since 2003 to evaluate progress in reconstruction of the yeast metabolic network. We compared the genomic coverage, overlap of annotated metabolites, predictive ability for single gene essentiality with a selection of model parameters, and biomass production predictions in simulated nutrient-limited conditions. We have also compared pairwise gene knockout essentiality predictions for 10 of these models. We found that varying approaches to model scope and annotation reflected the involvement of multiple research groups in model development; that single-gene essentiality predictions were affected by simulated medium, objective function, and the reference list of essential genes; and that predictive ability for single-gene essentiality did not correlate well with predictive ability for our reference list of synthetic lethal gene interactions (R = 0.159). We conclude that the reconstruction of the yeast metabolic network is indeed gradually improving through the iterative process of model development, and there remains great opportunity for advancing our understanding of biology through continued efforts to reconstruct the full biochemical reaction network that constitutes yeast metabolism. Additionally, we suggest that there is opportunity for refining the process of deriving a metabolic model from a metabolic network reconstruction to facilitate mechanistic investigation and discovery. This comparative study lays the groundwork for developing improved tools and formalized methods to quantitatively assess metabolic network reconstructions independently of any particular model application, which will facilitate ongoing efforts to advance our understanding of the relationship between genotype and cellular phenotype.  相似文献   

19.
The initial genome‐scale reconstruction of the metabolic network of Escherichia coli K‐12 MG1655 was assembled in 2000. It has been updated and periodically released since then based on new and curated genomic and biochemical knowledge. An update has now been built, named iJO1366, which accounts for 1366 genes, 2251 metabolic reactions, and 1136 unique metabolites. iJO1366 was (1) updated in part using a new experimental screen of 1075 gene knockout strains, illuminating cases where alternative pathways and isozymes are yet to be discovered, (2) compared with its predecessor and to experimental data sets to confirm that it continues to make accurate phenotypic predictions of growth on different substrates and for gene knockout strains, and (3) mapped to the genomes of all available sequenced E. coli strains, including pathogens, leading to the identification of hundreds of unannotated genes in these organisms. Like its predecessors, the iJO1366 reconstruction is expected to be widely deployed for studying the systems biology of E. coli and for metabolic engineering applications.  相似文献   

20.
The human gut microbiota plays a central role in human well-being and disease. In this study, we present an integrated, iterative approach of computational modeling, in vitro experiments, metabolomics, and genomic analysis to accelerate the identification of metabolic capabilities for poorly characterized (anaerobic) microorganisms. We demonstrate this approach for the beneficial human gut microbe Faecalibacterium prausnitzii strain A2-165. We generated an automated draft reconstruction, which we curated against the limited biochemical data. This reconstruction modeling was used to develop in silico and in vitro a chemically defined medium (CDM), which was validated experimentally. Subsequent metabolomic analysis of the spent medium for growth on CDM was performed. We refined our metabolic reconstruction according to in vitro observed metabolite consumption and secretion and propose improvements to the current genome annotation of F. prausnitzii A2-165. We then used the reconstruction to systematically characterize its metabolic properties. Novel carbon source utilization capabilities and inabilities were predicted based on metabolic modeling and validated experimentally. This study resulted in a functional metabolic map of F. prausnitzii, which is available for further applications. The presented workflow can be readily extended to other poorly characterized and uncharacterized organisms to yield novel biochemical insights about the target organism.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号