首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Structural analyses of a hypothetical minimal metabolism   总被引:2,自引:0,他引:2  
By integrating data from comparative genomics and large-scale deletion studies, we previously proposed a minimal gene set comprising 206 protein-coding genes. To evaluate the consistency of the metabolism encoded by such a minimal genome, we have carried out a series of computational analyses. Firstly, the topology of the minimal metabolism was compared with that of the reconstructed networks from natural bacterial genomes. Secondly, the robustness of the metabolic network was evaluated by simulated mutagenesis and, finally, the stoichiometric consistency was assessed by automatically deriving the steady-state solutions from the reaction set. The results indicated that the proposed minimal metabolism presents stoichiometric consistency and that it is organized as a complex power-law network with topological parameters falling within the expected range for a natural metabolism of its size. The robustness analyses revealed that most random mutations do not alter the topology of the network significantly, but do cause significant damage by preventing the synthesis of several compounds or compromising the stoichiometric consistency of the metabolism. The implications that these results have on the origins of metabolic complexity and the theoretical design of an artificial minimal cell are discussed.  相似文献   

2.
The availability of complete genome sequences of cellular life forms creates the opportunity to explore the functional content of the genomes and evolutionary relationships between them at a new qualitative level. With the advent of these sequences, the construction of a minimal gene set sufficient for sustaining cellular life and reconstruction of the genome of the last common ancestor of bacteria, eukaryotes, and archaea become realistic, albeit challenging, research projects. A version of the minimal gene set for modern-type cellular life derived by comparative analysis of two bacterial genomes, those of Haemophilus influenzae and Mycoplasma genitalium, consists of ∼250 genes. A comparison of the protein sequences encoded in these genes with those of the proteins encoded in the complete yeast genome suggests that the last common ancestor of all extant life might have had an RNA genome.  相似文献   

3.
Protein content of minimal and ancestral ribosome   总被引:2,自引:0,他引:2       下载免费PDF全文
Minimal genome approaches seek to define the smallest gene complement compatible with modern-type cellular life on Earth. A consensus of computational and experimental approaches indicates that a minimal genome is close to 300 protein-coding genes, if a rich medium is provided for cell growth. I relate ribosomal gene content in completely sequenced genomes to ribosomal subunit structure and approximate the protein components of the putative minimal ribosome and the ribosome of the Last Universal Common Ancestor of Life. Both sets contain between 35 and 40 proteins. There is evidence of protein-protein and protein-RNA displacement in the evolution of both ribosomal subunits.  相似文献   

4.
Approximately 50% of the predicted protein-coding genes of the Trypanosoma cruzi CL Brener strain are annotated as hypothetical or conserved hypothetical proteins. To further characterize these genes, we generated 1161 open-reading frame expressed sequence tags (ORESTES) from the mammalian stages of the VL10 human strain. Sequence clustering resulted in 435 clusters, consisting of 339 singletons and 96 contigs. Significant matches to the T. cruzi predicted gene database were found for ~94% contigs and ~69% singletons. These included genes encoding surface proteins, known to be intensely expressed in the parasite mammalian stages and implicated in host cell invasion and/or immune evasion mechanisms. Among 151 contigs and singletons with similarity to predicted hypothetical protein-coding genes and conserved hypothetical protein-coding genes, 83% showed no match with T. cruzi EST and/or proteome databases. These ORESTES are the first experimental evidence that the corresponding genes are in fact transcribed. Sequences with no significant match were searched against several T. cruzi and National Center for Biotechnology Information non-redundant sequence databases. The ORESTES analysis indicated that 124 predicted conserved hypothetical protein-coding genes and 27 predicted hypothetical protein-coding genes annotated in the CL Brener genome are transcribed in the VL10 mammalian stages. Six ORESTES annotated as hypothetical protein-coding genes showing no match to EST and/or proteome databases were confirmed by Northern blot in VL10. The generation of this set of ORESTES complements the T. cruzi genome annotation and suggests new stage-regulated genes encoding hypothetical proteins.  相似文献   

5.
The minimal genome concept   总被引:7,自引:0,他引:7  
Complete genome sequences are becoming available for a large number of diverse species. Quantification of gene content, of gene family expansion, of orthologous gene conservation, as well as their displacement, are now possible - laying the ground for the estimation of the minimal set of proteins sufficient for cellular life. The consensus of computational results suggests a set close to 300 genes. These predictions will be evaluated by engineering of small bacterial genomes.  相似文献   

6.
Genome sequences are annotated by computational prediction of coding sequences, followed by similarity searches such as BLAST, which provide a layer of possible functional information. While the existence of processes such as alternative splicing complicates matters for eukaryote genomes, the view of bacterial genomes as a linear series of closely spaced genes leads to the assumption that computational annotations that predict such arrangements completely describe the coding capacity of bacterial genomes. We undertook a proteomic study to identify proteins expressed by Pseudomonas fluorescens Pf0-1 from genes that were not predicted during the genome annotation. Mapping peptides to the Pf0-1 genome sequence identified sixteen non-annotated protein-coding regions, of which nine were antisense to predicted genes, six were intergenic, and one read in the same direction as an annotated gene but in a different frame. The expression of all but one of the newly discovered genes was verified by RT-PCR. Few clues as to the function of the new genes were gleaned from informatic analyses, but potential orthologs in other Pseudomonas genomes were identified for eight of the new genes. The 16 newly identified genes improve the quality of the Pf0-1 genome annotation, and the detection of antisense protein-coding genes indicates the under-appreciated complexity of bacterial genome organization.  相似文献   

7.
With recent breakthroughs in experimental microbiology making it possible to synthesize and implant an entire genome to create a living cell, the challenge of constructing a working blueprint for the first truly minimal synthetic organism is more important than ever. Here we review the significant progress made in the design and creation of a minimal organism. We discuss how comparative genomes, gene essentiality data, naturally small genomes, and metabolic modeling are all being applied to produce a catalogue of the biological functions essential for life. We compare the minimal gene sets from three published sources with functions identified in 13 existing gene essentiality datasets. We examine how genome-scale metabolic models have been applied to design a minimal metabolism for growth in simple and complex media. Additionally, we survey the progress of efforts to construct a minimal organism, either through implementation of combinatorial deletions in Bacillus subtilis and Escherichia coli or through the synthesis and implantation of synthetic genomes.  相似文献   

8.
A large number of complete microorganism genomes has been sequenced and submitted to the public database and then incorporated into our complete genome database, Genome Information Broker (GIB, http://gib.genes.nig.ac.jp/). However, when comparative genomics is carried out, researchers must be aware that there are protein-coding genes not confirmed by homology or motif search and that reliable protein-coding genes are missing. Therefore, we developed a protocol (Gene Trek in Prokaryote Space, GTPS) for finding possible protein-coding genes in bacterial genomes. GTPS assigns a degree of reliability to predicted protein-coding genes. We first systematically applied the protocol to the complete genomes of all 123 bacterial species and strains that were publicly available as of July 2003, and then to those of 183 species and strains available as of September 2004. We found a number of incorrect genes and several new ones in the genome data in question. We also found a way to estimate the total number of orthologous genes in the bacterial world.  相似文献   

9.
With the completion of the sequence of the first bacterial genomes, scientists have been able to address the question: How many genes are required for cell viability? In attempting to reply to this question, the concept of the minimal gene set was developed and validated by systematic gene disruption. In a similar manner, whole genome comparisons and systematic Knock-Out have been performed in eukaryotes and have led to the identification to date of the set of essential genes in yeast and C. elegans. In the plant kingdom, the sequence of the Arabidopsis genome together with large-scale functional genomics programs now allow us to address the question of essentiality in Arabidopsis. These concerted efforts have resulted in the identification to date of up to 219 genes essential for seed development (EMBRYO-DEFECTIVE, EMB, genes). With this basic knowledge, we can start a valid comparison of essentiality in Arabidopsis and in other eukaryotes based on functional categories and orthologous relationships. Furthermore, the function of the EMB genes in the particular context of eukaryote evolution driven by whole genome duplications and selective gene loss will be discussed.  相似文献   

10.
11.
12.

Background  

Gene losses played a role which may have been as important as gene and genome duplications and rearrangements, in modelling today species' genomes from a common ancestral set of genes. The set and diversity of protein-coding genes in a species has direct output at the functional level. While gene losses have been reported in all the major lineages of the metazoan tree of life, none have proposed a focus on specific losses in the vertebrates and mammals lineages. In contrast, genes lost in protostomes (i.e. arthropods and nematodes) but still present in vertebrates have been reported and extensively detailed. This probable over-anthropocentric way of comparing genomes does not consider as an important phenomena, gene losses in species that are usually described as "higher". However reporting universally conserved genes throughout evolution that have recently been lost in vertebrates and mammals could reveal interesting features about the evolution of our genome, particularly if these losses can be related to losses of capability.  相似文献   

13.
Liu W  Fang L  Li M  Li S  Guo S  Luo R  Feng Z  Li B  Zhou Z  Shao G  Chen H  Xiao S 《PloS one》2012,7(4):e35698
Mycoplasma, the smallest self-replicating organism with a minimal metabolism and little genomic redundancy, is expected to be a close approximation to the minimal set of genes needed to sustain bacterial life. This study employs comparative evolutionary analysis of twenty Mycoplasma genomes to gain an improved understanding of essential genes. By analyzing the core genome of mycoplasmas, we finally revealed the conserved essential genes set for mycoplasma survival. Further analysis showed that the core genome set has many characteristics in common with experimentally identified essential genes. Several key genes, which are related to DNA replication and repair and can be disrupted in transposon mutagenesis studies, may be critical for bacteria survival especially over long period natural selection. Phylogenomic reconstructions based on 3,355 homologous groups allowed robust estimation of phylogenetic relatedness among mycoplasma strains. To obtain deeper insight into the relative roles of molecular evolution in pathogen adaptation to their hosts, we also analyzed the positive selection pressures on particular sites and lineages. There appears to be an approximate correlation between the divergence of species and the level of positive selection detected in corresponding lineages.  相似文献   

14.
李金玉  杨姗  崔玉军  王涛  滕越 《遗传》2021,(2):142-159
具有最小基因组的细菌只包含维持自我生命复制所必需的基因,其作为一种潜在的工业生产平台具有诸多优势.由于高通量DNA测序和合成技术的发展,目前已经构建了多种缩减基因组的菌株.本文首先介绍了最小基因组的概念,其次总结了细菌必需基因的相关研究进展,然后梳理了人工缩减与合成微生物基因组的相关工作,最后探讨了在设计和组装基因组的...  相似文献   

15.
Essential genes are absolutely required for cell survival. Determination of the universal minimal set of genes needed to sustain life is, therefore, expected to contribute greatly to our understanding of life at its simplest level, with applications in medicine and synthetic biology. The search for the minimal genome has led to the identification of often variable gene sets. We argue here that, based on the outcome of these analyses, it is becoming increasingly evident that some genes, and the functions encoded by them, are absolutely necessary for the survival of any living entity, whereas others can be omitted. We also examine ways of determining the minimal genome and discuss possible practical applications of a minimal cell.  相似文献   

16.
The bacterial core genome is of intense interest and the volume of whole genome sequence data in the public domain available to investigate it has increased dramatically. The aim of our study was to develop a model to estimate the bacterial core genome from next-generation whole genome sequencing data and use this model to identify novel genes associated with important biological functions. Five bacterial datasets were analysed, comprising 2096 genomes in total. We developed a Bayesian decision model to estimate the number of core genes, calculated pairwise evolutionary distances (p-distances) based on nucleotide sequence diversity, and plotted the median p-distance for each core gene relative to its genome location. We designed visually-informative genome diagrams to depict areas of interest in genomes. Case studies demonstrated how the model could identify areas for further study, e.g. 25% of the core genes with higher sequence diversity in the Campylobacter jejuni and Neisseria meningitidis genomes encoded hypothetical proteins. The core gene with the highest p-distance value in C. jejuni was annotated in the reference genome as a putative hydrolase, but further work revealed that it shared sequence homology with beta-lactamase/metallo-beta-lactamases (enzymes that provide resistance to a range of broad-spectrum antibiotics) and thioredoxin reductase genes (which reduce oxidative stress and are essential for DNA replication) in other C. jejuni genomes. Our Bayesian model of estimating the core genome is principled, easy to use and can be applied to large genome datasets. This study also highlighted the lack of knowledge currently available for many core genes in bacterial genomes of significant global public health importance.  相似文献   

17.
18.
The three green algal mitochondrial genomes completely sequenced to date — those of Chlamydomonas reinhardtii Dangeard, Chlamydomonas eugametos Gerloff, and Prototheca wickerhamii Soneda & Tubaki — revealed very different mitochondrial genome organizations and sequence affiliations. The Chlamydomonas genomes resemble the ciliate / fungal / animal counterparts, and the Prototheca genome resembles land plant homologues. This review points out that all the green algal mitochondrial genomes examined to date resemble either the Chlamydomonas or the Prototheca mitochondrial genome; the Chlamydomonas- like mitochondrial genomes are small and have a reduced gene content (no ribosomal protein or 5S rRNA genes and only a few protein-coding and tRNA genes) and fragmented and scrambled rRNA coding regions, whereas the Prototheca- like mitochondrial genomes are larger and have a larger set of protein-coding genes (including ribosomal protein genes), more tRNA genes, and 5S rRNA and conventional continuous small-subunit (SSU) and large-subunit (LSU) rRNA coding regions. It appears, therefore, that the differences previously observed between the mitochondrial genomes of C. reinhardtii and P. wickerhamii extend to the two green algal mitochondrial lineages to which they belong and are significant enough to raise questions about the causes and mechanisms responsible for such contrasting evolutionary strategies among green algae. This review suggests an integrative approach in explaining the occurrence of distinct evolutionary strategies and apparent phylogenetic affiliations among the known green algal mitochondrial lineages. The observed differences could be the result of distinct genetic potentials differentiated during the previous evolutionary history of the flagellate ancestors and / or of subsequent changes in habitat and life history of the more advanced green algal lineages.  相似文献   

19.
With the completion of the Human Genome Project in 2003, many new projects to sequence bacterial genomes were started and soon many complete bacterial genome sequences were available. The sequenced genomes of pathogenic bacteria provide useful information for understanding host-pathogen interactions. These data prove to be a new weapon in fighting against pathogenic bacteria by providing information about potential drug targets. But the limitation of computational tools for finding potential drug targets has hindered the process and further experimental analysis. There are many in silico approaches proposed for finding drug targets but only few have been automated. One such approach finds essential genes in bacterial genomes with no human homologue and predicts these as potential drug targets. The same approach is used in our tool. T-iDT, a tool for the identification of drug targets, finds essential genes by comparing a bacterial gene set against DEG (Database of Essential Genes) and excludes homologue genes by comparing against a human protein database. The tool predicts both the set of essential genes as well as potential target genes for the given genome. The tool was tested with Mycobacterium tuberculosis and results were validated. With default parameters, the tool predicted 236 essential genes and 52 genes to encode potential drug targets. A pathway-based approach was used to validate these potential drug target genes. The pathway in which the products of these genes are involved was determined. Our analysis shows that almost all these pathways are very essential for the bacterial survival and hence these genes encode possible drug targets. Our tool provides a fast method for finding possible drug targets in bacterial genomes with varying stringency level. The tool will be helpful in finding possible drug targets in various pathogenic organisms and can be used for further analysis in novel therapeutic drug development. The tool can be downloaded from http://www.milser.co.in/research.htm and http://www.srmbioinformatics.edu.in/ forum.htm.  相似文献   

20.
To explore the mitochondrial genes of the Cruciferae family, the mitochondrial genome of Raphanus sativus (sat) was sequenced and annotated. The circular mitochondrial genome of sat is 239,723 bp and includes 33 protein-coding genes, three rRNA genes and 17 tRNA genes. The mitochondrial genome also contains a pair of large repeat sequences 5.9 kb in length, which may mediate genome reorga-nization into two sub-genomic circles, with predicted sizes of 124.8 kb and 115.0 kb, respectively. Furthermore, gene evolution of mitochondrial genomes within the Cruciferae family was analyzed using sat mitochondrial type (mitotype), together with six other re-ported mitotypes. The cruciferous mitochondrial genomes have maintained almost the same set of functional genes. Compared with Cycas taitungensis (a representative gymnosperm), the mitochondrial genomes of the Cruciferae have lost nine protein-coding genes and seven mitochondrial-like tRNA genes, but acquired six chloroplast-like tRNAs. Among the Cruciferae, to maintain the same set of genes that are necessary for mitochondrial function, the exons of the genes have changed at the lowest rates, as indicated by the numbers of single nucleotide polymorphisms. The open reading frames (ORFs) of unknown function in the cruciferous genomes are not conserved. Evolutionary events, such as mutations, genome reorganizations and sequence insertions or deletions (indels), have resulted in the non- conserved ORFs in the cruciferous mitochondrial genomes, which is becoming significantly different among mitotypes. This work represents the first phylogenic explanation of the evolution of genes of known function in the Cruciferae family. It revealed significant variation in ORFs and the causes of such variation.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号