首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 187 毫秒
1.
The recent discovery of diverse very large viruses, such as the mimivirus, has fostered a profusion of hypotheses positing that these viruses define a new domain of life together with the three cellular ones (Archaea, Bacteria and Eucarya). It has also been speculated that they have played a key role in the origin of eukaryotes as donors of important genes or even as the structures at the origin of the nucleus. Thanks to the increasing availability of genome sequences for these giant viruses, those hypotheses are amenable to testing via comparative genomic and phylogenetic analyses. This task is made very difficult by the high evolutionary rate of viruses, which induces phylogenetic artefacts, such as long branch attraction, when inadequate methods are applied. It can be demonstrated that phylogenetic trees supporting viruses as a fourth domain of life are artefactual. In most cases, the presence of homologues of cellular genes in viruses is best explained by recurrent horizontal gene transfer from cellular hosts to their infecting viruses and not the opposite. Today, there is no solid evidence for the existence of a viral domain of life or for a significant implication of viruses in the origin of the cellular domains.  相似文献   

2.
A crucially important part of the biosphere - the virosphere - is too often overlooked. Inclusion of the virosphere into the global picture of protein structure space reveals that 63 protein domain superfamilies in viruses do not have any structural and evolutionary relatives in modern cellular organisms. More than half of these have functions which are not virus-specific and thus might be a source of new folds and functions for cellular life. The number of viruses on the planet exceeds that of cells by an order of magnitude and viruses evolve up to six orders of magnitude faster. As a result, cellular species are subject to a constitutive 'flow-through' of new viral genetic material. Due to this and the relaxed evolutionary constraints in viruses, the transfer of domains between host-to-virus could be a mechanism for accelerated protein evolution. The virosphere could be an engine for the genesis of protein structures, and may even have been so before the last universal common ancestor of cellular life.  相似文献   

3.
Viruses are the most abundant life form and infect practically all organisms. Consequently, these obligate parasites are a major cause of human suffering and economic loss. Rossmann‐like fold is the most populated fold among α/β‐folds in the Protein Data Bank and proteins containing Rossmann‐like fold constitute 22% of all known proteins 3D structures. Thus, analysis of viral proteins containing Rossmann‐like domains could provide an understanding of viral biology and evolution as well as could propose possible targets for antiviral therapy. We provide functional and evolutionary analysis of viral proteins containing a Rossmann‐like fold found in the evolutionary classification of protein domains (ECOD) database developed in our lab. We identified 81 protein families of bacterial, archeal, and eukaryotic viruses in light of their evolution‐based ECOD classification and Pfam taxonomy. We defined their functional significance using enzymatic EC number assignments as well as domain‐level family annotations.  相似文献   

4.
Mimivirus and Megavirus are the best characterized representatives of an expanding new family of giant viruses infecting Acanthamoeba. Their most distinctive features, megabase-sized genomes carried in particles of size comparable to that of small bacteria, fill the gap between the viral and cellular worlds. These giant viruses are also uniquely equipped with genes coding for central components of the translation apparatus. The presence of those genes, thought to be hallmarks of cellular organisms, revived fundamental interrogations on the evolutionary origin of these viruses and the link they might have with the emergence of eukaryotes. In this work, we focused on the Mimivirus-encoded translation termination factor gene, the detailed primary structure of which was elucidated using computational and experimental approaches. We demonstrated that the translation of this protein proceeds through two internal stop codons via two distinct recoding events: a frameshift and a readthrough, the combined occurrence of which is unique to these viruses. Unexpectedly, the viral gene carries an autoregulatory mechanism exclusively encountered in bacterial termination factors, though the viral sequence is related to the eukaryotic/archaeal class-I release factors. This finding is a hint that the virally-encoded translation functions may not be strictly redundant with the one provided by the host. Lastly, the perplexing occurrence of a bacterial-like regulatory mechanism in a eukaryotic/archaeal homologous gene is yet another oddity brought about by the study of giant viruses.  相似文献   

5.
Holliday junction resolving enzymes are required by all life forms that catalyse homologous recombination, including all cellular organisms and many bacterial and eukaryotic viruses. Here we report the identification of three distinct Holliday junction resolving enzyme activities present in two highly divergent archaeal species. Both Sulfolobus and Pyrococcus share the Hjc activity, and in addition possess unique secondary activities (Hje and Hjr). We propose by analogy with the two other domains of life that the latter enzymes are viral in origin, suggesting the widespread existence of archaeal viruses that rely on homologous recombination as part of their life cycle.  相似文献   

6.
Protein–protein interactions play an essential role in the regulation of most cellular processes. The process of viral infection is no exception and many viral pathogenic strategies involve targeting and perturbing host–protein interactions. The characterization of the host protein subnetworks disturbed by invading viruses is a major goal of viral research and may contribute to reveal fundamental biological mechanisms and to identify new therapeutic strategies. To assist in this approach, we have developed a database, VirusMINT, which stores in a structured format most of the published interactions between viral and host proteome. Although SH3 are the most ubiquitous and abundant class of protein binding modules, VirusMINT contains only a few interactions mediated by this domain class. To overcome this limitation, we have applied the whole interactome scanning experiment approach to identify interactions between 15 human SH3 domains and viral proline-rich peptides of two oncogenic viruses, human papillomavirus type 16 and human adenovirus A type 12. This approach identifies 114 new potential interactions between the human SH3 domains and proline-rich regions of the two viral proteomes.  相似文献   

7.
PDZ domains are protein-protein interaction modules that recognize specific C-terminal sequences to assemble protein complexes in multicellular organisms. By scanning billions of random peptides, we accurately map binding specificity for approximately half of the over 330 PDZ domains in the human and Caenorhabditis elegans proteomes. The domains recognize features of the last seven ligand positions, and we find 16 distinct specificity classes conserved from worm to human, significantly extending the canonical two-class system based on position -2. Thus, most PDZ domains are not promiscuous, but rather are fine-tuned for specific interactions. Specificity profiling of 91 point mutants of a model PDZ domain reveals that the binding site is highly robust, as all mutants were able to recognize C-terminal peptides. However, many mutations altered specificity for ligand positions both close and far from the mutated position, suggesting that binding specificity can evolve rapidly under mutational pressure. Our specificity map enables the prediction and prioritization of natural protein interactions, which can be used to guide PDZ domain cell biology experiments. Using this approach, we predicted and validated several viral ligands for the PDZ domains of the SCRIB polarity protein. These findings indicate that many viruses produce PDZ ligands that disrupt host protein complexes for their own benefit, and that highly pathogenic strains target PDZ domains involved in cell polarity and growth.  相似文献   

8.
Phylomat: an automated protein motif analysis tool for phylogenomics   总被引:2,自引:0,他引:2  
Recent progress in genomics, proteomics, and bioinformatics enables unprecedented opportunities to examine the evolutionary history of molecular, cellular, and developmental pathways through phylogenomics. Accordingly, we have developed a motif analysis tool for phylogenomics (Phylomat, http://alg.ncsa.uiuc.edu/pmat) that scans predicted proteome sets for proteins containing highly conserved amino acid motifs or domains for in silico analysis of the evolutionary history of these motifs/domains. Phylomat enables the user to download results as full protein or extracted motif/domain sequences from each protein. Tables containing the percent distribution of a motif/domain in organisms normalized to proteome size are displayed. Phylomat can also align the set of full protein or extracted motif/domain sequences and predict a neighbor-joining tree from relative sequence similarity. Together, Phylomat serves as a user-friendly data-mining tool for the phylogenomic analysis of conserved sequence motifs/domains in annotated proteomes from the three domains of life.  相似文献   

9.
The origin of life has puzzled molecular scientists for over half a century. Yet fundamental questions remain unanswered, including which came first, the metabolic machinery or the encoding nucleic acids. In this study we take a protein-centric view and explore the ancestral origins of proteins. Protein domain structures in proteomes are highly conserved and embody molecular functions and interactions that are needed for cellular and organismal processes. Here we use domain structure to study the evolution of molecular function in the protein world. Timelines describing the age and function of protein domains at fold, fold superfamily, and fold family levels of structural complexity were derived from a structural phylogenomic census in hundreds of fully sequenced genomes. These timelines unfold congruent hourglass patterns in rates of appearance of domain structures and functions, functional diversity, and hierarchical complexity, and revealed a gradual build up of protein repertoires associated with metabolism, translation and DNA, in that order. The most ancient domain architectures were hydrolase enzymes and the first translation domains had catalytic functions for the aminoacylation and the molecular switch-driven transport of RNA. Remarkably, the most ancient domains had metabolic roles, did not interact with RNA, and preceded the gradual build-up of translation. In fact, the first translation domains had also a metabolic origin and were only later followed by specialized translation machinery. Our results explain how the generation of structure in the protein world and the concurrent crystallization of translation and diversified cellular life created further opportunities for proteomic diversification.  相似文献   

10.
Viruses, as obligate intracellular parasites, are the pathogens that have the most intimate relationship with their host, and as such, their genomes have been shaped directly by interactions with the host proteome. Every step of the viral life cycle, from entry to budding, is orchestrated through interactions with cellular proteins. Accordingly, viruses will hijack and manipulate these proteins utilising any achievable mechanism. Yet, the extensive interactions of viral proteomes has yielded a conundrum: how do viruses commandeer so many diverse pathways and processes, given the obvious spatial constraints imposed by their compact genomes? One important approach is slowly being revealed, the extensive mimicry of host protein short linear motifs (SLiMs).  相似文献   

11.
Viruses constantly adapt to and modulate the host environment during replication and propagation. Both DNA and RNA viruses encode multifunctional proteins that interact with and modify host cell proteins. While viral genomes were the first complete sequences known, the corresponding proteomes are only now elucidated, with some surprising results. Even more daunting is the task to globally monitor the impact of viral infection on the proteome of the host cell and many technical hurdles must still be overcome in order to facilitate robust and reproducible measurements. Further complicating the picture is the dynamic nature of proteins, including post-translational modifications, enzymatic cleavage and activation or destruction by proteolytic events. Nevertheless, several promising studies have been published using high-throughput methods directly measuring protein abundance. Particularly, quantitative or semiquantitative mass spectrometry-based analysis of viral and cellular proteomes are now being used to characterize viruses and their host interaction. In addition, the full set of interactions between viral and host proteins, the interactome, is beginning to emerge, with often unexpected interactions that need to be carefully validated. In this review, we will discuss two major areas of viral proteomics: first, virion proteomics (such as the protein characterization of viral particles) and second, proteoviromics, including the viral protein interactomics and the quantitative analysis of host cell proteome during viral infection.  相似文献   

12.
Viruses constantly adapt to and modulate the host environment during replication and propagation. Both DNA and RNA viruses encode multifunctional proteins that interact with and modify host cell proteins. While viral genomes were the first complete sequences known, the corresponding proteomes are only now elucidated, with some surprising results. Even more daunting is the task to globally monitor the impact of viral infection on the proteome of the host cell and many technical hurdles must still be overcome in order to facilitate robust and reproducible measurements. Further complicating the picture is the dynamic nature of proteins, including post-translational modifications, enzymatic cleavage and activation or destruction by proteolytic events. Nevertheless, several promising studies have been published using high-throughput methods directly measuring protein abundance. Particularly, quantitative or semiquantitative mass spectrometry-based analysis of viral and cellular proteomes are now being used to characterize viruses and their host interaction. In addition, the full set of interactions between viral and host proteins, the interactome, is beginning to emerge, with often unexpected interactions that need to be carefully validated. In this review, we will discuss two major areas of viral proteomics: first, virion proteomics (such as the protein characterization of viral particles) and second, proteoviromics, including the viral protein interactomics and the quantitative analysis of host cell proteome during viral infection.  相似文献   

13.
14.
15.
Requirements for species-specific papovavirus DNA replication.   总被引:13,自引:6,他引:7       下载免费PDF全文
Replication of papovavirus DNA requires a functional replication origin, a virus-encoded protein, large T antigen, and species-specific permissive factors. How these components interact to initiate and sustain viral DNA replication is not known. Toward that end, we have attempted to identify the viral target(s) of permissive factors. The functionally defined replication origins of polyomavirus and simian virus 40, two papovaviruses that replicate in different species (mice and monkeys, respectively), are composed of two functionally distinct domains: a core domain and an auxiliary domain. The origin cores of the two viruses are remarkably similar in primary structure and have common binding sites for large T antigen. By contrast, their auxiliary domains share few sequences and serve as binding sites for cellular proteins. It seemed plausible, therefore, that if cellular permissive factors interacted with the replication origin, their targets were likely to be in the auxiliary domain. To test this hypothesis we constructed hybrid origins for DNA replication that were composed of the auxiliary domain of one virus and the origin core of the other and assessed their capacity to replicate in a number of mouse and monkey cell lines, which express the large T antigen of one or the other virus. The results of this analysis showed that the auxiliary domains of the viral replication origins could substitute for one another in DNA replication, provided that the viral origin core and its cognate large T antigen were present in a permissive cellular milieu. Surprisingly, the large T antigens of the viruses could not substitute for one another, regardless of the species of origin of the host cell, even though the two large T antigens bind to the same sequence motif in vitro. These results suggest that species-specific permissive factors do not interact with the origin-auxiliary domains but, rather, with either the origin core or the large T antigen or with both components to effect DNA replication.  相似文献   

16.
The infection cycle of viruses creates many opportunities for the exchange of genetic material with the host. Many viruses integrate their sequences into the genome of their host for replication. These processes may lead to the virus acquisition of host sequences. Such sequences are prone to accumulation of mutations and deletions. However, in rare instances, sequences acquired from a host become beneficial for the virus. We searched for unexpected sequence similarity among the 900,000 viral proteins and all proteins from cellular organisms. Here, we focus on viruses that infect metazoa. The high-conservation analysis yielded 187 instances of highly similar viral-host sequences. Only a small number of them represent viruses that hijacked host sequences. The low-conservation sequence analysis utilizes the Pfam family collection. About 5% of the 12,000 statistical models archived in Pfam are composed of viral-metazoan proteins. In about half of Pfam families, we provide indirect support for the directionality from the host to the virus. The other families are either wrongly annotated or reflect an extensive sequence exchange between the viruses and their hosts. In about 75% of cross-taxa Pfam families, the viral proteins are significantly shorter than their metazoan counterparts. The tendency for shorter viral proteins relative to their related host proteins accounts for the acquisition of only a fragment of the host gene, the elimination of an internal domain and shortening of the linkers between domains. We conclude that, along viral evolution, the host-originated sequences accommodate simplified domain compositions. We postulate that the trimmed proteins act by interfering with the fundamental function of the host including intracellular signaling, post-translational modification, protein-protein interaction networks and cellular trafficking. We compiled a collection of hijacked protein sequences. These sequences are attractive targets for manipulation of viral infection.  相似文献   

17.
18.
Over the past several years fungal infections have shown an increasing incidence in the susceptible population, and caused high mortality rates. In parallel, multi-resistant fungi are emerging in human infections. Therefore, the identification of new potential antifungal targets is a priority. The first task of this study was to analyse the protein domain and domain architecture content of the 137 fungal proteomes (corresponding to 111 species) available in UniProtKB (UniProt KnowledgeBase) by January 2013. The resulting list of core and exclusive domain and domain architectures is provided in this paper. It delineates the different levels of fungal taxonomic classification: phylum, subphylum, order, genus and species. The analysis highlighted Aspergillus as the most diverse genus in terms of exclusive domain content. In addition, we also investigated which domains could be considered promiscuous in the different organisms. As an application of this analysis, we explored three different ways to detect potential targets for antifungal drugs. First, we compared the domain and domain architecture content of the human and fungal proteomes, and identified those domains and domain architectures only present in fungi. Secondly, we looked for information regarding fungal pathways in public repositories, where proteins containing promiscuous domains could be involved. Three pathways were identified as a result: lovastatin biosynthesis, xylan degradation and biosynthesis of siroheme. Finally, we classified a subset of the studied fungi in five groups depending on their occurrence in clinical samples. We then looked for exclusive domains in the groups that were more relevant clinically and determined which of them had the potential to bind small molecules. Overall, this study provides a comprehensive analysis of the available fungal proteomes and shows three approaches that can be used as a first step in the detection of new antifungal targets.  相似文献   

19.
Intrinsically disordered proteins and intrinsically disordered protein regions are highly abundant in nature. However, the quantitative and qualitative measures of protein intrinsic disorder in species with known genomes are still not available. Furthermore, although the correlation between high fraction of disordered residues and advanced species has been reported, the details of this correlation and the connection between the disorder content and proteome complexity have not been reported as of yet. To fill this gap, we analysed entire proteomes of 3484 species from three domains of life (archaea, bacteria and eukaryotes) and from viruses. Our analysis revealed that the evolution process is characterized by distinctive patterns of changes in the protein intrinsic disorder content. We are showing here that viruses are characterized by the widest spread of the proteome disorder content (the percentage of disordered residues ranges from 7.3% in human coronavirus NL63 to 77.3% in Avian carcinoma virus). For several organisms, a clear correlation is seen between their disorder contents and habitats. In multicellular eukaryotes, there is a weak correlation between the complexity of an organism (evaluated as a number of different cell types) and its overall disorder content. For both the prokaryotes and eukaryotes, the disorder content is generally independent of the proteome size. However, disorder shows a sharp increase associated with the transition from prokaryotic to eukaryotic cells. This suggests that the increased disorder content in eukaryotic proteomes might be used by nature to deal with the increased cell complexity due to the appearance of the various cellular compartments.  相似文献   

20.
Comparative studies of the proteomes from different organisms have provided valuable information about protein domain distribution in the kingdoms of life. Earlier studies have been limited by the fact that only about 50% of the proteomes could be matched to a domain. Here, we have extended these studies by including less well-defined domain definitions, Pfam-B and clustered domains, MAS, in addition to Pfam-A and SCOP domains. It was found that a significant fraction of these domain families are homologous to Pfam-A or SCOP domains. Further, we show that all regions that do not match a Pfam-A or SCOP domain contain a significantly higher fraction of disordered structure. These unstructured regions may be contained within orphan domains or function as linkers between structured domains. Using several different definitions we have re-estimated the number of multi-domain proteins in different organisms and found that several methods all predict that eukaryotes have approximately 65% multi-domain proteins, while the prokaryotes consist of approximately 40% multi-domain proteins. However, these numbers are strongly dependent on the exact choice of cut-off for domains in unassigned regions. In conclusion, all eukaryotes have similar fractions of multi-domain proteins and disorder, whereas a high fraction of repeating domain is distinguished only in multicellular eukaryotes. This implies a role for repeats in cell-cell contacts while the other two features are important for intracellular functions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号