首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 9 毫秒
1.
Zinc is one of the metal ions essential for life, as it is required for the proper functioning of a large number of proteins. Despite its importance, the annotation of zinc-binding proteins in gene banks or protein domain databases still has significant room for improvement. In the present work, we compiled a list of known zinc-binding protein domains and of known zinc-binding sequence motifs (zinc-binding patterns), and then used them jointly to analyze the proteome of 57 different organisms to obtain an overview of zinc usage by archaeal, bacterial, and eukaryotic organisms. Zinc-binding proteins are an abundant fraction of these proteomes, ranging between 4% and 10%. The number of zinc-binding proteins correlates linearly with the total number of proteins encoded by the genome of an organism, but the proportionality constant of Eukaryota (8.8%) is significantly higher than that observed in Bacteria and Archaea (from 5% to 6%). Most of this enrichment is due to the larger portfolio of regulatory proteins in Eukaryota.  相似文献   

2.
Andreini C  Banci L  Bertini I  Elmi S  Rosato A 《Proteins》2007,67(2):317-324
Metalloproteins are proteins capable of binding one or more metal ions, which are often required for their biological function or for regulation of their activities or for structural purposes. In high-throughput genome-level protein investigation efforts, such as Structural Genomics, the systematic experimental characterization of metal-binding properties (i.e. the investigation of the metalloproteome) is not always pursued, and remains far from trivial. In the present work we have applied a bioinformatic approach to investigate the occurrence of (putative) non-heme iron-binding proteins in 57 different organisms spanning the entire tree of life. It is found that the non-heme iron-proteome constitutes between 1% and 10% of the entire proteome of an organism. However, the iron-proteome constitutes a higher fraction of the proteome in archaea (on average 7.1% +/- 2.1%) than in bacteria (3.9% +/- 1.6%) and in eukaryota (1.1% +/- 0.4%). The analysis of the function of each putative iron-protein identified suggests that extant organisms have inherited the large majority of their iron-proteome from the last common ancestor.  相似文献   

3.

Background

Chromosomal orthologs can reveal the shared ancestral gene set and their evolutionary trends. Additionally, physico-chemical properties of encoded proteins could provide information about functional adaptation and ecological niche requirements.

Results

We analyzed 7080 genes (five groups of 1416 orthologs each) from Rhizobiales species (S. meliloti, R. etli, and M. loti, plant symbionts; A. tumefaciens, a plant pathogen; and B. melitensis, an animal pathogen). We evaluated their phylogenetic relationships and observed three main topologies. The first, with closer association of R. etli to A. tumefaciens; the second with R. etli closer to S. meliloti; and the third with A. tumefaciens and S. meliloti as the closest pair. This was not unusual, given the close relatedness of these three species. We calculated the synonymous (dS) and nonsynonymous (dN) substitution rates of these orthologs, and found that informational and metabolic functions showed relatively low dN rates; in contrast, genes from hypothetical functions and cellular processes showed high dN rates. An alternative measure of sequence variability, percentage of changes by species, was used to evaluate the most specific proportion of amino acid residues from alignments. When dN was compared with that measure a high correlation was obtained, revealing that much of evolutive information was extracted with the percentage of changes by species at the amino acid level. By analyzing the sequence variability of orthologs with a set of five properties (polarity, electrostatic charge, formation of secondary structures, molecular volume, and amino acid composition), we found that physico-chemical characteristics of proteins correlated with specific functional roles, and association of species did not follow their typical phylogeny, probably reflecting more adaptation to their life styles and niche preferences. In addition, orthologs with low dN rates had residues with more positive values of polarity, volume and electrostatic charge.

Conclusions

These findings revealed that even when orthologs perform the same function in each genomic background, their sequences reveal important evolutionary tendencies and differences related to adaptation. This article was reviewed by: Dr. Purificación López-García, Prof. Jeffrey Townsend (nominated by Dr. J. Peter Gogarten), and Ms. Olga Kamneva.  相似文献   

4.
Comparative studies of the proteomes from different organisms have provided valuable information about protein domain distribution in the kingdoms of life. Earlier studies have been limited by the fact that only about 50% of the proteomes could be matched to a domain. Here, we have extended these studies by including less well-defined domain definitions, Pfam-B and clustered domains, MAS, in addition to Pfam-A and SCOP domains. It was found that a significant fraction of these domain families are homologous to Pfam-A or SCOP domains. Further, we show that all regions that do not match a Pfam-A or SCOP domain contain a significantly higher fraction of disordered structure. These unstructured regions may be contained within orphan domains or function as linkers between structured domains. Using several different definitions we have re-estimated the number of multi-domain proteins in different organisms and found that several methods all predict that eukaryotes have approximately 65% multi-domain proteins, while the prokaryotes consist of approximately 40% multi-domain proteins. However, these numbers are strongly dependent on the exact choice of cut-off for domains in unassigned regions. In conclusion, all eukaryotes have similar fractions of multi-domain proteins and disorder, whereas a high fraction of repeating domain is distinguished only in multicellular eukaryotes. This implies a role for repeats in cell-cell contacts while the other two features are important for intracellular functions.  相似文献   

5.
Members of the Toll-like receptor (TLR) family are currently under intense scrutiny for their role in the sampling and recognition of pathogens. It has already been reported that both vaccinia virus and Yersinia spp. express proteins that help them evade the TLR mediated immune response, acting through the Toll-interleukin-1 receptor-resistance (TIR) domain and leucine-rich repeat region of the host TLRs respectively. The TIR domain is involved in the dimerisation of the TLRs and their complexation with their adapter molecules. We tested here the hypothesis that bacteria have the ability to secrete proteins containing similar motifs to the intracellular TIR domains that are involved in the TIR-TIR interaction necessary for the subsequent signal transmission. Based upon their sequence homology, proteins expressing TIRs have been divided into three sub-classes, based around the TLRs, the TLR adapter proteins, and the interleukin-1 and -18 adapter proteins. The highly conserved regions from these separate sub-families were then used to identify similar bacterial proteins. The bacterial proteins identified were then included in an iterative MEME-BLAST process to broaden the search. Tollip, a known TLR antagonist and adapter protein, was included in this investigation although it does not fit into any of the three sub-classes outlined above. If suitable bacterial proteins had been identified, it would signify that certain bacteria had evolved a mechanism to aid them in avoiding detection by the innate immune system acting through the TIR domains. At this stage one has to conclude that there is no evidence currently available suggesting such a mechanism, when using the strategy applied here.  相似文献   

6.
7.
A structural model of the transient complex between the yeast copper chaperone Atx1 and the first soluble domain of the copper transporting ATPase Ccc2 was obtained with HADDOCK, combining NMR chemical shift mapping information with in silico docking. These two proteins are involved in copper trafficking in yeast cells. Calculations were performed starting with the copper ion either bound to Atx1 or to Ccc2 and using the experimental structures of the copper-loaded and apo forms of each protein. The copper binding motifs of the two proteins are found in close proximity. Copper tends to move from Atx1 to Ccc2, consistent with the physiological direction of transfer, with concomitant structural rearrangements, in agreement with experimental observations. The interaction is mainly of an electrostatic nature with hydrogen bonds stabilizing the complex. The structural data are relevant for a number of proteins homologous to Atx1 and Ccc2 and conserved from bacteria to humans.  相似文献   

8.
9.
Robinson NP  Bell SD 《The FEBS journal》2005,272(15):3757-3766
Replication of DNA is essential for the propagation of life. It is somewhat surprising then that, despite the vital nature of this process, cellular organisms show a great deal of variety in the mechanisms that they employ to ensure appropriate genome duplication. This diversity is manifested along classical evolutionary lines, with distinct combinations of replicon architecture and replication proteins being found in the three domains of life: the Bacteria, the Eukarya and the Archaea. Furthermore, although there are mechanistic parallels, even within a given domain of life, the way origins of replication are defined shows remarkable variation.  相似文献   

10.
Discrepancies in phylogenetic trees of bacteria and archaea are often explained as lateral gene transfer events. However, such discrepancies may also be due to phylogenetic artifacts or orthology assignment problems. A first step that may help to resolve this dilemma is to estimate the extent of phylogenetic inconsistencies in trees of prokaryotes in comparison with those of higher eukaryotes, where no lateral gene transfer is expected. To test this, we used 21 proteomes each of eukaryotes (mainly opisthokonts), proteobacteria, and archaea that spanned equivalent levels of genetic divergence. In each domain of life, we defined a set of putative orthologous sequences using a phylogenetic-based orthology protocol and, as a reference topology, we used a tree constructed with concatenated genes of each domain. Our results show, for most of the tests performed, that the magnitude of topological inconsistencies with respect to the reference tree was very similar in the trees of proteobacteria and eukaryotes. When clade support was taken into account, prokaryotes showed some more inconsistencies, but then all values were very low. Discrepancies were only consistently higher in archaea but, as shown by simulation analysis, this is likely due to the particular tree of the archaeal species used here being more difficult to reconstruct, whereas the trees of proteobacteria and eukaryotes were of similar difficulty. Although these results are based on a relatively small number of genes, it seems that phylogenetic reconstruction problems, including orthology assignment problems, have a similar overall effect over prokaryotic and eukaryotic trees based on single genes. Consequently, lateral gene transfer between distant prokaryotic species may have been more rare than previously thought, which opens the way to obtain the tree of life of bacterial and archaeal species using genomic data and the concatenation of adequate genes, in the same way as it is usually done in eukaryotes.  相似文献   

11.
The identification of protein-protein interaction networks has often given important information about the functions of specific proteins and on the cross-talk among metabolic and regulatory pathways. The availability of entire genome sequences has rendered feasible the systematic screening of collections of proteins, often of unknown function, aimed to find the cognate ligands. Once identified by genetic and/or biochemical approaches, the interaction between two proteins should be validated in the physiologic environment. Herein we describe an experimental strategy to screen collections of protein-protein interaction domains to find and validate candidate interactors. The approach is based on the assumption that the overexpression in cultured cells of protein-protein interaction domains, isolated from the context of the whole protein, could titrate the endogenous ligand and, in turn, exert a dominant negative effect. The identification of the ligand could provide us with a tool to check the relevance of the interaction because the contemporary overexpression of the isolated domain and of its ligand could rescue the dominant negative phenotype. We explored this approach by analyzing the possible dominant negative effects on the cell cycle progression of a collection of phosphotyrosine binding (PTB) domains of human proteins. Of 47 PTB domains, we found that the overexpression of 10 of them significantly interfered with the cell cycle progression of NIH3T3 cells. Four of them were used as baits to identify the cognate interactors. Among these proteins, CARM1, interacting with the PTB domain of RabGAP1, and EF1alpha, interacting with RGS12, were able to rescue the block of the cell cycle induced by the isolated PTB domain of the partner protein, thus confirming in vivo the relevance of the interaction. These results suggest that the described approach can be used for the systematic screening of the ligands of various protein-protein interaction domains also by using different biological assays.  相似文献   

12.
Cao Y  Li YD  Li H 《Biophysical journal》2011,(7):1794-1799
Rationally enhancing the mechanical stability of proteins remains a challenge in the field of single molecule force spectroscopy. Here we demonstrate that it is feasible to use a “cocktail” approach for combining more than one approach to enhance significantly the mechanical stability of proteins in an additive fashion. As a proof of principle, we show that metal chelation and protein-protein interaction can be combined to enhance the unfolding force of a protein to ∼450 pN, which is >3 times of its original value. This is also higher than the mechanical stability of most of proteins studied so far. We also extend such a cocktail concept to combine two different metal chelation sites to enhance protein mechanical stability. This approach opens new avenues to efficiently regulating the mechanical properties of proteins, and should be applicable to a wide range of elastomeric proteins.  相似文献   

13.
14.

Background

Nucleotide metabolism is central to all biological systems, due to their essential role in genetic information and energy transfer, which in turn suggests its possible presence in the last common ancestor (LCA) of Bacteria, Archaea and Eukarya. In this context, elucidation of the contribution of the origin and diversification of de novo and salvage pathways of nucleotide metabolism will allow us to understand the links between the enzymatic steps associated with the LCA and the emergence of the first metabolic pathways.

Results

In this work, the taxonomical distribution of the enzymes associated with nucleotide metabolism was evaluated in 1,606 complete genomes. 151 sequence profiles associated with 120 enzymatic reactions were used. The evaluation was based on profile comparisons, using RPS-Blast. Organisms were clustered based on their taxonomical classifications, in order to obtain a normalized measure of the taxonomical distribution of enzymes according to the average of presence/absence of enzymes per genus, which in turn was used for the second step, to calculate the average presence/absence of enzymes per Clade.

Conclusion

From these analyses, it was suggested that divergence at the enzymatic level correlates with environmental changes and related modifications of the cell wall and membranes that took place during cell evolution. Specifically, the divergence of the 5-(carboxyamino) imidazole ribonucleotide mutase to phosphoribosylaminoimidazole carboxylase could be related to the emergence of multicellularity in eukaryotic cells. In addition, segments of salvage and de novo pathways were probably complementary in the LCA to the synthesis of purines and pyrimidines. We also suggest that a large portion of the pathway to inosine 5’-monophosphate (IMP) in purines could have been involved in thiamine synthesis or its derivatives in early stages of cellular evolution, correlating with the fact that these molecules may have played an active role in the protein-RNA world. The analysis presented here provides general observations concerning the adaptation of the enzymatic steps in the early stages of the emergence of life and the LCA.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-800) contains supplementary material, which is available to authorized users.  相似文献   

15.
Dehydrogenases from all three domains of life cleave RNA   总被引:6,自引:0,他引:6  
Specific interactions of glyceraldehyde-3-phosphate dehydrogenase (GAPDH) with RNA have been reported both in vitro and in vivo. We show that eukaryotic and bacterial GAPDH and two proteins from the hyperthermophilic archaeon Sulfolobus solfataricus, which are annotated as dehydrogenases, cleave RNA producing similar degradation patterns. RNA cleavage is most efficient at 60 degrees C, at MgCl(2) concentrations up to 5 mm, and takes place between pyrimidine and adenosine. The RNase active center of the putative aspartate semialdehyde dehydrogenase from S. solfataricus is located within the N-terminal 73 amino acids, which comprise the first mononucleotide-binding site of the predicted Rossmann fold. Thus, RNA cleavage has to be taken into account in the ongoing discussion of the possible biological function of RNA binding by dehydrogenases.  相似文献   

16.
17.
The high-affinity cohesin–dockerin interaction was originally discovered as modular components, which mediate the assembly of the various subunits of the multienzyme cellulosome complex that characterizes some cellulolytic bacteria. Until recently, the presence of cohesins and dockerins within a bacterial proteome was considered a definitive signature of a cellulosome-producing bacterium. Widespread genome sequencing has since revealed a wealth of putative cohesin- and dockerin-containing proteins in Bacteria, Archaea, and in primitive eukaryotes. The newly identified modules appear to serve diverse functions that are clearly distinct from the classical cellulosome archetype, and the vast majority of parent proteins are not predicted glycoside hydrolases. In most cases, only a few such genes have been identified in a given microorganism, which encode proteins containing but a single cohesin and/or dockerin. In some cases, one or the other module appears to be missing from a given species, and in other cases both modules occur within the same protein. This review provides a bioinformatics-based survey of the current status of cohesin- and dockerin-like sequences in species from the Bacteria, Archaea, and Eukarya. Surprisingly, many identified modules and their parent proteins are clearly unrelated to cellulosomes. The cellulosome paradigm may thus be the exception rather than the rule for bacterial, archaeal, and eukaryotic employment of cohesin and dockerin modules.  相似文献   

18.
The question as to the origin and relationship between the three domains of life is lodged in a phylogenetic impasse. The dominant paradigm is to see the three domains as separated. However, the recently characterized bacterial species have suggested continuity between the three domains. Here, we review the evidence in support of this hypothesis and evaluate the implications for and against the models of the origin of the three domains of life. The existence of intermediate steps between the three domains discards the need for fusion to explain eukaryogenesis and suggests that the last universal common ancestor was complex. We propose a scenario in which the ancestor of the current bacterial Planctomycetes, Verrucomicrobiae and Chlamydiae superphylum was related to the last archaeal and eukaryotic common ancestor, thus providing a way out of the phylogenetic impasse.  相似文献   

19.
Food hypersensitivity is constantly increasing in Western societies with a prevalence of about 1-2% in Europe and in the USA. Among children, the incidence is even higher. Because of the introduction of foods derived from genetically modified crops on the marketplace, the scientific community, regulatory bodies and international associations have intensified discussions on risk assessment procedures to identify potential food allergenicity of the newly introduced proteins. In this work, we present a novel biocomputational methodology for the classification of amino acid sequences with regard to food allergenicity and non-allergenicity. This method relies on a computerised learning system trained using selected excerpts of amino acid sequences. One example of such a successful learning system is presented which consists of feature extraction from sequence alignments performed with the FASTA3 algorithm (employing the BLOSUM50 substitution matrix) combined with the k-Nearest-Neighbour (kNN) classification algorithm. Briefly, the two features extracted are the alignment score and the alignment length and the kNN algorithm assigns the pair of extracted features from an unknown sequence to the prevalent class among its k nearest neighbours in the training (prototype) set available. 91 food allergens from several specialised public repositories of food allergy and the SWALL database were identified, pre-processed, and stored, yielding one of the most extensively characterised repositories of allergenic sequences known today. All allergenic sequences were classified using a standard one-leave-out cross validation procedure yielding about 81% correctly classified allergens and the classification of 367 non-allergens in an independent test set resulted in about 98% correct classifications. The biocomputational approach presented should be regarded as a significant extension and refinement of earlier attempts suggested for in silico food safety assessment. Our results show that the framework described here is powerful enough to become useful as part of a multiple-procedure test scheme that also depicts other evaluation approaches such as solid phase immunoassay and tests for stability to digestions.  相似文献   

20.
Cladogenesis, coalescence and the evolution of the three domains of life   总被引:3,自引:0,他引:3  
In this article, we explore the large-scale structure of the tree of life by using a simple model with a constant number of species and rates of speciation that equal the rates of extinction. In addition, we discuss the consequences of horizontal gene transfer for the concept of a most recent common ancestor of all living organisms (cenancestor). A simple null hypothesis based on coalescence theory explains some features of the observed topologies of the tree of life. Simulations of genes and organismal lineages suggest that there was no single common ancestor that contained all the genes ancestral to those shared among the three domains of life. Each contemporary molecule has its own history that traces back to an individual molecular cenancestor. However, these molecular ancestors were likely to be present in different organisms and at different times.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号