首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 484 毫秒
1.
Twenty-seven protein sequence elements, six to nine amino acids long, were extracted from 15 phylogenetically diverse complete prokaryotic proteomes. The elements are present in all of these proteomes, with at least one copy each (omnipresent elements), and have presumably been conserved since the last universal common ancestor (LUCA). All these omnipresent elements are identified in crystallized protein structures as parts of highly conserved closed loops, 25–30 residues long, thus representing the closed-loop modules discovered in 2000 by Berezovsky et al. The omnipresent peptides make up seven distinct groups, of which the largest groups, Aleph and Beth, contain 18 and four elements, respectively, which are related but different, while five other groups are represented by only one element each. The LUCA modules appear with one or several copies per protein molecule in a variety of combinations depending on the functional identity of the corresponding protein. The functional involvement of individual LUCA modules is outlined on the basis of known protein annotations. Analyses of all the related sequences in a large, formatted protein sequence space suggest that many, if not all, of the 27 omnipresent elements have a common sequence origin. This sequence space network analysis may lead to elucidation of the earliest stages of protein evolution.  相似文献   

2.
Two previously undetected domains were identified in a variety of RNA-binding proteins, particularly RNA-modifying enzymes, using methods for sequence profile analysis. A small domain consisting of 60–65 amino acid residues was detected in the ribosomal protein S4, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yeast protein containing a pseudouridine synthetase and a deaminase domain, bacterial tyrosyl-tRNA synthetases, and a number of uncharacterized, small proteins that may be involved in translation regulation. Another novel domain, designated PUA domain, after PseudoUridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modification, a family of predicted archaeal and bacterial rRNA methylases. Additionally, the PUA domain was detected in a family of eukaryotic proteins that also contain a domain homologous to the translation initiation factor eIF1/SUI1; these proteins may comprise a novel type of translation factors. Unexpectedly, the PUA domain was detected also in bacterial and yeast glutamate kinases; this is compatible with the demonstrated role of these enzymes in the regulation of the expression of other genes. We propose that the S4 domain and the PUA domain bind RNA molecules with complex folded structures, adding to the growing collection of nucleic acid-binding domains associated with DNA and RNA modification enzymes. The evolution of the translation machinery components containing the S4, PUA, and SUI1 domains must have included several events of lateral gene transfer and gene loss as well as lineage-specific domain fusions. Received: 15 May 1998 / Accepted: 20 July 1998  相似文献   

3.
Classification and evolution of P-loop GTPases and related ATPases   总被引:1,自引:0,他引:1  
Sequences and available structures were compared for all the widely distributed representatives of the P-loop GTPases and GTPase-related proteins with the aim of constructing an evolutionary classification for this superclass of proteins and reconstructing the principal events in their evolution. The GTPase superclass can be divided into two large classes, each of which has a unique set of sequence and structural signatures (synapomorphies). The first class, designated TRAFAC (after translation factors) includes enzymes involved in translation (initiation, elongation, and release factors), signal transduction (in particular, the extended Ras-like family), cell motility, and intracellular transport. The second class, designated SIMIBI (after signal recognition particle, MinD, and BioD), consists of signal recognition particle (SRP) GTPases, the assemblage of MinD-like ATPases, which are involved in protein localization, chromosome partitioning, and membrane transport, and a group of metabolic enzymes with kinase or related phosphate transferase activity. These two classes together contain over 20 distinct families that are further subdivided into 57 subfamilies (ancient lineages) on the basis of conserved sequence motifs, shared structural features, and domain architectures. Ten subfamilies show a universal phyletic distribution compatible with presence in the last universal common ancestor of the extant life forms (LUCA). These include four translation factors, two OBG-like GTPases, the YawG/YlqF-like GTPases (these two subfamilies also consist of predicted translation factors), the two signal-recognition-associated GTPases, and the MRP subfamily of MinD-like ATPases. The distribution of nucleotide specificity among the proteins of the GTPase superclass indicates that the common ancestor of the entire superclass was a GTPase and that a secondary switch to ATPase activity has occurred on several independent occasions during evolution. The functions of most GTPases that are traceable to LUCA are associated with translation. However, in contrast to other superclasses of P-loop NTPases (RecA-F1/F0, AAA+, helicases, ABC), GTPases do not participate in NTP-dependent nucleic acid unwinding and reorganizing activities. Hence, we hypothesize that the ancestral GTPase was an enzyme with a generic regulatory role in translation, with subsequent diversification resulting in acquisition of diverse functions in transport, protein trafficking, and signaling. In addition to the classification of previously known families of GTPases and related ATPases, we introduce several previously undetected families and describe new functional predictions.  相似文献   

4.
5.
6.
Sequence profile searches were used to identify an ancient domain in ThiI-like thiouridine synthases, conserved RNA methylases, archaeal pseudouridine synthases and several uncharacterized proteins. We predict that this domain is an RNA-binding domain that adopts an alpha/beta fold similar to that found in the C-terminal domain of translation initiation factor 3 and ribosomal protein S8.  相似文献   

7.
Universal scale of the sequence conservation has been recently introduced based on omnipresence of the protein sequence motifs across species. A large spectrum of short sequences, up to eight residues has been found to reside in all or almost all prokaryotic organisms. By this discovery a principally novel quantitative approach is introduced to the problem of reconstruction of the last universal common ancestor (LUCA). The most conserved elements (protein modules) with defined structures and sequences harboring the omnipresent motifs are outlined in this work, by combining the sequence and protein crystal structure data. The structurally conserved modules involve 25–30 amino acid residues and have appearance of closed loops, loop-n-lock structures. This confirms earlier conclusions on the loop-fold structure of globular proteins. Many of the topmost conserved modules represent the primary closed loop prototypes, that have been derived by whole genome sequence searches. The data presented, thus, make a basis for further developments toward the earliest stages of protein evolution. [Reviewing Editor: Dr. Martin Kreitman]  相似文献   

8.
Influenza virus polymerase complex is a heterotrimer consisting of polymerase basic protein 1 (PB1), polymerase basic protein 2 (PB2), and polymerase acidic protein (PA). Of these, only PB1, which has been implicated in RNA chain elongation, possesses the four conserved motifs (motifs I, II, III, and IV) and the four invariant amino acids (one in each motif) found among all viral RNA-dependent RNA or RNA-dependent DNA polymerases. We have modified an assay system developed by Huang et al. (T.-J. Huang, P. Palese, and M. Krystal, J. Virol. 64:5669-5673, 1990) to reconstitute the functional polymerase activity in vivo. Using this assay, we have examined the requirement of each of these motifs of PB1 in polymerase activity. We find that each of these invariant amino acids is critical for PB1 activity and that mutation in any one of these residues renders the protein nonfunctional. We also find that in motif III, which contains the SSDD sequence, the signature sequence of influenza virus RNA polymerase, SDD is essentially invariant and cannot accommodate sequences found in other RNA viral polymerases. However, conserved changes in the flanking sequences of SDD can be partially tolerated. These results provide the experimental evidence that influenza virus PB1 possesses a similar polymerase module as has been proposed for other RNA viruses and that the core SDD sequence of influenza virus PB1 represents a sequence variant of the GDN in negative-stranded nonsegmented RNA viruses, GDD in positive-stranded RNA virus and double-stranded RNA viruses, or MDD in retroviruses.  相似文献   

9.
Translation initiation: structures, mechanisms and evolution   总被引:1,自引:0,他引:1  
Translation, the process of mRNA-encoded protein synthesis, requires a complex apparatus, composed of the ribosome, tRNAs and additional protein factors, including aminoacyl tRNA synthetases. The ribosome provides the platform for proper assembly of mRNA, tRNAs and protein factors and carries the peptidyl-transferase activity. It consists of small and large subunits. The ribosomes are ribonucleoprotein particles with a ribosomal RNA core, to which multiple ribosomal proteins are bound. The sequence and structure of ribosomal RNAs, tRNAs, some of the ribosomal proteins and some of the additional protein factors are conserved in all kingdoms, underlying the common origin of the translation apparatus. Translation can be subdivided into several steps: initiation, elongation, termination and recycling. Of these, initiation is the most complex and the most divergent among the different kingdoms of life. A great amount of new structural, biochemical and genetic information on translation initiation has been accumulated in recent years, which led to the realization that initiation also shows a great degree of conservation throughout evolution. In this review, we summarize the available structural and functional data on translation initiation in the context of evolution, drawing parallels between eubacteria, archaea, and eukaryotes. We will start with an overview of the ribosome structure and of translation in general, placing emphasis on factors and processes with relevance to initiation. The major steps in initiation and the factors involved will be described, followed by discussion of the structure and function of the individual initiation factors throughout evolution. We will conclude with a summary of the available information on the kinetic and thermodynamic aspects of translation initiation.  相似文献   

10.
Evolutionary conservation of reactions in translation.   总被引:1,自引:0,他引:1  
Current X-ray diffraction and cryoelectron microscopic data of ribosomes of eubacteria have shed considerable light on the molecular mechanisms of translation. Structural studies of the protein factors that activate ribosomes also point to many common features in the primary sequence and tertiary structure of these proteins. The reconstitution of the complex apparatus of translation has also revealed new information important to the mechanisms. Surprisingly, the latter approach has uncovered a number of proteins whose sequence and/or structure and function are conserved in all cells, indicating that the mechanisms are indeed conserved. The possible mechanisms of a new initiation factor and two elongation factors are discussed in this context.  相似文献   

11.
Evolution of the triplet code is reconstructed on the basis of consensus temporal order of appearance of amino acids. Several important predictions are confirmed by computational sequence analyses. The earliest amino acids, alanine and glycine, have been encoded by GCC and GGC codons, as today. They were succeeded, respectively, by A- and G-series of amino acids, encoded by pyrimidine-central and purine-central codons. The length of the earliest proteins is estimated to be 6–7 residues. The earliest mRNAs were short G+C-rich molecules. These short sequences could have formed hairpins. This is confirmed by analysis of modern prokaryotic mRNA sequences. Predominant size of detected ancient hairpins also corresponds to 6–7 amino acids, as above. Vestiges of last common ancestor can be found in extant proteins in form of entirely conserved short sequences of size six to nine residues present in all or almost all sequenced prokaryotic proteomes (omnipresent motifs). The functions of the topmost conserved octamers are not involved in the basic elementary syntheses. This suggests an initial abiotic supply of amino acids, bases and sugars. Presented at: National Workshop on Astrobiology: Search for Life in the Solar System, Capri, Italy, 26 to 28 October, 2005.  相似文献   

12.
Evolutionary Conservation of Reactions in Translation   总被引:3,自引:0,他引:3       下载免费PDF全文
Current X-ray diffraction and cryoelectron microscopic data of ribosomes of eubacteria have shed considerable light on the molecular mechanisms of translation. Structural studies of the protein factors that activate ribosomes also point to many common features in the primary sequence and tertiary structure of these proteins. The reconstitution of the complex apparatus of translation has also revealed new information important to the mechanisms. Surprisingly, the latter approach has uncovered a number of proteins whose sequence and/or structure and function are conserved in all cells, indicating that the mechanisms are indeed conserved. The possible mechanisms of a new initiation factor and two elongation factors are discussed in this context.  相似文献   

13.
14.
15.
16.
17.
18.
The gene encoding DNA polymerase alpha from Plasmodium falciparum.   总被引:2,自引:1,他引:1       下载免费PDF全文
The gene encoding DNA polymerase alpha from the human malaria parasite Plasmodium falciparum has been sequenced and characterised. The deduced amino acid sequence possesses the seven sequence motifs which characterise eukaryotic replicative DNA polymerases (I-VII) and four of five motifs (A-E) identified in alpha DNA polymerases. The predicted protein also contains sequences which are reminiscent of Plasmodium proteins but absent from other DNA polymerases. These include four blocks of additional amino acids interspersed with the conserved motifs of the DNA polymerases, four asparagine rich sequences and a novel carboxy-terminal extension. Repetitive sequences similar to those found in other malarial proteins are also present. cDNA-directed PCR was used to establish the presence of these features in the approximately 7kb mRNA. The coding sequence contains a single intron. The gene for DNAPol alpha is located on chromosome 4 and is transcribed in both asexual and sexual erythrocytic stages of the parasite.  相似文献   

19.
The origin of life has puzzled molecular scientists for over half a century. Yet fundamental questions remain unanswered, including which came first, the metabolic machinery or the encoding nucleic acids. In this study we take a protein-centric view and explore the ancestral origins of proteins. Protein domain structures in proteomes are highly conserved and embody molecular functions and interactions that are needed for cellular and organismal processes. Here we use domain structure to study the evolution of molecular function in the protein world. Timelines describing the age and function of protein domains at fold, fold superfamily, and fold family levels of structural complexity were derived from a structural phylogenomic census in hundreds of fully sequenced genomes. These timelines unfold congruent hourglass patterns in rates of appearance of domain structures and functions, functional diversity, and hierarchical complexity, and revealed a gradual build up of protein repertoires associated with metabolism, translation and DNA, in that order. The most ancient domain architectures were hydrolase enzymes and the first translation domains had catalytic functions for the aminoacylation and the molecular switch-driven transport of RNA. Remarkably, the most ancient domains had metabolic roles, did not interact with RNA, and preceded the gradual build-up of translation. In fact, the first translation domains had also a metabolic origin and were only later followed by specialized translation machinery. Our results explain how the generation of structure in the protein world and the concurrent crystallization of translation and diversified cellular life created further opportunities for proteomic diversification.  相似文献   

20.
In many gamma-proteobacteria, the conserved GacS/GacA (BarA/UvrY) two-component system positively controls the expression of one to five genes specifying small RNAs (sRNAs) that are characterized by repeated unpaired GGA motifs but otherwise appear to belong to several independent families. The GGA motifs are essential for binding small, dimeric RNA-binding proteins of a single conserved family designated RsmA (CsrA). These proteins, which also occur in bacterial species outside the gamma-proteobacteria, act as translational repressors of certain mRNAs when these contain an RsmA/CsrA binding site at or near the Shine-Dalgarno sequence plus additional binding sites located in the 5' untranslated leader mRNA. Recent structural data have established that the RsmA-like protein RsmE of Pseudomonas fluorescens makes specific contacts with an RNA consensus sequence 5'-(A)/(U)CANGGANG(U)/(A)-3' (where N is any nucleotide). Interaction with an RsmA/CsrA protein promotes the formation of a short stem supporting an ANGGAN loop. This conformation hinders access of 30S ribosomal subunits and hence translation initiation. The output of the Gac/Rsm cascade varies widely in different bacterial species and typically involves management of carbon storage and expression of virulence or biocontrol factors. Unidentified signal molecules co-ordinate the activity of the Gac/Rsm cascade in a cell population density-dependent manner.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号