首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Inteins are naturally occurring intervening sequences that catalyze a protein splicing reaction resulting in intein excision and concatenation of the flanking polypeptides (exteins) with a native peptide bond. Inteins display a diversity of catalytic mechanisms within a highly conserved fold that is shared with hedgehog autoprocessing proteins. The unusual chemistry of inteins has afforded powerful biotechnology tools for controlling enzyme function upon splicing and allowing peptides of different origins to be coupled in a specific, time-defined manner. The extein sequences immediately flanking the intein affect splicing and can be defined as the intein substrate. Because of the enormous potential complexity of all possible flanking sequences, studying intein substrate specificity has been difficult. Therefore, we developed a genetic selection for splicing-dependent kanamycin resistance with no significant bias when six amino acids that immediately flanked the intein insertion site were randomized. We applied this selection to examine the sequence space of residues flanking the Nostoc punctiforme Npu DnaE intein and found that this intein efficiently splices a much wider range of sequences than previously thought, with little N-extein specificity and only two important C-extein positions. The novel selected extein sequences were sufficient to promote splicing in three unrelated proteins, confirming the generalizable nature of the specificity data and defining new potential insertion sites for any target. Kinetic analysis showed splicing rates with the selected exteins that were as fast or faster than the native extein, refuting past assumptions that the naturally selected flanking extein sequences are optimal for splicing.  相似文献   

2.
Summary SecY is a central component of the export machinery that mediates the translocation of secretory proteins across the plasma membrane of Escherichia coli. We have cloned and sequenced the secY genes from Bacillus licheniformis and Staphylococcus carnosus. The deduced amino acid sequences are highly homologous to those of other known SecY polypeptides, all having the potential to form 10 transmembrane segments. Comparative analysis of 9 SecY polypeptides, derived from different bacteria, revealed that 14 amino acid positions (2,7%) are identical in all SecY proteins and 89 (16.9%) show conservative changes. Clusters of conserved amino acid residues were found in 4 of the 10 transmembrane segments and 2 of the 6 cytoplasmic domains. It is suggested that the conserved regions might be involved in the translocation activity of SecY or might be required for the correct interaction of SecY with other components of the secretion apparatus.  相似文献   

3.
It is well established that, within families of homologous enzymes, amino acid residues that are involved in the chemistry of the reaction are highly conserved. To determine if residues at the subunit interface of oligomeric enzymes with shared active sites are also conserved, comparative analysis of five enzyme families was undertaken. For the chosen enzyme families, sequence data were available for a large number of proteins and a three-dimensional structure was known for at least two members of each family. The analysis indicates that the subunit interface and the hydrophobic core of proteins from all five families have diverged to a similar extent to the overall protein sequences.  相似文献   

4.
IgASE1, a C18-Delta9-polyunsaturated fatty acid-specific fatty acid elongase component from Isochrysis galbana, contains a variant histidine box (his-box) with glutamine replacing the first histidine of the conserved histidine-rich motif present in all other known equivalent proteins. The importance of glutamine and other variant amino acid residues in the his-box of IgASE1 was determined by site-directed mutagenesis. Results showed that all the variation in amino acid sequence between this motif in IgASE1 and the consensus sequences of other elongase components was required for optimum enzyme activity. The substrate specificity was shown to be unaffected by these changes suggesting that components of the his-box are not directly responsible for substrate specificity.  相似文献   

5.
Analysis of the predicted amino acid sequence of Bacillus anthracis adenylyl cyclase revealed sequences with homology to consensus sequences for A- and B-type ATP binding domains found in many ATP binding proteins. Based on the analysis of nucleotide binding proteins, a conserved basic amino acid residue in the A-type consensus sequence and a conserved acidic amino acid residue in the B-type consensus sequence have been implicated in the binding of ATP. The putative ATP binding sequences in the B. anthracis adenylyl cyclase possess analogous lysine residues at positions 346 and 353 within two A-type consensus sequences and a glutamate residue at position 436 within a B-type consensus sequence. The two A-type consensus sequences overlap each other and have the opposite orientation. To determine whether Lys-346, Lys-353, or Glu-436 of the B. anthracis adenylyl cyclase are crucial for enzyme activity, Lys-346 and Lys-353 were replaced with methionine and Glu-436 with glutamine by oligonucleotide-directed mutagenesis. Furthermore, Lys-346 was also replaced with arginine. The genes encoding the wild type and mutant adenylyl cyclases were placed under the control of the lac promoter for expression in Escherichia coli, and extracts were assayed for adenylyl cyclase activity. In all cases, a 90-kDa polypeptide corresponding to the catalytic subunit of the enzyme was detected in E. coli extracts by rabbit polyclonal antibodies raised against the purified B. anthracis adenylyl cyclase. The proteins with the Lys-346 to methionine or arginine mutations exhibited no adenylyl cyclase activity, indicating that Lys-346 in the A-type ATP binding consensus sequence plays a critical role for enzyme catalysis. Furthermore, the enzyme with the Lys-353 to methionine mutation was also inactive, suggesting that Lys-353 may also directly contribute to enzyme catalysis. In contrast, the protein with the Glu-436 to glutamine mutation retained 75% of enzyme activity, suggesting that Glu-436 in the B-type ATP binding consensus sequence may not be directly involved in enzyme catalysis. It is concluded that Lys-346 and Lys-353 in B. anthracis adenylyl cyclase may interact directly with ATP and contribute to the binding of the nucleotide to the enzyme.  相似文献   

6.
In a previous paper we obtained ten (orthogonal) factors, linear combinations of which can express the properties of the 20 naturally occurring amino acids. In this paper, we assume that the most important properties (linear combinations of these ten factors) that determine the three-dimensional structure of a protein are conserved properties, i.e., are those that have been conserved during evolution. Two definitions of a conserved property are presented: (1) a conserved property for an average protein is defined as that linear combination of the ten factors that optimally expresses the similarity of one amino acid to another (hence, little change during evolution), as given by the relatedness odds matrix of Dayhoff et al.; (2) a conserved property for each position in the amino acid sequence (locus) of a specific family of homologous proteins (the cytochromec family or the globin family) is defined as that linear combination of the ten factors that is common among a set of amino acids at a given locus when the sequences are properly aligned. When the specificity at each locus is averaged over all loci, the same features are observed for three expressions of these two definitions, namely the conserved property for an average protein, the average conserved property for the cytochromec family, and the average conserved property for the globin family; we find that bulk and hydrophobicity (information about packing and long-range interactions) are more important than other properties, such as the preference for adopting a specific backbone structure (information about short-range interactions). We also demonstrate that the sequence profile of a conserved property, defined for each locus of a protein family (definition 2), corresponds uniquely to the three-dimensional structure, while the conserved property for an average protein (definition 1) is not useful for the prediction of protein structure. The amino acid sequences of numerous proteins are searched to find those that are similar, in terms of the conserved properties (definition 2), to sequences of the same size from one of the homologous families (cytochromec and globin, respectively) for whose loci the conserved properties were defined. Many similar sequences are found, the number of similarities decreasing with increasing size of the segment. However, the segments must be rather long (15 residues) before the comparisons become meaningful. As an example, one sufficiently large sequence (20 residues) from a protein of known structure (apo-liver alcohol dehydrogenase that is not a member of either family) is found to be similar in the conserved properties to a particular sequence of a member of the family of human hemoglobin chains, and the two sequences have similar structures. This means that, since conserved properties are expected to be structure determinants, we can use the conserved properties to predict an initial protein structure for subsequent energy minimization for a protein for which the conserved properties are similar to those of a family of proteins with a sufficiently large number of homologous amino acid sequences; such a large number of homologous sequences is required to define a conserved property for each locus of the homologous protein family.  相似文献   

7.
The exponential growth of sequence data provides abundant information for the discovery of new enzyme reactions. Correctly annotating the functions of highly diverse proteins can be difficult, however, hindering use of this information. Global analysis of large superfamilies of related proteins is a powerful strategy for understanding the evolution of reactions by identifying catalytic commonalities and differences in reaction and substrate specificity, even when only a few members have been biochemically or structurally characterized. A comparison of >2500 sequences sharing the six-bladed β-propeller fold establishes sequence, structural, and functional links among the three subgroups of the functionally diverse N6P superfamily: the arylesterase-like and senescence marker protein-30/gluconolactonase/luciferin-regenerating enzyme-like (SGL) subgroups, representing enzymes that catalyze lactonase and related hydrolytic reactions, and the so-called strictosidine synthase-like (SSL) subgroup. Metal-coordinating residues were identified as broadly conserved in the active sites of all three subgroups except for a few proteins from the SSL subgroup, which have been experimentally determined to catalyze the quite different strictosidine synthase (SS) reaction, a metal-independent condensation reaction. Despite these differences, comparison of conserved catalytic features of the arylesterase-like and SGL enzymes with the SSs identified similar structural and mechanistic attributes between the hydrolytic reactions catalyzed by the former and the condensation reaction catalyzed by SS. The results also suggest that despite their annotations, the great majority of these >500 SSL sequences do not catalyze the SS reaction; rather, they likely catalyze hydrolytic reactions typical of the other two subgroups instead. This prediction was confirmed experimentally for one of these proteins.  相似文献   

8.
Mitochondrial NADH:ubiquinone oxidoreductase (complex I) is the most complicated enzyme in the respiratory chain and is composed of at least 26 distinct polypeptides. Two hydrophilic subfractions of bovine heart complex I were systematically resolved into individual polypeptides by chromatography. Three polypeptides (51, 24, and 9 kDa) were isolated from the flavoprotein fraction (FP) of complex I, and the complete amino acid sequence of the 9 kDa polypeptide was determined. The 9 kDa polypeptide is composed of 75 amino acids with a molecular weight of 8,437. This protein exhibits no obvious sequence similarity to other proteins. The iron-sulfur protein fraction (IP) of complex I was separated into eight polypeptides, 75, 49, 30, 20, 18, 15, 13 kDa-A, and 13 kDa-B. The 20 kDa polypeptide was recognized as a novel component of IP for the first time. The N-terminal and several peptide sequences of the 20 kDa polypeptide were determined. Comparison of the sequences revealed significant sequence similarities of the 20 kDa polypeptide to the psbG gene products encoded in the chloroplast genome. The conserved sequence in these proteins was also found in the small subunit of the nickel-containing hydrogenases. These results suggest that complex I is related to other redox enzyme complexes.  相似文献   

9.
Amino acid sequence analysis of fungal histidine acid phosphatases displaying phytase activity has revealed a conserved eight-cysteine motif. These conserved amino acids are not directly associated with catalytic function; rather they appear to be essential in the formation of disulfide bridges. Their role is seen as being similar to another eight-cysteine motif recently reported in the amino acid sequence of nearly 500 plant polypeptides. An additional disulfide bridge formed by two cysteines at the N-terminus of all the filamentous ascomycete phytases was also observed. Disulfide bridges are known to increase both stability and heat tolerance in proteins. It is therefore plausible that this extra disulfide bridge contributes to the higher stability found in phytase from some Aspergillus species. To engineer an enhanced phytase for the feed industry, it is imperative that the role of disulfide bridges be taken into cognizance and possibly be increased in number to further elevate stability in this enzyme.  相似文献   

10.
THR1, the gene from Saccharomyces cerevisiae, encoding homoserine kinase, one of the threonine biosynthetic enzymes, has been cloned by complementation. The nucleotide sequence of a 3.1-kb region carrying this gene reveals an open reading frame of 356 codons, corresponding to about 40 kDa for the encoded protein. The presence of three canonical GCN4 regulatory sequences in the upstream flanking region suggests that the expression of THR1 is under the general amino acid control. In parallel, the enzyme was purified by four consecutive column chromatographies, monitoring homoserine kinase activity. In SDS gel electrophoresis, homoserine kinase migrates like a 40-kDa protein; the native enzyme appears to be a homodimer. The sequence of the first 15 NH2-terminal amino acids, as determined by automated Edman degradation, is in accordance with the amino acid sequence deduced from the nucleotide sequence. Computer-assisted comparison of the yeast enzyme with the corresponding activities from bacterial sources showed that several segments among these proteins are highly conserved. Furthermore, the observed homology patterns suggest that the ancestral sequences might have been composed from separate (functional) domains. A block of very similar amino acids is found in the homoserine kinases towards the carboxy terminus that is also present in many other proteins involved in threonine (or serine) metabolism; this motif, therefore, may represent the binding site for the hydroxyamino acids. Limited similarity was detected between a motif conserved among the homoserine kinases and consensus sequences found in other mono- or dinucleotide-binding proteins.  相似文献   

11.
The hydrolases and transferases that constitute the alpha-amylase family are multidomain proteins, but each has a catalytic domain in the form of a (beta/alpha)(8)-barrel, with the active site being at the C-terminal end of the barrel beta-strands. Although the enzymes are believed to share the same catalytic acids and a common mechanism of action, they have been assigned to three separate families - 13, 70 and 77 - in the classification scheme for glycoside hydrolases and transferases that is based on amino acid sequence similarities. Each enzyme has one glutamic acid and two aspartic acid residues necessary for activity, while most enzymes of the family also contain two histidine residues critical for transition state stabilisation. These five residues occur in four short sequences conserved throughout the family, and within such sequences some key amino acid residues are related to enzyme specificity. A table is given showing motifs distinctive for each specificity as extracted from 316 sequences, which should aid in identifying the enzyme from primary structure information. Where appropriate, existing problems with identification of some enzymes of the family are pointed out. For enzymes of known three-dimensional structure, action is discussed in terms of molecular architecture. The sequence-specificity and structure-specificity relationships described may provide useful pointers for rational protein engineering.  相似文献   

12.
R L Robson 《FEBS letters》1984,173(2):394-398
Published amino acid sequences for nitrogenase component polypeptides were compared with those of other proteins which also bind adenine nucleotides. Three sequences which might contribute to an adenine nucleotide-binding domain were found for the Fe-protein component of nitrogenase. The beta-subunit of the MoFe-protein (nifK gene product) contains a sequence which is similar to other proteins which exhibit ATPase activity. No similarities were observed for the alpha-subunit of this component. The findings are discussed in relation to the experimental data on adenine nucleotide binding and the proposed role of ATP in the enzyme mechanism.  相似文献   

13.
8-Oxo-7,8-dihydro-2'-deoxyguanosine 5'-triphosphate (8-oxo-dGTP) is produced during normal cellular metabolism, and incorporation into DNA causes transversion mutation. Organisms possess an enzyme, 8-oxo-dGTPase, which catalyzes the hydrolysis of 8-oxo-dGTP to the corresponding nucleoside monophosphate, thereby preventing the occurrence of mutation. There are highly conserved amino acid sequences in prokaryotic and eukaryotic proteins containing this and related enzyme activities. To elucidate the significance of the conserved sequence, amino acid substitutions were introduced by site- directed mutagenesis of the cloned cDNA for human 8-oxo-dGTPase, and the activity and stability of mutant forms of the enzyme were examined. When lysine-38 was replaced by other amino acids, all of the mutants isolated carried the 8-oxo-dGTPase-negative phenotype. 8-Oxo-dGTPase-positive revertants, isolated from one of the negative mutants, carried the codon for lysine. Using the same procedure, the analysis was extended to other residues within the conserved sequence. At the glutamic acid-43, arginine-51 and glutamic acid-52 sites, all the positive revertants isolated carried codons for amino acids identical to those of the wild type protein. We propose that Lys-38, Glu-43, Arg-51 and Glu-52 residues in the conserved region are essential to exert 8-oxo-dGTPase activity.  相似文献   

14.

Background

Diacylglycerol acyltransferase families (DGATs) catalyze the final and rate-limiting step of triacylglycerol (TAG) biosynthesis in eukaryotic organisms. Understanding the roles of DGATs will help to create transgenic plants with value-added properties and provide clues for therapeutic intervention for obesity and related diseases. The objective of this analysis was to identify conserved sequence motifs and amino acid residues for better understanding of the structure-function relationship of these important enzymes.

Results

117 DGAT sequences from 70 organisms including plants, animals, fungi and human are obtained from database search using tung tree DGATs. Phylogenetic analysis separates these proteins into DGAT1 and DGAT2 subfamilies. These DGATs are integral membrane proteins with more than 40% of the total amino acid residues being hydrophobic. They have similar properties and amino acid composition except that DGAT1s are approximately 20 kDa larger than DGAT2s. DGAT1s and DGAT2s have 41 and 16 completely conserved amino acid residues, respectively, although only two of them are shared by all DGATs. These residues are distributed in 7 and 6 sequence blocks for DGAT1s and DGAT2s, respectively, and located at the carboxyl termini, suggesting the location of the catalytic domains. These conserved sequence blocks do not contain the putative neutral lipid-binding domain, mitochondrial targeting signal, or ER retrieval motif. The importance of conserved residues has been demonstrated by site-directed and natural mutants.

Conclusions

This study has identified conserved sequence motifs and amino acid residues in all 117 DGATs and the two subfamilies. None of the completely conserved residues in DGAT1s and DGAT2s is present in recently reported isoforms in the multiple sequences alignment, raising an important question how proteins with completely different amino acid sequences could perform the same biochemical reaction. The sequence analysis should facilitate studying the structure-function relationship of DGATs with the ultimate goal to identify critical amino acid residues for engineering superb enzymes in metabolic engineering and selecting enzyme inhibitors in therapeutic application for obesity and related diseases.  相似文献   

15.
Cloned DNA copies of rotavirus genomic segment 6 from simian 11 (subgroup 1) and human strain Wa (subgroup 2) rotaviruses have been used to determine the nucleotide sequences of the gene that determines viral subgroup specificity. Both genomic segments are 1,356 nucleotides in length and possess 5'- and 3'-terminal untranslated regions of 23 and 142 nucleotides, respectively. The inferred amino acid sequence reveals VP6 to be a polypeptide of 397 amino acids in which more than 90% of the amino acid sequence is conserved between the two viruses. There are 34 amino acid changes between the subgroup 1 and 2 polypeptides, most clustered in three regions of the molecule at residues 39 through 62, 80 through 122, and 281 through 315.  相似文献   

16.
17.
Antirestriction proteins Ard encoded by some self-transmissible plasmids specifically inhibit restriction by members of all three families of type I restriction-modification (R-M) systems in E.coli. Recently, we have identified the amino acid region, 'antirestriction' domain, that is conserved within different plasmid and phage T7-encoded antirestriction proteins and may be involved in interaction with the type I R-M systems. In this paper we demonstrate that this amino acid sequence shares considerable similarity with a well-known conserved sequence (the Argos repeat) found in the DNA sequence specificity (S) polypeptides of type I systems. We suggest that the presence of these similar motifs in restriction and antirestriction proteins may give a structural basis for their interaction and that the antirestriction action of Ard proteins may be a result of the competition between the 'antirestriction' domains of Ard proteins and the similar conserved domains of the S subunits that are believed to play a role in the subunit assembly of type I R-M systems.  相似文献   

18.
19.
Amino acid sequences of proteinaceous proteinase inhibitors have been extensively analysed for deriving information regarding the molecular evolution and functional relationship of these proteins. These sequences have been grouped into several well defined families. It was found that the phylogeny constructed with the sequences corresponding to the exposed loop responsible for inhibition has several branches that resemble those obtained from comparisons using the entire sequence. The major branches of the unrooted tree corresponded to the families to which the inhibitors belonged. Further branching is related to the enzyme specificity of the inhibitor. Examination of the active site loop sequences of trypsin inhibitors revealed that here are strong preferences for specific amino acids at different positions of the loop. These preferences are inhibitor class specific. Inhibitors active against more than one enzyme occur within a class and confirm to class specific sequence in their loops. Hence, only a few positions in the loop seem to determine the specificity. The ability to inhibit the same enzyme by inhibitors that belong to different classes appears to be a result of convergent evolution.  相似文献   

20.
The complete nucleotide sequences of the fomA genes encoding the 40-kDa outer membrane proteins (OMPs) of strains ATCC 10953 and ATCC 25586 of Fusobacterium nucleatum were determined using the genomic DNA, or DNA fragments ligated into a vector plasmid, as template in a polymerase chain reaction. The deduced amino acid sequences of these two proteins were aligned with the amino acid sequence of the corresponding protein of F. nucleatum strain Fev1 and examined for conserved/variable polypeptide segments. A model for the topology of the 40-kDa OMPs is proposed on the basis of this alignment and application of the structural principles derived for OMPs of Escherichia coli. According to this model, sixteen polypeptide segments, which are highly conserved, traverse the outer membrane, thereby creating eight external loops, most of which are highly variable.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号