首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The concept of consensus in multiple sequence alignments (MSAs) has been used to design and engineer proteins previously with some success. However, consensus design implicitly assumes that all amino acid positions function independently, whereas in reality, the amino acids in a protein interact with each other and work cooperatively to produce the optimum structure required for its function. Correlation analysis is a tool that can capture the effect of such interactions. In a previously published study, we made consensus variants of the triosephosphate isomerase (TIM) protein using MSAs that included sequences form both prokaryotic and eukaryotic organisms. These variants were not completely native-like and were also surprisingly different from each other in terms of oligomeric state, structural dynamics, and activity. Extensive correlation analysis of the TIM database has revealed some clues about factors leading to the unusual behavior of the previously constructed consensus proteins. Among other things, we have found that the more ill-behaved consensus mutant had more broken correlations than the better-behaved consensus variant. Moreover, we report three correlation and phylogeny-based consensus variants of TIM. These variants were more native-like than the previous consensus mutants and considerably more stable than a wild-type TIM from a mesophilic organism. This study highlights the importance of choosing the appropriate diversity of MSA for consensus analysis and provides information that can be used to engineer stable enzymes.  相似文献   

2.
A set of combinatorial amphipathic helical peptides referred to as the KIA series has been screened to identify native-like helical bundles. The series contains the following consensus sequence: AKAxAAxxKAxAAxxKAGGY, where "x" positions are occupied by either Ala or Ile. The peptide sequences in the series comprise all possible combinations of four Ile residues occupying the six x positions. In each case, Ala occupied the two x positions not occupied by Ile. There are a total of 15 peptides in the KIA series; all of the peptides differ in the number of ridges and grooves formed by the Ile side chains, and two of the KIA peptides possess a canonical knobs-into-holes heptad repeat. The structure and stability of these 15 peptides and their pairwise mixtures were evaluated. One peptide in the series formed a stable four-helix bundle that folded with cooperativity similar to native proteins. Ten peptides assembled into molten globular helical assemblies, two peptides were unstructured, and two peptides assembled into helical filaments that were several micrometers long. One of the helical filament forming peptides could be diverted from forming filaments by the addition of another KIA peptide, and resulted in the formation of a heteromeric six-helix bundle. This study demonstrates that combinatorial peptides composed of only three types of amino acids can form a diverse array of structures, some of which are native-like.  相似文献   

3.
The 3D structural comparison of families of divergent homologous domains revealed two main populations of hydrophobic amino acids, one with a low and the other with a significantly higher mean solvent accessibility, allowing two regions of the core of protein globular domains to be distinguished. The side chains of hydrophobic amino acids in topologically conserved positions (positions in the structural alignment where only hydrophobic amino acids are found), which we call topohydrophobic positions, are considerably less dispersed than those of the other amino acids (hydrophobic or not). Mean distances between gravity centers of amino acids in topohydrophobic positions are significantly shorter than those for non-topohydrophobic positions and show that the corresponding amino acids are almost all in direct contact in the inner core of globular domains. This study also showed that the small number of topohydrophobic positions is a characteristic of the structural differences between proteins of a family. This criterion is independent of the sequence identity between the sequences and of the root-mean-square distance between their corresponding structures. Using sensitive sequence alignment processes it will be possible, for many protein families, to identify topohydrophobic positions from sequences only. Proteins 33:329–342, 1998. © 1998 Wiley-Liss, Inc.  相似文献   

4.
The consensus concept for thermostability engineering of proteins   总被引:16,自引:0,他引:16  
Previously, sequence comparisons between a mesophilic enzyme and a more thermostable homologue were shown to be a feasible approach to successfully predict thermostabilizing amino acid substitutions. The 'consensus approach' described in the present paper shows that even a set of amino acid sequences of homologous, mesophilic enzymes contains sufficient information to allow rapid design of a thermostabilized, fully functional variant of this family of enzymes. A sequence alignment of homologous fungal phytases was used to calculate a consensus phytase amino acid sequence. Upon construction of the synthetic gene, recombinant expression and purification, the first phytase obtained, termed consensus phytase-1, displayed an unfolding temperature (T(m)) of 78.0 degrees C which is 15-22 degrees C higher than the T(m) values of all parent phytases used in its design. Refinement of the approach, combined with site-directed mutagenesis experiments, yielded optimized consensus phytases with T(m) values of up to 90.4 degrees C. These increases in T(m) are due to the combination of multiple amino acid exchanges which are distributed over the entire sequence of the protein and mainly affect surface-exposed residues; each individual substitution has a rather small thermostabilizing effect only. Remarkably, in spite of the pronounced increase in thermostability, catalytic activity at 37 degrees C is not compromised. Thus, the design of consensus proteins is a potentially powerful and novel alternative to directed evolution and to a series of rational approaches for thermostability engineering of enzymes and other proteins.  相似文献   

5.
Nilsson MT  Widersten M 《Biochemistry》2004,43(38):12038-12047
A single-chain derivative of the lambda Cro repressor (scCro) has been randomly mutated in amino acid residues critical for specific DNA recognition to create libraries of protein variants. Utilizing phage display-afforded affinity selection, scCro variants have been isolated for binding to synthetic DNA ligands. Isolated scCro variants were analyzed functionally, both in fusion with phage particles and after expression of the corresponding free proteins. The binding properties with regard to specificity and affinity in binding to different DNA ligands were investigated by inhibition studies and determination of equilibrium dissociation constants for formed complexes. Variant proteins with altered DNA-sequence specificity were identified, which favored binding of targeted synthetic DNA sequences over a consensus operator sequence, bound with high affinity by wild-type Cro. The specificities were relatively modest (2-3-fold, as calculated from K(D) values), which can be attributed to the inherent properties in the design of the selection system; one half-site of the synthetic DNA sequences maintains the consensus operator sequence, and one "subunit" of the variant single-chain Cro dimers was conserved as wild-type sequence. The anticipated interaction between the wild-type subunit and the consensus DNA half-site of target DNA ligands is, hence, expected to contribute to the overlap in sequence discrimination. The binding affinity for the synthetic DNA sequences, however, was improved 10-30-fold in selected variant proteins as compared to "wild-type" scCro.  相似文献   

6.
M Turmel  C Otis  V Ct    C Lemieux 《Nucleic acids research》1997,25(13):2610-2619
Two approaches were used to discern critical amino acid residues for the function of the I- Ceu I homing endonuclease: sequence comparison of subfamilies of homologous proteins and genetic selection. The first approach revealed residues potentially involved in catalysis and DNA recognition. Because I- Ceu I is lethal in Escherichia coli , enzyme variants not perturbing cell viability were readily selected from an expression library. A collection of 49 variants with single amino acid substitutions at 37 positions was assembled. Most of these positions are clustered within or around the LAGLI-DADG dodecapeptide and the TQH sequence, two motifs found in all protein subfamilies examined. The Km and kcat values of the wild-type and nine variant enzymes synthesized in vitro were determined. Three variants, including one showing a substitution of the glutamine residue in the TQH motif, revealed no detectable endonuclease activity; five others showed reduced activity compared to the wild-type enzyme; whereas the remaining variant cleaved the top strand about three times more efficiently than the wild-type. Our results not only confirm recent reports indicating that amino acids in the LAGLI-DADG dodecapeptide are functionally critical, but they also suggest that some residues outside this motif directly participate in catalysis.  相似文献   

7.
Tanaka J  Yanagawa H  Doi N 《PloS one》2011,6(3):e18034
Although modern proteins consist of 20 different amino acids, it has been proposed that primordial proteins consisted of a small set of amino acids, and additional amino acids have gradually been recruited into the genetic code. This hypothesis has recently been supported by comparative genome sequence analysis, but no direct experimental approach has been reported. Here, we utilized a novel experimental approach to test a hypothesis that native-like globular proteins might be easily simplified by a set of putative primitive amino acids with retention of its structure and function than by a set of putative new amino acids. We performed in vitro selection of a functional SH3 domain as a model from partially randomized libraries with different sets of amino acids using mRNA display. Consequently, a library rich in putative primitive amino acids included a larger number of functional SH3 sequences than a library rich in putative new amino acids. Further, the functional SH3 sequences were enriched from the primitive library slightly earlier than from a randomized library with the full set of amino acids, while the function and structure of the selected SH3 proteins with the primitive alphabet were comparable with those from the 20 amino acid alphabet. Application of this approach to various combinations of codons in protein sequences may be useful not only for clarifying the precise order of the amino acid expansion in the early stages of protein evolution but also for efficiently creating novel functional proteins in the laboratory.  相似文献   

8.
Loop 8 (residues 232-242) in triosephosphate isomerase (TIM) is a highly conserved loop that forms a tight binding pocket for the phosphate moiety of the substrate. Its sequence includes the fully conserved, solvent-exposed Leu238. The tight phosphate-binding pocket explains the high substrate specificity of TIM being limited to the in vivo substrates dihydroxyacetone-phosphate and D-glyceraldehyde-3-phosphate. Here we use the monomeric variant of trypanosomal TIM for exploring the structural consequences of shortening this loop. The mutagenesis, guided by extensive modeling calculations and followed up by crystallographic characterization, is aimed at widening the phosphate-binding pocket and, consequently, changing the substrate specificity. Two new variants were characterized. The crystal structures of these variants indicate that in monomeric forms of TIM, the Leu238 side-chain is nicely buried in a hydrophobic cluster. Monomeric forms of wild-type dimeric TIM are known to exist transiently as folding intermediates; our structural analysis suggests that in this monomeric form, Leu238 of loop 8 also adopts this completely buried conformation, which explains its full conservation across the evolution. The much wider phosphate-binding pocket of the new variant allows for the development of a new TIM variant with a different substrate specificity.  相似文献   

9.
Proteins unfolded by high concentrations of chemical denaturants adopt expanded, largely structure-free ensembles of conformations that are well approximated as random coils. In contrast, globular proteins unfolded under less denaturing conditions (via mutations, or transiently unfolded after a rapid jump to native conditions) and molten globules (arising due to mutations or cosolvents) are often compact. Here we explore the origins of this compaction using a truncated equilibrium-unfolded variant of the 57-residue FynSH3 domain. As monitored by far-UV circular dichroism, NMR spectroscopy, and hydrogen-exchange kinetics, CΔ4 (a 4-residue carboxy-terminal deletion variant of FynSH3) appears to be largely unfolded even in the absence of denaturant. Nevertheless, CΔ4 is quite compact under these conditions, with a hydrodynamic radius only slightly larger than that of the native protein. In order to understand the origins of this molten-globule-like compaction, we have characterized a random sequence polypeptide of identical amino acid composition to CΔ4. Notably, we find that the hydrodynamic radius of this random sequence polypeptide also approaches that of the native protein. Thus, while native-like interactions may contribute to the formation of compact “unfolded” states, it appears that non-sequence-specific monomer-monomer interactions can also account for the dramatic compaction observed for molten globules and the “physiological” unfolded state.  相似文献   

10.
Binary patterning of polar and nonpolar amino acids has been used as the key design feature for constructing large combinatorial libraries of de novo proteins. Each position in a binary patterned sequence is designed explicitly to be either polar or nonpolar; however, the precise identities of these amino acids are varied extensively. The combinatorial underpinnings of the "binary code" strategy preclude explicit design of particular side chains at specified positions. Therefore, packing interactions cannot be specified a priori. To assess whether the binary code strategy can nonetheless produce well-folded de novo proteins, we constructed a second-generation library based upon a new structural scaffold designed to fold into 102-residue four-helix bundles. Characterization of five proteins chosen arbitrarily from this new library revealed that (1) all are alpha-helical and quite stable; (2) four of the five contain an abundance of tertiary interactions indicative of well-ordered structures; and (3) one protein forms a well-folded structure with native-like features. The proteins from this new 102-residue library are substantially more stable and dramatically more native-like than those from an earlier binary patterned library of 74-residue sequences. These findings demonstrate that chain length is a crucial determinant of structural order in libraries of de novo four-helix bundles. Moreover, these results show that the binary code strategy--if applied to an appropriately designed structural scaffold--can generate large collections of stably folded and/or native-like proteins.  相似文献   

11.
The sequences of four-alpha-helical bundle proteins are characterized by a pattern of hydrophilic and hydrophobic amino acids which is repeated every seven residues. At each position of the heptad repeat there are specific constraints on the amino acid properties which result from the topology of the tertiary motif. These constraints give rise to patterns of amino acid distribution which are distinct from those of other proteins. The distributions in each of the heptad positions have been determined by a statistical analysis of structural and sequence data derived from seven families of aligned protein sequences. The constitution of each position is dominated by a very small number of different amino acids, with the core positions consisting overwhelmingly of Leu and Ala. The positional preferences of the individual amino acids can be generally interpreted in terms of residue properties and topological constraints. The potential for four-alpha-helix bundle folding is reflected primarily in the pattern of residue occurrence in the heptad and not in the overall amino acid composition of the protein. Possible applications of this analysis in structure predictions, sequence alignments and in the rational design and engineering of four-alpha-helical bundle proteins are discussed.  相似文献   

12.
The modification of proteins by SUMO (small ubiquitin-like modifier) regulates various cellular processes. Sumoylation often occurs on a specific lysine residue within the consensus motif psiKxE/D. However, little is known about the specificity and selectivity of SUMO target sites. We describe here a SUMO assay with peptide array on solid support for the simultaneous characterization of hundreds of different SUMO target sites. This approach was used to characterize known SUMO substrates. The position of the motif within the peptide and the amino acids flanking the acceptor site affected the efficiency of SUMO modification. Interestingly, a sequence of only four amino acids, corresponding to the SUMO consensus motif without flanking amino acids, was a bona fide target site. Analysis of a peptide library for all variants of the psiKxE/D consensus motif revealed that the first and third positions in the tetrapeptide preferably contain aromatic amino acid residues. Furthermore, by adding the SUMO E3 ligase PIAS1 to the reaction mixture, we show specific enhancement of the modification of a PIAS1-dependent SUMO substrate in this system. Overall, our results demonstrate that the sumoylation assay with peptide array on solid support can be used for the high-throughput characterization of SUMO target sites, and provide new insights into the composition, selectivity and specificity of SUMO target sites.  相似文献   

13.
14.
Using a protein design algorithm that considers side-chain packing quantitatively, the effect of explicit backbone motion on the selection of amino acids in protein design was assessed in the core of the streptococcal protein G beta 1 domain (G beta 1). Concerted backbone motion was introduced by varying G beta 1's supersecondary structure parameter values. The stability and structural flexibility of seven of the redesigned proteins were determined experimentally and showed that core variants containing as many as 6 of 10 possible mutations retain native-like properties. This result demonstrates that backbone flexibility can be combined explicitly with amino acid side-chain selection and that the selection algorithm is sufficiently robust to tolerate perturbations as large as 15% of G beta 1's native supersecondary structure parameter values.  相似文献   

15.
Understanding the determinants of protein stability remains one of protein science's greatest challenges. There are still no computational solutions that calculate the stability effects of even point mutations with sufficient reliability for practical use. Amino acid substitutions rarely increase the stability of native proteins; hence, large libraries and high-throughput screens or selections are needed to stabilize proteins using directed evolution. Consensus mutations have proven effective for increasing stability, but these mutations are successful only about half the time. We set out to understand why some consensus mutations fail to stabilize, and what criteria might be useful to predict stabilization more accurately. Overall, consensus mutations at more conserved positions were more likely to be stabilizing in our model, triosephosphate isomerase (TIM) from Saccharomyces cerevisiae. However, positions coupled to other sites were more likely not to stabilize upon mutation. Destabilizing mutations could be removed both by removing sites with high statistical correlations to other positions and by removing nearly invariant positions at which "hidden correlations" can occur. Application of these rules resulted in identification of stabilizing mutations in 9 out of 10 positions, and amalgamation of all predicted stabilizing positions resulted in the most stable yeast TIM variant we produced (+8 °C). In contrast, a multimutant with 14 mutations each found to stabilize TIM independently was destabilized by 2 °C. Our results are a practical extension to the consensus concept of protein stabilization, and they further suggest the importance of positional independence in the mechanism of consensus stabilization.  相似文献   

16.
《Biophysical journal》2021,120(16):3455-3469
Protein aggregation is involved in a variety of diseases, including neurodegenerative diseases and cancer. The cellular environment is crowded by a plethora of cosolutes comprising small molecules and biomacromolecules at high concentrations, which may influence the aggregation of proteins in vivo. To account for the effect of cosolutes on cancer-related protein aggregation, we studied their effect on the aggregation of the cancer-related L106R mutant of the Axin protein. Axin is a key player in the Wnt signaling pathway, and the L106R mutation in its RGS domain results in a native molten globule that tends to form native-like aggregates. This results in uncontrolled activation of the Wnt signaling pathway, leading to cancer. We monitored the aggregation process of Axin RGS L106R in vitro in the presence of a wide ensemble of cosolutes including polyols, amino acids, betaine, and polyethylene glycol crowders. Except myo-inositol, all polyols decreased RGS L106R aggregation, with carbohydrates exerting the strongest inhibition. Conversely, betaine and polyethylene glycols enhanced aggregation. These results are consistent with the reported effects of osmolytes and crowders on the stability of molten globular proteins and with both amorphous and amyloid aggregation mechanisms. We suggest a model of Axin L106R aggregation in vivo, whereby molecularly small osmolytes keep the protein as a free soluble molecule but the increased crowding of the bound state by macromolecules induces its aggregation at the nanoscale. To our knowledge, this is the first systematic study on the effect of osmolytes and crowders on a process of native-like aggregation involved in pathology, as it sheds light on the contribution of cosolutes to the onset of cancer as a protein misfolding disease and on the relevance of aggregation in the molecular etiology of cancer.  相似文献   

17.

Background

Global residue-specific amino acid mutagenesis can provide important biological insight and generate proteins with altered properties, but at the risk of protein misfolding. Further, targeted libraries are usually restricted to a handful of amino acids because there is an exponential correlation between the number of residues randomized and the size of the resulting ensemble. Using GFP as the model protein, we present a strategy, termed protein evolution via amino acid and codon elimination, through which simplified, native-like polypeptides encoded by a reduced genetic code were obtained via screening of reduced-size ensembles.

Methodology/Principal Findings

The strategy involves combining a sequential mutagenesis scheme to reduce library size with structurally stabilizing mutations, chaperone complementation, and reduced temperature of gene expression. In six steps, we eliminated a common buried residue, Phe, from the green fluorescent protein (GFP), while retaining activity. A GFP variant containing 11 Phe residues was used as starting scaffold to generate 10 separate variants in which each Phe was replaced individually (in one construct two adjacent Phe residues were changed simultaneously), while retaining varying levels of activity. Combination of these substitutions to generate a Phe-free variant of GFP abolished fluorescence. Combinatorial re-introduction of five Phe residues, based on the activities of the respective single amino acid replacements, was sufficient to restore GFP activity. Successive rounds of mutagenesis generated active GFP variants containing, three, two, and zero Phe residues. These GFPs all displayed progenitor-like fluorescence spectra, temperature-sensitive folding, a reduced structural stability and, for the least stable variants, a reduced steady state abundance.

Conclusions/Significance

The results provide strategies for the design of novel GFP reporters. The described approach offers a means to enable engineering of active proteins that lack certain amino acids, a key step towards expanding the functional repertoire of uniquely labeled proteins in synthetic biology.  相似文献   

18.
19.
Comparative structural studies on proteins derived from organisms with growth optima ranging from 15 to 100 degrees C are beginning to shed light on the mechanisms of protein thermoadaptation. One means of sustaining hyperthermostability is for proteins to exist in higher oligomeric forms than their mesophilic homologues. Triosephosphate isomerase (TIM) is one of the most studied enzymes, whose fold represents one of nature's most common protein architectures. Most TIMs are dimers of approximately 250 amino acid residues per monomer. Here, we report the 2.7 A resolution crystal structure of the extremely thermostable TIM from Pyrococcus woesei, a hyperthermophilic archaeon growing optimally at 100 degrees C, representing the first archaeal TIM structure. P. woesei TIM exists as a tetramer comprising monomers of only 228 amino acid residues. Structural comparisons with other less thermostable TIMs show that although the central beta-barrel is largely conserved, severe pruning of several helices and truncation of some loops give rise to a much more compact monomer in the small hyperthermophilic TIM. The classical TIM dimer formation is conserved in P. woesei TIM. The extreme thermostability of PwTIM appears to be achieved by the creation of a compact tetramer where two classical TIM dimers interact via an extensive hydrophobic interface. The tetramer is formed through largely hydrophobic interactions between some of the pruned helical regions. The equivalent helical regions in less thermostable dimeric TIMs represent regions of high average temperature factor. The PwTIM seems to have removed these regions of potential instability in the formation of the tetramer. This study of PwTIM provides further support for the role of higher oligomerisation states in extreme thermal stabilisation.  相似文献   

20.
Peptides (33-34 amino acids long) corresponding to the helix-turn-helix (EF-hand) motif of the calcium binding site I of Paramecium tetraurelia calmodulin have been synthesized. The linear sequence was unable to acquire a native-like conformation and calcium binding. However, incorporation of a well-positioned disulfide bond bridging the two putative helical regions greatly improved the ordered structure and binding properties. Analyzed by electrospray mass spectrometry, circular dichroism and time-resolved laser-induced fluorescence, such a disulfide-stabilized peptide is shown to acquire a calcium-dependent helical conformation and exhibits native-like affinity for calcium, terbium and europium ions with 30+/-1, 3.5+/-0.6 and 0.6+/-0.1 microM dissociation constants, respectively. Comparable affinities were calculated within the biological construct comprising the entire domain I of Arabidopsis taliana calmodulin. Single sequence mutation (Glu25Asp) in the binding loop of the peptide abolishes calcium affinity, but preserves lanthanide affinity, showing that metal selectivity can be modulated by specific mutations. Such disulfide-stabilized peptides represent useful models to engineer metal specificity in new calmodulin proteins, facilitating the development of new systems to monitor metal pollution in biosensors and to increase metal binding capability of bacterial and plant cells in bioremediation techniques.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号