首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 125 毫秒
1.
Computational protein design is a reverse procedure of protein folding and structure prediction, where constructing structures from evolutionarily related proteins has been demonstrated to be the most reliable method for protein 3-dimensional structure prediction. Following this spirit, we developed a novel method to design new protein sequences based on evolutionarily related protein families. For a given target structure, a set of proteins having similar fold are identified from the PDB library by structural alignments. A structural profile is then constructed from the protein templates and used to guide the conformational search of amino acid sequence space, where physicochemical packing is accommodated by single-sequence based solvation, torsion angle, and secondary structure predictions. The method was tested on a computational folding experiment based on a large set of 87 protein structures covering different fold classes, which showed that the evolution-based design significantly enhances the foldability and biological functionality of the designed sequences compared to the traditional physics-based force field methods. Without using homologous proteins, the designed sequences can be folded with an average root-mean-square-deviation of 2.1 Å to the target. As a case study, the method is extended to redesign all 243 structurally resolved proteins in the pathogenic bacteria Mycobacterium tuberculosis, which is the second leading cause of death from infectious disease. On a smaller scale, five sequences were randomly selected from the design pool and subjected to experimental validation. The results showed that all the designed proteins are soluble with distinct secondary structure and three have well ordered tertiary structure, as demonstrated by circular dichroism and NMR spectroscopy. Together, these results demonstrate a new avenue in computational protein design that uses knowledge of evolutionary conservation from protein structural families to engineer new protein molecules of improved fold stability and biological functionality.  相似文献   

2.
De novo sequence design of foldable proteins provides a way of investigating principles of protein architecture. We performed fully automated sequence design for a target structure having a three-helix bundle topology and synthesized the designed sequences. Our design principle is different from the conventional approach, in that instead of optimizing interactions within the target structure, we design the global shape of the protein folding funnel. This includes automated implementation of negative design by explicitly requiring higher free energy of the denatured state. The designed sequences do not have significant similarity to those of any natural proteins. The NMR and CD spectroscopic data indicated that one designed sequence has a well-defined three-dimensional structure as well as alpha-helical content consistent with the target.  相似文献   

3.
To determine the extent to which protein folding rates and free energy landscapes have been shaped by natural selection, we have examined the folding kinetics of five proteins generated using computational design methods and, hence, never exposed to natural selection. Four of these proteins are complete computer-generated redesigns of naturally occurring structures and the fifth protein, called Top7, has a computer-generated fold not yet observed in nature. We find that three of the four redesigned proteins fold much faster than their naturally occurring counterparts. While natural selection thus does not appear to operate on protein folding rates, the majority of the designed proteins unfold considerably faster than their naturally occurring counterparts, suggesting possible selection for a high free energy barrier to unfolding. In contrast to almost all naturally occurring proteins of less than 100 residues but consistent with simple computational models, the folding energy landscape for Top7 appears to be quite complex, suggesting the smooth energy landscapes and highly cooperative folding transitions observed for small naturally occurring proteins may also reflect the workings of natural selection.  相似文献   

4.
Experiments were designed to explore the tolerance of protein structure and folding to very large insertions of folded protein within a structural domain. Dihydrofolate reductase and beta-lactamase have been inserted in four different positions of phosphoglycerate kinase. The resultant chimeric proteins are all overexpressed, and the host as well as the inserted partners are functional. Although not explicitly designed, functional coupling between the two fused partners was observed in some of the chimeras. These results show that the tolerance of protein structures to very large structured insertions is more general than previously expected and supports the idea that the natural sequence continuity of a structural domain is not required for the folding process. These results directly suggest a new experimental approach to screen, for example, for folded protein in randomized polypeptide sequences.  相似文献   

5.
Measurements of protection against exchange of main chain amide hydrogens (NH) with solvent hydrogens in globular proteins have provided remarkable insights into the structures of rare high‐energy states that populate their folding free‐energy surfaces. Lacking, however, has been a unifying theory that rationalizes these high‐energy states in terms of the structures and sequences of their resident proteins. The Branched Aliphatic Side Chain (BASiC) hypothesis has been developed to explain the observed patterns of protection in a pair of TIM barrel proteins. This hypothesis supposes that the side chains of isoleucine, leucine, and valine (ILV) residues often form large hydrophobic clusters that very effectively impede the penetration of water to their underlying hydrogen bond networks and, thereby, enhance the protection against solvent exchange. The linkage between the secondary and tertiary structures enables these ILV clusters to serve as cores of stability in high‐energy partially folded states. Statistically significant correlations between the locations of large ILV clusters in native conformations and strong protection against exchange for a variety of motifs reported in the literature support the generality of the BASiC hypothesis. The results also illustrate the necessity to elaborate this simple hypothesis to account for the roles of adjacent hydrocarbon moieties in defining stability cores of partially folded states along folding reaction coordinates.  相似文献   

6.
Many single-domain proteins with <100 residues fold cooperatively; but the recently designed 92-residue Top7 protein exhibits clearly non-two-state behaviors. In apparent agreement with experiment, we found that coarse-grained, native-centric chain models, including potentials with and without elementary desolvation barriers, predicted that Top7 has a stable intermediate state in which the C-terminal fragment is folded while the rest of the chain remains disordered. We observed noncooperative folding in Top7 models that incorporated nonnative hydrophobic interactions as well. In contrast, free energy profiles deduced from models with desolvation barriers for a set of thirteen natural proteins with similar chain lengths and secondary structure elements suggested that they fold much more cooperatively than Top7. Buttressed by related studies on smaller natural proteins with chain lengths of ∼40 residues, our findings argue that the de novo native topology of Top7 likely imposed a significant restriction on the cooperativity achievable by any design for this target structure.  相似文献   

7.
Photoswitchable distance constraints in the form of photoisomerizable chemical cross-links offer a general approach to the design of reversibly photocontrolled proteins. To apply these effectively, however, one must have guidelines for the choice of cross-linker structure and cross-linker attachment sites. Here we investigate the effects of varying cross-linker structure on the photocontrol of folding of the Fyn SH3 domain, a well-studied model protein. We develop a theoretical framework based on an explicit-chain model of protein folding, modified to include detailed model linkers, that allows prediction of the effect of a given linker on the free energy of folding of a protein. Using this framework, we were able to quantitatively explain the experimental result that a longer, but somewhat flexible, cross-linker is less destabilizing to the folded state than a shorter more rigid cross-linker. The models also suggest how misfolded states may be generated by cross-linking, providing a rationale for altered dynamics seen in nuclear magnetic resonance analyses of these proteins. The theoretical framework is readily portable to any protein of known folded state structure and thus can be used to guide the design of photoswitchable proteins generally.  相似文献   

8.
This work investigates whether mRNA has a lower estimated folding free energy than random sequences. The free energy estimates are calculated by the mfold program for prediction of RNA secondary structures. For a set of 46 mRNAs it is shown that the predicted free energy is not significantly different from random sequences with the same dinucleotide distribution. For random sequences with the same mononucleotide distribution it has previously been shown that the native mRNA sequences have a lower predicted free energy, which indicates a more stable structure than random sequences. However, dinucleotide content is important when assessing the significance of predicted free energy as the physical stability of RNA secondary structure is known to depend on dinucleotide base stacking energies. Even known RNA secondary structures, like tRNAs, can be shown to have predicted free energies indistinguishable from randomized sequences. This suggests that the predicted free energy is not always a good determinant for RNA folding.  相似文献   

9.
Intrinsically disordered proteins (IDPs) are extensively involved in dynamic signaling processes which require a high association rate and a high dissociation rate for rapid binding/unbinding events and at the same time a sufficient high affinity for specific recognition. Although the coupled folding-binding processes of IDPs have been extensively studied, it is still impossible to predict whether an unfolded protein is suitable for molecular signaling via coupled folding-binding. In this work, we studied the interplay between intrinsic folding mechanisms and coupled folding-binding process for unfolded proteins through molecular dynamics simulations. We first studied the folding process of three representative IDPs with different folded structures, that is, c-Myb, AF9, and E3 rRNase. We found the folding free energy landscapes of IDPs are downhill or show low barriers. To further study the influence of intrinsic folding mechanism on the binding process, we modulated the folding mechanism of barnase via circular permutation and simulated the coupled folding-binding process between unfolded barnase permutant and folded barstar. Although folding of barnase was coupled to target binding, the binding kinetics was significantly affected by the intrinsic folding free energy barrier, where reducing the folding free energy barrier enhances binding rate up to two orders of magnitude. This accelerating effect is different from previous results which reflect the effect of structure flexibility on binding kinetics. Our results suggest that coupling the folding of an unfolded protein with no/low folding free energy barrier with its target binding may provide a way to achieve high specificity and rapid binding/unbinding kinetics simultaneously.  相似文献   

10.
The folding specificity of proteins can be simulated using simplified structural models and knowledge-based pair-potentials. However, when the same models are used to simulate systems that contain many proteins, large aggregates tend to form. In other words, these models cannot account for the fact that folded, globular proteins are soluble. Here we show that knowledge-based pair-potentials, which include explicitly calculated energy terms between the solvent and each amino acid, enable the simulation of proteins that are much less aggregation-prone in the folded state. Our analysis clarifies why including a solvent term improves the foldability. The aggregation for potentials without water is due to the unrealistically attractive interactions between polar residues, causing artificial clustering. When a water-based potential is used instead, polar residues prefer to interact with water; this leads to designed protein surfaces rich in polar residues and well-defined hydrophobic cores, as observed in real protein structures. We developed a simple knowledge-based method to calculate interactions between the solvent and amino acids. The method provides a starting point for modeling the folding and aggregation of soluble proteins. Analysis of our simple model suggests that inclusion of these solvent terms may also improve off-lattice potentials for protein simulation, design, and structure prediction.  相似文献   

11.
12.
13.
从蛋白质折叠成自由能最小的稳定结构类型为研究的出发点,为揭示蛋白质空间折叠的动力学本质,对非同源蛋白质数据库,以蛋白质序列的氮基酸频率和自协方差函数为特征矢量,求出表征特征矢量中各分量耦合作用与协同作用的协方差矩阵所对应的特征值.与Chou的方法相比,更全面地反映了蛋白质折叠密码的简并性、全局性和多意性,为定量表征折叠成不同结构类的蛋白质,提供了一种动力学参数分析方法.  相似文献   

14.
Rashin AA  Rashin AH 《Proteins》2007,66(2):321-341
Two-dimensional lattice protein models were studied in two approximations of the conformational equilibrium to elucidate the role of surface hydrophobic groups in their stabilities. We demonstrate that stability of any compactly folded sequence is determined by its ability to "flip-flop" (refold) into alternative compact structures. The degree of stability required for folded sequences determines the average numbers of surface hydrophobic groups in stable lattice structures which are in good agreement with ratios of core to surface hydrophobic groups in real proteins. However, the average destabilization of the native structure per surface hydrophobic group is small (0-0.25 kcal/mol), often disagrees with the free energies derived from the ratios of core to surface hydrophobic groups in the same structures, and has a combinatorial entropic nature independent of the strength of structure stabilizing interactions. This suggests that the free energies derived from the core to surface ratios of hydrophobic groups in real proteins have little to do with folding thermodynamics. On average, sequences with highly stable native structures are the least hydrophobic. The results suggest that in designing novel stable proteins hydrophobic groups on the surface should be avoided to reduce the possibility of flip-flopping. The average stability of highly designable structures is never higher than that of some low designability structures, contrary to the accepted view. In the equilibrium approximation with alternative compact and partially unfolded structures, the requirement of high stability selects a unique 5 x 5 structure formed by only a few sequences, suggesting much stronger sequence selectivity than commonly thought.  相似文献   

15.
Predicting RNA secondary structure is often the first step to determining the structure of RNA. Prediction approaches have historically avoided searching for pseudoknots because of the extreme combinatorial and time complexity of the problem. Yet neglecting pseudoknots limits the utility of such approaches. Here, an algorithm utilizing structure mapping and thermodynamics is introduced for RNA pseudoknot prediction that finds the minimum free energy and identifies information about the flexibility of the RNA. The heuristic approach takes advantage of the 5' to 3' folding direction of many biological RNA molecules and is consistent with the hierarchical folding hypothesis and the contact order model. Mapping methods are used to build and analyze the folded structure for pseudoknots and to add important 3D structural considerations. The program can predict some well known pseudoknot structures correctly. The results of this study suggest that many functional RNA sequences are optimized for proper folding. They also suggest directions we can proceed in the future to achieve even better results.  相似文献   

16.
The protein folding problem was apparently solved recently by the advent of a deep learning method for protein structure prediction called AlphaFold. However, this program is not able to make predictions about the protein folding pathways. Moreover, it only treats about half of the human proteome, as the remaining proteins are intrinsically disordered or contain disordered regions. By definition these proteins differ from natively folded proteins and do not adopt a properly folded structure in solution. However these intrinsically disordered proteins (IDPs) also systematically differ in amino acid composition and uniquely often become folded upon binding to an interaction partner. These factors preclude solving IDP structures by current machine-learning methods like AlphaFold, which also cannot solve the protein aggregation problem, since this meta-folding process can give rise to different aggregate sizes and structures. An alternative computational method is provided by molecular dynamics simulations that already successfully explored the energy landscapes of IDP conformational switching and protein aggregation in multiple cases. These energy landscapes are very different from those of ‘simple’ protein folding, where one energy funnel leads to a unique protein structure. Instead, the energy landscapes of IDP conformational switching and protein aggregation feature a number of minima for different competing low-energy structures. In this review, I discuss the characteristics of these multifunneled energy landscapes in detail, illustrated by molecular dynamics simulations that elucidated the underlying conformational transitions and aggregation processes.  相似文献   

17.
A previously developed computer program for protein design, RosettaDesign, was used to predict low free energy sequences for nine naturally occurring protein backbones. RosettaDesign had no knowledge of the naturally occurring sequences and on average 65% of the residues in the designed sequences differ from wild-type. Synthetic genes for ten completely redesigned proteins were generated, and the proteins were expressed, purified, and then characterized using circular dichroism, chemical and temperature denaturation and NMR experiments. Although high-resolution structures have not yet been determined, eight of these proteins appear to be folded and their circular dichroism spectra are similar to those of their wild-type counterparts. Six of the proteins have stabilities equal to or up to 7kcal/mol greater than their wild-type counterparts, and four of the proteins have NMR spectra consistent with a well-packed, rigid structure. These encouraging results indicate that the computational protein design methods can, with significant reliability, identify amino acid sequences compatible with a target protein backbone.  相似文献   

18.
To illuminate the evolutionary pressure acting on the folding free energy landscapes of naturally occurring proteins, we have systematically characterized the folding free energy landscape of Top7, a computationally designed protein lacking an evolutionary history. Stopped-flow kinetics, circular dichroism, and NMR experiments reveal that there are at least three distinct phases in the folding of Top7, that a nonnative conformation is stable at equilibrium, and that multiple fragments of Top7 are stable in isolation. These results indicate that the folding of Top7 is significantly less cooperative than the folding of similarly sized naturally occurring proteins, suggesting that the cooperative folding and smooth free energy landscapes observed for small naturally occurring proteins are not general properties of polypeptide chains that fold to unique stable structures but are instead a product of natural selection.  相似文献   

19.
Nobuhiro G   Haruo Abe 《Biopolymers》1981,20(5):991-1011
A statistical-mechanical model (a noninteracting local structure model) of folding and unfolding transition in globular proteins is described and a formulation is given to calculate the partition function. The process of transition is discussed in this model within the framework of equilibrium statistical mechanics. In order to clarify the range of applicability of such an approach, the characteristics of the folding and unfolding transition in globular proteins are analyzed from the statistical-physical point of view. A theoretical advantage is pointed out in studying folding and unfolding processes taking place as conformational fluctuations in individual protein molecules under macroscopic equilibrium at the melting temperature. In this case, paths of folding and unfolding are shown to be identical in the statistical sense. A key to the noninteracting local structure model lies in the concept of local structures and the assumption of the absence of interactions between local structures. A local structure is defined as a continuous section of the chain which takes the same or similar local conformation as in the native conformation. The assumption of the absence of inter-actions between local structures endows the model with the remarkable character that its partition function can be calculated exactly; thereby the equilibrium population of various conformations along the folding and unfolding paths can be discussed only by a knowledge of the folded native conformation.  相似文献   

20.
Recent advances in modeling protein structures at the atomic level have made it possible to tackle "de novo" computational protein design. Most procedures are based on combinatorial optimization using a scoring function that estimates the folding free energy of a protein sequence on a given main-chain structure. However, the computation of the conformational entropy in the folded state is generally an intractable problem, and its contribution to the free energy is not properly evaluated. In this article, we propose a new automated protein design methodology that incorporates such conformational entropy based on statistical mechanics principles. We define the free energy of a protein sequence by the corresponding partition function over rotamer states. The free energy is written in variational form in a pairwise approximation and minimized using the Belief Propagation algorithm. In this way, a free energy is associated to each amino acid sequence: we use this insight to rescore the results obtained with a standard minimization method, with the energy as the cost function. Then, we set up a design method that directly uses the free energy as a cost function in combination with a stochastic search in the sequence space. We validate the methods on the design of three superficial sites of a small SH3 domain, and then apply them to the complete redesign of 27 proteins. Our results indicate that accounting for entropic contribution in the score function affects the outcome in a highly nontrivial way, and might improve current computational design techniques based on protein stability.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号