首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Currently, one of the most serious problems in protein-folding simulations for de novo structure prediction is conformational sampling of medium-to-large proteins. In vivo, folding of these proteins is mediated by molecular chaperones. Inspired by the functions of chaperonins, we designed a simple chaperonin-like simulation protocol within the framework of the standard fragment assembly method: in our protocol, the strength of the hydrophobic interaction is periodically modulated to help the protein escape from misfolded structures. We tested this protocol for 38 proteins and found that, using a certain defined criterion of success, our method could successfully predict the native structures of 14 targets, whereas only those of 10 targets were successfully predicted using the standard protocol. In particular, for non-α-helical proteins, our method yielded significantly better predictions than the standard approach. This chaperonin-inspired protocol that enhanced de novo structure prediction using folding simulations may, in turn, provide new insights into the working principles underlying the chaperonin system.  相似文献   

2.
Konermann L 《Proteins》2006,65(1):153-163
It should take an astronomical time span for unfolded protein chains to find their native state based on an unguided conformational random search. The experimental observation that folding is fast can be rationalized by assuming that protein energy landscapes are sloped towards the native state minimum, such that rapid folding can proceed from virtually any point in conformational space. Folding transitions often exhibit two-state behavior, involving extensively disordered and highly structured conformers as the only two observable kinetic species. This study employs a simple Brownian dynamics model of "protein particles" moving in a spherically symmetrical potential. As expected, the presence of an overall slope towards the native state minimum is an effective means to speed up folding. However, the two-state nature of the transition is eradicated if a significant energetic bias extends too far into the non-native conformational space. The breakdown of two-state cooperativity under these conditions is caused by a continuous conformational drift of the unfolded proteins. Ideal two-state behavior can only be maintained on surfaces exhibiting large regions that are energetically flat, a result that is supported by other recent data in the literature (Kaya and Chan, Proteins: Struct Funct Genet 2003;52:510-523). Rapid two-state folding requires energy landscapes exhibiting the following features: (i) A large region in conformational space that is energetically flat, thus allowing for a significant degree of random sampling, such that unfolded proteins can retain a random coil structure; (ii) a trapping area that is strongly sloped towards the native state minimum.  相似文献   

3.
Computational de novo protein structure prediction is limited to small proteins of simple topology. The present work explores an approach to extend beyond the current limitations through assembling protein topologies from idealized α-helices and β-strands. The algorithm performs a Monte Carlo Metropolis simulated annealing folding simulation. It optimizes a knowledge-based potential that analyzes radius of gyration, β-strand pairing, secondary structure element (SSE) packing, amino acid pair distance, amino acid environment, contact order, secondary structure prediction agreement and loop closure. Discontinuation of the protein chain favors sampling of non-local contacts and thereby creation of complex protein topologies. The folding simulation is accelerated through exclusion of flexible loop regions further reducing the size of the conformational search space. The algorithm is benchmarked on 66 proteins with lengths between 83 and 293 amino acids. For 61 out of these proteins, the best SSE-only models obtained have an RMSD100 below 8.0 Å and recover more than 20% of the native contacts. The algorithm assembles protein topologies with up to 215 residues and a relative contact order of 0.46. The method is tailored to be used in conjunction with low-resolution or sparse experimental data sets which often provide restraints for regions of defined secondary structure.  相似文献   

4.
We report here a multiprotein blind test of a computer method to predict native protein structures based solely on an all-atom physics-based force field. We use the AMBER 96 potential function with an implicit (GB/SA) model of solvation, combined with replica-exchange molecular-dynamics simulations. Coarse conformational sampling is performed using the zipping and assembly method (ZAM), an approach that is designed to mimic the putative physical routes of protein folding. ZAM was applied to the folding of six proteins, from 76 to 112 monomers in length, in CASP7, a community-wide blind test of protein structure prediction. Because these predictions have about the same level of accuracy as typical bioinformatics methods, and do not utilize information from databases of known native structures, this work opens up the possibility of predicting the structures of membrane proteins, synthetic peptides, or other foldable polymers, for which there is little prior knowledge of native structures. This approach may also be useful for predicting physical protein folding routes, non-native conformations, and other physical properties from amino acid sequences.  相似文献   

5.
Contact order and ab initio protein structure prediction   总被引:1,自引:0,他引:1       下载免费PDF全文
Although much of the motivation for experimental studies of protein folding is to obtain insights for improving protein structure prediction, there has been relatively little connection between experimental protein folding studies and computational structural prediction work in recent years. In the present study, we show that the relationship between protein folding rates and the contact order (CO) of the native structure has implications for ab initio protein structure prediction. Rosetta ab initio folding simulations produce a dearth of high CO structures and an excess of low CO structures, as expected if the computer simulations mimic to some extent the actual folding process. Consistent with this, the majority of failures in ab initio prediction in the CASP4 (critical assessment of structure prediction) experiment involved high CO structures likely to fold much more slowly than the lower CO structures for which reasonable predictions were made. This bias against high CO structures can be partially alleviated by performing large numbers of additional simulations, selecting out the higher CO structures, and eliminating the very low CO structures; this leads to a modest improvement in prediction quality. More significant improvements in predictions for proteins with complex topologies may be possible following significant increases in high-performance computing power, which will be required for thoroughly sampling high CO conformations (high CO proteins can take six orders of magnitude longer to fold than low CO proteins). Importantly for such a strategy, simulations performed for high CO structures converge much less strongly than those for low CO structures, and hence, lack of simulation convergence can indicate the need for improved sampling of high CO conformations. The parallels between Rosetta simulations and folding in vivo may extend to misfolding: The very low CO structures that accumulate in Rosetta simulations consist primarily of local up-down beta-sheets that may resemble precursors to amyloid formation.  相似文献   

6.
Elucidation of the high-resolution structures of folding intermediates is a necessary but difficult step toward the ultimate understanding of the mechanism of protein folding. Here, using hydrogen-exchange-directed protein engineering, we populated the folding intermediate of the Thermus thermophilus ribonuclease H, which forms before the rate-limiting transition state, by removing the unfolded regions of the intermediate, including an α-helix and two β-strands (51 folded residues). Using multidimensional NMR, we solved the structure of this intermediate mimic to an atomic resolution (backbone rmsd, 0.51 Å). It has a native-like backbone topology and shows some local deviations from the native structure, revealing that the structure of the folded region of an early folding intermediate can be as well defined as the native structure. The topological parameters calculated from the structures of the intermediate mimic and the native state predict that the intermediate should fold on a millisecond time scale or less and form much faster than the native state. Other factors that may lead to the slow folding of the native state and the accumulation of the intermediate before the rate-limiting transition state are also discussed.  相似文献   

7.
It has long been proposed that much of the information encoding how a protein folds is contained locally in the peptide chain. Here we present a large-scale simulation study designed to examine the extent to which conformations of peptide fragments in water predict native conformations in proteins. We perform replica exchange molecular dynamics (REMD) simulations of 872 8-mer, 12-mer, and 16-mer peptide fragments from 13 proteins using the AMBER 96 force field and the OBC implicit solvent model. To analyze the simulations, we compute various contact-based metrics, such as contact probability, and then apply Bayesian classifier methods to infer which metastable contacts are likely to be native vs. non-native. We find that a simple measure, the observed contact probability, is largely more predictive of a peptide''s native structure in the protein than combinations of metrics or multi-body components. Our best classification model is a logistic regression model that can achieve up to 63% correct classifications for 8-mers, 71% for 12-mers, and 76% for 16-mers. We validate these results on fragments of a protein outside our training set. We conclude that local structure provides information to solve some but not all of the conformational search problem. These results help improve our understanding of folding mechanisms, and have implications for improving physics-based conformational sampling and structure prediction using all-atom molecular simulations.  相似文献   

8.
Development of a tightly packed hydrophobic core drives the folding of water-soluble globular proteins and is a key determinant of protein stability. Despite this, there remains much to be learnt about how and when the hydrophobic core becomes desolvated and tightly packed during protein folding. We have used the bacterial immunity protein Im7 to examine the specificity of hydrophobic core packing during folding. This small, four-helix protein has previously been shown to fold via a compact three-helical intermediate state. Here, overpacking substitutions, in which residue side-chain size is increased, were used to examine the specificity and malleability of core packing in the folding intermediate and rate-limiting transition state. In parallel, polar groups were introduced into the Im7 hydrophobic core via Val→Thr or Phe→Tyr substitutions and used to determine the solvation status of core residues at different stages of folding. Over 30 Im7 variants were created allowing both series of substitutions to cover all regions of the protein structure. Φ-value analysis demonstrated that the major changes in Im7 core solvation occur prior to the population of the folding intermediate, with key regions involved in docking of the short helix III remaining solvent-exposed until after the rate-limiting transition state has been traversed. In contrast, overpacking core residues revealed that some regions of the native Im7 core are remarkably malleable to increases in side-chain volume. Overpacking residues in other regions of the Im7 core result in substantial (> 2.5 kJ mol− 1) destabilisation of the native structure or even prevents efficient folding to the native state. This study provides new insights into Im7 folding; demonstrating that whilst desolvation occurs early during folding, adoption of a specifically packed core is achieved only at the very last step in the folding mechanism.  相似文献   

9.
De novo protein structure prediction requires location of the lowest energy state of the polypeptide chain among a vast set of possible conformations. Powerful approaches include conformational space annealing, in which search progressively focuses on the most promising regions of conformational space, and genetic algorithms, in which features of the best conformations thus far identified are recombined. We describe a new approach that combines the strengths of these two approaches. Protein conformations are projected onto a discrete feature space which includes backbone torsion angles, secondary structure, and beta pairings. For each of these there is one “native” value: the one found in the native structure. We begin with a large number of conformations generated in independent Monte Carlo structure prediction trajectories from Rosetta. Native values for each feature are predicted from the frequencies of feature value occurrences and the energy distribution in conformations containing them. A second round of structure prediction trajectories are then guided by the predicted native feature distributions. We show that native features can be predicted at much higher than background rates, and that using the predicted feature distributions improves structure prediction in a benchmark of 28 proteins. The advantages of our approach are that features from many different input structures can be combined simultaneously without producing atomic clashes or otherwise physically inviable models, and that the features being recombined have a relatively high chance of being correct. Proteins 2010. © 2009 Wiley‐Liss, Inc.  相似文献   

10.
The three-dimensional structure of a protein is a key determinant of its biological function. Given the cost and time required to acquire this structure through experimental means, computational models are necessary to complement wet-lab efforts. Many computational techniques exist for navigating the high-dimensional protein conformational search space, which is explored for low-energy conformations that comprise a protein's native states. This work proposes two strategies to enhance the sampling of conformations near the native state. An enhanced fragment library with greater structural diversity is used to expand the search space in the context of fragment-based assembly. To manage the increased complexity of the search space, only a representative subset of the sampled conformations is retained to further guide the search towards the native state. Our results make the case that these two strategies greatly enhance the sampling of the conformational space near the native state. A detailed comparative analysis shows that our approach performs as well as state-of-the-art ab initio structure prediction protocols.  相似文献   

11.
Investigations into protein folding are largely dominated by studies on monomeric proteins. However, the transmembrane domain of an important group of membrane proteins is only formed upon multimerization. Here, we use in vitro translation-coupled folding and insertion into artificial liposomes to investigate kinetic steps in the assembly of one such protein, the outer membrane secretin PulD of the bacterial type II secretion system. Analysis of the folding kinetics, measured by the acquisition of distinct determinants of the native state, provides unprecedented evidence for a sequential multistep process initiated by membrane-driven oligomerization. The effects of varying the lipid composition of the liposomes indicate that PulD first forms a “prepore” structure that attains the native state via a conformational switch.  相似文献   

12.
Over the last few years we have developed an empirical potential function that solves the protein structure recognition problem: given the sequence for an n-residue globular protein and a collection of plausible protein conformations, including the native conformation for that sequence, identify the correct, native conformation. Having determined this potential on the basis of only some 6500 native/nonnative pairs of structures for 58 proteins, we find it recognizes the native conformation for essentially all compact, soluble, globular proteins having known native conformations in comparisons with 104 to 106 reasonable alternative conformations apiece. In this sense, the potential encodes nearly all the essential features of globular protein conformational preference. In addition it “knows” about many additional factors in protein folding, such as the stabilization of multimeric proteins, quaternary structure, the role of disulfide bridges and ligands, proproteins vs. processed proteins, and minimal strand lengths in globular proteins. Comparisons are made with other sorts of protein folding problems, and applications in protein conformational determination and prediction are discussed. © 1994 Wiley-Liss, Inc.  相似文献   

13.
Proteinase K (E.C. 3.4.21.64), a serine proteinase from fungus Tritirachium album, has been used as a model system to investigate the conformational changes induced by monohydric alcohols at low pH. Proteinase K belongs to α/β class of proteins and maintains structural integrity in the range of pH 7.0–3.0. Enzyme acquires partially unfolded conformation (UP) at pH 2.5 with lower activity, partial loss of tertiary structure and exposure of some hydrophobic patches. Proteinase K in stressed state at pH 2.5 is chosen and the conformational changes induced by alkyl alcohols (methanol/ethanol/isopropanol) are studied. At critical concentration of alcohol, conformational switch occurs in the protein structure from α/β to β-sheet driving the protein into O-state. Complete loss of tertiary contacts and proteolytic activity in O-sate emphasize the involvement of alpha regions in maintaining the active site of the enzyme. Moreover, isopropanol induced unfolding of proteinase K in UP state occurred in two steps with the formation of β state at low alcohol concentration followed by stabilization of β state at high alcohol concentration. GuHCl and temperature induced unfolding of proteinase K in O-state (in 50% isopropanol) is non-cooperative as the transition curves are biphasic. This suggests that the structure of proteinase K in O-state has melted alpha regions and stabilized beta regions and that these differentially stabilized regions unfold sequentially. Further, the O-state of proteinase K can be attained from complete unfolded protein by the addition of 50% isopropanol. Hence the alcohol-induced O-state is different from native state or completely unfolded state and shows characteristics of the molten globule-like state. Thus, this state may be functioning as an intermediary in the folding pathway of proteinase K.  相似文献   

14.
Thompson J  Baker D 《Proteins》2011,79(8):2380-2388
Prediction of protein structures from sequences is a fundamental problem in computational biology. Algorithms that attempt to predict a structure from sequence primarily use two sources of information. The first source is physical in nature: proteins fold into their lowest energy state. Given an energy function that describes the interactions governing folding, a method for constructing models of protein structures, and the amino acid sequence of a protein of interest, the structure prediction problem becomes a search for the lowest energy structure. Evolution provides an orthogonal source of information: proteins of similar sequences have similar structure, and therefore proteins of known structure can guide modeling. The relatively successful Rosetta approach takes advantage of the first, but not the second source of information during model optimization. Following the classic work by Andrej Sali and colleagues, we develop a probabilistic approach to derive spatial restraints from proteins of known structure using advances in alignment technology and the growth in the number of structures in the Protein Data Bank. These restraints define a region of conformational space that is high-probability, given the template information, and we incorporate them into Rosetta's comparative modeling protocol. The combined approach performs considerably better on a benchmark based on previous CASP experiments. Incorporating evolutionary information into Rosetta is analogous to incorporating sparse experimental data: in both cases, the additional information eliminates large regions of conformational space and increases the probability that energy-based refinement will hone in on the deep energy minimum at the native state.  相似文献   

15.
An important puzzle in structural biology is the question of how proteins are able to fold so quickly into their unique native structures. There is much evidence that protein folding is hierarchic. In that case, folding routes are not linear, but have a tree structure. Trees are commonly used to represent the grammatical structure of natural language sentences, and chart parsing algorithms efficiently search the space of all possible trees for a given input string. Here we show that one such method, the CKY algorithm, can be useful both for providing novel insight into the physical protein folding process, and for computational protein structure prediction. As proof of concept, we apply this algorithm to the HP lattice model of proteins. Our algorithm identifies all direct folding route trees to the native state and allows us to construct a simple model of the folding process. Despite its simplicity, our model provides an account for the fact that folding rates depend only on the topology of the native state but not on sequence composition.  相似文献   

16.
Convergence of the vast sequence space of proteins into a highly restricted fold/conformational space suggests a simple yet unique underlying mechanism of protein folding that has been the subject of much debate in the last several decades. One of the major challenges related to the understanding of protein folding or in silico protein structure prediction is the discrimination of non-native structures/decoys from the native structure. Applications of knowledge-based potentials to attain this goal have been extensively reported in the literature. Also, scoring functions based on accessible surface area and amino acid neighbourhood considerations were used in discriminating the decoys from native structures. In this article, we have explored the potential of protein structure network (PSN) parameters to validate the native proteins against a large number of decoy structures generated by diverse methods. We are guided by two principles: (a) the PSNs capture the local properties from a global perspective and (b) inclusion of non-covalent interactions, at all-atom level, including the side-chain atoms, in the network construction accommodates the sequence dependent features. Several network parameters such as the size of the largest cluster, community size, clustering coefficient are evaluated and scored on the basis of the rank of the native structures and the Z-scores. The network analysis of decoy structures highlights the importance of the global properties contributing to the uniqueness of native structures. The analysis also exhibits that the network parameters can be used as metrics to identify the native structures and filter out non-native structures/decoys in a large number of data-sets; thus also has a potential to be used in the protein ‘structure prediction’ problem.  相似文献   

17.
During CASP10 in summer 2012, we tested BCL::Fold for prediction of free modeling (FM) and template‐based modeling (TBM) targets. BCL::Fold assembles the tertiary structure of a protein from predicted secondary structure elements (SSEs) omitting more flexible loop regions early on. This approach enables the sampling of conformational space for larger proteins with more complex topologies. In preparation of CASP11, we analyzed the quality of CASP10 models throughout the prediction pipeline to understand BCL::Fold's ability to sample the native topology, identify native‐like models by scoring and/or clustering approaches, and our ability to add loop regions and side chains to initial SSE‐only models. The standout observation is that BCL::Fold sampled topologies with a GDT_TS score > 33% for 12 of 18 and with a topology score > 0.8 for 11 of 18 test cases de novo. Despite the sampling success of BCL::Fold, significant challenges still exist in clustering and loop generation stages of the pipeline. The clustering approach employed for model selection often failed to identify the most native‐like assembly of SSEs for further refinement and submission. It was also observed that for some β‐strand proteins model refinement failed as β‐strands were not properly aligned to form hydrogen bonds removing otherwise accurate models from the pool. Further, BCL::Fold samples frequently non‐natural topologies that require loop regions to pass through the center of the protein. Proteins 2015; 83:547–563. © 2015 Wiley Periodicals, Inc.  相似文献   

18.
Huang JT  Cheng JP 《Proteins》2007,68(1):218-222
Folding kinetics of proteins is governed by the free energy and position of transition states. But attempts to predict the position of folding transition state on reaction pathway from protein structure have been met with only limited success, unlike the folding-rate prediction. Here, we find that the folding transition-state position is related to the secondary structure content of native two-state proteins. We present a simple method for predicting the transition-state position from their alpha-helix, turn and polyproline secondary structures. The method achieves 81% correlation with experiment over 24 small, two-state proteins, suggesting that the local secondary structure content, especially for content of alpha-helix, is a determinant of the solvent accessibility of the transition state ensemble and size of folding nucleus.  相似文献   

19.
20.
The unfolded state of a protein is an ensemble of a large number of conformations ranging from fully extended to compact structures. To investigate the effects of the difference in the unfolded-state ensemble on protein folding, we have studied the structure, stability, and folding of "circular" dihydrofolate reductase (DHFR) from Escherichia coli in which the N and C-terminal regions are cross-linked by a disulfide bond, and compared the results with those of disulfide-reduced "linear" DHFR. Equilibrium studies by circular dichroism, difference absorption spectra, solution X-ray scattering, and size-exclusion chromatography show that whereas the native structures of both proteins are essentially the same, the unfolded state of circular DHFR adopts more compact conformations than the unfolded state of the linear form, even with the absence of secondary structure. Circular DHFR is more stable than linear DHFR, which may be due to the decrease in the conformational entropy of the unfolded state as a result of circularization. Kinetic refolding measurements by stopped-flow circular dichroism and fluorescence show that under the native conditions both proteins accumulate a burst-phase intermediate having the same structures and both fold by the same complex folding mechanism with the same folding rates. Thus, the effects of the difference in the unfolded state of circular and linear DHFRs on the refolding reaction are not observed after the formation of the intermediate. This suggests that for the proteins with close termini in the native structure, early compaction of a protein molecule to form a specific folding intermediate with the N and C-terminal regions in close proximity is a crucial event in folding. If there is an enhancement in the folding reflecting the reduction in the breadth of the unfolded-state ensemble for circular DHFR, this acceleration must occur in the sub-millisecond time-range.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号