首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
It is a challenging task to understand the relationship between sequences and folding rates of proteins. Previous studies are found that one of contact order (CO), long-range order (LRO), total contact distance (TCD), chain topology parameter (CTP), and effective length (Leff) has a significant correlation with folding rate of proteins. In this paper, we introduce a new parameter called n-order contact distance (nOCD) and use it to predict folding rate of proteins with two- and three-state folding kinetics. A good linear correlation between the folding rate logarithm lnkf and nOCD with n=1.2, alpha=0.6 is found for two-state folders (correlation coefficient is -0.809, P-value<0.0001) and n=2.8, alpha=1.5 for three-state folders (correlation coefficient is -0.816, P-value<0.0001). However, this correlation is completely absent for three-state folders with n=1.2, alpha=0.6 (correlation coefficient is 0.0943, P-value=0.661) and for two-state folders with n=2.8, alpha=1.5 (correlation coefficient is -0.235, P-value=0.2116). We also find that the average number of contacts per residue Pm in the interval of m for two-state folders is smaller than that for three-state folders. The probability distribution P(gamma) of residue having gamma pairs of contacts fits a Gaussian distribution for both two- and three-state folders. We observe that the correlations between square radius of gyration S2 and number of residues for two- and three-state folders are both good, and the correlation coefficient is 0.908 and 0.901, and the slope of the fitting line is 1.202 and 0.795, respectively. Maybe three-state folders are more compact than two-state folders. Comparisons with nTCD and nCTP are also made, and it is found that nOCD is the best one in folding rate prediction.  相似文献   

2.
3.
Huang JT  Cheng JP  Chen H 《Proteins》2007,67(1):12-17
We present a simple method for determining the folding rates of two- and three-state proteins from the number of residues in their secondary structures (secondary structure length). The method is based on the hypothesis that two- and three-state foldings share a common pattern. Three-state proteins first condense into metastable intermediates, subsequent forming of alpha-helices, turns, and beta-sheets at slow rate-limiting step. The folding rate of such proteins anticorrelate with the length of these beta-secondary structures. It is also assumed that in two-state folding, rapidly folded alpha-helices and turns may facilitate formation of fleeting unobservable intermediates and thus show two-state behavior. There is an inverse relationship between the folding rate and the length of beta-sheets and loops. Our study achieves 94.0 and 88.1% correlations with folding rates determined experimentally for 21 three- and 38 two-state proteins, respectively, suggesting that protein-folding rates are determined by the secondary structure length. The kinetic kinds are selected on the basis of a competitive formation of hydrophobic collapse and alpha-structure in early intermediates.  相似文献   

4.
Contact order revisited: influence of protein size on the folding rate   总被引:13,自引:0,他引:13       下载免费PDF全文
Guided by the recent success of empirical model predicting the folding rates of small two-state folding proteins from the relative contact order (CO) of their native structures, by a theoretical model of protein folding that predicts that logarithm of the folding rate decreases with the protein chain length L as L(2/3), and by the finding that the folding rates of multistate folding proteins strongly correlate with their sizes and have very bad correlation with CO, we reexamined the dependence of folding rate on CO and L in attempt to find a structural parameter that determines folding rates for the totality of proteins. We show that the Abs_CO = CO x L, is able to predict rather accurately folding rates for both two-state and multistate folding proteins, as well as short peptides, and that this Abs_CO scales with the protein chain length as L(0.70 +/- 0.07) for the totality of studied single-domain proteins and peptides.  相似文献   

5.
We have collected the kinetic folding data for non-two-state and two-state globular proteins reported in the literature, and investigated the relationships between the folding kinetics and the native three-dimensional structure of these proteins. The rate constants of formation of both the intermediate and the native state of non-two-state folders were found to be significantly correlated with protein chain length and native backbone topology, which is represented by the absolute contact order and sequence-distant native pairs. The folding rate of two-state folders, which is known to be correlated with the native backbone topology, apparently does not correlate significantly with protein chain length. On the basis of a comparison of the folding rates of the non-two-state and two-state folders, it was found that they are similarly dependent on the parameters that reflect the native backbone topology. This suggests that the mechanisms behind non-two-state and two-state folding are essentially identical. The present results lead us to propose a unified mechanism of protein folding, in which folding occurs in a hierarchical manner, reflecting the hierarchy of the native three-dimensional structure, as embodied in the case of non-two-state folding with an accumulation of the intermediate. Apparently, two-state folding is merely a simplified version of hierarchical folding caused either by an alteration in the rate-limiting step of folding or by destabilization of the intermediate.  相似文献   

6.
Dixit PD  Weikl TR 《Proteins》2006,64(1):193-197
The folding rates of two-state proteins have been found to correlate with simple measures of native-state topology. The most prominent among these measures is the relative contact order (CO), which is the average CO, or localness, of all contacts in the native protein structure, divided by the chain length. Here, we test whether such measures can be generalized to capture the effect of chain crosslinks on the folding rate. Crosslinks change the chain connectivity and therefore also the localness of some of the native contacts. These changes in localness can be taken into account by the graph-theoretical concept of effective contact order (ECO). The relative ECO, however, the natural extension of the relative CO for proteins with crosslinks, overestimates the changes in the folding rates caused by crosslinks. We suggest here a novel measure of native-state topology, the relative logCO, and its natural extension, the relative logECO. The relative logCO is the average value for the logarithm of the CO of all contacts, divided by the logarithm of the chain length. The relative log(E)CO reproduces the folding rates of a set of 26 two-state proteins without crosslinks with essentially the same high correlation coefficient as the relative CO. In addition, it also captures the folding rates of eight two-state proteins with crosslinks.  相似文献   

7.
Ma BG  Guo JX  Zhang HY 《Proteins》2006,65(2):362-372
Discovering the mechanism of protein folding, in molecular biology, is a great challenge. A key step to this end is to find factors that correlate with protein folding rates. Over the past few years, many empirical parameters, such as contact order, long-range order, total contact distance, secondary structure contents, have been developed to reflect the correlation between folding rates and protein tertiary or secondary structures. However, the correlation between proteins' folding rates and their amino acid compositions has not been explored. In the present work, we examined systematically the correlation between proteins' folding rates and their amino acid compositions for two-state and multistate folders and found that different amino acids contributed differently to the folding progress. The relation between the amino acids' molecular weight and degeneracy and the folding rates was examined, and the role of hydrophobicity in the protein folding process was also inspected. As a consequence, a new indicator called composition index was derived, which takes no structure factors into account and is merely determined by the amino acid composition of a protein. Such an indicator is found to be highly correlated with the protein's folding rate (r > 0.7). From the results of this work, three points of concluding remarks are evident. (1) Two-state folders and multistate folders have different rate-determining amino acids. (2) The main determining information of a protein's folding rate is largely reflected in its amino acid composition. (3) Composition index may be the best predictor for an ab initio protein folding rate prediction directly from protein sequence from the standpoint of practical application.  相似文献   

8.
Fernández A  Colubri A 《Proteins》2002,48(2):293-310
We generate ab initio folding pathways in two single-domain proteins, hyperthermophile variant of protein G domain (1gb4) and ubiquitin (1ubi), both presumed to be two-state folders. Both proteins are endowed with the same topology but, as shown in this work, rely to a different extent on large-scale context to find their native folds. First, we demonstrate a generic feature of two-state folders: A downsizing of structural fluctuations is achieved only when the protein reaches a stationary plateau maximizing the number of highly protected hydrogen bonds. This enables us to identify the folding nucleus and show that folding does not become expeditious until a topology is generated that is able to protect intramolecular hydrogen bonds from water attack. Pathway heterogeneity is shown to be dependent on the extent to which the protein relies on large-scale context to fold, rather than on contact order: Proteins that can only stabilize native secondary structure by packing it against scaffolding hydrophobic moieties are meant to have a heterogeneous transition-state ensemble if they are to become successful folders (otherwise, successful folding would be too fortuitous an event.) We estimate mutational Phi values as ensemble averages and deconvolute individual-route contributions to the averaged two-state kinetic picture. Our results find experimental corroboration in the well-studied chymotrypsin inhibitor (CI2), while leading to verifiable predictions for the other two study cases.  相似文献   

9.
We develop a simple model for computing the rates and routes of folding of two-state proteins from the contact maps of their native structures. The model is based on the graph-theoretical concept of effective contact order (ECO). The model predicts that proteins fold by "zipping up" in a sequence of small-loop-closure events, depending on the native chain fold. Using a simple equation, with a few physical rate parameters, we obtain a good correlation with the folding rates of 24 two-state folding proteins. The model rationalizes data from Phi-value analysis that have been interpreted in terms of delocalized or polarized transition states. This model indicates how much of protein folding may take place in parallel, not along a single reaction coordinate or with a single transition state.  相似文献   

10.
Protein folding speeds are known to vary over more than eight orders of magnitude. Plaxco, Simons, and Baker (see References) first showed a correlation of folding speed with the topology of the native protein. That and subsequent studies showed, if the native structure of a protein is known, its folding speed can be predicted reasonably well through a correlation with the "localness" of the contacts in the protein. In the present work, we develop a related measure, the geometric contact number, N (alpha), which is the number of nonlocal contacts that are well-packed, by a Voronoi criterion. We find, first, that in 80 proteins, the largest such database of proteins yet studied, N (alpha) is a consistently excellent predictor of folding speeds of both two-state fast folders and more complex multistate folders. Second, we show that folding rates can also be predicted from amino acid sequences directly, without the need to know the native topology or other structural properties.  相似文献   

11.
Kameda T 《Proteins》2003,53(3):616-628
Recent experimental and theoretical studies suggest that rates and pathways of protein folding are largely decided by topology of the native structures, at least for small proteins. However, some exceptions are known; for example, protein L and protein G have the same topology, but exhibit different characteristics of the TSE. Thus, folding pathways of some proteins are critically affected by detailed information on amino acid sequences. To investigate the sequence specificity, we calculate folding pathways of 20 small proteins using the perturbed Gaussian chain model developed by Portman et al. (Phys Rev Lett 1998;81:5237-5240; J Chem Phys 2001;114:5069-5081). Characteristics of the TSE predicted by the model are in good agreement with experimental phi-value data for many proteins at coarse-grained level. Especially, estimation of folding TSE for protein G and protein L based on both topology and additional sequence information are consistent with experimental phi-value data. With only topology information, however, the model predicts the TSE of protein G incorrectly. Moreover, the model that uses topology and sequence information describes free energy profiles of two-state and three-state folders consistently with experiment, whereas the topology only model predicts free energy profiles of some proteins incorrectly. This indicates that sequence specificity also has critical roles in determining the folding pathways for some proteins.  相似文献   

12.
Understanding protein folding rate is the primary key to unlock the fundamental physics underlying protein structure and its folding mechanism.Especially,the temperature dependence of the folding rate remains unsolved in the literature.Starting from the assumption that protein folding is an event of quantum transition between molecular conformations,we calculated the folding rate for all two-state proteins in a database and studied their temperature dependencies.The non-Arrhenius temperature relation for 16 proteins,whose experimental data had previously been available,was successfully interpreted by comparing the Arrhenius plot with the first-principle calculation.A statistical formula for the prediction of two-state protein folding rate was proposed based on quantum folding theory.The statistical comparisons of the folding rates for 65 two-state proteins were carried out,and the theoretical vs.experimental correlation coefficient was 0.73.Moreover,the maximum and the minimum folding rates given by the theory were consistent with the experimental results.  相似文献   

13.
A hydrophobic folding unit cutting algorithm, originally developed for dissecting single-chain proteins, has been applied to a dataset of dissimilar two-chain protein-protein interfaces. Rather than consider each individual chain separately, the two-chain complex has been treated as a single chain. The two-chain parsing results presented in this work show hydrophobicity to be a critical attribute of two-state versus three-state protein-protein complexes. The hydrophobic folding units at the interfaces of two-state complexes suggest that the cooperative nature of the two-chain protein folding is the outcome of the hydrophobic effect, similar to its being the driving force in a single-chain folding. In analogy to the protein-folding process, the two-chain, two-state model complex may correspond to the formation of compact, hydrophobic nuclei. On the other hand, the three-state model complex involves binding of already folded monomers, similar to the association of the hydrophobic folding units within a single chain. The similarity between folding entities in protein cores and in two-state protein-protein interfaces, despite the absence of some chain connectivities in the latter, indicates that chain linkage does not necessarily affect the native conformation. This further substantiates the notion that tertiary, non-local interactions play a critical role in protein folding. These compact, hydrophobic, two-chain folding units, derived from structurally dissimilar protein-protein interfaces, provide a rich set of data useful in investigations of the role played by chain connectivity and by tertiary interactions in studies of binding and of folding. Since they are composed of non-contiguous pieces of protein backbones, they may also aid in defining folding nuclei.  相似文献   

14.
Folding rates of small single-domain proteins that fold through simple two-state kinetics can be estimated from details of the three-dimensional protein structure. Previously, predictions of secondary structure had been exploited to predict folding rates from sequence. Here, we estimate two-state folding rates from predictions of internal residue-residue contacts in proteins of unknown structure. Our estimate is based on the correlation between the folding rate and the number of predicted long-range contacts normalized by the square of the protein length. It is well known that long-range order derived from known structures correlates with folding rates. The surprise was that estimates based on very noisy contact predictions were almost as accurate as the estimates based on known contacts. On average, our estimates were similar to those previously published from secondary structure predictions. The combination of these methods that exploit different sources of information improved performance. It appeared that the combined method reliably distinguished fast from slow two-state folders.  相似文献   

15.
Micheletti C 《Proteins》2003,51(1):74-84
A variety of experimental and theoretical studies have established that the folding process of monomeric proteins is strongly influenced by the topology of the native state. In particular, folding times have been shown to correlate well with the contact order, a measure of contact locality. Our investigation focuses on identifying additional topologic properties that correlate with experimentally measurable quantities, such as folding rates and transition-state placement, for both two- and three-state folders. The validation against data from 40 experiments shows that a particular topological property that measures the interdependence of contacts, termed cliquishness or clustering coefficient, can account with statistically significant accuracy both for the transition state placement and especially for folding rates. The observed correlations can be further improved by optimally combining the distinct topological information captured by cliquishness and contact order.  相似文献   

16.
Kuznetsov IB  Rackovsky S 《Proteins》2004,54(2):333-341
Small single-domain proteins that fold by simple two-state kinetics have been shown to exhibit a wide variation in their folding rates. It has been proposed that folding mechanisms in these proteins are largely determined by the native-state topology, and a significant correlation between folding rate and measures of the average topological complexity, such as relative contact order (RCO), has been reported. We perform a statistical analysis of folding rate and RCO in all three major structural classes (alpha, beta, and alpha/beta) of small two-state proteins and of RCO in groups of analogous and homologous small single-domain proteins with the same topology. We also study correlation between folding rate and the average physicochemical properties of amino acid sequences in two-state proteins. Our results indicate that 1) helical proteins have statistically distinguishable, class-specific folding rates; 2) RCO accounts for essentially all the variation of folding rate in helical proteins, but for only a part of the variation in beta-sheet-containing proteins; and 3) only a small fraction of the protein topologies studied show a topology-specific RCO. We also report a highly significant correlation between the folding rate and average intrinsic structural propensities of protein sequences. These results suggest that intrinsic structural propensities may be an important determinant of the rate of folding in small two-state proteins.  相似文献   

17.
Huang JT  Cheng JP 《Proteins》2008,72(1):44-49
Prediction of protein-folding rates follows different rules in two-state and multi-state kinetics. The prerequisite for the prediction is to recognize the folding kinetic pathway of proteins. Here, we use the logistic regression and support vector machine to discriminate between two-state and multi-state folding proteins. We find that chain length is sufficient to accurately recognize multi-state proteins. There is a transition boundary between two kinetic models. Protein folds with multi-state kinetics, if its length is larger than 112 residues. The logistic prediction from amino acid composition shows that the kinetic pathway of folding is closely related to amino acid volume. Small amino acids make two-state folding easier, and vice versa. However, cysteine, alanine, arginine, lysine, histidine, and methionine do not conform to this rule.  相似文献   

18.
19.
Akmal A  Muñoz V 《Proteins》2004,57(1):142-152
We introduce a simple procedure to analyze the temperature dependence of the folding and unfolding rates of two-state proteins. We start from the simple transition-state-like rate expression: k = D(eff)exp(-DeltaG(TS)/RT), in which upper and lower bounds for the intra-chain effective diffusion coefficient (D(eff)) are obtained empirically using the timescales of elementary processes in protein folding. From the changes in DeltaG(TS) as a function of temperature, we calculate enthalpies and heat capacities of activation, together with the more elusive entropies of activation. We then estimate the conformational entropy of the transition state by extrapolation to the temperature at which the solvation entropy vanishes by cancellation between polar and apolar terms. This approach is based on the convergence temperatures for the entropy of solvating apolar (approximately 385 K) and polar groups (approximately 335 K), the assumption that the structural properties of the transition state are somewhere in between the unfolded and folded states, and the established relationship between observed heat capacity and solvent accessibility.1 To circumvent the lack of structural information about transition states, we use the empirically determined heat capacities of activation as constraints to identify the extreme values of the transition state conformational entropy that are consistent with experiment. The application of this simple approach to six two-state folding proteins for which there is temperature-dependent data available in the literature provides important clues about protein folding. For these six proteins, we obtain an average equilibrium cost in conformational entropy of -4.3 cal x mol(-1)K(-1)per residue, which is in close agreement to previous empirical and computational estimates of the same quantity. Furthermore, we find that all these proteins have a conformationally diverse transition state, with more than half of the conformational entropy of the unfolded state. In agreement with predictions from theory and computer simulations, the transition state signals the change from a regime dominated by loss in conformational entropy to one driven by the gain in stabilization free energy (i.e., including protein interactions and solvation effects). Moreover, the height of the barrier is determined by how much stabilization free energy is realized at that point, which is related to the relative contribution of local versus non-local interactions. A remarkable observation is that the fraction of conformational entropy per residue that is present in the transition state is very similar for the six proteins in this study. Based on this commonality, we propose that the observed change in thermodynamic regime is connected to a change in the pattern of structure formation: from one driven by formation of pairwise interactions to one dominated by coupling of the networks of interactions involved in forming the protein core. In this framework, the barrier to two-state folding is crossed when the folding protein reaches a "critical native density" that allows expulsion of remaining interstitial water and consolidation of the core. The principle of critical native density should be general for all two-state proteins, but can accommodate different folding mechanisms depending on the particularities of the structure and sequence.  相似文献   

20.
Many single-domain proteins exhibit two-state folding kinetics, with folding rates that span more than six orders of magnitude. A quantity of much recent interest for such proteins is their contact order, the average separation in sequence between contacting residue pairs. Numerous studies have reached the surprising conclusion that contact order is well-correlated with the logarithm of the folding rate for these small, well-characterized molecules. Here, we investigate the physico-chemical basis for this finding by asking whether contact order is actually a composite number that measures the fraction of local secondary structure in the protein; viz. turns, helices, and hairpins. To pursue this question, we calculated the secondary structure content for 24 two-state proteins and obtained coefficients that predict their folding rates. The predicted rates correlate strongly with experimentally determined rates, comparable to the correlation with contact order. Further, these predicted folding rates are correlated strongly with contact order. Our results suggest that the folding rate of two-state proteins is a function of their local secondary structure content, consistent with the hierarchic model of protein folding. Accordingly, it should be possible to utilize secondary structure prediction methods to predict folding rates from sequence alone.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号