首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We investigate the possibility that atomic burials, as measured by their distances from the structural geometrical center, contain sufficient information to determine the tertiary structure of globular proteins. We report Monte Carlo simulated annealing results of all-atom hard-sphere models in continuous space for four small proteins: the all-beta WW-domain 1E0L, the alpha/beta protein-G 1IGD, the all-alpha engrailed homeo-domain 1ENH, and the alpha + beta engineered monomeric form of the Cro protein 1ORC. We used as energy function the sum over all atoms, labeled by i, of |R(i) - R(i) (*)|, where R(i) is the atomic distance from the center of coordinates, or central distance, and R(i) (*) is the "ideal" central distance obtained from the native structure. Hydrogen bonds were taken into consideration by the assignment of two ideal distances for backbone atoms forming hydrogen bonds in the native structure depending on the formation of a geometrically defined bond, independently of bond partner. Lowest energy final conformations turned out to be very similar to the native structure for the four proteins under investigation and a strong correlation was observed between energy and distance root mean square deviation (DRMS) from the native in the case of all-beta 1E0L and alpha/beta 1IGD. For all alpha 1ENH and alpha + beta 1ORC the overall correlation between energy and DRMS among final conformations was not as high because some trajectories resulted in high DRMS but low energy final conformations in which alpha-helices adopted a non-native mutual orientation. Comparison between central distances and actual accessible surface areas corroborated the implicit assumption of correlation between these two quantities. The Z-score obtained with this native-centric potential in the discrimination of native 1ORC from a set of random compact structures confirmed that it contains a much smaller amount of native information when compared to a traditional contact Go potential but indicated that simple sequence-dependent burial potentials still need some improvement in order to attain a similar discriminability. Taken together, our results suggest that central distances, in conjunction to physically motivated hydrogen bond constraints, contain sufficient information to determine the native conformation of these small proteins and that a solution to the folding problem for globular proteins could arise from sufficiently accurate burial predictions from sequence followed by minimization of a burial-dependent energy function.  相似文献   

2.
3.
The physicochemical mechanism of protein folding has been elucidated by the island model, describing a growth type of folding. The folding pathway is closely related with nucleation on the polypeptide chain and thus the formation of small local structures or secondary structures at the earliest stage of folding is essential to all following steps. The island model is applicable to any protein, but a high precision of secondary structure prediction is indispensable to folding simulation. The secondary structures formed at the earliest stage of folding are supposed to be of standard form, but they are usually deformed during the folding process, especially at the last stage, although the degree of deformation is different for each protein. Ferredoxin is an example of a protein having this property. According to X-ray investigation (1FDX), ferredoxin is not supposed to have secondary structures. However, if we assumed that in ferredoxin all the residues are in a coil state, we could not attain the correct structure similar to the native one. Further, we found that some parts of the chain are not flexible, suggesting the presence of secondary structures, in agreement with the recent PDB data (1DUR). Assuming standard secondary structures (-helices and -strands) at the nonflexible parts at the early stage of folding, and deforming these at the final stage, a structure similar to the native one was obtained. Another peculiarity of ferredoxin is the absence of disulfide bonds, in spite of its having eight cysteines. The reason cysteines do not form disulfide bonds became clear by applying the lampshade criterion, but more importantly, the two groups of cysteines are ready to make iron complexes, respectively, at a rather later stage of folding. The reason for poor prediction accuracy of secondary structure with conventional methods is discussed.  相似文献   

4.
A method is described for the prediction of probable folding pathways of globular proteins, based on the analysis of distance maps. It is applicable to proteins of unknown spatial structure but known amino acid sequence as well as to proteins of known structure. It is based on an objective procedure for the determination of the boundary of compact regions that contain high densities of interresidue contacts on the distance map of a globular protein. The procedure can be used both with contact maps derived from a known three-dimensional protein structure and with predicted contact maps computed by means of a statistical procedure from the amino acid sequence alone. The computed contact map can also be used to predict the location of compact short-range structures, viz. -helices and -turns, thereby complementing other statistical predictive procedures. The method provides an objective basis for the derivation of a theoretically predicted pathway of protein folding, proposed by us earlier [Tanaka and Scheraga (1977) Macromolecules10, 291–304; Némethy and Scheraga (1979) Proc. Natl. Acad. Sci., U.S.A.76, 6050–6054].  相似文献   

5.
We perform a statistical analysis of atomic distributions as a function of the distance R from the molecular geometrical center in a nonredundant set of compact globular proteins. The number of atoms increases quadratically for small R, indicating a constant average density inside the core, reaches a maximum at a size-dependent distance R(max), and falls rapidly for larger R. The empirical curves turn out to be consistent with the volume increase of spherical concentric solid shells and a Fermi-Dirac distribution in which the distance R plays the role of an effective atomic energy epsilon(R) = R. The effective chemical potential mu governing the distribution increases with the number of residues, reflecting the size of the protein globule, while the temperature parameter beta decreases. Interestingly, betamu is not as strongly dependent on protein size and appears to be tuned to maintain approximately half of the atoms in the high density interior and the other half in the exterior region of rapidly decreasing density. A normalized size-independent distribution was obtained for the atomic probability as a function of the reduced distance, r = R/R(g), where R(g) is the radius of gyration. The global normalized Fermi distribution, F(r), can be reasonably decomposed in Fermi-like subdistributions for different atomic types tau, F(tau)(r), with Sigma(tau)F(tau)(r) = F(r), which depend on two additional parameters mu(tau) and h(tau). The chemical potential mu(tau) affects a scaling prefactor and depends on the overall frequency of the corresponding atomic type, while the maximum position of the subdistribution is determined by h(tau), which appears in a type-dependent atomic effective energy, epsilon(tau)(r) = h(tau)r, and is strongly correlated to available hydrophobicity scales. Better adjustments are obtained when the effective energy is not assumed to be necessarily linear, or epsilon(tau)*(r) = h(tau)*r(alpha,), in which case a correlation with hydrophobicity scales is found for the product alpha(tau)h(tau)*. These results indicate that compact globular proteins are consistent with a thermodynamic system governed by hydrophobic-like energy functions, with reduced distances from the geometrical center, reflecting atomic burials, and provide a conceptual framework for the eventual prediction from sequence of a few parameters from which whole atomic probability distributions and potentials of mean force can be reconstructed.  相似文献   

6.
The results of minimal model calculations indicate that the stability and the kinetic accessibility of the native state of small globular proteins are controlled by few "hot" sites. By means of molecular dynamics simulations around the native conformation, which describe the protein and the surrounding solvent at the all-atom level, an accurate and compact energetic map of the native state of the protein is generated. This map is further simplified by means of an eigenvalue decomposition. The components of the eigenvector associated with the lowest eigenvalue indicate which hot sites are likely to be responsible for the stability and for the rapid folding of the protein. The comparison of the results of the model with the findings of mutagenesis experiments performed for four small proteins show that the eigenvalue decomposition method is able to identify between 60% and 80% of these (hot) sites.  相似文献   

7.
Understanding, and ultimately predicting, how a 1-D protein chain reaches its native 3-D fold has been one of the most challenging problems during the last few decades. Data increasingly indicate that protein folding is a hierarchical process. Hence, the question arises as to whether we can use the hierarchical concept to reduce the practically intractable computational times. For such a scheme to work, the first step is to cut the protein sequence into fragments that form local minima on the polypeptide chain. The conformations of such fragments in solution are likely to be similar to those when the fragments are embedded in the native fold, although alternate conformations may be favored during the mutual stabilization in the combinatorial assembly process. Two elements are needed for such cutting: (1) a library of (clustered) fragments derived from known protein structures and (2) an assignment algorithm that selects optimal combinations to "cover" the protein sequence. The next two steps in hierarchical folding schemes, not addressed here, are the combinatorial assembly of the fragments and finally, optimization of the obtained conformations. Here, we address the first step in a hierarchical protein-folding scheme. The input is a target protein sequence and a library of fragments created by clustering building blocks that were generated by cutting all protein structures. The output is a set of cutout fragments. We briefly outline a graph theoretic algorithm that automatically assigns building blocks to the target sequence, and we describe a sample of the results we have obtained.  相似文献   

8.
Helices in membrane spanning regions are more tightly packed than the helices in soluble proteins. Thus, we introduce a method that uses a simple scale of burial propensity and a new algorithm to predict transmembrane helical (TMH) segments and a positive-inside rule to predict amino-terminal orientation. The method (the topology predictor of transmembrane helical proteins using mean burial propensity [THUMBUP]) correctly predicted the topology of 55 of 73 proteins (or 75%) with known three-dimensional structures (the 3D helix database). This level of accuracy can be reached by MEMSAT 1.8 (a 200-parameter model-recognition method) and a new HMM-based method (a 111-parameter hidden Markov model, UMDHMM(TMHP)) if they were retrained with the 73-protein database. Thus, a method based on a physiochemical property can provide topology prediction as accurate as those methods based on more complicated statistical models and learning algorithms for the proteins with accurately known structures. Commonly used HMM-based methods and MEMSAT 1.8 were trained with a combination of the partial 3D helix database and a 1D helix database of TMH proteins in which topology information were obtained by gene fusion and other experimental techniques. These methods provide a significantly poorer prediction for the topology of TMH proteins in the 3D helix database. This suggests that the 1D helix database, because of its inaccuracy, should be avoided as either a training or testing database. A Web server of THUMBUP and UMDHMM(TMHP) is established for academic users at http://www.smbs.buffalo.edu/phys_bio/service.htm. The 3D helix database is also available from the same Web site.  相似文献   

9.
The essential features of the in vitro refolding of myoglobin are expressed in a solvable physical model. Alpha helices are taken as the fundamental collective coordinates of the system, while the refolding is assumed to be mainly driven by solvent-induced hydrophobic forces. A quantitative model of these forces is developed and compared with experimental and theoretical results. The model is then tested by being employed in a simulation scheme designed to mimic solvent effects. Realistic dynamic trajectories of myoglobin are shown as it folds from an extended conformation to a close approximation of the native state. Various suggestive features of the process are discussed. The tenets of the model are further tested by folding the single-chain plant protein leghemoglobin. © 1994 Wiley-Liss, Inc.  相似文献   

10.
Globular proteins are assemblies of alpha-helices and beta-strands, interconnected by reverse turns and longer loops. Most short turns can be classified readily into a limited repertoire of discrete backbone conformations, but the physical-chemical determinants of these distinct conformational basins remain an open question. We investigated this question by exhaustive analysis of all backbone conformations accessible to short chain segments bracketed by either an alpha-helix or a beta-strand (i.e., alpha-segment-alpha, beta-segment-beta, alpha-segment-beta, and beta-segment-alpha) in a nine-state model. We find that each of these four secondary structure environments imposes its own unique steric and hydrogen-bonding constraints on the intervening segment, resulting in a limited repertoire of conformations. In greater detail, an exhaustive set of conformations was generated for short backbone segments having reverse-turn chain topology and bracketed between elements of secondary structure. This set was filtered, and only clash-free, hydrogen-bond-satisfied conformers having reverse-turn topology were retained. The filtered set includes authentic turn conformations, observed in proteins of known structure, but little else. In particular, over 99% of the alternative conformations failed to satisfy at least one criterion and were excluded from the filtered set. Furthermore, almost all of the remaining alternative conformations have close tolerances that would be too tight to accommodate side chains longer than a single beta-carbon. These results provide a molecular explanation for the observation that reverse turns between elements of regular secondary can be classified into a small number of discrete conformations.  相似文献   

11.
A critical step in the folding pathway of globular proteins is the formation of a tightly packed hydrophobic core. Several mutational studies have addressed the question of whether tight packing interactions are present during the rate-limiting step of folding. In some of these investigations, substituted side chains have been assumed to form native-like interactions in the transition state when the folding rates of mutant proteins correlate with their native-state stabilities. Alternatively, it has been argued that side chains participate in nonspecific hydrophobic collapse when the folding rates of mutant proteins correlate with side-chain hydrophobicity. In a reanalysis of published data, we have found that folding rates often correlate similarly well, or poorly, with both native-state stability and side-chain hydrophobicity, and it is therefore not possible to select an appropriate transition state model based on these one-parameter correlations. We show that this ambiguity can be resolved using a two-parameter model in which side chain burial and the formation of all other native-like interactions can occur asynchronously. Notably, the model agrees well with experimental data, even for positions where the one-parameter correlations are poor. We find that many side chains experience a previously unrecognized type of transition state environment in which specific, native-like interactions are formed, but hydrophobic burial dominates. Implications of these results to the design and analysis of protein folding studies are discussed.  相似文献   

12.
White SH 《FEBS letters》2003,555(1):116-121
Recent three-dimensional structures of helical membrane proteins present new challenges for the prediction of structure from amino acid sequence. Membrane proteins reside stably in a thermodynamic free energy minimum after release into the membrane's bilayer fabric from the translocon complex. This means that structure prediction is primarily a problem of physical chemistry. But the folding processes within the translocon must also be considered. A distilled overview of the physical principles of membrane protein stability is presented, and extended to encompass translocon-assisted folding.  相似文献   

13.
To gain insight into the free energy changes accompanying protein hydrophobic core formation, we have used computer simulations to study the formation of small clusters of nonpolar solutes in water. A barrier to association is observed at the largest solute separation that does not allow substantial solvent penetration. The barrier reflects an effective increase in the size of the cavity occupied by the expanded but water-excluding cluster relative to both the close-packed cluster and the fully solvated separated solutes; a similar effect may contribute to the barrier to protein folding/unfolding. Importantly for the simulation of protein folding without explicit solvent, we find that the interactions between nonpolar solutes of varying size and number can be approximated by a linear function of the molecular surface, but not the solvent-accessible surface of the solutes. Comparison of the free energy of cluster formation to that of dimer formation suggests that the assumption of pair additivity implicit in current protein database derived potentials may be in error.  相似文献   

14.
Physical principles determining the protein structure and protein folding are reviewed: (i) the molecular theory of protein secondary structure and the method of its prediction based on this theory; (ii) the existence of a limited set of thermodynamically favourable folding patterns of α- and β-regions in a compact globule which does not depend on the details of the amino acid sequence; (iii) the moderns approaches to the prediction of the folding patterns of α- and β-regions in concrete proteins; (iv) experimental approaches to the mechanism of protein folding. The review reflects theoretical and experimental works of the author and his collaborators as well as those of other groups.  相似文献   

15.
Haspel N  Tsai CJ  Wolfson H  Nussinov R 《Proteins》2003,51(2):203-215
We have previously presented a building block folding model. The model postulates that protein folding is a hierarchical top-down process. The basic unit from which a fold is constructed, referred to as a hydrophobic folding unit, is the outcome of combinatorial assembly of a set of "building blocks." Results obtained by the computational cutting procedure yield fragments that are in agreement with those obtained experimentally by limited proteolysis. Here we show that as expected, proteins from the same family give very similar building blocks. However, different proteins can also give building blocks that are similar in structure. In such cases the building blocks differ in sequence, stability, contacts with other building blocks, and in their 3D locations in the protein structure. This result, which we have repeatedly observed in many cases, leads us to conclude that while a building block is influenced by its environment, nevertheless, it can be viewed as a stand-alone unit. For small-sized building blocks existing in multiple conformations, interactions with sister building blocks in the protein will increase the population time of the native conformer. With this conclusion in hand, it is possible to develop an algorithm that predicts the building block assignment of a protein sequence whose structure is unknown. Toward this goal, we have created sequentially nonredundant databases of building block sequences. A protein sequence can be aligned against these, in order to be matched to a set of potential building blocks.  相似文献   

16.
Cooperative unfolding penalties are calculated by statistically evaluating an ensemble of denatured states derived from native structures. The ensemble of denatured states is determined by dividing the native protein into short contiguous segments and defining all possible combinations of native, i.e., interacting, and non-native, i.e., non-interacting, segments. We use a novel knowledge-based scoring function, derived from a set of non-homologous proteins in the Protein Data Bank, to describe the interactions among residues. This procedure is used for the structural identification of cooperative folding cores for four globular proteins: bovine pancreatic trypsin inhibitor, horse heart cytochrome c, French bean plastocyanin, and staphylococcal nuclease. The theoretical folding units are shown to correspond to regions that exhibit enhanced stability against denaturation as determined from experimental hydrogen exchange protection factors. Using a sequence similarity score for related sequences, we show that, in addition to residues necessary for enzymatic function, those amino acids comprising structurally important folding cores are also preferentially conserved during evolution. This implies that the identified folding cores may be part of an array of fundamental structural folding units.  相似文献   

17.
It has been shown that probable portions which form contacts in a protein can be predicted by means of an average distance map (ADM) as well as regular structures (-helices and -turns) defined as short-range compact regions (Kikuchiet al., 1988a,c). In this paper, we analyze the occurrence of those portions and short-range compact regions on ADMs for various proteins regarding their folding types. We have found out that each folding type of proteins shows characteristic distribution of such parts on ADMS. We also discuss the possibility of the prediction of folding types of proteins by ADMs.  相似文献   

18.
Franc Avbelj  John Moult 《Proteins》1995,23(2):129-141
Experimental evidence and theoretical models both suggest that protein folding begins by specific short regions of the polypeptide chain intermittently assuming conformations close to their final ones. The independent folding properties and small size of these folding initiation sites make them suitable subjects for computational methods aimed at deriving structure from sequence. We have used a torsion space Monte Carlo procedure together with an all-atom free energy function to investigate the folding of a set of such sites. The free energy function is derived by a potential of mean force analysis of experimental protein structures. The most important contributions to the total free energy are the local main chain electrostatics, main chain hydrogen bonds, and the burial of nonpolar area. Six proposed independent folding units and four control peptides 11–14 residues long have been investigated. Thirty Monte Carlo simulations were performed on each peptide, starting from different random conformations. Five of the six folding units adopted conformations close to the experimental ones in some of the runs. None of the controls did so, as expected. The generated conformations which are close to the experimental ones have among the lowest free energies encountered, although some less native like low free energy conformations were also found. The effectiveness of the method on these peptides, which have a wide variety of experimental conformations, is encouraging in two ways: First, it provides independent evidence that these regions of the sequences are able to adopt native like conformations early in folding, and therefore are most probably key components of the folding pathways. Second, it demonstrates that available simulation methods and free energy functions are able to produce reasonably accurate structures. Extensions of the methods to the folding of larger portions of proteins are suggested. © 1995 Wiley-Liss, Inc.  相似文献   

19.
We describe an algorithm to compute native structures of proteins from their primary sequences. The novel aspects of this method are: 1) The hydrophobic potential was set to be proportional to the nonpolar solvent accessible surface. To make computation feasible, we developed a new algorithm to compute the solvent accessible surface areas rapidly. 2) The supersecondary structures of each protein were predicted and used as restraints during the conformation searching processes. This algorithm was applied to five proteins. The overall fold of these proteins can be computed from their sequences, with deviations from crystal structures of 1.48–4.48 Å for Cα atoms. Proteins 31:247–257, 1998. © 1998 Wiley-Liss, Inc.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号