共查询到20条相似文献,搜索用时 0 毫秒
1.
We compiled and analyzed a data set of paired protein structures containing proteins for which multiple high-quality uncomplexed atomic structures were available in the Protein Data Bank. Side-chain flexibility was quantified, yielding a set of residue- and environment-specific confidence levels describing the range of motion around chi1 and chi2 angles. As expected, buried residues were inflexible, adopting similar conformations in different crystal structure analyses. Ile, Thr, Asn, Asp, and the large aromatics also showed limited flexibility when exposed on the protein surface, whereas exposed Ser, Lys, Arg, Met, Gln, and Glu residues were very flexible. This information is different from and complementary to the information available from rotamer surveys. The confidence levels are useful for assessing the significance of observed side-chain motion and estimating the extent of side-chain motion in protein structure prediction. We compare the performance of a simple 40 degrees threshold with these quantitative confidence levels in a critical evaluation of side-chain prediction with the program SCWRL. 相似文献
2.
Computational prediction of side‐chain conformation is an important component of protein structure prediction. Accurate side‐chain prediction is crucial for practical applications of protein structure models that need atomic‐detailed resolution such as protein and ligand design. We evaluated the accuracy of eight side‐chain prediction methods in reproducing the side‐chain conformations of experimentally solved structures deposited to the Protein Data Bank. Prediction accuracy was evaluated for a total of four different structural environments (buried, surface, interface, and membrane‐spanning) in three different protein types (monomeric, multimeric, and membrane). Overall, the highest accuracy was observed for buried residues in monomeric and multimeric proteins. Notably, side‐chains at protein interfaces and membrane‐spanning regions were better predicted than surface residues even though the methods did not all use multimeric and membrane proteins for training. Thus, we conclude that the current methods are as practically useful for modeling protein docking interfaces and membrane‐spanning regions as for modeling monomers. Proteins 2014; 82:1971–1984. © 2014 Wiley Periodicals, Inc. 相似文献
3.
A. J. Doig M. J. Sternberg 《Protein science : a publication of the Protein Society》1995,4(11):2247-2251
An important, but often neglected, contribution to the thermodynamics of protein folding is the loss of entropy that results from restricting the number of accessible side-chain conformers in the native structure. Conformational entropy changes can be found by comparing the number of accessible rotamers in the unfolded and folded states, or by estimating fusion entropies. Comparison of several sets of results using different techniques shows that the mean conformational free energy change (T delta S) is 1 kcal.mol-1 per side chain or 0.5 kcal.mol-1 per bond. Changes in vibrational entropy appear to be negligible compared to the entropy change resulting from the loss of accessible rotamers. Side-chain entropies can help rationalize alpha-helix propensities, predict protein/inhibitor complex structures, and account for the distribution of side chains on the protein surface or interior. 相似文献
4.
Recent advances in modeling protein structures at the atomic level have made it possible to tackle \"de novo\" computational protein design. Most procedures are based on combinatorial optimization using a scoring function that estimates the folding free energy of a protein sequence on a given main-chain structure. However, the computation of the conformational entropy in the folded state is generally an intractable problem, and its contribution to the free energy is not properly evaluated. In this article, we propose a new automated protein design methodology that incorporates such conformational entropy based on statistical mechanics principles. We define the free energy of a protein sequence by the corresponding partition function over rotamer states. The free energy is written in variational form in a pairwise approximation and minimized using the Belief Propagation algorithm. In this way, a free energy is associated to each amino acid sequence: we use this insight to rescore the results obtained with a standard minimization method, with the energy as the cost function. Then, we set up a design method that directly uses the free energy as a cost function in combination with a stochastic search in the sequence space. We validate the methods on the design of three superficial sites of a small SH3 domain, and then apply them to the complete redesign of 27 proteins. Our results indicate that accounting for entropic contribution in the score function affects the outcome in a highly nontrivial way, and might improve current computational design techniques based on protein stability. 相似文献
5.
Loss of side-chain conformational entropy is an important force opposing protein folding and the relative preferences of the amino acids for being buried or solvent exposed may be partially determined by which amino acids lose more side-chain entropy when placed in the core of a protein. To investigate these preferences, we have incorporated explicit modeling of side-chain entropy into the protein design algorithm, RosettaDesign. In the standard version of the program, the energy of a particular sequence for a fixed backbone depends only on the lowest energy side-chain conformations that can be identified for that sequence. In the new model, the free energy of a single amino acid sequence is calculated by evaluating the average energy and entropy of an ensemble of structures generated by Monte Carlo sampling of amino acid side-chain conformations. To evaluate the impact of including explicit side-chain entropy, sequences were designed for 110 native protein backbones with and without the entropy model. In general, the differences between the two sets of sequences are modest, with the largest changes being observed for the longer amino acids: methionine and arginine. Overall, the identity between the designed sequences and the native sequences does not increase with the addition of entropy, unlike what is observed when other key terms are added to the model (hydrogen bonding, Lennard-Jones energies, and solvation energies). These results suggest that side-chain conformational entropy has a relatively small role in determining the preferred amino acid at each residue position in a protein. 相似文献
6.
Keehyoung Joo Jinwoo Lee Joo‐Hyun Seo Kyoungrim Lee Byung‐Gee Kim Jooyoung Lee 《Proteins》2009,75(4):1010-1023
We have investigated the effect of rigorous optimization of the MODELLER energy function for possible improvement in protein all‐atom chain‐building. For this we applied the global optimization method called conformational space annealing (CSA) to the standard MODELLER procedure to achieve better energy optimization than what MODELLER provides. The method, which we call MODELLERCSA , is tested on two benchmark sets. The first is the 298 proteins taken from the HOMSTRAD multiple alignment set. By simply optimizing the MODELLER energy function, we observe significant improvement in side‐chain modeling, where MODELLERCSA provides about 10.7% (14.5%) improvement for χ1 (χ1 + χ2) accuracy compared to the standard MODELLER modeling. The improvement of backbone accuracy by MODELLERCSA is shown to be less prominent, and a similar improvement can be achieved by simply generating many standard MODELLER models and selecting lowest energy models. However, the level of side‐chain modeling accuracy by MODELLERCSA could not be matched either by extensive MODELLER strategies, side‐chain remodeling by SCWRL3, or copying unmutated rotamers. The identical procedure was successfully applied to 100 CASP7 template base modeling domains during the prediction season in a blind fashion, and the results are included here for comparison. From this study, we observe a good correlation between the MODELLER energy and the side‐chain accuracy. Our findings indicate that, when a good alignment between a target protein and its templates is provided, thorough optimization of the MODELLER energy function leads to accurate all‐atom models. Proteins 2009. © 2008 Wiley‐Liss, Inc. 相似文献
7.
Despite years of effort, the problem of predicting the conformations of protein side chains remains a subject of inquiry. This problem has three major issues, namely defining the conformations that a side chain may adopt within a protein, developing a sampling procedure for generating possible side‐chain packings, and defining a scoring function that can rank these possible packings. To solve the former of these issues, most procedures rely on a rotamer library derived from databases of known protein structures. We introduce an alternative method that is free of statistics. We begin with a rotamer library that is based only on stereochemical considerations; this rotamer library is then optimized independently for each protein under study. We show that this optimization step restores the diversity of conformations observed in native proteins. We combine this protein‐dependent rotamer library (PDRL) method with the self‐consistent mean field (SCMF) sampling approach and a physics‐based scoring function into a new side‐chain prediction method, SCMF–PDRL. Using two large test sets of 831 and 378 proteins, respectively, we show that this new method compares favorably with competing methods such as SCAP, OPUS‐Rota, and SCWRL4 for energy‐minimized structures. Proteins 2014; 82:2000–2017. © 2014 Wiley Periodicals, Inc. 相似文献
8.
The side‐chain dihedral angle distributions of all amino acids have been measured from myriad high‐resolution protein crystal structures. However, we do not yet know the dominant interactions that determine these distributions. Here, we explore to what extent the defining features of the side‐chain dihedral angle distributions of different amino acids can be captured by a simple physical model. We find that a hard‐sphere model for a dipeptide mimetic that includes only steric interactions plus stereochemical constraints is able to recapitulate the key features of the back‐bone dependent observed amino acid side‐chain dihedral angle distributions of Ser, Cys, Thr, Val, Ile, Leu, Phe, Tyr, and Trp. We find that for certain amino acids, performing the calculations with the amino acid of interest in the central position of a short α‐helical segment improves the match between the predicted and observed distributions. We also identify the atomic interactions that give rise to the differences between the predicted distributions for the hard‐sphere model of the dipeptide and that of the α‐helical segment. Finally, we point out a case where the hard‐sphere plus stereochemical constraint model is insufficient to recapitulate the observed side‐chain dihedral angle distribution, namely the distribution P(χ3) for Met. Proteins 2014; 82:2574–2584. © 2014 Wiley Periodicals, Inc. 相似文献
9.
10.
We describe an extensive test of Geocore, an ab initio peptide folding algorithm. We studied 18 short molecules for which there are structures in the Protein Data Bank; chains are up to 31 monomers long. Except for the very shortest peptides, an extremely simple energy function is sufficient to discriminate the true native state from more than 10(8) lowest energy conformations that are searched explicitly for each peptide. A high incidence of native-like structures is found within the best few hundred conformations generated by Geocore for each amino acid sequence. Predictions improve when the number of discrete phi/psi choices is increased. 相似文献
11.
12.
《Proteins》2018,86(5):581-591
We compare side chain prediction and packing of core and non‐core regions of soluble proteins, protein‐protein interfaces, and transmembrane proteins. We first identified or created comparable databases of high‐resolution crystal structures of these 3 protein classes. We show that the solvent‐inaccessible cores of the 3 classes of proteins are equally densely packed. As a result, the side chains of core residues at protein‐protein interfaces and in the membrane‐exposed regions of transmembrane proteins can be predicted by the hard‐sphere plus stereochemical constraint model with the same high prediction accuracies (>90%) as core residues in soluble proteins. We also find that for all 3 classes of proteins, as one moves away from the solvent‐inaccessible core, the packing fraction decreases as the solvent accessibility increases. However, the side chain predictability remains high (80% within ) up to a relative solvent accessibility, , for all 3 protein classes. Our results show that % of the interface regions in protein complexes are “core”, that is, densely packed with side chain conformations that can be accurately predicted using the hard‐sphere model. We propose packing fraction as a metric that can be used to distinguish real protein‐protein interactions from designed, non‐binding, decoys. Our results also show that cores of membrane proteins are the same as cores of soluble proteins. Thus, the computational methods we are developing for the analysis of the effect of hydrophobic core mutations in soluble proteins will be equally applicable to analyses of mutations in membrane proteins. 相似文献
13.
Based on available experimental data and using a theoretical model of protein folding, we demonstrate that there is an optimal ratio between the average conformational entropy and the average contact energy per residue for fast protein folding. A statistical analysis of the conformational entropy and the number of contacts per residue for 5829 protein domains from four main classes (α, β, α/β, α+β) shows that each class has its own characteristic average number of contacts per residue and average conformational entropy per residue. These class-specific characteristics determine the protein folding rates: α-proteins are the fastest to fold, β-proteins are the second fastest, α+β-proteins are the third, and α/β-proteins are the last to fold. 相似文献
14.
Using a recently developed protein folding algorithm, a prediction of the tertiary structure of the KIX domain of the CREB binding protein is described. The method incorporates predicted secondary and tertiary restraints derived from multiple sequence alignments in a reduced protein model whose conformational space is explored by Monte Carlo dynamics. Secondary structure restraints are provided by the PHD secondary structure prediction algorithm that was modified for the presence of predicted U-turns, i.e., regions where the chain reverses global direction. Tertiary restraints are obtained via a two-step process: First, seed side-chain contacts are identified from a correlated mutation analysis, and then, a threading-based algorithm expands the number of these seed contacts. Blind predictions indicate that the KIX domain is a putative three-helix bundle, although the chirality of the bundle could not be uniquely determined. The expected root-mean-square deviation for the correct chirality of the KIX domain is between 5.0 and 6.2 Å. This is to be compared with the estimate of 12.9 Å that would be expected by a random prediction, using the model of F. Cohen and M. Sternberg (J. Mol. Biol. 138:321–333, 1980). Proteins 30:287–294, 1998. © 1998 Wiley-Liss, Inc. 相似文献
15.
Hahnbeom Park Gyu Rie Lee David E. Kim Ivan Anishchenko Qian Cong David Baker 《Proteins》2019,87(12):1276-1282
Because proteins generally fold to their lowest free energy states, energy-guided refinement in principle should be able to systematically improve the quality of protein structure models generated using homologous structure or co-evolution derived information. However, because of the high dimensionality of the search space, there are far more ways to degrade the quality of a near native model than to improve it, and hence, refinement methods are very sensitive to energy function errors. In the 13th Critial Assessment of techniques for protein Structure Prediction (CASP13), we sought to carry out a thorough search for low energy states in the neighborhood of a starting model using restraints to avoid straying too far. The approach was reasonably successful in improving both regions largely incorrect in the starting models as well as core regions that started out closer to the correct structure. Models with GDT-HA over 70 were obtained for five targets and for one of those, an accuracy of 0.5 å backbone root-mean-square deviation (RMSD) was achieved. An important current challenge is to improve performance in refining oligomers and larger proteins, for which the search problem remains extremely difficult. 相似文献
16.
Side chain prediction is an integral component of computational antibody design and structure prediction. Current antibody modelling tools use backbone‐dependent rotamer libraries with conformations taken from general proteins. Here we present our antibody‐specific rotamer library, where rotamers are binned according to their immunogenetics (IMGT) position, rather than their local backbone geometry. We find that for some amino acid types at certain positions, only a restricted number of side chain conformations are ever observed. Using this information, we are able to reduce the breadth of the rotamer sampling space. Based on our rotamer library, we built a side chain predictor, position‐dependent antibody rotamer swapper (PEARS). On a blind test set of 95 antibody model structures, PEARS had the highest average χ1 and accuracy (78.7% and 64.8%) compared to three leading backbone‐dependent side chain predictors. Our use of IMGT position, rather than backbone ϕ/ψ, meant that PEARS was more robust to errors in the backbone of the model structure. PEARS also achieved the lowest number of side chain–side chain clashes. PEARS is freely available as a web application at http://opig.stats.ox.ac.uk/webapps/pears . 相似文献
17.
Akshaya K. Meher Sachiko I. Blaber Jihun Lee Ejiro Honjo Ryota Kuroki Michael Blaber 《Acta Crystallographica. Section F, Structural Biology Communications》2009,65(11):1136-1140
Large‐volume protein crystals are a prerequisite for neutron diffraction studies and their production represents a bottleneck in obtaining neutron structures. Many protein crystals that permit the collection of high‐resolution X‐ray diffraction data are inappropriate for neutron diffraction owing to a plate‐type morphology that limits the crystal volume. Human fibroblast growth factor 1 crystallizes in a plate morphology that yields atomic resolution X‐ray diffraction data but has insufficient volume for neutron diffraction. The thin physical dimension has been identified as corresponding to the b cell edge and the X‐ray structure identified a solvent‐mediated crystal contact adjacent to position Glu81 that was hypothesized to limit efficient crystal growth in this dimension. In this report, a series of mutations at this crystal contact designed to both reduce side‐chain entropy and replace the solvent‐mediated interface with direct side‐chain contacts are reported. The results suggest that improved crystal growth is achieved upon the introduction of direct crystal contacts, while little improvement is observed with side‐chain entropy‐reducing mutations alone. 相似文献
18.
We have developed an original method for global optimization of protein side-chain conformations, called the Fast and Accurate Side-Chain Topology and Energy Refinement (FASTER) method. The method operates by systematically overcoming local minima of increasing order. Comparison of the FASTER results with those of the dead-end elimination (DEE) algorithm showed that both methods produce nearly identical results, but the FASTER algorithm is 100-1000 times faster than the DEE method and scales in a stable and favorable way as a function of protein size. We also show that low-order local minima may be almost as accurate as the global minimum when evaluated against experimentally determined structures. In addition, the new algorithm provides significant information about the conformational flexibility of individual side-chains. We observed that strictly rigid side-chains are concentrated mainly in the core of the protein, whereas highly flexible side-chains are found almost exclusively among solvent-oriented residues. 相似文献
19.
Structural basis of hierarchical multiple substates of a protein. III: Side chain and main chain local conformations 总被引:2,自引:0,他引:2
An analysis is carried out of differences in the minimum energy conformations obtained in the previous paper by energy minimization starting from conformations sampled by a Monte Carlo simulation of conformational fluctuations in the native state of a globular protein, bovine pancreatic trypsin inhibitor. Main conformational differences in each pair of energy minima are found usually localized in several side chains and in a few local main chain segments. Such side chains and local main chain segments are found to take a few distinct local conformations in the minimum energy conformations. Energy minimum conformations can thus be described in terms of combinations of these multiple local conformations. 相似文献
20.
We describe an algorithm which enables us to search the conformational space of the side chains of a protein to identify the global minimum energy combination of side chain conformations as well as all other conformations within a specified energy cutoff of the global energy minimum. The program is used to explore the side chain conformational energy surface of a number of proteins, to investigate how this surface varies with the energy model used to describe the interactions within the system and the rotamer library. Enumeration of the rotamer combinations enables us to directly evaluate the partition function, and thus calculate the side chain contribution to the conformational entropy of the folded protein. An investigation of these conformations and the relationships between them shows that most of the conformations near to the global energy minimum arise from changes in side chain conformations that are essentially independent; very few result from a concerted change in conformation of two or more residues. Some of the limitations of the approach are discussed. Proteins 33:227–239, 1998. © 1998 Wiley-Liss, Inc. 相似文献