共查询到20条相似文献,搜索用时 15 毫秒
1.
The current work describes a simplified representation of protein structure with uses in the simulation of protein folding. The model assumes that a protein can be represented by a freely rotating rigid chain with a single atom approximating the effect of each side chain. Potentials describing the attraction or repulsion between different types of amino acids are determined directly from the distribution of amino acids in the database of known protein structures. The optimization technique of simulated annealing has been used to dynamically sample the conformations available to this simple model, allowing the protein to evolve from an extended, random coil into a compact globular structure. Many characteristics expected of true proteins, such as the sequence-dependent formation of secondary structure, the partitioning of hydrophobic residues, and specific disulfide pairing, are reproduced by the simulation, suggesting the model may accurately simulate the folding process. 相似文献
2.
Protein-small molecule docking algorithms provide a means to model the structure of protein-small molecule complexes in structural detail and play an important role in drug development. In recent years the necessity of simulating protein side-chain flexibility for an accurate prediction of the protein-small molecule interfaces has become apparent, and an increasing number of docking algorithms probe different approaches to include protein flexibility. Here we describe a new method for docking small molecules into protein binding sites employing a Monte Carlo minimization procedure in which the rigid body position and orientation of the small molecule and the protein side-chain conformations are optimized simultaneously. The energy function comprises van der Waals (VDW) interactions, an implicit solvation model, an explicit orientation hydrogen bonding potential, and an electrostatics model. In an evaluation of the scoring function the computed energy correlated with experimental small molecule binding energy with a correlation coefficient of 0.63 across a diverse set of 229 protein- small molecule complexes. The docking method produced lowest energy models with a root mean square deviation (RMSD) smaller than 2 A in 71 out of 100 protein-small molecule crystal structure complexes (self-docking). In cross-docking calculations in which both protein side-chain and small molecule internal degrees of freedom were varied the lowest energy predictions had RMSDs less than 2 A in 14 of 20 test cases. 相似文献
3.
Since determining the crystallographic structure of all peptide-MHC complexes is infeasible, an accurate prediction of the conformation is a critical computational problem. These models can be useful for determining binding energetics, predicting the structures of specific ternary complexes with T-cell receptors, and designing new molecules interacting with these complexes. The main difficulties are (1) adequate sampling of the large number of conformational degrees of freedom for the flexible peptide, (2) predicting subtle changes in the MHC interface geometry upon binding, and (3) building models for numerous MHC allotypes without known structures. Whereas previous studies have approached the sampling problem by dividing the conformational variables into different sets and predicting them separately, we have refined the Biased-Probability Monte Carlo docking protocol in internal coordinates to optimize a physical energy function for all peptide variables simultaneously. We also imitated the induced fit by docking into a more permissive smooth grid representation of the MHC followed by refinement and reranking using an all-atom MHC model. Our method was tested by a comparison of the results of cross-docking 14 peptides into HLA-A*0201 and 9 peptides into H-2K(b) as well as docking peptides into homology models for five different HLA allotypes with a comprehensive set of experimental structures. The surprisingly accurate prediction (0.75 A backbone RMSD) for cross-docking of a highly flexible decapeptide, dissimilar to the original bound peptide, as well as docking predictions using homology models for two allotypes with low average backbone RMSDs of less than 1.0 A illustrate the method's effectiveness. Finally, energy terms calculated using the predicted structures were combined with supervised learning on a large data set to classify peptides as either HLA-A*0201 binders or nonbinders. In contrast with sequence-based prediction methods, this model was also able to predict the binding affinity for peptides to a different MHC allotype (H-2K(b)), not used for training, with comparable prediction accuracy. 相似文献
4.
Using a newly developed Monte Carlo global optimization method called basin paving, we have performed an ab initio computation for the structure of Trp-cage based on the ECEPP/3 force field in vacuo. The lowest energy minimum has been located. Its corresponding configuration is comparable to the native structure of Trp-cage (PDB code 1L2Y) with a backbone root mean square deviation of 2.24 A. 相似文献
5.
Jorov A Zhorov BS Yang DS 《Protein science : a publication of the Protein Society》2004,13(6):1524-1537
Antifreeze proteins (AFPs) are synthesized by various organisms to enable their cells to survive subzero environment. These proteins bind to small ice crystals and inhibit their growth, which if left uncontrolled would be fatal to cells. The crystal structures of a number of AFPs have been determined; however, crystallographic analysis of AFP-ice complex is nearly impossible. Molecular modeling studies of AFPs' interaction with ice surface is therefore invaluable. Early models of AFP-ice interaction suggested H-bond as the primary driving force behind such interaction. Recent experimental evidence, however, suggested that hydrophobic interactions could be the main contributor to AFP-ice association. All computational studies published to date were carried out to verify the H-bond model, and no works attempting to verify the hydrophobic interaction model have been published. In this work, we Monte Carlo-minimized complexes of several AFPs with ice taking into account nonbonded interactions, H-bonds, and the hydration potential for proteins. Parameters of the hydration potential for ice were developed with the assumption that the free energy of the water-ice association should be close to zero at equilibrium melting temperature. Our calculations demonstrate that desolvation of hydrophobic groups in the AFPs upon their binding to the grooves at the ice surface is indeed the major stabilizing contributor to the free energy of AFP-ice binding. This study is consistent with available structural and mutation data on AFPs. In particular, it explains the paradoxical finding that substitution of Thr residues with Val does not affect the potency of winter flounder AFP whereas substitution with Ser abolished its antifreeze activity. 相似文献
6.
Human leukocyte antigens (HLA) or histocompatibility molecules are glycoproteins that play a pivotal role in the development of an effective immune response. An important function of the HLA molecules is the ability to bind and present antigen peptides to T lymphocytes. Presently there is no comprehensive way of predicting and energetically evaluating peptide binding on HLA molecules. To quantitatively determine the binding specificity of a class II HLA molecule interacting with peptides, a novel decomposition approach based on deterministic global optimization is proposed that takes advantage of the topography of HLA binding grove, and examined the interactions of the bound peptide with the five different pockets. In particular, the main focus of this paper is the study of pocket 1 of HLA DR1 (DRB1*0101 allele). The determination of the minimum energy conformation is based on the ECEPP/3 potential energy model that describes the energetics of the atomic interactions. The minimization of the total potential energy is formulated on the set of peptide dihedral angles, Euler angles, and translation variables to describe the relative position. The deterministic global optimization algorithm, αBB, which has been shown to be ϵ-convergent to the global minimum potential energy through the solution of a series of nonlinear convex optimization problems, is utilized. The PACK conformational energy model that utilizes the ECEPP/3 model but also allows the consideration of protein chain interactions is interfaced with αBB. MSEED, a program used to calculate the solvation contribution via the area accessible to the solvent, is also interfaced with αBB. Results are presented for the entire array of naturally occurring amino acids binding to pocket 1 of the HLA DR1 molecule and very good agreement with experimental binding assays is obtained. Proteins 29:87–102, 1997. © 1997 Wiley-Liss, Inc. 相似文献
7.
We present a computational approach for predicting structures of ligand-protein complexes and analyzing binding energy landscapes that combines Monte Carlo simulated annealing technique to determine the ligand bound conformation with the dead-end elimination algorithm for side-chain optimization of the protein active site residues. Flexible ligand docking and optimization of mobile protein side-chains have been performed to predict structural effects in the V32I/I47V/V82I HIV-1 protease mutant bound with the SB203386 ligand and in the V82A HIV-1 protease mutant bound with the A77003 ligand. The computational structure predictions are consistent with the crystal structures of these ligand-protein complexes. The emerging relationships between ligand docking and side-chain optimization of the active site residues are rationalized based on the analysis of the ligand-protein binding energy landscape. Proteins 33:295–310, 1998. © 1998 Wiley-Liss, Inc. 相似文献
8.
G M Verkhivker D Bouzida D K Gehlhaar P A Rejto L Schaffer S Arthurs A B Colson S T Freer V Larson B A Luty T Marrone P W Rose 《Proteins》2001,45(4):456-470
Computer simulations using the simplified energy function and simulated tempering dynamics have accurately determined the native structure of the pYVPML, SVLpYTAVQPNE, and SPGEpYVNIEF peptides in the complexes with SH2 domains. Structural and equilibrium aspects of the peptide binding with SH2 domains have been studied by generating temperature-dependent binding free energy landscapes. Once some native peptide-SH2 domain contacts are constrained, the underlying binding free energy profile has the funnel-like shape that leads to a rapid and consistent acquisition of the native structure. The dominant native topology of the peptide-SH2 domain complexes represents an extended peptide conformation with strong specific interactions in the phosphotyrosine pocket and hydrophobic interactions of the peptide residues C-terminal to the pTyr group. The topological features of the peptide-protein interface are primarily determined by the thermodynamically stable phosphotyrosyl group. A diversity of structurally different binding orientations has been observed for the amino-terminal residues to the phosphotyrosine. The dominant native topology for the peptide residues carboxy-terminal to the phosphotyrosine is tolerant to flexibility in this region of the peptide-SH2 domain interface observed in equilibrium simulations. The energy landscape analysis has revealed a broad, entropically favorable topology of the native binding mode for the bound peptides, which is robust to structural perturbations. This could provide an additional positive mechanism underlying tolerance of the SH2 domains to hydrophobic conservative substitutions in the peptide specificity region. 相似文献
9.
A revised version of the Conformational Space Annealing (CSA) global optimization method is developed, with three separate measures of structural similarity, in order to overcome the inability of a single distance measure to evaluate multiple-chain protein structures adequately. A second search method, Conformational Family Monte Carlo (CFMC), involving genetic-type moves, Monte Carlo-with-minimization perturbations, and explicit clustering of the population into conformational families, is adapted to treat multiple-chain proteins. These two methods are applied to two oligomeric proteins, the retro-GCN4 leucine zipper and the synthetic domain-swapped dimer. CFMC proves superior to CSA in its search for low-energy representatives of its conformational families, but both methods encounter difficulty in finding the native packing arrangements in the absence of native-like symmetry constraints, even when native monomers are present in the population. 相似文献
10.
Abstract A new Monte Carlo sampling scheme, namely the Modified Valley Restrained Monte Carlo procedure, is used to obtain the global energy minimum conformations for polypeptides, such as Met-enkephalin and Melittin. For each peptide, we found close agreement with previous results from both theoretical and experimental studies. The simple idea for controlling the step size according to the Valley Function, provides useful suggestions in searching the global energy minimum structures, and furthermore helps solve the multiple minima problem. 相似文献
11.
J. S. Evans S. I. Chan W. A. Goddard rd 《Protein science : a publication of the Protein Society》1995,4(10):2019-2031
Many interesting proteins possess defined sequence stretches containing negatively charged amino acids. At present, experimental methods (X-ray crystallography, NMR) have failed to provide structural data for many of these sequence domains. We have applied the dihedral probability grid-Monte Carlo (DPG-MC) conformational search algorithm to a series of N- and C-capped polyelectrolyte peptides, (Glu)20, (Asp)20, (PSer)20, and (PSer-Asp)10, that represent polyanionic regions in a number of important proteins, such as parathymosin, calsequestrin, the sodium channel protein, and the acidic biomineralization proteins. The atomic charges were estimated from charge equilibration and the valence and van der Waals parameters are from DREIDING. Solvation of the carboxylate and phosphate groups was treated using sodium counterions for each charged side chain (one Na+ for COO-; two Na for CO(PO3)-2) plus a distance-dependent (shielded) dielectric constant, epsilon = epsilon 0 R, to simulate solvent water. The structures of these polyelectrolyte polypeptides were obtained by the DPG-MC conformational search with epsilon 0 = 10, followed by calculation of solvation energies for the lowest energy conformers using the protein dipole-Langevin dipole method of Warshel. These calculations predict a correlation between amino acid sequence and global folded conformational minima: 1. Poly-L-Glu20, our structural benchmark, exhibited a preference for right-handed alpha-helix (47% helicity), which approximates experimental observations of 55-60% helicity in solution. 2. For Asp- and PSer-containing sequences, all conformers exhibited a low preference for right-handed alpha-helix formation (< or = 10%), but a significant percentage (approximately 20% or greater) of beta-strand and beta-turn dihedrals were found in all three sequence cases: (1) Aspn forms supercoil conformers, with a 2:1:1 ratio of beta-turn:beta-strand:alpha-helix dihedral angles; (2) PSer20 features a nearly 1:1 ratio of beta-turn:beta-sheet dihedral preferences, with very little preference for alpha-helical structure, and possesses short regions of strand and turn combinations that give rise to a collapsed bend or hairpin structure; (3) (PSer-Asp)10 features a 3:2:1 ratio of beta-sheet:beta-turn:alpha-helix and gives rise to a superturn or C-shaped structure. 相似文献
12.
Protein recognition is one of the most challenging and intriguing problems in structural biology. Despite all the available structural, sequence and biophysical information about protein-protein complexes, the physico-chemical patterns, if any, that make a protein surface likely to be involved in protein-protein interactions, remain elusive. Here, we apply protein docking simulations and analysis of the interaction energy landscapes to identify protein-protein interaction sites. The new protocol for global docking based on multi-start global energy optimization of an all-atom model of the ligand, with detailed receptor potentials and atomic solvation parameters optimized in a training set of 24 complexes, explores the conformational space around the whole receptor without restrictions. The ensembles of the rigid-body docking solutions generated by the simulations were subsequently used to project the docking energy landscapes onto the protein surfaces. We found that highly populated low-energy regions consistently corresponded to actual binding sites. The procedure was validated on a test set of 21 known protein-protein complexes not used in the training set. As much as 81% of the predicted high-propensity patch residues were located correctly in the native interfaces. This approach can guide the design of mutations on the surfaces of proteins, provide geometrical details of a possible interaction, and help to annotate protein surfaces in structural proteomics. 相似文献
13.
We describe an effective procedure for modeling the structures of simple transmembrane helix homo-oligomers. The method differs from many previous approaches in that the only structural constraint we use to help select the correct model is the oligomerization state of the protein. The method involves the following steps: (1) perform 100-250 independent Monte Carlo energy minimizations of helix pairs to produce a large collection of well-packed structures; (2) filter the minimized structures to find those that are consistent with the expected symmetry of the oligomer; (3) cluster the structures that pass the symmetry filter; and (4) select a representative of the most populous cluster as the final prediction. We applied the method to the transmembrane helices of five proteins and compare our results to the available experimental data. Our predictions of glycophorin A, neu, the M2 channel and phospholamban resulted in a single model for each protein that agreed with the experimental results. In the case of erbB-2, however, we obtained three structurally distinct clusters of approximately equal sizes, so it was not possible to identify a clearly favored structure. This may reflect a real heterogeneity of packing modes for erbB-2, which is known to interact with different receptor subunits. Our method should be useful for obtaining structural models of transmembrane domains, improving our understanding of structure/function relationships for particular membrane proteins. 相似文献
14.
15.
An atomically detailed potential for docking pairs of proteins is derived using mathematical programming. A refinement algorithm that builds atomically detailed models of the complex and combines coarse grained and atomic scoring is introduced. The refinement step consists of remodeling the interface side chains of the top scoring decoys from rigid docking followed by a short energy minimization. The refined models are then re‐ranked using a combination of coarse grained and atomic potentials. The docking algorithm including the refinement and re‐ranking, is compared favorably to other leading docking packages like ZDOCK, Cluspro, and PATCHDOCK, on the ZLAB 3.0 Benchmark and a test set of 30 novel complexes. A detailed analysis shows that coarse grained potentials perform better than atomic potentials for realistic unbound docking (where the exact structures of the individual bound proteins are unknown), probably because atomic potentials are more sensitive to local errors. Nevertheless, the atomic potential captures a different signal from the residue potential and as a result a combination of the two scores provides a significantly better prediction than each of the approaches alone. Proteins 2013. © 2012 Wiley Periodicals, Inc. 相似文献
16.
We have developed a method to both predict the geometry and the relative stability of point mutants that may be used for arbitrary mutations. The geometry optimization procedure was first tested on a new benchmark of 2141 ordered pairs of X-ray crystal structures of proteins that differ by a single point mutation, the largest data set to date. An empirical energy function, which includes terms representing the energy contributions of the folded and denatured proteins and uses the predicted mutant side chain conformation, was fit to a training set consisting of half of a diverse set of 1816 experimental stability values for single point mutations in 81 different proteins. The data included a substantial number of small to large residue mutations not considered by previous prediction studies. After removing 22 (approximately 2%) outliers, the stability calculation gave a standard deviation of 1.08 kcal/mol with a correlation coefficient of 0.82. The prediction method was then tested on the remaining half of the experimental data, giving a standard deviation of 1.10 kcal/mol and covariance of 0.66 for 97% of the test set. A regression fit of the energy function to a subset of 137 mutants, for which both native and mutant structures were available, gave a prediction error comparable to that for the complete training set with predicted side chain conformations. We found that about half of the variation is due to conformation-independent residue contributions. Finally, a fit to the experimental stability data using these residue parameters exclusively suggests guidelines for improving protein stability in the absence of detailed structure information. 相似文献
17.
G-protein coupled receptors (GPCRs) are the largest family of cell-surface receptors involved in signal transmission. Drugs associated with GPCRs represent more than one fourth of the 100 top-selling drugs and are the targets of more than half of the current therapeutic agents on the market. Our methodology based on the internal coordinate mechanics (ICM) program can accurately identify the ligand-binding pocket in the currently available crystal structures of seven transmembrane (7TM) proteins [bacteriorhodopsin (BR) and bovine rhodopsin (bRho)]. The binding geometry of the ligand can be accurately predicted by ICM flexible docking with and without the loop regions, a useful finding for GPCR docking because the transmembrane regions are easier to model. We also demonstrate that the native ligand can be identified by flexible docking and scoring in 1.5% and 0.2% (for bRho and BR, respectively) of the best scoring compounds from two different types of compound database. The same procedure can be applied to the database of available chemicals to identify specific GPCR binders. Finally, we demonstrate that even if the sidechain positions in the bRho binding pocket are entirely wrong, their correct conformation can be fully restored with high accuracy (0.28 A) through the ICM global optimization with and without the ligand present. These binding site adjustments are critical for flexible docking of new ligands to known structures or for docking to GPCR homology models. The ICM docking method has the potential to be used to \"de-orphanize\" orphan GPCRs (oGPCRs) and to identify antagonists-agonists for GPCRs if an accurate model (experimentally and computationally validated) of the structure has been constructed or when future crystal structures are determined. 相似文献
18.
Protein folding, stability, and function are usually influenced by pH. And free energy plays a fundamental role in analysis of such pH-dependent properties. Electrostatics-based theoretical framework using dielectric solvent continuum model and solving Poisson-Boltzmann equation numerically has been shown to be very successful in understanding the pH-dependent properties. However, in this approach the exact computation of pH-dependent free energy becomes impractical for proteins possessing more than several tens of ionizable sites (e.g.>30), because exact evaluation of the partition function requires a summation over a vast number of possible protonation microstates. Here we present a method which computes the free energy using the average energy and the protonation probabilities of ionizable sites obtained by the well-established Monte Carlo sampling procedure. The key feature is to calculate the entropy by using the protonation probabilities. We used this method to examine a well-studied protein (lysozyme) and produced results which agree very well with the exact calculations. Applications to the optimum pH of maximal stability of proteins and protein–DNA interactions have also resulted in good agreement with experimental data. These examples recommend our method for application to the elucidation of the pH-dependent properties of proteins. 相似文献
19.
Most structure prediction algorithms consist of initial sampling of the conformational space, followed by rescoring and possibly refinement of a number of selected structures. Here we focus on protein docking, and show that while decoupling sampling and scoring facilitates method development, integration of the two steps can lead to substantial improvements in docking results. Since decoupling is usually achieved by generating a decoy set containing both non‐native and near‐native docked structures, which can be then used for scoring function construction, we first review the roles and potential pitfalls of decoys in protein–protein docking, and show that some type of decoys are better than others for method development. We then describe three case studies showing that complete decoupling of scoring from sampling is not the best choice for solving realistic docking problems. Although some of the examples are based on our own experience, the results of the CAPRI docking and scoring experiments also show that performing both sampling and scoring generally yields better results than scoring the structures generated by all predictors. Next we investigate how the selection of training and decoy sets affects the performance of the scoring functions obtained. Finally, we discuss pathways to better alignment of the two steps, and show some algorithms that achieve a certain level of integration. Although we focus on protein–protein docking, our observations most likely also apply to other conformational search problems, including protein structure prediction and the docking of small molecules to proteins.Proteins 2013; 81:1874–1884. © 2013 Wiley Periodicals, Inc. 相似文献
20.
We present a knowledge‐based function to score protein decoys based on their similarity to native structure. A set of features is constructed to describe the structure and sequence of the entire protein chain. Furthermore, a qualitative relationship is established between the calculated features and the underlying electromagnetic interaction that dominates this scale. The features we use are associated with residue–residue distances, residue–solvent distances, pairwise knowledge‐based potentials and a four‐body potential. In addition, we introduce a new target to be predicted, the fitness score, which measures the similarity of a model to the native structure. This new approach enables us to obtain information both from decoys and from native structures. It is also devoid of previous problems associated with knowledge‐based potentials. These features were obtained for a large set of native and decoy structures and a back‐propagating neural network was trained to predict the fitness score. Overall this new scoring potential proved to be superior to the knowledge‐based scoring functions used as its inputs. In particular, in the latest CASP (CASP10) experiment our method was ranked third for all targets, and second for freely modeled hard targets among about 200 groups for top model prediction. Ours was the only method ranked in the top three for all targets and for hard targets. This shows that initial results from the novel approach are able to capture details that were missed by a broad spectrum of protein structure prediction approaches. Source codes and executable from this work are freely available at http://mathmed.org /#Software and http://mamiris.com/ . Proteins 2014; 82:752–759. © 2013 Wiley Periodicals, Inc. 相似文献