首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Huang SY  Zou X 《Proteins》2008,72(2):557-579
Using an efficient iterative method, we have developed a distance-dependent knowledge-based scoring function to predict protein-protein interactions. The function, referred to as ITScore-PP, was derived using the crystal structures of a training set of 851 protein-protein dimeric complexes containing true biological interfaces. The key idea of the iterative method for deriving ITScore-PP is to improve the interatomic pair potentials by iteration, until the pair potentials can distinguish true binding modes from decoy modes for the protein-protein complexes in the training set. The iterative method circumvents the challenging reference state problem in deriving knowledge-based potentials. The derived scoring function was used to evaluate the ligand orientations generated by ZDOCK 2.1 and the native ligand structures on a diverse set of 91 protein-protein complexes. For the bound test cases, ITScore-PP yielded a success rate of 98.9% if the top 10 ranked orientations were considered. For the more realistic unbound test cases, the corresponding success rate was 40.7%. Furthermore, for faster orientational sampling purpose, several residue-level knowledge-based scoring functions were also derived following the similar iterative procedure. Among them, the scoring function that uses the side-chain center of mass (SCM) to represent a residue, referred to as ITScore-PP(SCM), showed the best performance and yielded success rates of 71.4% and 30.8% for the bound and unbound cases, respectively, when the top 10 orientations were considered. ITScore-PP was further tested using two other published protein-protein docking decoy sets, the ZDOCK decoy set and the RosettaDock decoy set. In addition to binding mode prediction, the binding scores predicted by ITScore-PP also correlated well with the experimentally determined binding affinities, yielding a correlation coefficient of R = 0.71 on a test set of 74 protein-protein complexes with known affinities. ITScore-PP is computationally efficient. The average run time for ITScore-PP was about 0.03 second per orientation (including optimization) on a personal computer with 3.2 GHz Pentium IV CPU and 3.0 GB RAM. The computational speed of ITScore-PP(SCM) is about an order of magnitude faster than that of ITScore-PP. ITScore-PP and/or ITScore-PP(SCM) can be combined with efficient protein docking software to study protein-protein recognition.  相似文献   

2.
Ma XH  Wang CX  Li CH  Chen WZ 《Protein engineering》2002,15(8):677-681
Three useful variables from the interfaces of 20 protein-protein complexes were investigated. These variables are the side-chain accessible number (N(b)), the number of hydrophilic pairs (N(pair)) and buried a polar solvent accessible surface areas (DeltaDeltaASA(apol)). An empirical model based on the three variables was developed to describe the free energy of protein associations. As the results show, the side-chain accessible numbers characterize the loss of side-chain conformational entropy of protein interactions and the effective empirical function presented here has great capability for estimating the binding free energy. It was found that the variables of interface information capture most of the significant features of protein-protein association. Also, we applied the model based on the variables as a rescoring function to docking simulations and found that it has the potential to distinguish the 'true' binding mode. It is clear that the simple and empirical scale developed here is an attractive target function for calculating binding free energy for various biological processes to rational protein design.  相似文献   

3.
del Sol A  O'Meara P 《Proteins》2005,58(3):672-682
We show that protein complexes can be represented as small-world networks, exhibiting a relatively small number of highly central amino-acid residues occurring frequently at protein-protein interfaces. We further base our analysis on a set of different biological examples of protein-protein interactions with experimentally validated hot spots, and show that 83% of these predicted highly central residues, which are conserved in sequence alignments and nonexposed to the solvent in the protein complex, correspond to or are in direct contact with an experimentally annotated hot spot. The remaining 17% show a general tendency to be close to an annotated hot spot. On the other hand, although there is no available experimental information on their contribution to the binding free energy, detailed analysis of their properties shows that they are good candidates for being hot spots. Thus, highly central residues have a clear tendency to be located in regions that include hot spots. We also show that some of the central residues in the protein complex interfaces are central in the monomeric structures before dimerization and that possible information relating to hot spots of binding free energy could be obtained from the unbound structures.  相似文献   

4.
We have studied the effect of point mutations of the primary binding residue (P1) at the protein-protein interface in complexes of chymotrypsin and elastase with the third domain of the turkey ovomucoid inhibitor and in trypsin with the bovine pancreatic trypsin inhibitor, using molecular dynamics simulations combined with the linear interaction energy (LIE) approach. A total of 56 mutants have been constructed and docked into their host proteins. The free energy of binding could be reliably calculated for 52 of these mutants that could unambiguously be fitted into the binding sites. We find that the predicted binding free energies are in very good agreement with experimental data with mean unsigned errors between 0.50 and 1.03 kcal/mol. It is also evident that the standard LIE model used to study small drug-like ligand binding to proteins is not suitable for protein-protein interactions. Three different LIE models were therefore tested for each of the series of protein-protein complexes included, and the best models for each system turn out to be very similar. The difference in parameterization between small drug-like compounds and protein point mutations is attributed to the preorganization of the binding surface. Our results clearly demonstrate the potential of free energy calculations for probing the effect of point mutations at protein-protein interfaces and for exploring the principles of specificity of hot spots at the interface.  相似文献   

5.
Lu L  Lu H  Skolnick J 《Proteins》2002,49(3):350-364
In this postgenomic era, the ability to identify protein-protein interactions on a genomic scale is very important to assist in the assignment of physiological function. Because of the increasing number of solved structures involving protein complexes, the time is ripe to extend threading to the prediction of quaternary structure. In this spirit, a multimeric threading approach has been developed. The approach is comprised of two phases. In the first phase, traditional threading on a single chain is applied to generate a set of potential structures for the query sequences. In particular, we use our recently developed threading algorithm, PROSPECTOR. Then, for those proteins whose template structures are part of a known complex, we rethread on both partners in the complex and now include a protein-protein interfacial energy. To perform this analysis, a database of multimeric protein structures has been constructed, the necessary interfacial pairwise potentials have been derived, and a set of empirical indicators to identify true multimers based on the threading Z-score and the magnitude of the interfacial energy have been established. The algorithm has been tested on a benchmark set comprised of 40 homodimers, 15 heterodimers, and 69 monomers that were scanned against a protein library of 2478 structures that comprise a representative set of structures in the Protein Data Bank. Of these, the method correctly recognized and assigned 36 homodimers, 15 heterodimers, and 65 monomers. This protocol was applied to identify partners and assign quaternary structures of proteins found in the yeast database of interacting proteins. Our multimeric threading algorithm correctly predicts 144 interacting proteins, compared to the 56 (26) cases assigned by PSI-BLAST using a (less) permissive E-value of 1 (0.01). Next, all possible pairs of yeast proteins have been examined. Predictions (n = 2865) of protein-protein interactions are made; 1138 of these 2865 interactions have counterparts in the Database of Interacting Proteins. In contrast, PSI-BLAST made 1781 predictions, and 1215 have counterparts in DIP. An estimation of the false-negative rate for yeast-predicted interactions has also been provided. Thus, a promising approach to help assist in the assignment of protein-protein interactions on a genomic scale has been developed.  相似文献   

6.
Protein-protein interactions are governed by the change in free energy upon binding, ΔG = ΔH - TΔS. These interactions are often marginally stable, so one must examine the balance between the change in enthalpy, ΔH, and the change in entropy, ΔS, when investigating known complexes, characterizing the effects of mutations, or designing optimized variants. To perform a large-scale study into the contribution of conformational entropy to binding free energy, we developed a technique called GOBLIN (Graphical mOdel for BiomoLecular INteractions) that performs physics-based free energy calculations for protein-protein complexes under both side-chain and backbone flexibility. Goblin uses a probabilistic graphical model that exploits conditional independencies in the Boltzmann distribution and employs variational inference techniques that approximate the free energy of binding in only a few minutes. We examined the role of conformational entropy on a benchmark set of more than 700 mutants in eight large, well-studied complexes. Our findings suggest that conformational entropy is important in protein-protein interactions--the root mean square error (RMSE) between calculated and experimentally measured ΔΔGs decreases by 12% when explicit entropic contributions were incorporated. GOBLIN models all atoms of the protein complex and detects changes to the binding entropy along the interface as well as positions distal to the binding interface. Our results also suggest that a variational approach to entropy calculations may be quantitatively more accurate than the knowledge-based approaches used by the well-known programs FOLDX and Rosetta--GOBLIN's RMSEs are 10 and 36% lower than these programs, respectively.  相似文献   

7.
Protein-protein interactions were measured for ovalbumin and for lysozyme in aqueous salt solutions. Protein-protein interactions are correlated with a proposed potential of mean force equal to the free energy to desolvate the protein surface that is made inaccessible to the solvent due to the protein-protein interaction. This energy is calculated from the surface free energy of the protein that is determined from protein-salt preferential-interaction parameter measurements. In classical salting-out behavior, the protein-salt preferential interaction is unfavorable. Because addition of salt raises the surface free energy of the protein according to the surface-tension increment of the salt, protein-protein attraction increases, leading to a reduction in solubility. When the surface chemistry of proteins is altered by binding of a specific ion, salting-in is observed when the interactions between (kosmotrope) ion-protein complexes are more repulsive than those between the uncomplexed proteins. However, salting-out is observed when interactions between (chaotrope) ion-protein complexes are more attractive than those of the uncomplexed proteins.  相似文献   

8.
The protein-protein interaction energy of 12 nonhomologous serine protease-inhibitor and 15 antibody-antigen complexes is calculated using a molecular mechanics formalism and dissected in terms of the main-chain vs. side-chain contribution, nonrotameric side-chain contributions, and amino acid residue type involvement in the interface interaction. There are major differences in the interactions of the two types of protein-protein complex. Protease-inhibitor complexes interact predominantly through a main-chain-main-chain mechanism while antibody-antigen complexes interact predominantly through a side-chain-side-chain or a side-chain-main-chain mechanism. However, there is no simple correlation between the main-chain-main-chain interaction energy and the percentage of main-chain surface area buried on binding. The interaction energy is equally effected by the presence of nonrotameric side-chain conformations, which constitute approximately 20% of the interaction energy. The ability to reproduce the interface interaction energy of the crystal structure if original side-chain conformations are removed from the calculation is much greater in the protease-inhibitor complexes than the antibody-antigen complexes. The success of a rotameric model for protein-protein docking appears dependent on the extent of the main-chain-main-chain contribution to binding. Analysis of (1) residue type and (2) residue pair interactions at the interface show that antibody-antigen interactions are very restricted with over 70% of the antibody energy attributable to just six residue types (Tyr > Asp > Asn > Ser > Glu > Trp) in agreement with previous studies on residue propensity. However, it is found here that 50% of the antigen energy is attributable to just four residue types (Arg = Lys > Asn > Asp). On average just 12 residue pair interactions (6%) contribute over 40% of the favorable interaction energy in the antibody-antigen complexes, with charge-charge and charge/polar-tyrosine interactions being prominent. In contrast protease inhibitors use a diverse set of residue types and residue pair interactions.  相似文献   

9.
Clark LA  van Vlijmen HW 《Proteins》2008,70(4):1540-1550
A distance-dependent knowledge-based potential for protein-protein interactions is derived and tested for application in protein design. Information on residue type specific C(alpha) and C(beta) pair distances is extracted from complex crystal structures in the Protein Data Bank and used in the form of radial distribution functions. The use of only backbone and C(beta) position information allows generation of relative protein-protein orientation poses with minimal sidechain information. Further coarse-graining can be done simply in the same theoretical framework to give potentials for residues of known type interacting with unknown type, as in a one-sided interface design problem. Both interface design via pose generation followed by sidechain repacking and localized protein-protein docking tests are performed on 39 nonredundant antibody-antigen complexes for which crystal structures are available. As reference, Lennard-Jones potentials, unspecific for residue type and biasing toward varying degrees of residue pair separation are used as controls. For interface design, the knowledge-based potentials give the best combination of consistently designable poses, low RMSD to the known structure, and more tightly bound interfaces with no added computational cost. 77% of the poses could be designed to give complexes with negative free energies of binding. Generally, larger interface separation promotes designability, but weakens the binding of the resulting designs. A localized docking test shows that the knowledge-based nature of the potentials improves performance and compares respectably with more sophisticated all-atoms potentials.  相似文献   

10.
Absolute binding free energy calculations and free energy decompositions are presented for the protein-protein complexes H-Ras/C-Raf1 and H-Ras/RalGDS. Ras is a central switch in the regulation of cell proliferation and differentiation. In our study, we investigate the capability of the molecular mechanics (MM)-generalized Born surface area (GBSA) approach to estimate absolute binding free energies for the protein-protein complexes. Averaging gas-phase energies, solvation free energies, and entropic contributions over snapshots extracted from trajectories of the unbound proteins and the complexes, calculated binding free energies (Ras-Raf: -15.0(+/-6.3)kcal mol(-1); Ras-RalGDS: -19.5(+/-5.9)kcal mol(-1)) are in fair agreement with experimentally determined values (-9.6 kcal mol(-1); -8.4 kcal mol(-1)), if appropriate ionic strength is taken into account. Structural determinants of the binding affinity of Ras-Raf and Ras-RalGDS are identified by means of free energy decomposition. For the first time, computationally inexpensive generalized Born (GB) calculations are applied in this context to partition solvation free energies along with gas-phase energies between residues of both binding partners. For selected residues, in addition, entropic contributions are estimated by classical statistical mechanics. Comparison of the decomposition results with experimentally determined binding free energy differences for alanine mutants of interface residues yielded correlations with r(2)=0.55 and 0.46 for Ras-Raf and Ras-RalGDS, respectively. Extension of the decomposition reveals residues as far apart as 25A from the binding epitope that can contribute significantly to binding free energy. These "hotspots" are found to show large atomic fluctuations in the unbound proteins, indicating that they reside in structurally less stable regions. Furthermore, hotspot residues experience a significantly larger-than-average decrease in local fluctuations upon complex formation. Finally, by calculating a pair-wise decomposition of interactions, interaction pathways originating in the binding epitope of Raf are found that protrude through the protein structure towards the loop L1. This explains the finding of a conformational change in this region upon complex formation with Ras, and it may trigger a larger structural change in Raf, which is considered to be necessary for activation of the effector by Ras.  相似文献   

11.
In a previous paper, we described a novel empirical free energy function that was used to accurately predict experimental binding free energies for a diverse test set of 31 protein–protein complexes to within ≈ 1.0 kcal. Here, we extend that work and show that an updated version of the function can be used to (1) accurately predict native binding free energies and (2) rank crystallographic, native-like and non-native binding modes in a physically realistic manner. The modified function includes terms designed to capture some of the unfavorable interactions that characterize non-native interfaces. The function was used to calculate one-dimensional binding free energy surfaces for 21 protein complexes. In roughly 90% of the cases tested, the function was used to place native-like and crystallographic binding modes in global free energy minima. Our analysis further suggests that buried hydrogen bonds might provide the key to distinguishing native from non-native interactions. To the best of our knowledge our function is the only one of its kind, a single expression that can be used to accurately calculate native and non-native binding free energies for a large number of proteins. Given the encouraging results presented in this paper, future work will focus on improving the function and applying it to the protein–protein docking problem.  相似文献   

12.
Tuncbag N  Keskin O  Nussinov R  Gursoy A 《Proteins》2012,80(4):1239-1249
The similarity between folding and binding led us to posit the concept that the number of protein-protein interface motifs in nature is limited, and interacting protein pairs can use similar interface architectures repeatedly, even if their global folds completely vary. Thus, known protein-protein interface architectures can be used to model the complexes between two target proteins on the proteome scale, even if their global structures differ. This powerful concept is combined with a flexible refinement and global energy assessment tool. The accuracy of the method is highly dependent on the structural diversity of the interface architectures in the template dataset. Here, we validate this knowledge-based combinatorial method on the Docking Benchmark and show that it efficiently finds high-quality models for benchmark complexes and their binding regions even in the absence of template interfaces having sequence similarity to the targets. Compared to "classical" docking, it is computationally faster; as the number of target proteins increases, the difference becomes more dramatic. Further, it is able to distinguish binders from nonbinders. These features allow performing large-scale network modeling. The results on an independent target set (proteins in the p53 molecular interaction map) show that current method can be used to predict whether a given protein pair interacts. Overall, while constrained by the diversity of the template set, this approach efficiently produces high-quality models of protein-protein complexes. We expect that with the growing number of known interface architectures, this type of knowledge-based methods will be increasingly used by the broad proteomics community.  相似文献   

13.
We have made a comparative structure based analysis of the thermodynamics of lectin-carbohydrate (L-C) binding and protein folding. Examination of the total change in accessible surface area in those processes revealed a much larger decrease in free energy per unit of area buried in the case of L-C associations. According to our analysis, this larger stabilization of L-C interactions arises from a more favorable enthalpy of burying a unit of polar surface area, and from higher proportions of polar areas. Hydrogen bonds present at 14 L-C interfaces were identified, and their overall characteristics were compared to those reported before for hydrogen bonds in protein structures. Three major factors might explain why polar-polar interactions are stronger in L-C binding than in protein folding: (1) higher surface density of hydrogen bonds; (2) better hydrogen-bonding geometry; (3) larger proportion of hydrogen bonds involving charged groups. Theoretically, the binding entropy can be partitioned into three main contributions: entropy changes due to surface desolvation, entropy losses arising from freezing rotatable bonds, and entropic effects that result from restricting translation and overall rotation motions. These contributions were estimated from structural information and added up to give calculated binding entropies. Good correlation between experimental and calculated values was observed when solvation effects were treated according to a parametrization developed by other authors from protein folding studies. Finally, our structural parametrization gave calculated free energies that deviate from experimental values by 1.1 kcal/mol on the average; this amounts to an uncertainty of one order of magnitude in the binding constant.  相似文献   

14.
Protein recognition is one of the most challenging and intriguing problems in structural biology. Despite all the available structural, sequence and biophysical information about protein-protein complexes, the physico-chemical patterns, if any, that make a protein surface likely to be involved in protein-protein interactions, remain elusive. Here, we apply protein docking simulations and analysis of the interaction energy landscapes to identify protein-protein interaction sites. The new protocol for global docking based on multi-start global energy optimization of an all-atom model of the ligand, with detailed receptor potentials and atomic solvation parameters optimized in a training set of 24 complexes, explores the conformational space around the whole receptor without restrictions. The ensembles of the rigid-body docking solutions generated by the simulations were subsequently used to project the docking energy landscapes onto the protein surfaces. We found that highly populated low-energy regions consistently corresponded to actual binding sites. The procedure was validated on a test set of 21 known protein-protein complexes not used in the training set. As much as 81% of the predicted high-propensity patch residues were located correctly in the native interfaces. This approach can guide the design of mutations on the surfaces of proteins, provide geometrical details of a possible interaction, and help to annotate protein surfaces in structural proteomics.  相似文献   

15.
We have searched for intermolecular aromatic pairs in 77 protein-protein complexes of intrinsically disordered proteins (IDPs) to understand the role of π-π interactions in protein-protein interactions involving IDPs. We found that 40% of the complexes possess at least one intermolecular pair of aromatic residues. Analysis of composition, characteristics, location and the contribution to the free energy of binding showed that π-π interactions substantially contribute to binding by working as anchor residues, conformational locks, and ready-made recognition motifs required for specific binding. By using available experimental data we show that π-π interactions play a variety of roles that link binding of IDPs and their function in the cell. The results presented in this study pave the way to understand in atomic detail the inner workings of IDPs interaction networks.  相似文献   

16.
Predicting changes in protein binding affinity due to single amino acid mutations helps us better understand the driving forces underlying protein-protein interactions and design improved biotherapeutics. Here, we use the MM-GBSA approach with the OPLS2005 force field and the VSGB2.0 solvent model to calculate differences in binding free energy between wild type and mutant proteins. Crucially, we made no changes to the scoring model as part of this work on protein-protein binding affinity—the energy model has been developed for structure prediction and has previously been validated only for calculating the energetics of small molecule binding. Here, we compare predictions to experimental data for a set of 418 single residue mutations in 21 targets and find that the MM-GBSA model, on average, performs well at scoring these single protein residue mutations. Correlation between the predicted and experimental change in binding affinity is statistically significant and the model performs well at picking “hotspots,” or mutations that change binding affinity by more than 1 kcal/mol. The promising performance of this physics-based method with no tuned parameters for predicting binding energies suggests that it can be transferred to other protein engineering problems.  相似文献   

17.
We propose a self-consistent approach to analyze knowledge-based atom-atom potentials used to calculate protein-ligand binding energies. Ligands complexed to actual protein structures were first built using the SMoG growth procedure (DeWitte & Shakhnovich, 1996) with a chosen input potential. These model protein-ligand complexes were used to construct databases from which knowledge-based protein-ligand potentials were derived. We then tested several different modifications to such potentials and evaluated their performance on their ability to reconstruct the input potential using the statistical information available from a database composed of model complexes. Our data indicate that the most significant improvement resulted from properly accounting for the following key issues when estimating the reference state: (1) the presence of significant nonenergetic effects that influence the contact frequencies and (2) the presence of correlations in contact patterns due to chemical structure. The most successful procedure was applied to derive an atom-atom potential for real protein-ligand complexes. Despite the simplicity of the model (pairwise contact potential with a single interaction distance), the derived binding free energies showed a statistically significant correlation (approximately 0.65) with experimental binding scores for a diverse set of complexes.  相似文献   

18.
Molecular principles of the interactions of disordered proteins   总被引:6,自引:0,他引:6  
Thorough knowledge of the molecular principles of protein-protein recognition is essential to our understanding of protein function at the cellular level. Whereas interactions of ordered proteins have been analyzed in great detail, complexes of intrinsically unstructured/disordered proteins (IUPs) have hardly been addressed so far. Here, we have collected a database of 39 complexes of experimentally verified IUPs, and compared their interfaces with those of 72 complexes of ordered, globular proteins. The characteristic differences found between the two types of complexes suggest that IUPs represent a distinct molecular implementation of the principles of protein-protein recognition. The interfaces do not differ in size, but those of IUPs cover a much larger part of the surface of the protein than for their ordered counterparts. Moreover, IUP interfaces are significantly more hydrophobic relative to their overall amino acid composition, but also in absolute terms. They rely more on hydrophobic-hydrophobic than on polar-polar interactions. Their amino acids in the interface realize more intermolecular contacts, which suggests a better fit with the partner due to induced folding upon binding that results in a better adaptation to the partner. The two modes of interaction also differ in that IUPs usually use only a single continuous segment for partner binding, whereas the binding sites of ordered proteins are more segmented. Probably, all these features contribute to the increased evolutionary conservation of IUP interface residues. These noted molecular differences are also manifested in the interaction energies of IUPs. Our approximation of these by low-resolution force-fields shows that IUPs gain much more stabilization energy from intermolecular contacts, than from folding, i.e. they use their binding energy for folding. Overall, our findings provide a structural rationale to the prior suggestions that many IUPs are specialized for functions realized by protein-protein interactions.  相似文献   

19.
20.
For the first time, a statistical potential has been developed to quantitatively describe the CH.O hydrogen bonding interaction at the protein-protein interface. The calculated energies of the CH.O pair interaction show a favorable valley at approximately 3.3 A, exhibiting a feature typical of an H-bond and similar to the ab initio quantum calculation result (Scheiner, S., Kar, T., and Gu, Y. (2001) J. Biol. Chem. 276, 9832-9837). The potentials have been applied to a set of 469 protein-protein complexes to calculate the contribution of different types of interactions to each protein complex: the average energy contribution of a conventional H-bond is approximately 30%; that of a CH.O H-bond is 17%; and that of a hydrophobic interaction is 50%. In some protein-protein complexes, the contribution of the CH.O H-bond can reach as high as approximately 40-50%, indicating the importance of the CH.O H-bond at the protein interface. At the interfaces of these complexes, C(alpha)H.O H-bonds frequently occur between adjacent strands in both parallel and antiparallel orientations, having the obvious structural motif of bifurcated H-bonds. Our study suggests that the weak CH.O H-bond makes an important contribution to the association and stability of protein complexes and needs more attention in protein-protein interaction studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号