期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

How different from random are docking predictions when ranked by scoring functions?

Elisenda Feliu Baldomero Oliva 《Proteins》2010,78(16):3376-3385

Docking algorithms predict the structure of protein–protein interactions. They sample the orientation of two unbound proteins to produce various predictions about their interactions, followed by a scoring step to rank the predictions. We present a statistical assessment of scoring functions used to rank near‐native orientations, applying our statistical analysis to a benchmark dataset of decoys of protein–protein complexes and assessing the statistical significance of the outcome in the Critical Assessment of PRedicted Interactions (CAPRI) scoring experiment. A P value was assigned that depended on the number of near‐native structures in the sampling. We studied the effect of filtering out redundant structures and tested the use of pair‐potentials derived using ZDock and ZRank. Our results show that for many targets, it is not possible to determine when a successful reranking performed by scoring functions results merely from random choice. This analysis reveals that changes should be made in the design of the CAPRI scoring experiment. We propose including the statistical assessment in this experiment either at the preprocessing or the evaluation step. Proteins 2010. © 2010 Wiley‐Liss, Inc. 相似文献

2.

总被引：1，自引：0，他引：1

Tanggis Bohnuud Lingqi Luo Shoshana J. Wodak Alexandre M. J. J. Bonvin Zhiping Weng Sandor Vajda Ora Schueler‐Furman Dima Kozakov 《Proteins》2017,85(1):10-16

Protein docking procedures carry out the task of predicting the structure of a protein–protein complex starting from the known structures of the individual protein components. More often than not, however, the structure of one or both components is not known, but can be derived by homology modeling on the basis of known structures of related proteins deposited in the Protein Data Bank (PDB). Thus, the problem is to develop methods that optimally integrate homology modeling and docking with the goal of predicting the structure of a complex directly from the amino acid sequences of its component proteins. One possibility is to use the best available homology modeling and docking methods. However, the models built for the individual subunits often differ to a significant degree from the bound conformation in the complex, often much more so than the differences observed between free and bound structures of the same protein, and therefore additional conformational adjustments, both at the backbone and side chain levels need to be modeled to achieve an accurate docking prediction. In particular, even homology models of overall good accuracy frequently include localized errors that unfavorably impact docking results. The predicted reliability of the different regions in the model can also serve as a useful input for the docking calculations. Here we present a benchmark dataset that should help to explore and solve combined modeling and docking problems. This dataset comprises a subset of the experimentally solved ‘target’ complexes from the widely used Docking Benchmark from the Weng Lab (excluding antibody–antigen complexes). This subset is extended to include the structures from the PDB related to those of the individual components of each complex, and hence represent potential templates for investigating and benchmarking integrated homology modeling and docking approaches. Template sets can be dynamically customized by specifying ranges in sequence similarity and in PDB release dates, or using other filtering options, such as excluding sets of specific structures from the template list. Multiple sequence alignments, as well as structural alignments of the templates to their corresponding subunits in the target are also provided. The resource is accessible online or can be downloaded at http://cluspro.org/benchmark , and is updated on a weekly basis in synchrony with new PDB releases. Proteins 2016; 85:10–16. © 2016 Wiley Periodicals, Inc. 相似文献

3.

Jacob Verburgt Daisuke Kihara 《Proteins》2022,90(1):83-95

Protein structure docking is the process in which the quaternary structure of a protein complex is predicted from individual tertiary structures of the protein subunits. Protein docking is typically performed in two main steps. The subunits are first docked while keeping them rigid to form the complex, which is then followed by structure refinement. Structure refinement is crucial for a practical use of computational protein docking models, as it is aimed for correcting conformations of interacting residues and atoms at the interface. Here, we benchmarked the performance of eight existing protein structure refinement methods in refinement of protein complex models. We show that the fraction of native contacts between subunits is by far the most straightforward metric to improve. However, backbone dependent metrics, based on the Root Mean Square Deviation proved more difficult to improve via refinement. 相似文献

4.

Efrat Mashiach Ruth Nussinov Haim J. Wolfson 《Proteins》2010,78(6):1503-1519

相似文献

5.

Li C. Xue Rafael A. Jordan EL‐Manzalawy Yasser Drena Dobbs Vasant Honavar 《Proteins》2014,82(2):250-267

Selecting near‐native conformations from the immense number of conformations generated by docking programs remains a major challenge in molecular docking. We introduce DockRank, a novel approach to scoring docked conformations based on the degree to which the interface residues of the docked conformation match a set of predicted interface residues. DockRank uses interface residues predicted by partner‐specific sequence homology‐based protein–protein interface predictor (PS‐HomPPI), which predicts the interface residues of a query protein with a specific interaction partner. We compared the performance of DockRank with several state‐of‐the‐art docking scoring functions using Success Rate (the percentage of cases that have at least one near‐native conformation among the top m conformations) and Hit Rate (the percentage of near‐native conformations that are included among the top m conformations). In cases where it is possible to obtain partner‐specific (PS) interface predictions from PS‐HomPPI, DockRank consistently outperforms both (i) ZRank and IRAD, two state‐of‐the‐art energy‐based scoring functions (improving Success Rate by up to 4‐fold); and (ii) Variants of DockRank that use predicted interface residues obtained from several protein interface predictors that do not take into account the binding partner in making interface predictions (improving success rate by up to 39‐fold). The latter result underscores the importance of using partner‐specific interface residues in scoring docked conformations. We show that DockRank, when used to re‐rank the conformations returned by ClusPro, improves upon the original ClusPro rankings in terms of both Success Rate and Hit Rate. DockRank is available as a server at http://einstein.cs.iastate.edu/DockRank/ . Proteins 2014; 82:250–267. © 2013 Wiley Periodicals, Inc. 相似文献

6.

Sandor Vajda David R. Hall Dima Kozakov 《Proteins》2013,81(11):1874-1884

Most structure prediction algorithms consist of initial sampling of the conformational space, followed by rescoring and possibly refinement of a number of selected structures. Here we focus on protein docking, and show that while decoupling sampling and scoring facilitates method development, integration of the two steps can lead to substantial improvements in docking results. Since decoupling is usually achieved by generating a decoy set containing both non‐native and near‐native docked structures, which can be then used for scoring function construction, we first review the roles and potential pitfalls of decoys in protein–protein docking, and show that some type of decoys are better than others for method development. We then describe three case studies showing that complete decoupling of scoring from sampling is not the best choice for solving realistic docking problems. Although some of the examples are based on our own experience, the results of the CAPRI docking and scoring experiments also show that performing both sampling and scoring generally yields better results than scoring the structures generated by all predictors. Next we investigate how the selection of training and decoy sets affects the performance of the scoring functions obtained. Finally, we discuss pathways to better alignment of the two steps, and show some algorithms that achieve a certain level of integration. Although we focus on protein–protein docking, our observations most likely also apply to other conformational search problems, including protein structure prediction and the docking of small molecules to proteins.Proteins 2013; 81:1874–1884. © 2013 Wiley Periodicals, Inc. 相似文献

7.

Petr Popov David W. Ritchie Sergei Grudinin 《Proteins》2014,82(1):34-44

In spite of the abundance of oligomeric proteins within a cell, the structural characterization of protein–protein interactions is still a challenging task. In particular, many of these interactions involve heteromeric complexes, which are relatively difficult to determine experimentally. Hence there is growing interest in using computational techniques to model such complexes. However, assembling large heteromeric complexes computationally is a highly combinatorial problem. Nonetheless the problem can be simplified greatly by considering interactions between protein trimers. After dimers and monomers, triangular trimers (i.e. trimers with pair‐wise contacts between all three pairs of proteins) are the most frequently observed quaternary structural motifs according to the three‐dimensional (3D) complex database. This article presents DockTrina, a novel protein docking method for modeling the 3D structures of nonsymmetrical triangular trimers. The method takes as input pair‐wise contact predictions from a rigid body docking program. It then scans and scores all possible combinations of pairs of monomers using a very fast root mean square deviation test. Finally, it ranks the predictions using a scoring function which combines triples of pair‐wise contact terms and a geometric clash penalty term. The overall approach takes less than 2 min per complex on a modern desktop computer. The method is tested and validated using a benchmark set of 220 bound and seven unbound protein trimer structures. DockTrina will be made available at http://nano‐d.inrialpes.fr/software/docktrina . Proteins 2014; 82:34–44. © 2013 Wiley Periodicals, Inc. 相似文献

8.

Panagiotis L. Kastritis Koen M. Visscher Aalt D. J. van Dijk Alexandre M. J. J. Bonvin 《Proteins》2013,81(3):510-518

HADDOCK is one of the few docking programs that can explicitly account for water molecules in the docking process. Its solvated docking protocol starts from hydrated molecules and a fraction of the resulting interfacial waters is subsequently removed in a biased Monte Carlo procedure based on water‐mediated contact probabilities. The latter were derived from an analysis of water contact frequencies from high‐resolution crystal structures. Here, we introduce a simple water‐mediated amino acid–amino acid contact probability scale derived from the Kyte‐Doolittle hydrophobicity scale and assess its performance on the largest high‐resolution dataset developed to date for solvated docking. Both scales yield high‐quality docking results. The novel and simple hydrophobicity scale, which should reflect better the physicochemical principles underlying contact propensities, leads to a performance improvement of around 10% in ranking, cluster quality and water recovery at the interface compared with the statistics‐based original solvated docking protocol. Proteins 2013. © 2012 Wiley Periodicals, Inc. 相似文献

9.

Subunit-specific backbone NMR assignments of a 64 kDa trp repressor/DNA complex: A role for N-terminal residues in tandem binding

Xi Shan Kevin H. Gardner D.R. Muhandiram Lewis E. Kay Cheryl H. Arrowsmith 《Journal of biomolecular NMR》1998,11(3):307-318

Deuterium decoupled, triple resonance NMR spectroscopy was used to analyze complexes of ²H,¹⁵N,¹³C labelled intact and (des2–7) trp repressor (2–7 trpR) from E. coli bound in tandem to an idealized 22 basepair trp operator DNA fragment and the corepressor 5-methyltryptophan. The DNA sequence used here binds two trpR dimers in tandem resulting in chemically nonequivalent environments for the two subunits of each dimer. Sequence- and subunit-specific NMR resonance assignments were made for backbone ¹HN, ¹⁵N, ¹³C positions in both forms of the protein and for¹³ C in the intact repressor. The differences in backbone chemical shifts between the two subunits within each dimer of 2–7 trpR reflect dimer-dimer contacts involving the helix-turn-helix domains and N-terminal residues consistent with a previously determined crystal structure [Lawson and Carey (1993) Nature, 366, 178–182]. Comparison of the backbone chemical shifts of DNA-bound 2–7 trpR with those of DNA-bound intact trpR reveals significant changes for those residues involved in N-terminal-mediated interactions observed in the crystal structure. In addition, our solution NMR data contain three sets of resonances for residues 2–12 in intact trpR suggesting that the N-terminus has multiple conformations in the tandem complex. Analysis of C chemical shifts using a chemical shift index (CSI) modified for deuterium isotope effects has allowed a comparison of the secondary structure of intact and 2–7 tprR. Overall these data demonstrate that NMR backbone chemical shift data can be readily used to study specific structural details of large protein complexes. 相似文献

10.

Myong‐Ho Chae Florian Krull Stephan Lorenzen Ernst‐Walter Knapp 《Proteins》2010,78(4):1026-1039

A major challenge of the protein docking problem is to define scoring functions that can distinguish near‐native protein complex geometries from a large number of non‐native geometries (decoys) generated with noncomplexed protein structures (unbound docking). In this study, we have constructed a neural network that employs the information from atom‐pair distance distributions of a large number of decoys to predict protein complex geometries. We found that docking prediction can be significantly improved using two different types of polar hydrogen atoms. To train the neural network, 2000 near‐native decoys of even distance distribution were used for each of the 185 considered protein complexes. The neural network normalizes the information from different protein complexes using an additional protein complex identity input neuron for each complex. The parameters of the neural network were determined such that they mimic a scoring funnel in the neighborhood of the native complex structure. The neural network approach avoids the reference state problem, which occurs in deriving knowledge‐based energy functions for scoring. We show that a distance‐dependent atom pair potential performs much better than a simple atom‐pair contact potential. We have compared the performance of our scoring function with other empirical and knowledge‐based scoring functions such as ZDOCK 3.0, ZRANK, ITScore‐PP, EMPIRE, and RosettaDock. In spite of the simplicity of the method and its functional form, our neural network‐based scoring function achieves a reasonable performance in rigid‐body unbound docking of proteins. Proteins 2010. © 2009 Wiley‐Liss, Inc. 相似文献

11.

Lydie Vamparys Benoist Laurent Alessandra Carbone Sophie Sacquin‐Mora 《Proteins》2016,84(10):1408-1421

Protein–protein interactions play a key part in most biological processes and understanding their mechanism is a fundamental problem leading to numerous practical applications. The prediction of protein binding sites in particular is of paramount importance since proteins now represent a major class of therapeutic targets. Amongst others methods, docking simulations between two proteins known to interact can be a useful tool for the prediction of likely binding patches on a protein surface. From the analysis of the protein interfaces generated by a massive cross‐docking experiment using the 168 proteins of the Docking Benchmark 2.0, where all possible protein pairs, and not only experimental ones, have been docked together, we show that it is also possible to predict a protein's binding residues without having any prior knowledge regarding its potential interaction partners. Evaluating the performance of cross‐docking predictions using the area under the specificity‐sensitivity ROC curve (AUC) leads to an AUC value of 0.77 for the complete benchmark (compared to the 0.5 AUC value obtained for random predictions). Furthermore, a new clustering analysis performed on the binding patches that are scattered on the protein surface show that their distribution and growth will depend on the protein's functional group. Finally, in several cases, the binding‐site predictions resulting from the cross‐docking simulations will lead to the identification of an alternate interface, which corresponds to the interaction with a biomolecular partner that is not included in the original benchmark. Proteins 2016; 84:1408–1421. © 2016 The Authors Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. 相似文献

12.

Mateusz Kurcinski Aleksandra Badaczewska‐Dawid Michal Kolinski Andrzej Kolinski Sebastian Kmiecik 《Protein science : a publication of the Protein Society》2020,29(1):211-222

Molecular docking of peptides to proteins can be a useful tool in the exploration of the possible peptide binding sites and poses. CABS‐dock is a method for protein–peptide docking that features significant conformational flexibility of both the peptide and the protein molecules during the peptide search for a binding site. The CABS‐dock has been made available as a web server and a standalone package. The web server is an easy to use tool with a simple web interface. The standalone package is a command‐line program dedicated to professional users. It offers a number of advanced features, analysis tools and support for large‐sized systems. In this article, we outline the current status of the CABS‐dock method, its recent developments, applications, and challenges ahead. 相似文献

13.

Alberto Meseguer Lluis Dominguez Patricia M. Bota Joaquim Aguirre‐Plans Jaume Bonet Narcis Fernandez‐Fuentes Baldo Oliva 《Protein science : a publication of the Protein Society》2020,29(10):2112-2130

Protein–protein interactions (PPIs) in all the molecular aspects that take place both inside and outside cells. However, determining experimentally the structure and affinity of PPIs is expensive and time consuming. Therefore, the development of computational tools, as a complement to experimental methods, is fundamental. Here, we present a computational suite: MODPIN, to model and predict the changes of binding affinity of PPIs. In this approach we use homology modeling to derive the structures of PPIs and score them using state‐of‐the‐art scoring functions. We explore the conformational space of PPIs by generating not a single structural model but a collection of structural models with different conformations based on several templates. We apply the approach to predict the changes in free energy upon mutations and splicing variants of large datasets of PPIs to statistically quantify the quality and accuracy of the predictions. As an example, we use MODPIN to study the effect of mutations in the interaction between colicin endonuclease 9 and colicin endonuclease 2 immune protein from Escherichia coli. Finally, we have compared our results with other state‐of‐art methods. 相似文献

14.

Ren Kong Ran-Ran Liu Xi-Ming Xu Da-Wei Zhang Xiao-Shuang Xu Hang Shi Shan Chang 《Proteins》2020,88(8):1100-1109

Integration of template-based modeling, global sampling and precise scoring is crucial for the development of molecular docking programs with improved accuracy. We combined template-based modeling and ab-initio docking protocol as hybrid docking strategy called CoDock for the docking and scoring experiments of the seventh CAPRI edition. For CAPRI rounds 38-45, we obtained acceptable or better models in the top 10 submissions for eight out of the 16 evaluated targets as predictors, nine out of the 16 targets as scorers. Especially, we submitted acceptable models for all of the evaluated protein-oligosaccharide targets. For the CASP13-CAPRI experiment (round 46), we obtained acceptable or better models in the top 5 submissions for 10 out of the 20 evaluated targets as predictors, 11 out of the 20 targets as scorers. The failed cases for our group were mainly the difficult targets and the protein-peptide systems in CAPRI and CASP13-CAPRI experiments. In summary, this CAPRI edition showed that our hybrid docking strategy can be efficiently adapted to the increasing variety of challenges in the field of molecular interactions. 相似文献

15.

Jan H. Hoh 《Proteins》1998,32(2):223-228

It is proposed that the thermally driven motion of certain polypeptide chains, including those that are part of an otherwise stable folded protein, produces time-averaged three-dimensional domains that confer unique functions to a protein. These domains may be controlled by collapsing the polypeptide into an enthalpically favored structure, or extending it into an entropically dominated form. In the extended form, these domains occupy a relatively large space, which may be used to regulate protein–protein interactions and confer mechanical properties to proteins. This “entropic bristle” model makes several predictions about the structure and properties of these domains, and the predictions are used to reevaluate a range of biophysical studies on proteins. The outcome of the analysis suggests that the entropic bristle can be used to explain a wide range of disparate and apparently unrelated experimental observations. Proteins 32:223–228, 1998. © 1998 Wiley-Liss, Inc. 相似文献

16.

Yaqin Hou Haihua Quan Weiwei Xu Yongli Bao Yuxin Li Yuan Fu Shuxue Zou 《Protein science : a publication of the Protein Society》2013,22(8):1060-1070

A plethora of both experimental and computational methods have been proposed in the past 20 years for the identification of hot spots at a protein–protein interface. The experimental determination of a protein–protein complex followed by alanine scanning mutagenesis, though able to determine hot spots with much precision, is expensive and has no guarantee of success while the accuracy of the current computational methods for hot‐spot identification remains low. Here, we present a novel structure‐based computational approach that accurately determines hot spots through docking into a set of proteins homologous to only one of the two interacting partners of a compound capable of disrupting the protein–protein interaction (PPI). This approach has been applied to identify the hot spots of human activin receptor type II (ActRII) critical for its binding toward Cripto‐I. The subsequent experimental confirmation of the computationally identified hot spots portends a potentially accurate method for hot‐spot determination in silico given a compound capable of disrupting the PPI in question. The hot spots of human ActRII first reported here may well become the focal points for the design of small molecule drugs that target the PPI. The determination of their interface may have significant biological implications in that it suggests that Cripto‐I plays an important role in both activin and nodal signal pathways. 相似文献

17.

K. Lauren Hindle Jordi Bella Simon C. Lovell 《Proteins》2009,77(2):342-358

Leucine‐rich repeat (LRR) proteins form a large and diverse family. They have a wide range of functions most of which involve the formation of protein–protein interactions. All known LRR structures form curved solenoids, although there is large variation in their curvature. It is this curvature that determines the shape and dimensions of the inner space available for ligand binding. Unfortunately, large‐scale parameters such as the overall curvature of a protein domain are extremely difficult to predict. Here, we present a quantitative analysis of determinants of curvature of this family. Individual repeats typically range in length between 20 and 30 residues and have a variety of secondary structures on their convex side. The observed curvature of the LRR domains correlates poorly with the lengths of their individual repeats. We have, therefore, developed a scoring function based on the secondary structure of the convex side of the protein that allows prediction of the overall curvature with a high degree of accuracy. We also demonstrate the effectiveness of this method in selecting a suitable template for comparative modeling. We have developed an automated, quantitative protocol that can be used to predict accurately the curvature of leucine‐rich repeat proteins of unknown structure from sequence alone. This protocol is available as an online resource at http://www.bioinf.manchester.ac.uk/curlrr/ . Proteins 2009. © 2009 Wiley‐Liss, Inc. 相似文献

18.

Gong X Liu B Chang S Li C Chen W Wang C 《中国科学：生命科学英文版》2010,53(9):1152-1161

A holistic protein-protein molecular docking approach, HoDock, was established, composed of such steps as binding site prediction, initial complex structure sampling, refined complex structure sampling, structure clustering, scoring and final structure selection. This article explains the detailed steps and applications for CAPRI Target 39. The CAPRI result showed that three predicted binding site residues, A191HIS, B512ARG and B531ARG, were correct, and there were five submitted structures with a high fraction of correct receptor-ligand interface residues, indicating that this docking approach may improve prediction accuracy for protein-protein complex structures. 相似文献

19.

Mark Nicholas Wass Carles Pons Florencio Pazos Alfonso Valencia 《Molecular systems biology》2011,7(1)

Deciphering the whole network of protein interactions for a given proteome (‘interactome’) is the goal of many experimental and computational efforts in Systems Biology. Separately the prediction of the structure of protein complexes by docking methods is a well‐established scientific area. To date, docking programs have not been used to predict interaction partners. We provide a proof of principle for such an approach. Using a set of protein complexes representing known interactors in their unbound form, we show that a standard docking program can distinguish the true interactors from a background of 922 non‐redundant potential interactors. We additionally show that true interactions can be distinguished from non‐likely interacting proteins within the same structural family. Our approach may be put in the context of the proposed ‘funnel‐energy model’; the docking algorithm may not find the native complex, but it distinguishes binding partners because of the higher probability of favourable models compared with a collection of non‐binders. The potential exists to develop this proof of principle into new approaches for predicting interaction partners and reconstructing biological networks. 相似文献

20.

D. V. S. Ravikant Ron Elber 《Proteins》2010,78(2):400-419

Identifying correct binding modes in a large set of models is an important step in protein–protein docking. We identified protein docking filter based on overlap area that significantly reduces the number of candidate structures that require detailed examination. We also developed potentials based on residue contacts and overlap areas using a comprehensive learning set of 640 two‐chain protein complexes with mathematical programming. Our potential showed substantially better recognition capacity compared to other publicly accessible protein docking potentials in discriminating between native and nonnative binding modes on a large test set of 84 complexes independent of our training set. We were able to rank a near‐native model on the top in 43 cases and within top 10 in 51 cases. We also report an atomic potential that ranks a near‐native model on the top in 46 cases and within top 10 in 58 cases. Our filter+potential is well suited for selecting a small set of models to be refined to atomic resolution. Proteins 2010. © 2009 Wiley‐Liss, Inc. 相似文献