首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Structural data as collated in the Protein Data Bank (PDB) have been widely applied in the study and prediction of protein-protein interactions. However, since the basic PDB Entries contain only the contents of the asymmetric unit rather than the biological unit, some key interactions may be missed by analysing only the PDB Entry. A total of 69,054 SCOP (Structural Classification of Proteins) domains were examined systematically to identify the number of additional novel interacting domain pairs and interfaces found by considering the biological unit as stored in the PQS (Protein Quaternary Structure) database. The PQS data adds 25,965 interacting domain pairs to those seen in the PDB Entries to give a total of 61,783 redundant interacting domain pairs. Redundancy filtering at the level of the SCOP family shows PQS to increase the number of novel interacting domain-family pairs by 302 (13.3%) from 2277, but only 16/302 (1.4%) of the interacting domain pairs have the two domains in different SCOP families. This suggests the biological units add little to the elucidation of novel biological interaction networks. However, when the orientation of the domain pairs is considered, the PQS data increases the number of novel domain-domain interfaces observed by 1455 (34.5%) to give 5677 non-redundant domain-domain interfaces. In all, 162/1455 novel domain-domain interfaces are between domains from different families, an increase of 8.9% over the PDB Entries. Overall, the PQS biological units provide a rich source of novel domain-domain interfaces that are not seen in the studied PDB Entries, and so PQS domain-domain interaction data should be exploited wherever possible in the analysis and prediction of protein-protein interactions.  相似文献   

3.
    
We present a novel partner‐specific protein–protein interaction site prediction method called PAIRpred. Unlike most existing machine learning binding site prediction methods, PAIRpred uses information from both proteins in a protein complex to predict pairs of interacting residues from the two proteins. PAIRpred captures sequence and structure information about residue pairs through pairwise kernels that are used for training a support vector machine classifier. As a result, PAIRpred presents a more detailed model of protein binding, and offers state of the art accuracy in predicting binding sites at the protein level as well as inter‐protein residue contacts at the complex level. We demonstrate PAIRpred's performance on Docking Benchmark 4.0 and recent CAPRI targets. We present a detailed performance analysis outlining the contribution of different sequence and structure features, together with a comparison to a variety of existing interface prediction techniques. We have also studied the impact of binding‐associated conformational change on prediction accuracy and found PAIRpred to be more robust to such structural changes than existing schemes. As an illustration of the potential applications of PAIRpred, we provide a case study in which PAIRpred is used to analyze the nature and specificity of the interface in the interaction of human ISG15 protein with NS1 protein from influenza A virus. Python code for PAIRpred is available at http://combi.cs.colostate.edu/supplements/pairpred/ . Proteins 2014; 82:1142–1155. © 2013 Wiley Periodicals, Inc.  相似文献   

4.
Rigid-body docking has become quite successful in predicting the correct conformations of binary protein complexes, at least when the constituent proteins do not undergo large conformational changes upon binding. However, determining whether two given proteins interact is a more difficult problem. Successful docking procedures often give equally good scores for proteins that do not interact experimentally. This is the case for the multiple minimization approach we use here. An analysis of the results where all proteins within a set are docked with all other proteins (complete cross-docking) shows that the predictions can be greatly improved if the location of the correct binding interface on each protein is known, since the experimental complexes are much more likely to bring these two interfaces into contact, at the same time as yielding good interaction energy scores. While various methods exist for identifying binding interfaces, it is shown that simply studying the interaction of all potential protein pairs within a data set can itself help to identify the correct interfaces.  相似文献   

5.
    
Cells are interactive living systems where proteins movements, interactions and regulation are substantially free from centralized management. How protein physico‐chemical and geometrical properties determine who interact with whom remains far from fully understood. We show that characterizing how a protein behaves with many potential interactors in a complete cross‐docking study leads to a sharp identification of its cellular/true/native partner(s). We define a sociability index, or S‐index, reflecting whether a protein likes or not to pair with other proteins. Formally, we propose a suitable normalization function that accounts for protein sociability and we combine it with a simple interface‐based (ranking) score to discriminate partners from non‐interactors. We show that sociability is an important factor and that the normalization permits to reach a much higher discriminative power than shape complementarity docking scores. The social effect is also observed with more sophisticated docking algorithms. Docking conformations are evaluated using experimental binding sites. These latter approximate in the best possible way binding sites predictions, which have reached high accuracy in recent years. This makes our analysis helpful for a global understanding of partner identification and for suggesting discriminating strategies. These results contradict previous findings claiming the partner identification problem being solvable solely with geometrical docking. Proteins 2016; 85:137–154. © 2016 Wiley Periodicals, Inc.  相似文献   

6.
    
A holistic protein-protein molecular docking approach, HoDock, was established, composed of such steps as binding site prediction, initial complex structure sampling, refined complex structure sampling, structure clustering, scoring and final structure selection. This article explains the detailed steps and applications for CAPRI Target 39. The CAPRI result showed that three predicted binding site residues, A191HIS, B512ARG and B531ARG, were correct, and there were five submitted structures with a high fraction of correct receptor-ligand interface residues, indicating that this docking approach may improve prediction accuracy for protein-protein complex structures.  相似文献   

7.
    
  相似文献   

8.
    
Protein functional sites control most biological processes and are important targets for drug design and protein engineering. To characterize them, the evolutionary trace (ET) ranks the relative importance of residues according to their evolutionary variations. Generally, top‐ranked residues cluster spatially to define evolutionary hotspots that predict functional sites in structures. Here, various functions that measure the physical continuity of ET ranks among neighboring residues in the structure, or in the sequence, are shown to inform sequence selection and to improve functional site resolution. This is shown first, in 110 proteins, for which the overlap between top‐ranked residues and actual functional sites rose by 8% in significance. Then, on a structural proteomic scale, optimized ET led to better 3D structure‐function motifs (3D templates) and, in turn, to enzyme function prediction by the Evolutionary Trace Annotation (ETA) method with better sensitivity of (40% to 53%) and positive predictive value (93% to 94%). This suggests that the similarity of evolutionary importance among neighboring residues in the sequence and in the structure is a universal feature of protein evolution. In practice, this yields a tool for optimizing sequence selections for comparative analysis and, via ET, for better predictions of functional site and function. This should prove useful for the efficient mutational redesign of protein function and for pharmaceutical targeting.  相似文献   

9.
The ability of certain Src homology 3 (SH3) domains to bind specifically both type I and type II polyproline ligands is perhaps the best characterized, but also the worst understood, example in the family of protein-interaction modules. A detailed analysis of the structural variations in SH3 domains, with respect to ligand-binding specificity, together with mutagenesis of SH3 Fyn tyrosine kinase, reveal the structural basis for types I and II binding specificity by SH3 domains. The conserved Trp in the SH3 binding pocket can adopt two different orientations that, in turn, determine the type of ligand (I or II) able to bind to the domain. The only exceptions are ligands with Leu at positions P(-1) and P(2), that deviate from standard poly-Pro angles. The motion of the conserved Trp depends on the presence of certain residues located in a key position (132 for Fyn), near the binding pocket. SH3 domains placing aromatic residues in this key position are promiscuous. By contrast, those presenting beta-branched or long aliphatic residues block the conserved Trp in one of the two possible orientations, preventing binding in a type I orientation. This is experimentally demonstrated by a single mutation in Fyn SH3 (Y132I) that abolishes type I ligand binding, while preserving binding to type II ligands. Thus, simple conformational changes, governed by simple rules, can have profound effects on protein-protein interactions, highlighting the importance of structural details to predict protein-protein interactions.  相似文献   

10.
11.
We compare the geometric and physical-chemical properties of interfaces involved in specific and non-specific protein-protein interactions in crystal structures reported in the Protein Data Bank. Specific interactions are illustrated by 70 protein-protein complexes and by subunit contacts in 122 homodimeric proteins; non-specific interactions are illustrated by 188 pairs of monomeric proteins making crystal-packing contacts selected to bury more than 800 A2 of protein surface. A majority of these pairs have 2-fold symmetry and form crystal dimers that cannot be distinguished from real dimers on the basis of the interface size or symmetry. The chemical and amino acid compositions of the large crystal-packing interfaces resemble the protein solvent-accessible surface. These interfaces are less hydrophobic than in homodimers and contain much fewer fully buried atoms. We develop a residue propensity score and a hydrophobic interaction score to assess preferences seen in the chemical and amino acid compositions of the different types of interfaces, and we derive indexes to evaluate the atomic packing, which we find to be less compact at non-specific than at specific interfaces. We test the capacity of these parameters to identify homodimeric proteins in crystal structures, and show that a simple combination of the non-polar interface area and the fraction of buried interface atoms assigns the quaternary structure of 88% of the homodimers and 77% of the monomers in our data set correctly. These success rates increase to 93-95% when the residue propensity score of the interfaces is taken into consideration.  相似文献   

12.
In order to identify the amino acids that determine protein structure and function it is useful to rank them by their relative importance. Previous approaches belong to two groups; those that rely on statistical inference, and those that focus on phylogenetic analysis. Here, we introduce a class of hybrid methods that combine evolutionary and entropic information from multiple sequence alignments. A detailed analysis in insulin receptor kinase domain and tests on proteins that are well-characterized experimentally show the hybrids' greater robustness with respect to the input choice of sequences, as well as improved sensitivity and specificity of prediction. This is a further step toward proteome scale analysis of protein structure and function.  相似文献   

13.
蛋白质-蛋白质相互作用及其抑制剂研究进展   总被引:1,自引:0,他引:1  
赵亚雪  唐赟 《生命科学》2007,19(5):506-511
蛋白质-蛋白质相互作用在细胞活动和生命过程中扮演着非常重要的角色。基因调节、免疫应答、信号转导、细胞组装等等都离不开蛋白质-蛋白质的相互作用。近几年,靶向蛋白质-蛋白质相互作用及其抑制剂研究也逐渐成为研究的热点;但是蛋白质复合物相互作用界面的一些特点和性质,如相互作用界面较大、结合界面较为平坦等,使蛋白质-蛋白质相互作用及其抑制剂研究充满了挑战。本文主要总结了蛋白质-蛋白质相互作用界面的一些性质和特点,分析了界面特性与其抑制剂设计的关系,并讨论了蛋白质-蛋白质相互作用的理论预测方法及其抑制剂的类型和特点,最后又通过实例说明了如何进行蛋白质-蛋白质相互作用抑制剂的设计。  相似文献   

14.
Mannose is an abundant cell surface monosaccharide and has an important role in many biochemical processes. It binds to a greatdiversity of receptor proteins. In this study we have employed Random Forest for prediction of mannose binding sites. Mannosebindingsite is taken to be a sphere around the centroid of the ligand and the sphere is subdivided into different layers and atomwise and residue wise features were extracted for each layer. The method achieves 95.59 % of accuracy using Random Forest with10 fold cross validation. Prediction of mannose binding site analysis will be quite useful in drug design.  相似文献   

15.
Hetero dimer (different monomers) interfaces are involved in catalysis and regulation through the formation of interface active sites. This is critical in cell and molecular biology events. The physical and chemical factors determining the formation of the interface active sites is often large in numbers. The combined role of interacting features is frequently combinatorial and additive in nature. Therefore, it is important to determine the physical and chemical features of such interactions. A number of such features have been documented in literature since 1975. However, the use of such interaction features in the prediction of interaction partners and sites given their sequences is still a challenge. In a non-redundant dataset of 156 hetero-dimer structures determined by X-ray crystallography, the interacting partners are often varying in size and thus, size variation between subunits is an important factor in determining the mode of interface formation. The size of protein subunits interacting are either small-small, largelarge, medium-medium, large-small, large-medium and small-medium. It should also be noted that the interface formed between subunits have physical interactions at N terminal (N), C terminal (C) and middle (M) region of the protein with reference to their sequences in one dimension. These features are believed to have application in the prediction of interaction partners and sites from sequences. However, the use of such features for interaction prediction from sequence is not currently clear.  相似文献   

16.
    
  相似文献   

17.
    
Yun SH  Ji SC  Jeon HJ  Wang X  Lee Y  Choi BS  Lim HM 《Molecules and cells》2012,33(2):211-216
Cnu is a small 71-amino acid protein that complexes with H-NS and binds to a specific sequence in the replication origin of the E. coli chromosome. To understand the mechanism of interaction between Cnu and H-NS, we used bacterial genetics to select and analyze Cnu variants that cannot complex with H-NS. Out of 2,000 colonies, 40 Cnu variants were identified. Most variants (82.5%) had a single mutation, but a few variants (17.5%) had double amino acid changes. An in vitro assay was used to identify Cnu variants that were truly defective in H-NS binding. The changes in these defective variants occurred exclusively at charged amino acids (Asp, Glu, or Lys) on the surface of the protein. We propose that the attractive force that governs the Cnu-H-NS interaction is an ionic bond, unlike the hydrophobic interaction that is the major attractive force in most proteins.  相似文献   

18.
    
Cell-surface-anchored immunoglobulin superfamily (IgSF) proteins are widespread throughout the human proteome, forming crucial components of diverse biological processes including immunity, cell-cell adhesion, and carcinogenesis. IgSF proteins generally function through protein-protein interactions carried out between extracellular, membrane-bound proteins on adjacent cells, known as trans-binding interfaces. These protein-protein interactions constitute a class of pharmaceutical targets important in the treatment of autoimmune diseases, chronic infections, and cancer. A molecular-level understanding of IgSF protein-protein interactions would greatly benefit further drug development. A critical step toward this goal is the reliable identification of IgSF trans-binding interfaces. We propose a novel combination of structure and sequence information to identify trans-binding interfaces in IgSF proteins. We developed a structure-based binding interface prediction approach that can identify broad regions of the protein surface that encompass the binding interfaces and suggests that IgSF proteins possess binding supersites. These interfaces could theoretically be pinpointed using sequence-based conservation analysis, with performance approaching the theoretical upper limit of binding interface prediction accuracy, but achieving this in practice is limited by the current ability to identify an appropriate multiple sequence alignment for conservation analysis. However, an important contribution of combining the two orthogonal methods is that agreement between these approaches can estimate the reliability of the predictions. This approach was benchmarked on the set of 22 IgSF proteins with experimentally solved structures in complex with their ligands. Additionally, we provide structure-based predictions and reliability scores for the 62 IgSF proteins with known structure but yet uncharacterized binding interfaces.  相似文献   

19.
    
Natural evolution is driven by random mutations that improve fitness. In vitro evolution mimics this process, however, on a short time-scale and is driven by the given bait. Here, we used directed in vitro evolution of a random mutant library of Uracil glycosylase (eUNG) displayed on yeast surface to select for binding to chaperones GroEL, DnaK + DnaJ + ATP (DnaKJ) or E. coli cell extract (CE), using binding to the eUNG inhibitor Ugi as probe for native fold. The CE selected population was further divided to Ugi binders (+U) or non-binders (?U). The aim here was to evaluate the sequence space and physical state of the evolved protein binding the different baits. We found that GroEL, DnaKJ and CE-U select and enrich for mutations causing eUNG to misfold, with the three being enriched in mutations in buried and conserved positions, with a tendency to increase positive charge. Still, each selection had its own trajectory, with GroEL and CE-U selecting mutants highly sensitive to protease cleavage while DnaKJ selected partially structured misfolded species with a tendency to refold, making them less sensitive to proteases. More general, our results show that GroEL has a higher tendency to purge promiscuous misfolded protein mutants from the system, while DnaKJ binds misfolding-prone mutant species that are, upon chaperone release, more likely to natively refold. CE-U shares some of the properties of GroEL- and DnaKJ-selected populations, while harboring also unique properties that can be explained by the presence of additional chaperones in CE, such as Trigger factor, HtpG and ClpB.  相似文献   

20.
    
Mihalek I  Res I  Lichtarge O 《Proteins》2006,63(1):87-99
  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号