首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
MOTIVATION: In recent years, the Protein Data Bank (PDB) has experienced rapid growth. To maximize the utility of the high resolution protein-protein interaction data stored in the PDB, we have developed PIBASE, a comprehensive relational database of structurally defined interfaces between pairs of protein domains. It is composed of binary interfaces extracted from structures in the PDB and the Probable Quaternary Structure server using domain assignments from the Structural Classification of Proteins and CATH fold classification systems. RESULTS: PIBASE currently contains 158,915 interacting domain pairs between 105,061 domains from 2125 SCOP families. A diverse set of geometric, physiochemical and topologic properties are calculated for each complex, its domains, interfaces and binding sites. A subset of the interface properties are used to remove interface redundancy within PDB entries, resulting in 20,912 distinct domain-domain interfaces. The complexes are grouped into 989 topological classes based on their patterns of domain-domain contacts. The binary interfaces and their corresponding binding sites are categorized into 18,755 and 30,975 topological classes, respectively, based on the topology of secondary structure elements. The utility of the database is illustrated by outlining several current applications. AVAILABILITY: The database is accessible via the world wide web at http://salilab.org/pibase SUPPLEMENTARY INFORMATION: http://salilab.org/pibase/suppinfo.html.  相似文献   

2.
Proteins fold into a well-defined structure as a result of the collapse of the polypeptide chain, while transient protein-complex formation mainly is a result of binding of two folded individual monomers. Therefore, a protein-protein interface does not resemble the core of monomeric proteins, but has a more polar nature. Here, we address the question of whether the physico-chemical characteristics of intraprotein versus interprotein bonds differ, or whether interfaces are different from folded monomers only in the preference for certain types of interactions. To address this question we assembled a high resolution, nonredundant, protein-protein interaction database consisting of 1374 homodimer and 572 heterodimer complexes, and compared the physico-chemical properties of these interactions between protein interfaces and monomers. We performed extensive statistical analysis of geometrical properties of interatomic interactions of different types: hydrogen bonds, electrostatic interactions, and aromatic interactions. Our study clearly shows that there is no significant difference in the chemistry, geometry, or packing density of individual interactions between interfaces and monomeric structures. However, the distribution of different bonds differs. For example, side-chain-side-chain interactions constitute over 62% of all interprotein interactions, while they make up only 36% of the bonds stabilizing a protein structure. As on average, properties of backbone interactions are different from those of side chains, a quantitative difference is observed. Our findings clearly show that the same knowledge-based potential can be used for protein-binding sites as for protein structures. However, one has to keep in mind the different architecture of the interfaces and their unique bond preference.  相似文献   

3.
Block P  Weskamp N  Wolf A  Klebe G 《Proteins》2007,68(1):170-186
Since protein-protein interactions play a pivotal role in the communication on the molecular level in virtually every biological system and process, the search and design for modulators of such interactions is of utmost importance. In recent years many inhibitors for specific protein-protein interactions have been developed, however, in only a few cases, small and druglike molecules are able to interfere in the complex formation of proteins. On the other hand, there are several small molecules known to modulate protein-protein interactions by means of stabilizing an already assembled complex. To achieve this goal, a ligand is binding to a pocket, which is located rim-exposed at the interface of the interacting proteins, for example as the phytotoxin Fusicoccin, which stabilizes the interaction of plant H+-ATPase and 14-3-3 protein by nearly a factor of 100. To suggest alternative leads, we performed a virtual screening campaign to discover new molecules putatively stabilizing this complex. Furthermore, we screen a dataset of 198 transient recognition protein-protein complexes for cavities, which are located rim-exposed at their interfaces. We provide evidence for high similarity between such rim-exposed cavities and usual ligands accommodating active sites of enzymes. This analysis suggests that rim-exposed cavities at protein-protein interfaces are druggable binding sites. Therefore, the principle of stabilizing protein-protein interactions seems to be a promising alternative to the approach of the competitive inhibition of such interactions by small molecules.  相似文献   

4.
Protein heterodimer complexes are often involved in catalysis, regulation, assembly, immunity and inhibition. This involves the formation of stable interfaces between the interacting partners. Hence, it is of interest to describe heterodimer interfaces using known structural complexes. We use a non-redundant dataset of 192 heterodimer complex structures from the protein databank (PDB) to identify interface residues and describe their interfaces using amino-acids residue property preference. Analysis of the dataset shows that the heterodimer interfaces are often abundant in polar residues. The analysis also shows the presence of two classes of interfaces in heterodimer complexes. The first class of interfaces (class A) with more polar residues than core but less than surface is known. These interfaces are more hydrophobic than surfaces, where protein-protein binding is largely hydrophobic. The second class of interfaces (class B) with more polar residues than core and surface is shown. These interfaces are more polar than surfaces, where binding is mainly polar. Thus, these findings provide insights to the understanding of protein-protein interactions.  相似文献   

5.
To elucidate the structural basis of the diversity and universality in protein-protein interactions, an exhaustive all-against-all structural comparison of all known protein interfaces in the Protein Data Bank was performed at atomic resolution. After similar interfaces were clustered, approximately 20,000 structural motifs with at least two members were identified, out of which 3678 motifs consisted of at least 10 interfaces. Except for some trivial interfaces involving single α helices, almost all motifs were found to be confined within single protein families. Furthermore, the interaction partners of each motif were found to be very limited, and, accordingly, the interaction networks of the motifs tend to be small and are much more restricted than the binding sites for small ligand molecules. These findings suggest that, at the level of atomic structures, protein-protein interactions are precisely designed; hence, protein interfaces with multiple interacting partners should involve incompletely overlapping multiple interfaces and/or accommodate structural changes upon binding to their targets.  相似文献   

6.
7.
Structural data as collated in the Protein Data Bank (PDB) have been widely applied in the study and prediction of protein-protein interactions. However, since the basic PDB Entries contain only the contents of the asymmetric unit rather than the biological unit, some key interactions may be missed by analysing only the PDB Entry. A total of 69,054 SCOP (Structural Classification of Proteins) domains were examined systematically to identify the number of additional novel interacting domain pairs and interfaces found by considering the biological unit as stored in the PQS (Protein Quaternary Structure) database. The PQS data adds 25,965 interacting domain pairs to those seen in the PDB Entries to give a total of 61,783 redundant interacting domain pairs. Redundancy filtering at the level of the SCOP family shows PQS to increase the number of novel interacting domain-family pairs by 302 (13.3%) from 2277, but only 16/302 (1.4%) of the interacting domain pairs have the two domains in different SCOP families. This suggests the biological units add little to the elucidation of novel biological interaction networks. However, when the orientation of the domain pairs is considered, the PQS data increases the number of novel domain-domain interfaces observed by 1455 (34.5%) to give 5677 non-redundant domain-domain interfaces. In all, 162/1455 novel domain-domain interfaces are between domains from different families, an increase of 8.9% over the PDB Entries. Overall, the PQS biological units provide a rich source of novel domain-domain interfaces that are not seen in the studied PDB Entries, and so PQS domain-domain interaction data should be exploited wherever possible in the analysis and prediction of protein-protein interactions.  相似文献   

8.
Protein-protein interfaces are regions between 2 polypeptide chains that are not covalently connected. Here, we have created a nonredundant interface data set generated from all 2-chain interfaces in the Protein Data Bank. This data set is unique, since it contains clusters of interfaces with similar shapes and spatial organization of chemical functional groups. The data set allows statistical investigation of similar interfaces, as well as the identification and analysis of the chemical forces that account for the protein-protein associations. Toward this goal, we have developed I2I-SiteEngine (Interface-to-Interface SiteEngine) [Data set available at http://bioinfo3d.cs.tau.ac.il/Interfaces; Web server: http://bioinfo3d.cs.tau.ac.il/I2I-SiteEngine]. The algorithm recognizes similarities between protein-protein binding surfaces. I2I-SiteEngine is independent of the sequence or the fold of the proteins that comprise the interfaces. In addition to geometry, the method takes into account both the backbone and the side-chain physicochemical properties of the interacting atom groups. Its high efficiency makes it suitable for large-scale database searches and classifications. Below, we briefly describe the I2I-SiteEngine method. We focus on the classification process and the obtained nonredundant protein-protein interface data set. In particular, we analyze the biological significance of the clusters and present examples which illustrate that given constellations of chemical groups in protein-protein binding sites may be preferred, and are observed in proteins with different structures and different functions. We expect that these would yield further information regarding the forces stabilizing protein-protein interactions.  相似文献   

9.
MOTIVATION: Large-scale experiments reveal pairs of interacting proteins but leave the residues involved in the interactions unknown. These interface residues are essential for understanding the mechanism of interaction and are often desired drug targets. Reliable identification of residues that reside in protein-protein interface typically requires analysis of protein structure. Therefore, for the vast majority of proteins, for which there is no high-resolution structure, there is no effective way of identifying interface residues. RESULTS: Here we present a machine learning-based method that identifies interacting residues from sequence alone. Although the method is developed using transient protein-protein interfaces from complexes of experimentally known 3D structures, it never explicitly uses 3D information. Instead, we combine predicted structural features with evolutionary information. The strongest predictions of the method reached over 90% accuracy in a cross-validation experiment. Our results suggest that despite the significant diversity in the nature of protein-protein interactions, they all share common basic principles and that these principles are identifiable from sequence alone.  相似文献   

10.
A long-standing question in molecular biology is whether interfaces of protein-protein complexes are more conserved than the rest of the protein surfaces. Although it has been reported that conservation can be used as an indicator for predicting interaction sites on proteins, there are recent reports stating that the interface regions are only slightly more conserved than the rest of the protein surfaces, with conservation signals not being statistically significant enough for predicting protein-protein binding sites. In order to properly address these controversial reports we have studied a set of 28 well resolved hetero complex structures of proteins that consists of transient and non-transient complexes. The surface positions were classified into four conservation classes and the conservation index of the surface positions was quantitatively analyzed. The results indicate that the surface density of highly conserved positions is significantly higher in the protein-protein interface regions compared with the other regions of the protein surface. However, the average conservation index of the patches in the interface region is not significantly higher compared with other surface regions of the protein structures. This finding demonstrates that the number of conserved residue positions is a more appropriate indicator for predicting protein-protein binding sites than the average conservation index in the interacting region. We have further validated our findings on a set of 59 benchmark complex structures. Furthermore, an analysis of 19 complexes of antigen-antibody interactions shows that there is no conservation of amino acid positions in the interacting regions of these complexes, as expected, with the variable region of the immunoglobulins interacting mostly with the antigens. Interestingly, antigen interacting regions also have a higher number of non-conserved residue positions in the interacting region than the rest of the protein surface.  相似文献   

11.
Is the whole protein surface available for interaction with other proteins, or are specific sites pre-assigned according to their biophysical and structural character? And if so, is it possible to predict the location of the binding site from the surface properties? These questions are answered quantitatively by probing the surfaces of proteins using spheres of radius of 10 A on a database (DB) of 57 unique, non-homologous proteins involved in heteromeric, transient protein-protein interactions for which the structures of both the unbound and bound states were determined. In structural terms, we found the binding site to have a preference for beta-sheets and for relatively long non-structured chains, but not for alpha-helices. Chemically, aromatic side-chains show a clear preference for binding sites. While the hydrophobic and polar content of the interface is similar to the rest of the surface, hydrophobic and polar residues tend to cluster in interfaces. In the crystal, the binding site has more bound water molecules surrounding it, and a lower B-factor already in the unbound protein. The same biophysical properties were found to hold for the unbound and bound DBs. All the significant interface properties were combined into ProMate, an interface prediction program. This was followed by an optimization step to choose the best combination of properties, as many of them are correlated. During optimization and prediction, the tested proteins were not used for data collection, to avoid over-fitting. The prediction algorithm is fully automated, and is used to predict the location of potential binding sites on unbound proteins with known structures. The algorithm is able to successfully predict the location of the interface for about 70% of the proteins. The success rate of the predictor was equal whether applied on the unbound DB or on the disjoint bound DB. A prediction is assumed correct if over half of the predicted continuous interface patch is indeed interface. The ability to predict the location of protein-protein interfaces has far reaching implications both towards our understanding of specificity and kinetics of binding, as well as in assisting in the analysis of the proteome.  相似文献   

12.
One possible path towards understanding the biological function of a target protein is through the discovery of how it interfaces within protein-protein interaction networks. The goal of this study was to create a virtual protein-protein interaction model using the concepts of orthologous conservation (or interologs) to elucidate the interacting networks of a particular target protein. POINT (the prediction of interactome database) is a functional database for the prediction of the human protein-protein interactome based on available orthologous interactome datasets. POINT integrates several publicly accessible databases, with emphasis placed on the extraction of a large quantity of mouse, fruit fly, worm and yeast protein-protein interactions datasets from the Database of Interacting Proteins (DIP), followed by conversion of them into a predicted human interactome. In addition, protein-protein interactions require both temporal synchronicity and precise spatial proximity. POINT therefore also incorporates correlated mRNA expression clusters obtained from cell cycle microarray databases and subcellular localization from Gene Ontology to further pinpoint the likelihood of biological relevance of each predicted interacting sets of protein partners.  相似文献   

13.
Hetero dimer (different monomers) interfaces are involved in catalysis and regulation through the formation of interface active sites. This is critical in cell and molecular biology events. The physical and chemical factors determining the formation of the interface active sites is often large in numbers. The combined role of interacting features is frequently combinatorial and additive in nature. Therefore, it is important to determine the physical and chemical features of such interactions. A number of such features have been documented in literature since 1975. However, the use of such interaction features in the prediction of interaction partners and sites given their sequences is still a challenge. In a non-redundant dataset of 156 hetero-dimer structures determined by X-ray crystallography, the interacting partners are often varying in size and thus, size variation between subunits is an important factor in determining the mode of interface formation. The size of protein subunits interacting are either small-small, largelarge, medium-medium, large-small, large-medium and small-medium. It should also be noted that the interface formed between subunits have physical interactions at N terminal (N), C terminal (C) and middle (M) region of the protein with reference to their sequences in one dimension. These features are believed to have application in the prediction of interaction partners and sites from sequences. However, the use of such features for interaction prediction from sequence is not currently clear.  相似文献   

14.
Protein-protein complexes that dissociate and associate readily, often depending on the physiological condition or environment, play an important role in many biological processes. In order to characterise these "transient" protein-protein interactions, two sets of complexes were collected and analysed. The first set consists of 16 experimentally validated "weak" transient homodimers, which are known to exist as monomers and dimers at physiological concentration, with dissociation constants in the micromolar range. A set of 23 functionally validated transient (i.e. intracellular signalling) heterodimers comprise the second set. This set includes complexes that are more stable, with nanomolar binding affinities, and require a molecular trigger to form and break the interaction. In comparison to more stable homodimeric complexes, the weak homodimers demonstrate smaller contact areas between protomers and the interfaces are more planar and polar on average. The physicochemical and geometrical properties of these weak homodimers more closely resemble those of non-obligate hetero-oligomeric complexes, whose components can exist either as monomers or as complexes in vivo. In contrast to the weak transient dimers, "strong" transient dimers often undergo large conformational changes upon association/dissociation and are characterised with larger, less planar and sometimes more hydrophobic interfaces. From sequence alignments we find that the interface residues of the weak transient homodimers are generally more conserved than surface residues, consistent with being constrained to maintain the protein-protein interaction during evolution. Protein families that include members with different oligomeric states or structures are identified, and found to exhibit a lower sequence conservation at the interface. The results are discussed in terms of the physiological function and evolution of protein-protein interactions.  相似文献   

15.
MOTIVATION: Much research has been devoted to the characterization of interaction interfaces found in complexes with known structure. In this context, the interactions of non-homologous domains at equivalent binding sites are of particular interest, as they can reveal convergently evolved interface motifs. Such motifs are an important source of information to formulate rules for interaction specificity and to design ligands based on the common features shared among diverse partners. RESULTS: We develop a novel method to identify non-homologous structural domains which bind at equivalent sites when interacting with a common partner. We systematically apply this method to all pairs of interactions with known structure and derive a comprehensive database for these interactions. Of all non-homologous domains, which bind with a common interaction partner, 4.2% use the same interface of the common interaction partner (excluding immunoglobulins and proteases). This rises to 16% if immunoglobulin and proteases are included. We demonstrate two applications of our database: first, the systematic screening for viral protein interfaces, which can mimic native interfaces and thus interfere; and second, structural motifs in enzymes and its inhibitors. We highlight several cases of virus protein mimicry: viral M3 protein interferes with a chemokine dimer interface. The virus has evolved the motif SVSPLP, which mimics the native SSDTTP motif. A second example is the regulatory factor Nef in HIV which can mimic a kinase when interacting with SH3. Among others the virus has evolved the kinase's PxxP motif. Further, we elucidate motif resemblances in Baculovirus p35 and HIV capsid proteins. Finally, chymotrypsin is subject to scrutiny wrt. its structural similarity to subtilisin and wrt. its inhibitor's similar recognition sites. SUPPLEMENTARY INFORMATION: A database is online at scoppi.biotec.tu-dresden.de/abac/.  相似文献   

16.
17.
Molecular principles of the interactions of disordered proteins   总被引:6,自引:0,他引:6  
Thorough knowledge of the molecular principles of protein-protein recognition is essential to our understanding of protein function at the cellular level. Whereas interactions of ordered proteins have been analyzed in great detail, complexes of intrinsically unstructured/disordered proteins (IUPs) have hardly been addressed so far. Here, we have collected a database of 39 complexes of experimentally verified IUPs, and compared their interfaces with those of 72 complexes of ordered, globular proteins. The characteristic differences found between the two types of complexes suggest that IUPs represent a distinct molecular implementation of the principles of protein-protein recognition. The interfaces do not differ in size, but those of IUPs cover a much larger part of the surface of the protein than for their ordered counterparts. Moreover, IUP interfaces are significantly more hydrophobic relative to their overall amino acid composition, but also in absolute terms. They rely more on hydrophobic-hydrophobic than on polar-polar interactions. Their amino acids in the interface realize more intermolecular contacts, which suggests a better fit with the partner due to induced folding upon binding that results in a better adaptation to the partner. The two modes of interaction also differ in that IUPs usually use only a single continuous segment for partner binding, whereas the binding sites of ordered proteins are more segmented. Probably, all these features contribute to the increased evolutionary conservation of IUP interface residues. These noted molecular differences are also manifested in the interaction energies of IUPs. Our approximation of these by low-resolution force-fields shows that IUPs gain much more stabilization energy from intermolecular contacts, than from folding, i.e. they use their binding energy for folding. Overall, our findings provide a structural rationale to the prior suggestions that many IUPs are specialized for functions realized by protein-protein interactions.  相似文献   

18.
19.

Background

The study and comparison of protein-protein interfaces is essential for the understanding of the mechanisms of interaction between proteins. While there are many methods for comparing protein structures and protein binding sites, so far no methods have been reported for comparing the geometry of non-covalent interactions occurring at protein-protein interfaces.

Methodology/Principal Findings

Here we present a method for aligning non-covalent interactions between different protein-protein interfaces. The method aligns the vector representations of van der Waals interactions and hydrogen bonds based on their geometry. The method has been applied to a dataset which comprises a variety of protein-protein interfaces. The alignments are consistent to a large extent with the results obtained using two other complementary approaches. In addition, we apply the method to three examples of protein mimicry. The method successfully aligns respective interfaces and allows for recognizing conserved interface regions.

Conclusions/Significance

The Galinter method has been validated in the comparison of interfaces in which homologous subunits are involved, including cases of mimicry. The method is also applicable to comparing interfaces involving non-peptidic compounds. Galinter assists users in identifying local interface regions with similar patterns of non-covalent interactions. This is particularly relevant to the investigation of the molecular basis of interaction mimicry.  相似文献   

20.
Rahat O  Yitzhaky A  Schreiber G 《Proteins》2008,71(2):621-630
Protein-protein interactions networks has come to be a buzzword associated with nets containing edges that represent a pair of interacting proteins (e.g. hormone-receptor, enzyme-inhibitor, antigen-antibody, and a subset of multichain biological machines). Yet, each such interaction composes its own unique network, in which vertices represent amino acid residues, and edges represent atomic contacts. Recent studies have shown that analyses of the data encapsulated in these detailed networks may impact predictions of structure-function correlation. Here, we study homologous families of protein-protein interfaces, which share the same fold but vary in sequence. In this context, we address what properties of the network are shared among relatives with different sequences (and hence different atomic interactions) and which are not. Herein, we develop the general mathematical framework needed to compare the modularity of homologous networks. We then apply this analysis to the structural data of a few interface families, including hemoglobin alpha-beta, growth hormone-receptor, and Serine protease-inhibitor. Our results suggest that interface modularity is an evolutionarily conserved property. Hence, protein-protein interfaces can be clustered down to a few modules, with the boundaries being evolutionarily conserved along homologous complexes. This suggests that protein engineering of protein-protein binding sites may be simplified by varying each module, but retaining the overall modularity of the interface.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号