首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 328 毫秒
1.
Studying similarities in protein molecules has become a fundamental activity in much of biology and biomedical research, for which methods such as multiple sequence alignments are widely used. Most methods available for such comparisons cater to studying proteins which have clearly recognizable evolutionary relationships but not to proteins that recognize the same or similar ligands but do not share similarities in their sequence or structural folds. In many cases, proteins in the latter class share structural similarities only in their binding sites. While several algorithms are available for comparing binding sites, there are none for deriving structural motifs of the binding sites, independent of the whole proteins. We report the development of SiteMotif, a new algorithm that compares binding sites from multiple proteins and derives sequence-order independent structural site motifs. We have tested the algorithm at multiple levels of complexity and demonstrate its performance in different scenarios. We have benchmarked against 3 current methods available for binding site comparison and demonstrate superior performance of our algorithm. We show that SiteMotif identifies new structural motifs of spatially conserved residues in proteins, even when there is no sequence or fold-level similarity. We expect SiteMotif to be useful for deriving key mechanistic insights into the mode of ligand interaction, predict the ligand type that a protein can bind and improve the sensitivity of functional annotation.  相似文献   

2.
MOTIVATION: An approach for identifying similarities of protein-protein binding sites is presented. The geometric shape of a binding site is described by computing a feature vector based on moment invariants. In order to search for similarities, feature vectors of binding sites are compared. Similar feature vectors indicate binding sites with similar shapes. RESULTS: The approach is validated on a representative set of protein-protein binding sites, extracted from the SCOPPI database. When querying binding sites from a representative set, we search for known similarities among 2819 binding sites. A median area under the ROC curve of 0.98 is observed. For half of the queries, a similar binding site is identified among the first two of 2819 when sorting all binding sites according the proposed similarity measure. Typical examples identified by this method are analyzed and discussed. The nitrogenase iron protein-like SCOP family is clustered hierarchically according to the proposed similarity measure as a case study. AVAILABILITY: Python code is available on request from the authors.  相似文献   

3.
The recognition of cryptic small-molecular binding sites in protein structures is important for understanding off-target side effects and for recognizing potential new indications for existing drugs. Current methods focus on the geometry and detailed chemical interactions within putative binding pockets, but may not recognize distant similarities where dynamics or modified interactions allow one ligand to bind apparently divergent binding pockets. In this paper, we introduce an algorithm that seeks similar microenvironments within two binding sites, and assesses overall binding site similarity by the presence of multiple shared microenvironments. The method has relatively weak geometric requirements (to allow for conformational change or dynamics in both the ligand and the pocket) and uses multiple biophysical and biochemical measures to characterize the microenvironments (to allow for diverse modes of ligand binding). We term the algorithm PocketFEATURE, since it focuses on pockets using the FEATURE system for characterizing microenvironments. We validate PocketFEATURE first by showing that it can better discriminate sites that bind similar ligands from those that do not, and by showing that we can recognize FAD-binding sites on a proteome scale with Area Under the Curve (AUC) of 92%. We then apply PocketFEATURE to evolutionarily distant kinases, for which the method recognizes several proven distant relationships, and predicts unexpected shared ligand binding. Using experimental data from ChEMBL and Ambit, we show that at high significance level, 40 kinase pairs are predicted to share ligands. Some of these pairs offer new opportunities for inhibiting two proteins in a single pathway.  相似文献   

4.
Ligand binding: functional site location,similarity and docking   总被引:3,自引:0,他引:3  
Computational methods for the detection and characterisation of protein ligand-binding sites have increasingly become an area of interest now that large amounts of protein structural information are becoming available prior to any knowledge of protein function. There have been particularly interesting recent developments in the following areas: first, functional site detection, whereby protein evolutionary information has been used to locate binding sites on the protein surface; second, functional site similarity, whereby structural similarity and three-dimensional templates can be used to compare and classify and potentially locate new binding sites; and third, ligand docking, which is being used to find and validate functional sites, in addition to having more conventional uses in small-molecule lead discovery.  相似文献   

5.
6.
A bioinformatics method was developed to identify the protein surface around the functional site and to estimate the biochemical function, using a newly constructed molecular surface database named the eF-site (electrostatic surface of Functional site. Molecular surfaces of protein molecules were computed based on the atom coordinates, and the eF-site database was prepared by adding the physical properties on the constructed molecular surfaces. The electrostatic potential on each molecular surface was individually calculated solving the Poisson–Boltzmann equation numerically for the precise continuum model, and the hydrophobicity information of each residue was also included. The eF-site database is accessed by the internet (http://pi.protein.osaka-u.ac.jp/eF-site/). We have prepared four different databases, eF-site/antibody, eF-site/prosite, eF-site/P-site, and eF-site/ActiveSite, corresponding to the antigen binding sites of antibodies with the same orientations, the molecular surfaces for the individual motifs in PROSITE database, the phosphate binding sites, and the active site surfaces for the representatives of the individual protein family, respectively. An algorithm using the clique detection method as an applied graph theory was developed to search of the eF-site database, so as to recognize and discriminate the characteristic molecular surfaces of the proteins. The method identifies the active site having the similar function to those of the known proteins.  相似文献   

7.
The identification of protein biochemical functions based on their three-dimensional structures is strongly required in the post-genome-sequencing era. We have developed a new method to identify and predict protein biochemical functions using the similarity information of molecular surface geometries and electrostatic potentials on the surfaces. Our prediction system consists of a similarity search method based on a clique search algorithm and the molecular surface database eF-site (electrostatic surface of functional-site in proteins). Using this system, functional sites similar to those of phosphoenoylpyruvate carboxy kinase were detected in several mononucleotide-binding proteins, which have different folds. We also applied our method to a hypothetical protein, MJ0226 from Methanococcus jannaschii, and detected the mononucleotide binding site from the similarity to other proteins having different folds.  相似文献   

8.
Lectins form a diverse group of protein families that have in common their ability to specifically recognize certain carbohydrates. Crystal structures of members of the different animal and plant lectin families have revealed a wide variety of lectin folds and carbohydrate binding site architectures. Despite this large variability, a number of interesting cases of convergent as well as divergent evolution among animal and plant lectin families can be noted. These similarities exist at the levels of the protein fold, the architecture of the binding site as well as quaternary structure and may be derived from similar functional needs.  相似文献   

9.
10.
Protein mapping distributes many copies of different molecular probes on the surface of a target protein in order to determine binding hot spots, regions that are highly preferable for ligand binding. While mapping of X-ray structures by the FTMap server is inherently static, this limitation can be overcome by the simultaneous analysis of multiple structures of the protein. FTMove is an automated web server that implements this approach. From the input of a target protein, by PDB code, the server identifies all structures of the protein available in the PDB, runs mapping on them, and combines the results to form binding hot spots and binding sites. The user may also upload their own protein structures, bypassing the PDB search for similar structures. Output of the server consists of the consensus binding sites and the individual mapping results for each structure - including the number of probes located in each binding site, for each structure. This level of detail allows the users to investigate how the strength of a binding site relates to the protein conformation, other binding sites, and the presence of ligands or mutations. In addition, the structures are clustered on the basis of their binding properties. The use of FTMove is demonstrated by application to 22 proteins with known allosteric binding sites; the orthosteric and allosteric binding sites were identified in all but one case, and the sites were typically ranked among the top five. The FTMove server is publicly available at https://ftmove.bu.edu.  相似文献   

11.
12.
13.
Elucidating the mechanisms of specific small‐molecule (ligand) recognition by proteins is a long‐standing conundrum. While the structures of these molecules, proteins and ligands, have been extensively studied, protein–ligand interactions, or binding modes, have not been comprehensively analyzed. Although methods for assessing similarities of binding site structures have been extensively developed, the methods for the computational treatment of binding modes have not been well established. Here, we developed a computational method for encoding the information about binding modes as graphs, and assessing their similarities. An all‐against‐all comparison of 20,040 protein–ligand complexes provided the landscape of the protein–ligand binding modes and its relationships with protein‐ and chemical spaces. While similar proteins in the same SCOP Family tend to bind relatively similar ligands with similar binding modes, the correlation between ligand and binding similarities was not very high (R2 = 0.443). We found many pairs with novel relationships, in which two evolutionally distant proteins recognize dissimilar ligands by similar binding modes (757,474 pairs out of 200,790,780 pairs were categorized into this relationship, in our dataset). In addition, there were an abundance of pairs of homologous proteins binding to similar ligands with different binding modes (68,217 pairs). Our results showed that many interesting relationships between protein–ligand complexes are still hidden in the structure database, and our new method for assessing binding mode similarities is effective to find them.  相似文献   

14.
The IS 1-encoded protein InsA binds specifically to both ends of IS1, and acts as a repressor of IS1 gene expression and may be a direct inhibitor of the transposition process. We show here, using DNasel 'foot-printing' and gel retardation, that the InsA binding sites are located within the 24/25 bp minimal active ends of IS1 and that InsA induces DNA bending upon binding. Conformational modification of the ends of IS1 as a result of binding of the host protein integration host factor (IHF) to its site within the minimal ends has been previously observed. Using a collection of synthetic mutant ends we have mapped some of the nucleotide sequence requirements for InsA binding and for transposition activity. We show that sequences necessary for InsA binding are also essential for transposition activity. We demonstrate that InsA and IHF binding sites overlap since some sequence determinants are shared by both InsA and IHF. The data suggest that these ends contain two functional domains: one for binding of InsA and IHF, and the other for transposition activity. A third region, when present, may enhance transposition activity with an intact right end. This 'architecture' of the ends of IS1 is remarkably similar to that of IS elements IS10, IS50 and IS903.  相似文献   

15.

Background  

Recognizing similarities and deriving relationships among protein molecules is a fundamental requirement in present-day biology. Similarities can be present at various levels which can be detected through comparison of protein sequences or their structural folds. In some cases similarities obscure at these levels could be present merely in the substructures at their binding sites. Inferring functional similarities between protein molecules by comparing their binding sites is still largely exploratory and not as yet a routine protocol. One of the main reasons for this is the limitation in the choice of appropriate analytical tools that can compare binding sites with high sensitivity. To benefit from the enormous amount of structural data that is being rapidly accumulated, it is essential to have high throughput tools that enable large scale binding site comparison.  相似文献   

16.
17.
Conclusions Autoantibodies to chromatin-associated proteins are frequently present in sera from patients with SLE, and related disorders. Autoantibodies to conformational epitopes may constitute the majority of the immune response to chromatin-associated antigens, suggesting that intact chromatin may be the immunogen in SLE as well as in certain forms of drug-induced lupus (eg. in procainamide-induced lupus). The preferential reactivity of autoantibodies to histones, PCNA, and Ku with antigenic determinants that are exposed on the surface of the native antigens is consistent with this interpretation.Strikingly, autoantibodies to these antigens frequently bind within or near active or functional sites, such as the DNA binding site of Ku [29], the site of PCNA critical for its role in enhancing DNA synthesis by polymerase delta [52], the posttranslational modification sites of the histones [68], and the catalytic site of poly(ADP-ribose) polymerase [69]. The explanation for the frequent observation that autoantibodies inhibit function is not yet known. It is possible that this phenomenon is related to the generation of autoantibodies by molecular mimicry, and that the functional sites of foreign antigens may crossreact with self antigens having similar functional sites [9]. Alternatively, the targeting of functional sites by autoantibodies may reflect merely a similar requirement for active sites and antibody-recognition sites to be exposed on surface. Features that make a site suitable for interacting with other proteins (eg. enzymes) or nucleic acids (eg DNA binding sites) may also make it more easily recognized by antibodies.The amino acids critical for autoantibody binding have not, in any of these cases, been shown to be critical to function. Further mapping and/ or mutagenesis studies will be necessary to determine the significance of the targeting of active or functional sites by autoantibodies.This work was supported by Public Health Service grant AR40391 from the National Institutes of Health  相似文献   

18.
We present here the first detailed biochemical analysis of an archaeal restriction enzyme. PspGI shows sequence similarity to SsoII, EcoRII, NgoMIV and Cfr10I, which recognize related DNA sequences. We demonstrate here that PspGI, like SsoII and unlike EcoRII or NgoMIV and Cfr10I, interacts with and cleaves DNA as a homodimer and is not stimulated by simultaneous binding to two recognition sites. PspGI and SsoII differ in their basic biochemical properties, viz. stability against chemical denaturation and proteolytic digestion, DNA binding and the pH, MgCl(2) and salt-dependence of their DNA cleavage activity. In contrast, the results of mutational analyses and cross-link experiments show that PspGI and SsoII have a very similar DNA binding site and catalytic center as NgoMIV and Cfr10I (whose crystal structures are known), and presumably also as EcoRII, in spite of the fact that these enzymes, which all recognize variants of the sequence -/CC-GG- (/ denotes the site of cleavage), are representatives of different subgroups of type II restriction endonucleases. A sequence comparison of all known restriction endonuclease sequences, furthermore, suggests that several enzymes recognizing other DNA sequences also share amino acid sequence similarities with PspGI, SsoII and EcoRII in the region of the presumptive active site. These results are discussed in an evolutionary context.  相似文献   

19.
Predicting off-targets by computational methods is getting increasing importance in early drug discovery stages. We herewith present a computational method based on binding site three-dimensional comparisons, which prompted us to investigate the cross-reaction of protein kinase inhibitors with synapsin I, an ATP-binding protein regulating neurotransmitter release in the synapse. Systematic pair-wise comparison of the staurosporine-binding site of the proto-oncogene Pim-1 kinase with 6,412 druggable protein-ligand binding sites suggested that the ATP-binding site of synapsin I may recognize the pan-kinase inhibitor staurosporine. Biochemical validation of this hypothesis was realized by competition experiments of staurosporine with ATP-γ35S for binding to synapsin I. Staurosporine, as well as three other inhibitors of protein kinases (cdk2, Pim-1 and casein kinase type 2), effectively bound to synapsin I with nanomolar affinities and promoted synapsin-induced F-actin bundling. The selective Pim-1 kinase inhibitor quercetagetin was shown to be the most potent synapsin I binder (IC50  = 0.15 µM), in agreement with the predicted binding site similarities between synapsin I and various protein kinases. Other protein kinase inhibitors (protein kinase A and chk1 inhibitor), kinase inhibitors (diacylglycerolkinase inhibitor) and various other ATP-competitors (DNA topoisomerase II and HSP-90α inhibitors) did not bind to synapsin I, as predicted from a lower similarity of their respective ATP-binding sites to that of synapsin I. The present data suggest that the observed downregulation of neurotransmitter release by some but not all protein kinase inhibitors may also be contributed by a direct binding to synapsin I and phosphorylation-independent perturbation of synapsin I function. More generally, the data also demonstrate that cross-reactivity with various targets may be detected by systematic pair-wise similarity measurement of ligand-annotated binding sites.  相似文献   

20.
Functional annotation is seldom straightforward with complexities arising due to functional divergence in protein families or functional convergence between non‐homologous protein families, leading to mis‐annotations. An enzyme may contain multiple domains and not all domains may be involved in a given function, adding to the complexity in function annotation. To address this, we use binding site information from bound cognate ligands and catalytic residues, since it can help in resolving fold‐function relationships at a finer level and with higher confidence. A comprehensive database of 2,020 fold‐function‐binding site relationships has been systematically generated. A network‐based approach is employed to capture the complexity in these relationships, from which different types of associations are deciphered, that identify versatile protein folds performing diverse functions, same function associated with multiple folds and one‐to‐one relationships. Binding site similarity networks integrated with fold, function, and ligand similarity information are generated to understand the depth of these relationships. Apart from the observed continuity in the functional site space, network properties of these revealed versatile families with topologically different or dissimilar binding sites and structural families that perform very similar functions. As a case study, subtle changes in the active site of a set of evolutionarily related superfamilies are studied using these networks. Tracing of such similarities in evolutionarily related proteins provide clues into the transition and evolution of protein functions. Insights from this study will be helpful in accurate and reliable functional annotations of uncharacterized proteins, poly‐pharmacology, and designing enzymes with new functional capabilities. Proteins 2017; 85:1319–1335. © 2017 Wiley Periodicals, Inc.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号