共查询到20条相似文献,搜索用时 10 毫秒
1.
We show that protein complexes can be represented as small-world networks, exhibiting a relatively small number of highly central amino-acid residues occurring frequently at protein-protein interfaces. We further base our analysis on a set of different biological examples of protein-protein interactions with experimentally validated hot spots, and show that 83% of these predicted highly central residues, which are conserved in sequence alignments and nonexposed to the solvent in the protein complex, correspond to or are in direct contact with an experimentally annotated hot spot. The remaining 17% show a general tendency to be close to an annotated hot spot. On the other hand, although there is no available experimental information on their contribution to the binding free energy, detailed analysis of their properties shows that they are good candidates for being hot spots. Thus, highly central residues have a clear tendency to be located in regions that include hot spots. We also show that some of the central residues in the protein complex interfaces are central in the monomeric structures before dimerization and that possible information relating to hot spots of binding free energy could be obtained from the unbound structures. 相似文献
2.
Using an efficient iterative method, we have developed a distance-dependent knowledge-based scoring function to predict protein-protein interactions. The function, referred to as ITScore-PP, was derived using the crystal structures of a training set of 851 protein-protein dimeric complexes containing true biological interfaces. The key idea of the iterative method for deriving ITScore-PP is to improve the interatomic pair potentials by iteration, until the pair potentials can distinguish true binding modes from decoy modes for the protein-protein complexes in the training set. The iterative method circumvents the challenging reference state problem in deriving knowledge-based potentials. The derived scoring function was used to evaluate the ligand orientations generated by ZDOCK 2.1 and the native ligand structures on a diverse set of 91 protein-protein complexes. For the bound test cases, ITScore-PP yielded a success rate of 98.9% if the top 10 ranked orientations were considered. For the more realistic unbound test cases, the corresponding success rate was 40.7%. Furthermore, for faster orientational sampling purpose, several residue-level knowledge-based scoring functions were also derived following the similar iterative procedure. Among them, the scoring function that uses the side-chain center of mass (SCM) to represent a residue, referred to as ITScore-PP(SCM), showed the best performance and yielded success rates of 71.4% and 30.8% for the bound and unbound cases, respectively, when the top 10 orientations were considered. ITScore-PP was further tested using two other published protein-protein docking decoy sets, the ZDOCK decoy set and the RosettaDock decoy set. In addition to binding mode prediction, the binding scores predicted by ITScore-PP also correlated well with the experimentally determined binding affinities, yielding a correlation coefficient of R = 0.71 on a test set of 74 protein-protein complexes with known affinities. ITScore-PP is computationally efficient. The average run time for ITScore-PP was about 0.03 second per orientation (including optimization) on a personal computer with 3.2 GHz Pentium IV CPU and 3.0 GB RAM. The computational speed of ITScore-PP(SCM) is about an order of magnitude faster than that of ITScore-PP. ITScore-PP and/or ITScore-PP(SCM) can be combined with efficient protein docking software to study protein-protein recognition. 相似文献
3.
Water-mediated hydrogen bonds play critical roles at protein-protein and protein-nucleic acid interfaces, and the interactions formed by discrete water molecules cannot be captured using continuum solvent models. We describe a simple model for the energetics of water-mediated hydrogen bonds, and show that, together with knowledge of the positions of buried water molecules observed in X-ray crystal structures, the model improves the prediction of free-energy changes upon mutation at protein-protein interfaces, and the recovery of native amino acid sequences in protein interface design calculations. We then describe a "solvated rotamer" approach to efficiently predict the positions of water molecules, at protein-protein interfaces and in monomeric proteins, that is compatible with widely used rotamer-based side-chain packing and protein design algorithms. Finally, we examine the extent to which the predicted water molecules can be used to improve prediction of amino acid identities and protein-protein interface stability, and discuss avenues for overcoming current limitations of the approach. 相似文献
4.
pK(a) values of ionizable residues have been calculated using the PROPKA method and structures of 75 protein-protein complexes and their corresponding free forms. These pK(a) values were used to compute changes in protonation state of individual residues, net changes in protonation state of the complex relative to the uncomplexed proteins, and the correction to a binding energy calculated assuming standard protonation states at pH 7. For each complex, two different structures for the uncomplexed form of the proteins were used: the X-ray structures determined for the proteins in the absence of the other protein and the individual protein structures taken from the structure of the complex (referred to as unbound and bound structures, respectively). In 28 and 77% of the cases considered here, protein-protein binding is accompanied by a complete (>95%) or significant (>50%) change in protonation state of at least one residue using unbound structures. Furthermore, in 36 and 61% of the cases, protein-protein binding is accompanied by a complete or significant net change in protonation state of the complex relative to the separated monomers. Using bound structures, the corresponding values are 12, 51, 20, and 48%. Comparison to experimental data suggest that using unbound and bound structures lead to over- and underestimation of binding-induced protonation state changes, respectively. Thus, we conclude that protein-protein binding is often associated with changes in protonation state of amino acid residues and with changes in the net protonation state of the proteins. The pH-dependent correction to the binding energy contributes at least one order of magnitude to the binding constant in 45 and 23%, using unbound and bound structures, respectively. 相似文献
5.
In this article we introduce a new method for the identification and the accurate characterization of protein surface cavities. The method is encoded in the program SCREEN (Surface Cavity REcognition and EvaluatioN). As a first test of the utility of our approach we used SCREEN to locate and analyze the surface cavities of a nonredundant set of 99 proteins cocrystallized with drugs. We find that this set of proteins has on average about 14 distinct cavities per protein. In all cases, a drug is bound at one (and sometimes more than one) of these cavities. Using cavity size alone as a criterion for predicting drug-binding sites yields a high balanced error rate of 15.7%, with only 71.7% coverage. Here we characterize each surface cavity by computing a comprehensive set of 408 physicochemical, structural, and geometric attributes. By applying modern machine learning techniques (Random Forests) we were able to develop a classifier that can identify drug-binding cavities with a balanced error rate of 7.2% and coverage of 88.9%. Only 18 of the 408 cavity attributes had a statistically significant role in the prediction. Of these 18 important attributes, almost all involved size and shape rather than physicochemical properties of the surface cavity. The implications of these results are discussed. A SCREEN Web server is available at http://interface.bioc.columbia.edu/screen. 相似文献
6.
The number of structures of protein-protein complexes deposited to the Protein Data Bank is growing rapidly. These structures embed important information for predicting structures of new protein complexes. This motivated us to develop the PPISP method for predicting interface residues in protein-protein complexes. In PPISP, sequence profiles and solvent accessibility of spatially neighboring surface residues were used as input to a neural network. The network was trained on native interface residues collected from the Protein Data Bank. The prediction accuracy at the time was 70% with 47% coverage of native interface residues. Now we have extensively improved PPISP. The training set now consisted of 1156 nonhomologous protein chains. Test on a set of 100 nonhomologous protein chains showed that the prediction accuracy is now increased to 80% with 51% coverage. To solve the problem of over-prediction and under-prediction associated with individual neural network models, we developed a consensus method that combines predictions from multiple models with different levels of accuracy and coverage. Applied on a benchmark set of 68 proteins for protein-protein docking, the consensus approach outperformed the best individual models by 3-8 percentage points in accuracy. To demonstrate the predictive power of cons-PPISP, eight complex-forming proteins with interfaces characterized by NMR were tested. These proteins are nonhomologous to the training set and have a total of 144 interface residues identified by chemical shift perturbation. cons-PPISP predicted 174 interface residues with 69% accuracy and 47% coverage and promises to complement experimental techniques in characterizing protein-protein interfaces. . 相似文献
7.
Protein-protein interactions can be altered by mutating one or more "hot spots," the subset of residues that account for most of the interface's binding free energy. The identification of hot spots requires a significant experimental effort, highlighting the practical value of hot spot predictions. We present two knowledge-based models that improve the ability to predict hot spots: K-FADE uses shape specificity features calculated by the Fast Atomic Density Evaluation (FADE) program, and K-CON uses biochemical contact features. The combined K-FADE/CON (KFC) model displays better overall predictive accuracy than computational alanine scanning (Robetta-Ala). In addition, because these methods predict different subsets of known hot spots, a large and significant increase in accuracy is achieved by combining KFC and Robetta-Ala. The KFC analysis is applied to the calmodulin (CaM)/smooth muscle myosin light chain kinase (smMLCK) interface, and to the bone morphogenetic protein-2 (BMP-2)/BMP receptor-type I (BMPR-IA) interface. The results indicate a strong correlation between KFC hot spot predictions and mutations that significantly reduce the binding affinity of the interface. 相似文献
8.
Verspoor K Cohn J Mniszewski S Joslyn C 《Protein science : a publication of the Protein Society》2006,15(6):1544-1549
Automated function prediction (AFP) methods increasingly use knowledge discovery algorithms to map sequence, structure, literature, and/or pathway information about proteins whose functions are unknown into functional ontologies, typically (a portion of) the Gene Ontology (GO). While there are a growing number of methods within this paradigm, the general problem of assessing the accuracy of such prediction algorithms has not been seriously addressed. We present first an application for function prediction from protein sequences using the POSet Ontology Categorizer (POSOC) to produce new annotations by analyzing collections of GO nodes derived from annotations of protein BLAST neighborhoods. We then also present hierarchical precision and hierarchical recall as new evaluation metrics for assessing the accuracy of any predictions in hierarchical ontologies, and discuss results on a test set of protein sequences. We show that our method provides substantially improved hierarchical precision (measure of predictions made that are correct) when applied to the nearest BLAST neighbors of target proteins, as compared with simply imputing that neighborhood's annotations to the target. Moreover, when our method is applied to a broader BLAST neighborhood, hierarchical precision is enhanced even further. In all cases, such increased hierarchical precision performance is purchased at a modest expense of hierarchical recall (measure of all annotations that get predicted at all). 相似文献
9.
Protein-protein docking (PPD) is a computational process that predicts the structure of a complex of two interacting proteins from their unbound structures. The accuracy of PPD predictions is low, but can be greatly enhanced if experimentally determined distance data are available for incorporation into the prediction. However, the specific effects of distance constraints on PPD predictions are largely uncharacterized. In this study, we systematically simulated the effects of using distance constraints both on a new distance constraint-driven PPD approach, called DPPD, and also, by re-ranking, on a well-established grid-based global search approach. Our results for a PPD benchmark dataset of 84 protein complexes of known structures showed that near 100% docking success rates could be obtained when the number of distance constraints exceeded six, the degrees of freedom of the system, but the success rate was significantly reduced by long distance constraints, large binding-induced conformational changes, and large errors in the distance data. Our results also showed that, under most conditions simulated, even two or three distance constraints were sufficient to achieve a much better success rate than those using a sophisticated physicochemical function to re-rank the results of the global search. Our study provides guidelines for the practical incorporation of experimental distance data to aid PPD predictions. 相似文献
10.
PIER: protein interface recognition for structural proteomics 总被引:1,自引:0,他引:1
Recent advances in structural proteomics call for development of fast and reliable automatic methods for prediction of functional surfaces of proteins with known three-dimensional structure, including binding sites for known and unknown protein partners as well as oligomerization interfaces. Despite significant progress the problem is still far from being solved. Most existing methods rely, at least partially, on evolutionary information from multiple sequence alignments projected on protein surface. The common drawback of such methods is their limited applicability to the proteins with a sparse set of sequential homologs, as well as inability to detect interfaces in evolutionary variable regions. In this study, the authors developed an improved method for predicting interfaces from a single protein structure, which is based on local statistical properties of the protein surface derived at the level of atomic groups. The proposed Protein IntErface Recognition (PIER) method achieved the overall precision of 60% at the recall threshold of 50% at the residue level on a diverse benchmark of 490 homodimeric, 62 heterodimeric, and 196 transient interfaces (compared with 25% precision at 50% recall expected from random residue function assignment). For 70% of proteins in the benchmark, the binding patch residues were successfully detected with precision exceeding 50% at 50% recall. The calculation only took seconds for an average 300-residue protein. The authors demonstrated that adding the evolutionary conservation signal only marginally influenced the overall prediction performance on the benchmark; moreover, for certain classes of proteins, using this signal actually resulted in a deteriorated prediction. Thorough benchmarking using other datasets from literature showed that PIER yielded improved performance as compared with several alignment-free or alignment-dependent predictions. The accuracy, efficiency, and dependence on structure alone make PIER a suitable tool for automated high-throughput annotation of protein structures emerging from structural proteomics projects. 相似文献
11.
Orly Razgour John B. Taggart Stephanie Manel Javier Juste Carlos Ibáñez Hugo Rebelo Antton Alberdi Gareth Jones Kirsty Park 《Molecular ecology resources》2018,18(1):18-31
Climate change is a major threat to global biodiversity that will produce a range of new selection pressures. Understanding species responses to climate change requires an interdisciplinary perspective, combining ecological, molecular and environmental approaches. We propose an applied integrated framework to identify populations under threat from climate change based on their extent of exposure, inherent sensitivity due to adaptive and neutral genetic variation and range shift potential. We consider intraspecific vulnerability and population‐level responses, an important but often neglected conservation research priority. We demonstrate how this framework can be applied to vertebrates with limited dispersal abilities using empirical data for the bat Plecotus austriacus. We use ecological niche modelling and environmental dissimilarity analysis to locate areas at high risk of exposure to future changes. Combining outlier tests with genotype–environment association analysis, we identify potential climate‐adaptive SNPs in our genomic data set and differences in the frequency of adaptive and neutral variation between populations. We assess landscape connectivity and show that changing environmental suitability may limit the future movement of individuals, thus affecting both the ability of populations to shift their distribution to climatically suitable areas and the probability of evolutionary rescue through the spread of adaptive genetic variation among populations. Therefore, a better understanding of movement ecology and landscape connectivity is needed for predicting population persistence under climate change. Our study highlights the importance of incorporating genomic data to determine sensitivity, adaptive potential and range shift potential, instead of relying solely on exposure to guide species vulnerability assessments and conservation planning. 相似文献
12.
Despite the increasing number of successful determinations of complex protein structures the understanding of their dynamics properties is still rather limited. Using X-ray crystallography, we demonstrate that ribonuclease A (RNase A) undergoes significant domain motions upon ligand binding. In particular, when cytidine 2'-monophosphate binds to RNase A, the structure of the enzyme becomes more compact. Interestingly, our data also show that these structural alterations are fully reversible in the crystal state. These findings provide structural bases for the dynamic behavior of RNase A in the binding of the substrate shown by Petsko and coworkers (Rasmussen et al. Nature 1992;357:423-424). These subtle domain motions may assume functional relevance for more complex system and may play a significant role in the cooperativity of oligomeric enzymes. 相似文献
13.
Management of biological systems involves the application of ecological and evolutionary principles within a decision theory framework. In the present review, we focus on epidemiology and pest (insect and plant) control. By studying their similarities and differences, it is possible to outline a broad conceptual background for the antagonistic interactions involved, and hence to begin to develop a classification system of predictive value in management situations. In particular, we address issues of scale in space and time, and relate these to contrasting practical problems associated with deploying biocontrol agents and disease control. Additionally, we discuss three areas where an integrated approach to natural enemies, using ecological and evolutionary insights, and decision theory have much to offer: (i) management and resistance, (ii) the problems of emerging diseases, and (iii) the links between disease and behaviour. 相似文献
14.
An equimolar mixture of avian pancreatic polypeptide (aPP) fragments aPP(1-11)-NH2 and Ac-aPP(12-36) had an electronic circular dichroism (ECD) spectrum that was similar to that of whole aPP in H2O and even more so in 30% (v/v) trifluoroethanol (TFE) in 15 mM Na2HPO4, but was different from the sum of the spectra of the individual fragments. The vibrational circular dichroism (VCD) spectrum of the combined fragments in 30% (v/v) TFE in 15 mM Na2HPO4 in D2O was also similar to that of the intact aPP and unlike the sum of the VCD spectra of the fragments. The interaction of these fragments is thus sufficient to support the conformation of whole aPP. This study demonstrates that VCD, in combination with ECD, is useful for the study of protein-protein interactions. 相似文献
15.
DE Runcie DA Garfield CC Babbitt JA Wygoda S Mukherjee GA Wray 《Molecular ecology》2012,21(18):4547-4562
Stress responses play an important role in shaping species distributions and robustness to climate change. We investigated how stress responses alter the contribution of additive genetic variation to gene expression during development of the purple sea urchin, Strongylocentrotus purpuratus, under increased temperatures that model realistic climate change scenarios. We first measured gene expression responses in the embryos by RNA‐seq to characterize molecular signatures of mild, chronic temperature stress in an unbiased manner. We found that an increase from 12 to 18 °C caused widespread alterations in gene expression including in genes involved in protein folding, RNA processing and development. To understand the quantitative genetic architecture of this response, we then focused on a well‐characterized gene network involved in endomesoderm and ectoderm specification. Using a breeding design with wild‐caught individuals, we measured genetic and gene–environment interaction effects on 72 genes within this network. We found genetic or maternal effects in 33 of these genes and that the genetic effects were correlated in the network. Fourteen network genes also responded to higher temperatures, but we found no significant genotype–environment interactions in any of the genes. This absence may be owing to an effective buffering of the temperature perturbations within the network. In support of this hypothesis, perturbations to regulatory genes did not affect the expression of the genes that they regulate. Together, these results provide novel insights into the relationship between environmental change and developmental evolution and suggest that climate change may not expose large amounts of cryptic genetic variation to selection in this species. 相似文献
16.
Khamir Mehta Adam D. Hoppe Raghunandan Kainkaryam Peter J. Woolf Jennifer J. Linderman 《Proteomics》2009,9(23):5371-5383
Fluorescence resonance energy transfer (FRET) microscopy can measure the spatial distribution of protein interactions inside live cells. Such experiments give rise to complex data sets with many images of single cells, motivating data reduction and abstraction. In particular, determination of the value of the equilibrium dissociation constant (Kd) will provide a quantitative measure of protein–protein interactions, which is essential to reconstructing cellular signaling networks. Here, we investigate the feasibility of using quantitative FRET imaging of live cells to estimate the local value of Kd for two interacting labeled molecules. An algorithm is developed to infer the values of Kd using the intensity of individual voxels of 3‐D FRET microscopy images. The performance of our algorithm is investigated using synthetic test data, both in the absence and in the presence of endogenous (unlabeled) proteins. The influence of optical blurring caused by the microscope (confocal or wide field) and detection noise on the accuracy of Kd inference is studied. We show that deconvolution of images followed by analysis of intensity data at local level can improve the estimate of Kd. Finally, the performance of this algorithm using cellular data on the interaction between yellow fluorescent protein‐Rac and cyan fluorescent protein‐PBD in mammalian cells is shown. 相似文献
17.
Cancer-associated mutations in the BRCT domain of BRCA1 (BRCA1-BRCT) abolish its tumor suppressor function by disrupting interactions with other proteins such as BACH1. Many cancer-related mutations do not cause sufficient destabilization to lead to global unfolding under physiological conditions, and thus abrogation of function probably is due to localized structural changes. To explore the reasons for mutation-induced loss of function, the authors performed molecular dynamics simulations on three cancer-associated mutants, A1708E, M1775R, and Y1853ter, and on the wild type and benign M1652I mutant, and compared the structures and fluctuations. Only the cancer-associated mutants exhibited significant backbone structure differences from the wild-type crystal structure in BACH1-binding regions, some of which are far from the mutation sites. Backbone differences of the A1708E mutant from the liganded wild type structure in these regions are much larger than those of the unliganded wild type X-ray or molecular dynamics structures. These BACH1-binding regions of the cancer-associated mutants also exhibited increases in their fluctuation magnitudes compared with the same regions in the wild type and M1562I mutant, as quantified by quasiharmonic analysis. Several of the regions of increased fluctuation magnitude correspond to correlated motions of residues in contact that provide a continuous path of fluctuating amino acids in contact from the A1708E and Y1853ter mutation sites to the BACH1-binding sites with altered structure and dynamics. The increased fluctuations in the disease-related mutants suggest an increase in vibrational entropy in the unliganded state that could result in a larger entropy loss in the disease-related mutants upon binding BACH1 than in the wild type. To investigate this possibility, vibrational entropies of the A1708E and wild type in the free state and bound to a BACH1-derived phosphopeptide were calculated using quasiharmonic analysis, to determine the binding entropy difference DeltaDeltaS between the A1708E mutant and the wild type. DeltaDeltaS was determined to be -4.0 cal mol(-1) K(-1), with an uncertainty of 2 cal mol(-1) K(-1); that is, the entropy loss upon binding the peptide is 4.0 cal mol(-1) K(-1) greater for the A1708E mutant, corresponding to an entropic contribution to the DeltaDeltaG of binding (-TDeltaDeltaS) 1.1 kcal mol(-1) more positive for the mutant. The observed differences in structure, flexibility, and entropy of binding likely are responsible for abolition of BACH1 binding, and illustrate that many disease- related mutations could have very long-range effects. The methods described here have potential for identifying correlated motions responsible for other long-range effects of deleterious mutations. 相似文献
18.
19.
An integrated approach to patient-specific predictive modeling for single ventricle heart palliation
Chiara Corsini Catriona Baker Ethan Kung Silvia Schievano Gregory Arbia Alessia Baretta 《Computer methods in biomechanics and biomedical engineering》2014,17(14):1572-1589
In patients with congenital heart disease and a single ventricle (SV), ventricular support of the circulation is inadequate, and staged palliative surgery (usually 3 stages) is needed for treatment. In the various palliative surgical stages individual differences in the circulation are important and patient-specific surgical planning is ideal. In this study, an integrated approach between clinicians and engineers has been developed, based on patient-specific multi-scale models, and is here applied to predict stage 2 surgical outcomes. This approach involves four distinct steps: (1) collection of pre-operative clinical data from a patient presenting for SV palliation, (2) construction of the pre-operative model, (3) creation of feasible virtual surgical options which couple a three-dimensional model of the surgical anatomy with a lumped parameter model (LPM) of the remainder of the circulation and (4) performance of post-operative simulations to aid clinical decision making. The pre-operative model is described, agreeing well with clinical flow tracings and mean pressures. Two surgical options (bi-directional Glenn and hemi-Fontan operations) are virtually performed and coupled to the pre-operative LPM, with the hemodynamics of both options reported. Results are validated against postoperative clinical data. Ultimately, this work represents the first patient-specific predictive modeling of stage 2 palliation using virtual surgery and closed-loop multi-scale modeling. 相似文献
20.
In multi‐domain proteins, the domains typically run end‐to‐end, that is, one domain follows the C‐terminus of another domain. However, approximately 10% of multi‐domain proteins are formed by insertion of one domain sequence into that of another domain. Detecting such insertions within protein sequences is a fundamental challenge in structural biology. The haloacid dehalogenase superfamily (HADSF) serves as a challenging model system wherein a variable cap domain (~5–200 residues in length) accessorizes the ubiquitous Rossmann‐fold core domain, with variations in insertion site and topology corresponding to different classes of cap types. Herein, we describe a comprehensive computational strategy, CapPredictor, for determining large, variable domain insertions in protein sequences. Using a novel sequence‐alignment algorithm in conjunction with a structure‐guided sequence profile from 154 core‐domain‐only structures, more than 40,000 HADSF member sequences were assigned cap types. The resulting data set afforded insight into HADSF evolution. Notably, a similar distribution of cap‐type classes across different phyla was observed, indicating that all cap types existed in the last universal common ancestor. In addition, comparative analyses of the predicted cap‐type and functional assignments showed that different cap types carry out similar chemistries. Thus, while cap domains play a role in substrate recognition and chemical reactivity, cap‐type does not strictly define functional class. Through this example, we have shown that CapPredictor is an effective new tool for the study of form and function in protein families where domain insertion occurs. Proteins 2014; 82:1896–1906. © 2014 Wiley Periodicals, Inc. 相似文献