首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
MOTIVATION: Protein-protein docking algorithms typically generate large numbers of possible complex structures with only a few of them resembling the native structure. Recently (Duan et al., Protein Sci, 14:316-218, 2005), it was observed that the surface density of conserved residue positions is high at the interface regions of interacting protein surfaces, except for antibody-antigen complexes, where a lesser number of conserved positions than average is observed at the interface regions. Using this observation, we identified putative interacting regions on the surface of interacting partners and significantly improved docking results by assigning top ranks to near-native complex structures. In this paper, we combine the residue conservation information with a widely used shape complementarity algorithm to generate candidate complex structures with a higher percentage of near-native structures (hits). What is new in this work is that the conservation information is used early in the generation stage and not only in the ranking stage of the docking algorithm. This results in a significantly larger number of generated hits and an improved predictive ability in identifying the native structure of protein-protein complexes. RESULTS: We report on results from 48 well-characterized protein complexes, which have enough residue conservation information from the same 59 benchmark complexes used in our previous work. We compute conservation indices of residue positions on the surfaces of interacting proteins using available homologous sequences from UNIPROT and calculate the solvent accessible surface area. We combine this information with shape-complementarity scores to generate candidate protein-protein complex structures. When compared with pure shape-complementarity algorithms, performed by FTDock, our method results in significantly more hits, with the improvement being over 100% in many instances. We demonstrate that residue conservation information is useful not only in refinement and scoring of docking solutions, but also helpful in enrichment of near-native-structures during the generation of candidate geometries of complex structures.  相似文献   

3.
Although atomic-resolution crystal structures of the conserved C-terminal domain of several species of TBP and their complexes with DNA have been determined, little information is available concerning the structure in solution of full-length TBP containing both the conserved C-terminal and nonconserved N-terminal domains. Quantitation of the amino acid side chain oxidation products generated by synchrotron X-ray radiolysis by mass spectrometry has been used to determine the solvent accessibility of individual residues in monomeric Saccharomyces cerevisiae TATA binding protein (TBP) free in solution and in the TBP-DNA complex. Amino acid side chains within the C-terminal domain of unliganded full-length TBP that are predicted to be accessible from crystal structures of the isolated domain are protected from oxidation. Residues within the N-terminal domain are also protected from oxidation in both the absence and presence of DNA. Some residues within the DNA-binding "saddle" of the C-terminal domain are protected upon formation of a TBP-DNA complex as expected, while others are protected in both the absence and presence of bound DNA. In addition, residues on the upper side of the beta-sheets undergo reactivity changes as a function of DNA binding. These data suggest that the DNA-binding saddle of monomeric unliganded yeast TBP is only partially accessible to solvent, the N-terminal domain is partially structured, and the N- and C-terminal domains form a different set of contacts in the free and DNA-bound protein. The functional implications of these results are discussed.  相似文献   

4.
Multiprotein systems mediate most regulatory processes in living organisms. Although the structures of the individual proteins are often defined, less is known of the structures of multiprotein systems. Computational methods for predicting interfaces, using evolutionary conservation and/or physicochemical data, have been developed. Here we consider the use of solvent accessibility, residue propensity, and hydrophobicity, in conjunction with secondary structure data, as prediction parameters. We analyze the influence of residue type and secondary structure on solvent accessibility and define a measure of "relative exposedness." Clustering abnormally high scoring residues provides a basis for predicting interaction sites. The analysis is extended to investigate abnormally exposed secondary structure elements, particularly beta-sheet strands. We show that surface-exposed beta-strands lacking protective features are more likely to be found at protein-protein interfaces, allowing us to create an algorithm with approximately 68% and approximately 75% accuracy in differentiating between interacting and edge strands in isolated beta-strands and beta-sheet strands, respectively. These methods of identifying abnormally exposed surface regions are combined in an algorithm, which, on a data set of 77 unbound and disjoint (single chain extracted from complex) structures, predicts 79% of the protein-protein interfaces correctly. If enzyme-inhibitor complexes, where the inhibitor mimics a nonprotein substrate, are excluded, the accuracy increases to 85%.  相似文献   

5.
We combine a new, extremely fast technique to generate a library of low energy structures of an oligopeptide (by using mutually orthogonal Latin squares to sample its conformational space) with a genetic algorithm to predict protein structures. The protein sequence is divided into oligopeptides, and a structure library is generated for each. These libraries are used in a newly defined mutation operator that, together with variation, crossover, and diversity operators, is used in a modified genetic algorithm to make the prediction. Application to five small proteins has yielded near native structures.  相似文献   

6.
Molecular dynamics effects on protein electrostatics   总被引:4,自引:0,他引:4  
Electrostatic calculations have been carried out on a number of structural conformers of tuna cytochrome c. Conformers were generated using molecular dynamics simulations with a range of solvent simulating, macroscopic dielectric formalisms, and one solvent model that explicitly included solvent water molecules. Structures generated using the lowest dielectric models were relatively tight, with side chains collapsed on the surface, while those from the higher dielectric models had more internal and external fluidity, with surface side chains exploring a fuller range of conformational space. The average structure generated with the explicitly solvated model corresponded most closely with the crystal structure. Individual pK values, overall titration curves, and electrostatic potential surfaces were calculated for average structures and structures along each simulation. Differences between structural conformers within each simulation give rise to substantial changes in calculated local electrostatic interactions, resulting in pK value fluctuations for individual sites in the protein that vary by 0.3-2.0 pK units from the calculated time average. These variations are due to the thermal side chain reorientations that produce fluctuations in charge site separations. Properties like overall titration curves and pH dependent stability are not as sensitive to side chain fluctuations within a simulation, but there are substantial effects between simulations due to marked differences in average side chain behavior. These findings underscore the importance of proper dielectric formalism in molecular dynamics simulations when used to generate alternate solution structures from a crystal structure, and suggest that conformers significantly removed from the average structure have altered electrostatic properties that may prove important in episodic protein properties such as catalysis.  相似文献   

7.
Amino acid-specific covalent labeling is well suited to probe protein structure and macromolecular interactions, especially for macromolecules and their complexes that are difficult to examine by alternative means, due to size, complexity, or instability. Here we present a detailed account of carbodiimide-based covalent labeling (with GEE tagging) applied to a glycosylated monoclonal antibody therapeutic, which represents an important class of biologic drugs. Characterization of such proteins and their antigen complexes is essential to development of new biologic-based medicines. In this study, the experiments were optimized to preserve the structural integrity of the protein, and experimental conditions were varied and replicated to establish the reproducibility and precision of the technique. Homology-based models were generated and used to compare the solvent accessibility of the labeled residues, which include D, E, and the C-terminus, against the experimental surface accessibility data in order to understand the accuracy of the approach in providing an unbiased assessment of structure. Data from the protein were also compared to reactivity measures of several model peptides to explain sequence or structure-based variations in reactivity. The results highlight several advantages of this approach. These include: the ease of use at the bench top, the linearity of the dose response plots at high levels of labeling (indicating that the label does not significantly perturb the structure of the protein), the high reproducibility of replicate experiments (<2 % variation in modification extent), the similar reactivity of the 3 target probe residues (as suggested by analysis of model peptides), and the overall positive and significant correlation of reactivity and solvent accessible surface area (the latter values predicted by the homology modeling). Attenuation of reactivity, in otherwise solvent accessible probes, is documented as arising from the effects of positive charge or bond formation between adjacent amine and carboxyl groups, the latter accompanied by observed water loss. The results are also compared with data from hydroxyl radical-mediated oxidative footprinting on the same protein, showing that complementary information is gained from the 2 approaches, although the number of target residues in carbodiimide/GEE labeling is fewer. Overall, this approach is an accurate and precise method for assessing protein structure of biologic drugs.  相似文献   

8.
《MABS-AUSTIN》2013,5(6):1486-1499
Amino acid-specific covalent labeling is well suited to probe protein structure and macromolecular interactions, especially for macromolecules and their complexes that are difficult to examine by alternative means, due to size, complexity, or instability. Here we present a detailed account of carbodiimide-based covalent labeling (with GEE tagging) applied to a glycosylated monoclonal antibody therapeutic, which represents an important class of biologic drugs. Characterization of such proteins and their antigen complexes is essential to development of new biologic-based medicines. In this study, the experiments were optimized to preserve the structural integrity of the protein, and experimental conditions were varied and replicated to establish the reproducibility and precision of the technique. Homology-based models were generated and used to compare the solvent accessibility of the labeled residues, which include D, E, and the C-terminus, against the experimental surface accessibility data in order to understand the accuracy of the approach in providing an unbiased assessment of structure. Data from the protein were also compared to reactivity measures of several model peptides to explain sequence or structure-based variations in reactivity. The results highlight several advantages of this approach. These include: the ease of use at the bench top, the linearity of the dose response plots at high levels of labeling (indicating that the label does not significantly perturb the structure of the protein), the high reproducibility of replicate experiments (<2 % variation in modification extent), the similar reactivity of the 3 target probe residues (as suggested by analysis of model peptides), and the overall positive and significant correlation of reactivity and solvent accessible surface area (the latter values predicted by the homology modeling). Attenuation of reactivity, in otherwise solvent accessible probes, is documented as arising from the effects of positive charge or bond formation between adjacent amine and carboxyl groups, the latter accompanied by observed water loss. The results are also compared with data from hydroxyl radical-mediated oxidative footprinting on the same protein, showing that complementary information is gained from the 2 approaches, although the number of target residues in carbodiimide/GEE labeling is fewer. Overall, this approach is an accurate and precise method for assessing protein structure of biologic drugs.  相似文献   

9.
The reaction of hydroxyl and other oxygen-based radicals with the side chains of proteins on millisecond timescales has been used to probe the structure of proteins, their dynamics in solution and interactions with other macromolecules. Radicals are generated in high flux within microseconds from synchrotron radiation and discharge sources and react with proteins on timescales that are less than those often attributed to structural reorganisation and folding. The oxygen-based radicals generated in aqueous solution react with proteins to effect limited oxidation at specific amino acids throughout the sequence of the protein. The extent of oxidation at these residue markers is highly influenced by the accessibility of the reaction site to the bulk solvent. The extent of oxidation allows protection levels to be measured based on the degree to which a reaction occurs. A map of a protein's three-dimensional structure is subsequently assembled as in a footprinting experiment. Protein solutions that contain various concentrations of substrates that either promote or disrupt structural transitions can be investigated to facilitate site-specific equilibrium and time-resolved studies of protein folding. The radical-based strategies can also be employed in the study of protein-protein interactions to provide a new avenue for investigating protein complexes and assemblies with high structural resolution. The urea-induced unfolding of apomyoglobin, and the binding domains within the ribonuclease S and calmodulin-melittin protein-peptide complexes are presented to illustrate the approach.  相似文献   

10.
Water molecules immobilized on a protein or DNA surface are known to play an important role in intramolecular and intermolecular interactions. Comparative analysis of related three-dimensional (3D) structures allows to predict the locations of such water molecules on the protein surface. We have developed and implemented the algorithm WLAKE detecting "conserved" water molecules, i.e. those located in almost the same positions in a set of superimposed structures of related proteins or macromolecular complexes. The problem is reduced to finding maximal cliques in a certain graph. Despite exponential algorithm complexity, the program works appropriately fast for dozens of superimposed structures. WLAKE was used to predict functionally significant water molecules in enzyme active sites (transketolases) as well as in intermolecular (ETS-DNA complexes) and intramolecular (thiol-disulfide interchange protein) interactions. The program is available online at http://monkey.belozersky.msu.ru/~evgeniy/wLake/wLake.html.  相似文献   

11.
Multiple-solvent crystal structure determination (MSCS) allows the position and orientation of bound solvent fragments to be identified by determining the structure of protein crystals soaked in organic solvents. We have extended this technique by the determination of high-resolution crystal structures of thermolysin (TLN), generated from crystals soaked in 2% to 100% isopropanol. The procedure causes only minor changes to the conformation of the protein, and an increasing number of isopropanol interaction sites could be identified as the solvent concentration is increased. Isopropanol occupies all four of the main subsites in the active site, although this was only observed at very high concentrations of isopropanol for three of the four subsites. Analysis of the isopropanol positions shows little correlation with interaction energy computed using a molecular mechanics force field, but the experimentally determined positions of isopropanol are consistent with the structures of known protein-ligand complexes of TLN.  相似文献   

12.
Li CH  Ma XH  Chen WZ  Wang CX 《Protein engineering》2003,16(4):265-269
An efficient 'soft docking' algorithm is described to assist the prediction of protein-protein association using three-dimensional structures of molecules. The basic tools are the 'simplified protein' model and the docking algorithm of Wodak and Janin. The side chain flexibility of Arg, Lys, Asp, Glu and Met residues at the protein surface is taken into account. The complex type-dependent filtering technique on the basis of the geometric matching, hydrophobicity and electrostatic complementarity is used to select candidate binding modes. Subsequently, we calculate a scoring function which includes electrostatic and desolvation energy terms. In the 44 complexes tested including enzyme-inhibitor, antibody-antigen and other complexes, native-like structures were all found, of which 30 were ranked in the top 20. Thus, our soft docking algorithm has the potential to predict protein-protein recognition.  相似文献   

13.
14.
Fan  Chao  Liu  Diwei  Huang  Rui  Chen  Zhigang  Deng  Lei 《BMC bioinformatics》2016,17(1):85-95
Protein solvent accessibility prediction is a pivotal intermediate step towards modeling protein tertiary structures directly from one-dimensional sequences. It also plays an important part in identifying protein folds and domains. Although some methods have been presented to the protein solvent accessibility prediction in recent years, the performance is far from satisfactory. In this work, we propose PredRSA, a computational method that can accurately predict relative solvent accessible surface area (RSA) of residues by exploring various local and global sequence features which have been observed to be associated with solvent accessibility. Based on these features, a novel and efficient approach, Gradient Boosted Regression Trees (GBRT), is first adopted to predict RSA. Experimental results obtained from 5-fold cross-validation based on the Manesh-215 dataset show that the mean absolute error (MAE) and the Pearson correlation coefficient (PCC) of PredRSA are 9.0 % and 0.75, respectively, which are better than that of the existing methods. Moreover, we evaluate the performance of PredRSA using an independent test set of 68 proteins. Compared with the state-of-the-art approaches (SPINE-X and ASAquick), PredRSA achieves a significant improvement on the prediction quality. Our experimental results show that the Gradient Boosted Regression Trees algorithm and the novel feature combination are quite effective in relative solvent accessibility prediction. The proposed PredRSA method could be useful in assisting the prediction of protein structures by applying the predicted RSA as useful restraints.  相似文献   

15.
Computational methods are used to determine the three-dimensional structure of the Agitoxin (AgTx2)-Shaker complex. In a first stage, a large number of models of the complex are generated using high temperature molecular dynamics, accounting for side chain flexibility with distance restraints deduced from thermodynamic analysis of double mutant cycles. Four plausible binding mode candidates are found using this procedure. In a second stage, the quality and validity of the resulting complexes is assessed by examining the stability of the binding modes during molecular dynamics simulations with explicit water molecules and by calculating the binding free energies of mutant proteins using a continuum solvent representation and comparing with experimental data. The docking protocol and the continuum solvent model are validated using the Barstar-Barnase and the lysozyme-antibody D1.2 complexes, for which there are high-resolution structures as well as double mutant data. This combination of computational methods permits the identification of two possible structural models of AgTx2 in complex with the Shaker K+ channel, additional structural analysis providing further evidence in favor of a single model. In this final complex, the toxin is bound to the extracellular entrance of the channel along the pore axis via a combination of hydrophobic, hydrogen bonding, and electrostatic interactions. The magnitude of the buried solvent accessible area corresponding to the protein-protein contact is on the order of 1000 A with roughly similar contributions from each of the four subunits. Some side chains of the toxin adopt different conformation than in the experimental solution structure, indicating the importance of an induced-fit upon the formation of the complex. In particular, the side chain of Lys-27, a residue highly conserved among scorpion toxins, points deep into the pore with its positively charge amino group positioned at the outer binding site for K+. Specific site-directed mutagenesis experiments are suggested to verify and confirm the structure of the toxin-channel complex.  相似文献   

16.
Protein interactions are often accompanied by significant changes in conformation. We have analyzed the relationships between protein structures and the conformational changes they undergo upon binding. Based upon this, we introduce a simple measure, the relative solvent accessible surface area, which can be used to predict the magnitude of binding-induced conformational changes from the structures of either monomeric proteins or bound subunits. Applying this to a large set of protein complexes suggests that large conformational changes upon binding are common. In addition, we observe considerable enrichment of intrinsically disordered sequences in proteins predicted to undergo large conformational changes. Finally, we demonstrate that the relative solvent accessible surface area of monomeric proteins can be used as a simple proxy for protein flexibility. This reveals a powerful connection between the flexibility of unbound proteins and their binding-induced conformational changes, consistent with the conformational selection model of molecular recognition.  相似文献   

17.
The tertiary structures of protein complexes provide a crucial insight about the molecular mechanisms that regulate their functions and assembly. However, solving protein complex structures by experimental methods is often more difficult than single protein structures. Here, we have developed a novel computational multiple protein docking algorithm, Multi‐LZerD, that builds models of multimeric complexes by effectively reusing pairwise docking predictions of component proteins. A genetic algorithm is applied to explore the conformational space followed by a structure refinement procedure. Benchmark on eleven hetero‐multimeric complexes resulted in near‐native conformations for all but one of them (a root mean square deviation smaller than 2.5Å). We also show that our method copes with unbound docking cases well, outperforming the methodology that can be directly compared with our approach. Multi‐LZerD was able to predict near‐native structures for multimeric complexes of various topologies.Proteins 2012; © 2012 Wiley Periodicals, Inc.  相似文献   

18.
Protein complex prediction via cost-based clustering   总被引:13,自引:0,他引:13  
MOTIVATION: Understanding principles of cellular organization and function can be enhanced if we detect known and predict still undiscovered protein complexes within the cell's protein-protein interaction (PPI) network. Such predictions may be used as an inexpensive tool to direct biological experiments. The increasing amount of available PPI data necessitates an accurate and scalable approach to protein complex identification. RESULTS: We have developed the Restricted Neighborhood Search Clustering Algorithm (RNSC) to efficiently partition networks into clusters using a cost function. We applied this cost-based clustering algorithm to PPI networks of Saccharomyces cerevisiae, Drosophila melanogaster and Caenorhabditis elegans to identify and predict protein complexes. We have determined functional and graph-theoretic properties of true protein complexes from the MIPS database. Based on these properties, we defined filters to distinguish between identified network clusters and true protein complexes. Conclusions: Our application of the cost-based clustering algorithm provides an accurate and scalable method of detecting and predicting protein complexes within a PPI network.  相似文献   

19.
Wang T  Wade RC 《Proteins》2003,50(1):158-169
The suitability of three implicit solvent models for flexible protein-protein docking by procedures using molecular dynamics simulation is investigated. The three models are (i) the generalized Born (GB) model implemented in the program AMBER6.0; (ii) a distance-dependent dielectric (DDD) model; and (iii) a surface area-dependent model that we have parameterized and call the NPSA model. This is a distance-dependent dielectric model modified by neutralizing the ionizable side-chains and adding a surface area-dependent solvation term. These solvent models were first tested in molecular dynamics simulations at 300 K of the native structures of barnase, barstar, segment B1 of protein G, and three WW domains. These protein structures display a range of secondary structure contents and stabilities. Then, to investigate the performance of the implicit solvent models in protein docking, molecular dynamics simulations of barnase/barstar complexation, as well as PIN1 WW domain/peptide complexation, were conducted, starting from separated unbound structures. The simulations show that the NPSA model has significant advantages over the DDD and GB models in maintaining the native structures of the proteins and providing more accurate docked complexes.  相似文献   

20.
The coverage and reliability of protein-protein interactions determined by high-throughput experiments still needs to be improved, especially for higher organisms, therefore the question persists, how interactions can be verified and predicted by computational approaches using available data on protein structural complexes. Recently we developed an approach called IBIS (Inferred Biomolecular Interaction Server) to predict and annotate protein-protein binding sites and interaction partners, which is based on the assumption that the structural location and sequence patterns of protein-protein binding sites are conserved between close homologs. In this study first we confirmed high accuracy of our method and found that its accuracy depends critically on the usage of all available data on structures of homologous complexes, compared to the approaches where only a non-redundant set of complexes is employed. Second we showed that there exists a trade-off between specificity and sensitivity if we employ in the prediction only evolutionarily conserved binding site clusters or clusters supported by only one observation (singletons). Finally we addressed the question of identifying the biologically relevant interactions using the homology inference approach and demonstrated that a large majority of crystal packing interactions can be correctly identified and filtered by our algorithm. At the same time, about half of biological interfaces that are not present in the protein crystallographic asymmetric unit can be reconstructed by IBIS from homologous complexes without the prior knowledge of crystal parameters of the query protein.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号