首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
A number of computational approaches have been developed to reengineer promising chimeric proteins one at a time through targeted point mutations. In this article, we introduce the computational procedure IPRO (iterative protein redesign and optimization procedure) for the redesign of an entire combinatorial protein library in one step using energy-based scoring functions. IPRO relies on identifying mutations in the parental sequences, which when propagated downstream in the combinatorial library, improve the average quality of the library (e.g., stability, binding affinity, specific activity, etc.). Residue and rotamer design choices are driven by a globally convergent mixed-integer linear programming formulation. Unlike many of the available computational approaches, the procedure allows for backbone movement as well as redocking of the associated ligands after a prespecified number of design iterations. IPRO can also be used, as a limiting case, for the redesign of a single or handful of individual sequences. The application of IPRO is highlighted through the redesign of a 16-member library of Escherichia coli/Bacillus subtilis dihydrofolate reductase hybrids, both individually and through upstream parental sequence redesign, for improving the average binding energy. Computational results demonstrate that it is indeed feasible to improve the overall library quality as exemplified by binding energy scores through targeted mutations in the parental sequences.  相似文献   

2.
Despite years of effort, the problem of predicting the conformations of protein side chains remains a subject of inquiry. This problem has three major issues, namely defining the conformations that a side chain may adopt within a protein, developing a sampling procedure for generating possible side‐chain packings, and defining a scoring function that can rank these possible packings. To solve the former of these issues, most procedures rely on a rotamer library derived from databases of known protein structures. We introduce an alternative method that is free of statistics. We begin with a rotamer library that is based only on stereochemical considerations; this rotamer library is then optimized independently for each protein under study. We show that this optimization step restores the diversity of conformations observed in native proteins. We combine this protein‐dependent rotamer library (PDRL) method with the self‐consistent mean field (SCMF) sampling approach and a physics‐based scoring function into a new side‐chain prediction method, SCMF–PDRL. Using two large test sets of 831 and 378 proteins, respectively, we show that this new method compares favorably with competing methods such as SCAP, OPUS‐Rota, and SCWRL4 for energy‐minimized structures. Proteins 2014; 82:2000–2017. © 2014 Wiley Periodicals, Inc.  相似文献   

3.
Scansite identifies short protein sequence motifs that are recognized by modular signaling domains, phosphorylated by protein Ser/Thr- or Tyr-kinases or mediate specific interactions with protein or phospholipid ligands. Each sequence motif is represented as a position-specific scoring matrix (PSSM) based on results from oriented peptide library and phage display experiments. Predicted domain-motif interactions from Scansite can be sequentially combined, allowing segments of biological pathways to be constructed in silico. The current release of Scansite, version 2.0, includes 62 motifs characterizing the binding and/or substrate specificities of many families of Ser/Thr- or Tyr-kinases, SH2, SH3, PDZ, 14-3-3 and PTB domains, together with signature motifs for PtdIns(3,4,5)P(3)-specific PH domains. Scansite 2.0 contains significant improvements to its original interface, including a number of new generalized user features and significantly enhanced performance. Searches of all SWISS-PROT, TrEMBL, Genpept and Ensembl protein database entries are now possible with run times reduced by approximately 60% when compared with Scansite version 1.0. Scansite 2.0 allows restricted searching of species-specific proteins, as well as isoelectric point and molecular weight sorting to facilitate comparison of predictions with results from two-dimensional gel electrophoresis experiments. Support for user-defined motifs has been increased, allowing easier input of user-defined matrices and permitting user-defined motifs to be combined with pre-compiled Scansite motifs for dual motif searching. In addition, a new series of Sequence Match programs for non-quantitative user-defined motifs has been implemented. Scansite is available via the World Wide Web at http://scansite.mit.edu.  相似文献   

4.
Hartmann C  Antes I  Lengauer T 《Proteins》2009,74(3):712-726
We describe a scoring and modeling procedure for docking ligands into protein models that have either modeled or flexible side-chain conformations. Our methodical contribution comprises a procedure for generating new potentials of mean force for the ROTA scoring function which we have introduced previously for optimizing side-chain conformations with the tool IRECS. The ROTA potentials are specially trained to tolerate small-scale positional errors of atoms that are characteristic of (i) side-chain conformations that are modeled using a sparse rotamer library and (ii) ligand conformations that are generated using a docking program. We generated both rigid and flexible protein models with our side-chain prediction tool IRECS and docked ligands to proteins using the scoring function ROTA and the docking programs FlexX (for rigid side chains) and FlexE (for flexible side chains). We validated our approach on the forty screening targets of the DUD database. The validation shows that the ROTA potentials are especially well suited for estimating the binding affinity of ligands to proteins. The results also show that our procedure can compensate for the performance decrease in screening that occurs when using protein models with side chains modeled with a rotamer library instead of using X-ray structures. The average runtime per ligand of our method is 168 seconds on an Opteron V20z, which is fast enough to allow virtual screening of compound libraries for drug candidates.  相似文献   

5.
We have recently completed systematic molecular dynamics simulations of 807 different proteins representing 95% of the known autonomous protein folds in an effort we refer to as Dynameomics. Here we focus on the analysis of side chain conformations and dynamics to create a dynamic rotamer library. Overall this library is derived from 31,000 occurrences of each of 86,217 different residues, or 2.7 × 10(9) rotamers. This dynamic library has 74% overlap of rotamer distributions with rotamer libraries derived from static high-resolution crystal structures. Seventy-five percent of the residues had an assignable primary conformation, and 68% of the residues had at least one significant alternate conformation. The average correlation time for switching between rotamers ranged from 22 ps for Met to over 8 ns for Cys; this time decreased 20-fold on the surface of the protein and modestly for dihedral angles further from the main chain. Side chain S(2) axis order parameters were calculated and they correlated well with those derived from NMR relaxation experiments (R = 0.9). Relationships relating the S(2) axis order parameters to rotamer occupancy were derived. Overall the Dynameomics rotamer library offers a comprehensive depiction of side chain rotamer preferences and dynamics in solution, and more realistic distributions for dynamic proteins in solution at ambient temperature than libraries derived from crystal structures, in particular charged surface residues are better represented. Details of the rotamer library are presented here and the library itself can be downloaded at http://www.dynameomics.org.  相似文献   

6.
Rotamer libraries are used in protein structure determination, prediction, and design. The backbone-dependent rotamer library consists of rotamer frequencies, mean dihedral angles, and variances as?a function of the backbone dihedral angles. Structure prediction and design methods that employ backbone flexibility would strongly benefit from smoothly varying probabilities and angles. A new version of the?backbone-dependent rotamer library has been developed using adaptive kernel density estimates for the rotamer frequencies and adaptive kernel regression for the mean dihedral angles and variances. This formulation allows for evaluation of the rotamer probabilities, mean angles, and variances as?a smooth and continuous function of phi and psi. Continuous probability density estimates for the nonrotameric degrees of freedom of amides, carboxylates, and aromatic side chains have been modeled as a function of the backbone dihedrals and rotamers of the remaining degrees of freedom. New backbone-dependent rotamer libraries at varying levels of smoothing are available from http://dunbrack.fccc.edu.  相似文献   

7.
Side chain prediction is an integral component of computational antibody design and structure prediction. Current antibody modelling tools use backbone‐dependent rotamer libraries with conformations taken from general proteins. Here we present our antibody‐specific rotamer library, where rotamers are binned according to their immunogenetics (IMGT) position, rather than their local backbone geometry. We find that for some amino acid types at certain positions, only a restricted number of side chain conformations are ever observed. Using this information, we are able to reduce the breadth of the rotamer sampling space. Based on our rotamer library, we built a side chain predictor, position‐dependent antibody rotamer swapper (PEARS). On a blind test set of 95 antibody model structures, PEARS had the highest average χ1 and accuracy (78.7% and 64.8%) compared to three leading backbone‐dependent side chain predictors. Our use of IMGT position, rather than backbone ϕ/ψ, meant that PEARS was more robust to errors in the backbone of the model structure. PEARS also achieved the lowest number of side chain–side chain clashes. PEARS is freely available as a web application at http://opig.stats.ox.ac.uk/webapps/pears .  相似文献   

8.
Accurate prediction of the placement and comformations of protein side chains given only the backbone trace has a wide range of uses in protein design, structure prediction, and functional analysis. Prediction has most often relied on discrete rotamer libraries so that rapid fitness of side-chain rotamers can be assessed against some scoring function. Scoring functions are generally based on experimental parameters from small-molecule studies or empirical parameters based on determined protein structures. Here, we describe the NCN algorithm for predicting the placement of side chains. A predominantly first-principles approach was taken to develop the potential energy function incorporating van der Waals and electrostatics based on the OPLS parameters, and a hydrogen bonding term. The only empirical knowledge used is the frequency of rotameric states from the PDB. The rotamer library includes nearly 50,000 rotamers, and is the most extensive discrete library used to date. Although the computational time tends to be longer than most other algorithms, the overall accuracy exceeds all algorithms in the literature when placing rotamers on an accurate backbone trace. Considering only the most buried residues, 80% of the total residues tested, the placement accuracy reaches 92% for chi(1), and 83% for chi(1 + 2), and an overall RMS deviation of 1 A. Additionally, we show that if information is available to restrict chi(1) to one rotamer well, then this algorithm can generate structures with an average RMS deviation of 1.0 A for all heavy side-chains atoms and a corresponding overall chi(1 + 2) accuracy of 85.0%.  相似文献   

9.
10.
De novo design of the hydrophobic cores of proteins.   总被引:22,自引:17,他引:5       下载免费PDF全文
We have developed and experimentally tested a novel computational approach for the de novo design of hydrophobic cores. A pair of computer programs has been written, the first of which creates a "custom" rotamer library for potential hydrophobic residues, based on the backbone structure of the protein of interest. The second program uses a genetic algorithm to globally optimize for a low energy core sequence and structure, using the custom rotamer library as input. Success of the programs in predicting the sequences of native proteins indicates that they should be effective tools for protein design. Using these programs, we have designed and engineered several variants of the phage 434 cro protein, containing five, seven, or eight sequence changes in the hydrophobic core. As controls, we have produced a variant consisting of a randomly generated core with six sequence changes but equal volume relative to the native core and a variant with a "minimalist" core containing predominantly leucine residues. Two of the designs, including one with eight core sequence changes, have thermal stabilities comparable to the native protein, whereas the third design and the minimalist protein are significantly destabilized. The randomly designed control is completely unfolded under equivalent conditions. These results suggest that rational de novo design of hydrophobic cores is feasible, and stress the importance of specific packing interactions for the stability of proteins. A surprising aspect of the results is that all of the variants display highly cooperative thermal denaturation curves and reasonably dispersed NMR spectra. This suggests that the non-core residues of a protein play a significant role in determining the uniqueness of the folded structure.  相似文献   

11.
Intelligent materials that can undergo physical gelation in response to environmental stimuli have potential impacts in the bioengineering and biomedical fields where the entrapment of cellular or molecular species is desired. Here, we utilize atomic force microscopy (AFM) to perform molecular level investigations of designer artificial proteins that undergo physical gelation. These are engineered as triblock copolymers with independent interchain binding and solvent retention functions, namely, two terminal leucine zipper-like peptide sequences and a central alanylglycine rich sequence, respectively. AFM force measurements between probes and surfaces functionalized with molecules of this triblock protein revealed adhesive interactions that increased in average force and frequency as the pH was lowered from pH 11.2 to 7.4 to 4.5, reflecting an increase in the numbers of interacting molecular strands. In bulk solution, lowering the pH results in a viscous liquid to gel transition. The modular design of the triblock protein was also exploited for single molecule force spectroscopy investigations, which revealed altered intramolecular interactions in response to changes in pH. An increased understanding of the inter- and intramolecular forces involved in biomolecule driven gelation processes is not only of great fundamental interest in the study of the biomolecular systems involved but may also prove key in enabling the rational design of new generations of intelligent hydrogel systems.  相似文献   

12.
We present a Bayesian statistical analysis of the conformations of side chains in proteins from the Protein Data Bank. This is an extension of the backbone-dependent rotamer library, and includes rotamer populations and average chi angles for a full range of phi, psi values. The Bayesian analysis used here provides a rigorous statistical method for taking account of varying amounts of data. Bayesian statistics requires the assumption of a prior distribution for parameters over their range of possible values. This prior distribution can be derived from previous data or from pooling some of the present data. The prior distribution is combined with the data to form the posterior distribution, which is a compromise between the prior distribution and the data. For the chi 2, chi 3, and chi 4 rotamer prior distributions, we assume that the probability of each rotamer type is dependent only on the previous chi rotamer in the chain. For the backbone-dependence of the chi 1 rotamers, we derive prior distributions from the product of the phi-dependent and psi-dependent probabilities. Molecular mechanics calculations with the CHARMM22 potential show a strong similarity with the experimental distributions, indicating that proteins attain their lowest energy rotamers with respect to local backbone-side-chain interactions. The new library is suitable for use in homology modeling, protein folding simulations, and the refinement of X-ray and NMR structures.  相似文献   

13.
Pendley SS  Yu YB  Cheatham TE 《Proteins》2009,74(3):612-629
The alpha-helical coiled-coil is one of the most common oligomerization motifs found in both native and engineered proteins. To better understand the stability and dynamics of the coiled-coil motifs, including those modified by fluorination, several fluorinated and nonfluorinated parallel dimeric coiled-coil protein structures were designed and modeled. We also attempt to investigate how changing the length and geometry of the important stabilizing salt bridges influences the coiled-coil protein structure. Molecular dynamics (MD) and free energy simulations with AMBER used a particle mesh Ewald treatment of the electrostatics in explicit TIP3P solvent with balanced force field treatments. Preliminary studies with legacy force fields (ff94, ff96, and ff99) show a profound instability of the coiled-coil structures in short MD simulation. Significantly, better behavior is evident with the more balanced ff99SB and ff03 protein force fields. Overall, the results suggest that the coiled-coil structures can readily accommodate the larger acidic arginine or S-2,7-diaminoheptanedoic acid mutants in the salt bridge, whereas substitution of the smaller L-ornithine residue leads to rapid disruption of the coiled-coil structure on the MD simulation time scale. This structural distortion of the secondary structure allows both the formation of large hydration pockets proximal to the charged groups and within the hydrophobic core. Moreover, the increased structural fluctuations and movement lead to a decrease in the water occupancy lifetimes in the hydration pockets. In contrast, analysis of the hydration in the stable dimeric coiled-coils shows high occupancy water sites along the backbone residues with no water occupancy in the hydrophobic core, although transitory water interactions with the salt bridge residues are evident. The simulations of the fluorinated coiled-coils suggest that in some cases fluorination electrostatically stabilizes the intermolecular coiled-coil salt bridges. Structural analyses also reveal different side chain rotamer preferences for leucine when compared with 5,5,5,5',5',5'-hexafluoroleucine mutants. These observed differences in the side chain rotamer populations suggest differential changes in the side chain conformational entropy upon coiled-coil formation when the protein is fluorinated. The free energy of hydration of the isolated 5,5,5,5',5',5'-hexafluoroleucine amino acid is calculated to be 1.1 kcal/mol less stable than leucine; this hydrophobic penalty in the monomer may provide a driving force for coiled-coil dimer formation. Estimation of the ellipticity at 222 nm from a series of snapshots from the MD simulations with DicroCalc shows distinct increases in the ellipticity when the coiled-coil is fluorinated, which suggests that the helicity in the folded coiled-coils is greater when fluorinated.  相似文献   

14.
Side-chain modeling with an optimized scoring function   总被引:1,自引:0,他引:1       下载免费PDF全文
Modeling side-chain conformations on a fixed protein backbone has a wide application in structure prediction and molecular design. Each effort in this field requires decisions about a rotamer set, scoring function, and search strategy. We have developed a new and simple scoring function, which operates on side-chain rotamers and consists of the following energy terms: contact surface, volume overlap, backbone dependency, electrostatic interactions, and desolvation energy. The weights of these energy terms were optimized to achieve the minimal average root mean square (rms) deviation between the lowest energy rotamer and real side-chain conformation on a training set of high-resolution protein structures. In the course of optimization, for every residue, its side chain was replaced by varying rotamers, whereas conformations for all other residues were kept as they appeared in the crystal structure. We obtained prediction accuracy of 90.4% for chi(1), 78.3% for chi(1 + 2), and 1.18 A overall rms deviation. Furthermore, the derived scoring function combined with a Monte Carlo search algorithm was used to place all side chains onto a protein backbone simultaneously. The average prediction accuracy was 87.9% for chi(1), 73.2% for chi(1 + 2), and 1.34 A rms deviation for 30 protein structures. Our approach was compared with available side-chain construction methods and showed improvement over the best among them: 4.4% for chi(1), 4.7% for chi(1 + 2), and 0.21 A for rms deviation. We hypothesize that the scoring function instead of the search strategy is the main obstacle in side-chain modeling. Additionally, we show that a more detailed rotamer library is expected to increase chi(1 + 2) prediction accuracy but may have little effect on chi(1) prediction accuracy.  相似文献   

15.
Motivation. Protein design aims to identify sequences compatible with a given protein fold but incompatible to any alternative folds. To select the correct sequences and to guide the search process, a design scoring function is critically important. Such a scoring function should be able to characterize the global fitness landscape of many proteins simultaneously. RESULTS: To find optimal design scoring functions, we introduce two geometric views and propose a formulation using a mixture of non-linear Gaussian kernel functions. We aim to solve a simplified protein sequence design problem. Our goal is to distinguish each native sequence for a major portion of representative protein structures from a large number of alternative decoy sequences, each a fragment from proteins of different folds. Our scoring function discriminates perfectly a set of 440 native proteins from 14 million sequence decoys. We show that no linear scoring function can succeed in this task. In a blind test of unrelated proteins, our scoring function misclassfies only 13 native proteins out of 194. This compares favorably with about three-four times more misclassifications when optimal linear functions reported in the literature are used. We also discuss how to develop protein folding scoring function.  相似文献   

16.
Laederach A  Reilly PJ 《Proteins》2005,60(4):591-597
We have a limited understanding of the details of molecular recognition of carbohydrates by proteins, which is critical to a multitude of biological processes. Furthermore, carbohydrate-modifying proteins such as glycosyl hydrolases and phosphorylases are of growing importance as potential drug targets. Interactions between proteins and carbohydrates have complex thermodynamics, and in general the specific positioning of only a few hydroxyl groups determines their binding affinities. A thorough understanding of both carbohydrate and protein structures is thus essential to predict these interactions. An atomic-level view of carbohydrate recognition through structures of carbohydrate-active enzymes complexed with transition-state inhibitors reveals some of the distinctive molecular features unique to protein-carbohydrate complexes. However, the inherent flexibility of carbohydrates and their often water-mediated hydrogen bonding to proteins makes simulation of their complexes difficult. Nonetheless, recent developments such as the parameterization of specific force fields and docking scoring functions have greatly improved our ability to predict protein-carbohydrate interactions. We review protein-carbohydrate complexes having defined molecular requirements for specific carbohydrate recognition by proteins, providing an overview of the different computational techniques available to model them.  相似文献   

17.
Braun P  Goldberg E  Negron C  von Jan M  Xu F  Nanda V  Koder RL  Noy D 《Proteins》2011,79(2):463-476
The cyclic tetrapyrroles, viz. chlorophylls (Chl), their bacterial analogs bacteriochlorophylls, and hemes are ubiquitous cofactors of biological catalysis that are involved in a multitude of reactions. One systematic approach for understanding how Nature achieves functional diversity with only this handful of cofactors is by designing de novo simple and robust protein scaffolds with heme and/or (bacterio)chlorophyll [(B)Chls]-binding sites. This strategy is currently mostly implemented for heme-binding proteins. To gain more insight into the factors that determine heme-/(B)Chl-binding selectivity, we explored the geometric parameters of (B)Chl-binding sites in a nonredundant subset of natural (B)Chl protein structures. Comparing our analysis to the study of a nonredundant database of heme-binding helical histidines by Negron et al. (Proteins 2009;74:400-416), we found a preference for the m-rotamer in (B)Chl-binding helical histidines, in contrast to the preferred t-rotamer in heme-binding helical histidines. This may be used for the design of specific heme- or (B)Chl-binding sites in water-soluble helical bundles, because the rotamer type defines the positioning of the bound cofactor with respect to the helix interface and thus the protein-binding site. Consensus sequences for (B)Chl binding were identified by combining a computational and database-derived approach and shown to be significantly different from the consensus sequences recommended by Negron et al. (Proteins 2009;74:400-416) for heme-binding helical proteins. The insights gained in this work on helix- (B)Chls-binding pockets provide useful guidelines for the construction of reasonable (B)Chl-binding protein templates that can be optimized by computational tools.  相似文献   

18.
Detecting similarities between local binding surfaces can facilitate identification of enzyme binding sites and prediction of enzyme functions, and aid in our understanding of enzyme mechanisms. Constructing a template of local surface characteristics for a specific enzyme function or binding activity is a challenging task, as the size and shape of the binding surfaces of a biochemical function often vary. Here we introduce the concept of signature binding pockets, which captures information on preserved and varied atomic positions at multiresolution levels. For proteins with complex enzyme binding and activity, multiple signatures arise naturally in our model, forming a signature basis set that characterizes this class of proteins. Both signatures and signature basis sets can be automatically constructed by a method called SOLAR (Signature Of Local Active Regions). This method is based on a sequence-order-independent alignment of computed binding surface pockets. SOLAR also provides a structure-based multiple sequence fragment alignment to facilitate the interpretation of computed signatures. By studying a family of evolutionarily related proteins, we show that for metzincin metalloendopeptidase, which has a broad spectrum of substrate binding, signature and basis set pockets can be used to discriminate metzincins from other enzymes, to predict the subclass of metzincins functions, and to identify specific binding surfaces. Studying unrelated proteins that have evolved to bind to the same NAD cofactor, we constructed signatures of NAD binding pockets and used them to predict NAD binding proteins and to locate NAD binding pockets. By measuring preservation ratio and location variation, our method can identify residues and atoms that are important for binding affinity and specificity. In both cases, we show that signatures and signature basis set reveal significant biological insight.  相似文献   

19.
Patterns of receptor-ligand interaction can be conserved in functionally equivalent proteins even in the absence of sequence homology. Therefore, structural comparison of ligand-binding pockets and their pharmacophoric features allow for the characterization of so-called "orphan" proteins with known three-dimensional structure but unknown function, and predict ligand promiscuity of binding pockets. We present an algorithm for rapid pocket comparison (PoLiMorph), in which protein pockets are represented by self-organizing graphs that fill the volume of the cavity. Vertices in these three-dimensional frameworks contain information about the local ligand-receptor interaction potential coded by fuzzy property labels. For framework matching, we developed a fast heuristic based on the maximum dispersion problem, as an alternative to techniques utilizing clique detection or geometric hashing algorithms. A sophisticated scoring function was applied that incorporates knowledge about property distributions and ligand-receptor interaction patterns. In an all-against-all virtual screening experiment with 207 pocket frameworks extracted from a subset of PDBbind, PoLiMorph correctly assigned 81% of 69 distinct structural classes and demonstrated sustained ability to group pockets accommodating the same ligand chemotype. We determined a score threshold that indicates "true" pocket similarity with high reliability, which not only supports structure-based drug design but also allows for sequence-independent studies of the proteome.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号