首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
This paper provides an unbiased comparison of four commercially available programs for loop sampling, Prime, Modeler, ICM, and Sybyl, each of which uses a different modeling protocol. The study assesses the quality of results and examines the relative strengths and weaknesses of each method. The set of loops to be modeled varied in length from 4-12 amino acids. The approaches used for loop modeling can be classified into two methodologies: ab initio loop generation (Modeler and Prime) and database searches (Sybyl and ICM). Comparison of the modeled loops to the native structures was used to determine the accuracy of each method. All of the protocols returned similar results for short loop lengths (four to six residues), but as loop length increased, the quality of the results varied among the programs. Prime generated loops with RMSDs <2.5 A for loops up to 10 residues, while the other three methods met the 2.5 A criteria at seven-residue loops. Additionally, the ability of the software to utilize disulfide bonds and X-ray crystal packing influenced the quality of the results. In the final analysis, the top-ranking loop from each program was rarely the loop with the lowest RMSD with respect to the native template, revealing a weakness in all programs to correctly rank the modeled loops.  相似文献   

2.
3.
When researchers build high-quality models of protein structure from sequence homology, it is today common to use several alternative target-template alignments. Several methods can, at least in theory, utilize information from multiple templates, and many examples of improved model quality have been reported. However, to our knowledge, thus far no study has shown that automatic inclusion of multiple alignments is guaranteed to improve models without artifacts. Here, we have carried out a systematic investigation of the potential of multiple templates to improving homology model quality. We have used test sets consisting of targets from both recent CASP experiments and a larger reference set. In addition to Modeller and Nest, a new method (Pfrag) for multiple template-based modeling is used, based on the segment-matching algorithm from Levitt's SegMod program. Our results show that all programs can produce multi-template models better than any of the single-template models, but a large part of the improvement is simply due to extension of the models. Most of the remaining improved cases were produced by Modeller. The most important factor is the existence of high-quality single-sequence input alignments. Because of the existence of models that are worse than any of the top single-template models, the average model quality does not improve significantly. However, by ranking models with a model quality assessment program such as ProQ, the average quality is improved by approximately 5% in the CASP7 test set.  相似文献   

4.
Stumpff-Kane AW  Maksimiak K  Lee MS  Feig M 《Proteins》2008,70(4):1345-1356
Protein structure refinement from comparative models with the goal of predicting structures at near-experimental accuracy remains an unsolved problem. Structure refinement might be achieved with an iterative protocol where the most native-like structure from a set of decoys generated from an initial model in one cycle is used as the starting structure for the next cycle. Conformational sampling based on the coarse-grained SICHO model, atomic level of detail molecular dynamics simulations, and normal-mode analysis is compared in the context of such a protocol. All of the sampling methods can achieve significant refinement close to experimental structures, although the distribution of structures and the ability to reach native-like structures differs greatly. Implications for the practical application of such sampling methods and the requirements for scoring functions in an iterative refinement protocol are analyzed in the context of theoretical predictions for the distribution of protein-like conformations with a random sampling protocol.  相似文献   

5.
We describe a database of protein structure alignments as well as methods and tools that use this database to improve comparative protein modeling. The current version of the database contains 105 alignments of similar proteins or protein segments. The database comprises 416 entries, 78,495 residues, 1,233 equivalent entry pairs, and 230,396 pairs of equivalent alignment positions. At present, the main application of the database is to improve comparative modeling by satisfaction of spatial restraints implemented in the program MODELLER (?ali A, Blundell TL, 1993, J Mol Biol 234:779–815). To illustrate the usefulness of the database, the restraints on the conformation of a disulfide bridge provided by an equivalent disulfide bridge in a related structure are derived from the alignments; the prediction success of the disulfide dihedral angle classes is increased to approximately 80%, compared to approximately 55% for modeling that relies on the stereochemistry of disulfide bridges alone. The second example of the use of the database is the derivation of the probability density function for comparative modeling of the cis/trans isomerism of the proline residues; the prediction success is increased from 0% to 82.9% for cis-proline and from 93.3% to 96.2% for trans-proline. The database is available via electronic mail.  相似文献   

6.
We propose a realistic coarse-grained protein model and a technique to "anchor" the model to available experimental data. We apply this procedure to characterize the effect of multiple mutations on the folding mechanism of protein S6. We show that the mutation of a few "gatekeeper" residues triggers significant changes on the folding landscape of S6. These results suggest that gatekeeper residues control the flexibility of critical regions of S6, that in turn regulates the delicate balance between folding and aggregation. Although obtained with a minimalist protein model, these results are fully consistent with experimental evidence and offer a clue to understand the interplay between folding and aggregation in protein S6.  相似文献   

7.
The protein structures of six comparative modeling targets were predicted in a procedure that relied on improved energy minimization, without empirical rules, to position all new atoms. The structures of human nucleoside diphosphate kinase NM23-H2, HPr from Mycoplasma capricolum, 2Fe-2S ferredoxin from Haloarcula marismortui, eosinophil-derived neurotoxin (EDN), mouse cellular retinoic acid protein I (CRABP1), and P450eryf were predicted with root mean square deviations on Cα atoms of 0.69, 0.73, 1.11, 1.48, 1.69, and 1.73 Å, respectively, compared to the target crystal structures. These differences increased as the sequence similarity between the target and parent proteins decreased from about 60 to 20% identity. More residues were predicted than form the common region shared by the two crystal structures. In most cases insertions or deletions between the target and the related protein of known structure were not correctly positioned. One two residue insertion in CRABP1 was predicted in the correct conformation, while a nine residue insertion in EDN was predicted in the correct spatial region, although not in the correct conformation. The positions of common cofactors and their binding sites were predicted correctly, even when overall sequence similarity was low. © 1995 Wiley-Liss, Inc.  相似文献   

8.
The structure of a chaperonin caging a substrate protein is not quite clear. We made engineered group II chaperonins fused with a guest protein and analyzed their structural and functional features. Thermococcus sp. KS-1 chaperonin alpha-subunit (TCP) which forms an eightfold symmetric double-ring structure was used. Expression plasmids were constructed which carried two or four TCP genes ligated head to tail in phase and a target protein gene at the 3' end of the linked TCP genes. Electron microscopy showed that the expressed gene products with the molecular sizes of ~120 kDa (di-TCP) and ~230 kDa (tetra-TCP) formed double-ring complexes similar to those of wild-type TCP. The tetra-TCP retained ATPase activity and its thermostability was significantly higher than that of the wild type. A 260-kDa fusion protein of tetra-TCP and green fluorescent protein (GFP, 27 kDa) was able to form the double-ring complexes with green fluorescence. Image analyses indicated that the GFP moiety of tetra-TCP/GFP fusion protein was accommodated in the central cavity, and tetra-TCP/GFP formed the closed-form similar to that crystallographically resolved in group II chaperonins. Furthermore, it was suggested that caging GFP expanded the cavity around the bottom. Using this tetra-TCP fusion strategy, two virus structural proteins (21-25 kDa) toxic to host cells or two antibody fragments (25-36 kDa) prone to aggregate were well expressed in the soluble fraction of Escherichia coli. These fusion products also assembled to double-ring complexes, suggesting encapsulation of the guest proteins. The antibody fragments liberated by site-specific protease digestion exhibited ligand-binding activities.  相似文献   

9.
Over the last few years we have developed an empirical potential function that solves the protein structure recognition problem: given the sequence for an n-residue globular protein and a collection of plausible protein conformations, including the native conformation for that sequence, identify the correct, native conformation. Having determined this potential on the basis of only some 6500 native/nonnative pairs of structures for 58 proteins, we find it recognizes the native conformation for essentially all compact, soluble, globular proteins having known native conformations in comparisons with 104 to 106 reasonable alternative conformations apiece. In this sense, the potential encodes nearly all the essential features of globular protein conformational preference. In addition it “knows” about many additional factors in protein folding, such as the stabilization of multimeric proteins, quaternary structure, the role of disulfide bridges and ligands, proproteins vs. processed proteins, and minimal strand lengths in globular proteins. Comparisons are made with other sorts of protein folding problems, and applications in protein conformational determination and prediction are discussed. © 1994 Wiley-Liss, Inc.  相似文献   

10.
利用生物信息学方法分析了杉木CCoAOMT蛋白的氨基酸组成、等电点、疏水/亲水区、二级结构等蛋白质性质,并同欧洲云杉、拟南芥和水稻的蛋白进行了对比,并用生物信息学软件对其空间结构进行了模拟,同时对模建结果进行了结构质量的分析与检测。结果表明:该蛋白共有255个氨基酸,等电点为5.56,二级结构中α螺旋占37.65%,β折叠片占19.61%,无规卷曲占42.75%。三维结构检测表明此模型的结构符合立体化学规则。  相似文献   

11.
Physical principles determining the protein structure and protein folding are reviewed: (i) the molecular theory of protein secondary structure and the method of its prediction based on this theory; (ii) the existence of a limited set of thermodynamically favourable folding patterns of α- and β-regions in a compact globule which does not depend on the details of the amino acid sequence; (iii) the moderns approaches to the prediction of the folding patterns of α- and β-regions in concrete proteins; (iv) experimental approaches to the mechanism of protein folding. The review reflects theoretical and experimental works of the author and his collaborators as well as those of other groups.  相似文献   

12.
Hu C  Koehl P  Max N 《Proteins》2011,79(10):2828-2843
The three‐dimensional structure of a protein is organized around the packing of its secondary structure elements. Predicting the topology and constructing the geometry of structural motifs involving α‐helices and/or β‐strands are therefore key steps for accurate prediction of protein structure. While many efforts have focused on how to pack helices and on how to sample exhaustively the topologies and geometries of multiple strands forming a β‐sheet in a protein, there has been little progress on generating native‐like packings of helices on sheets. We describe a method that can generate the packing of multiple helices on a given β‐sheet for αβα sandwich type protein folds. This method mines the results of a statistical analysis of the conformations of αβ2 motifs in protein structures to provide input values for the geometric attributes of the packing of a helix on a sheet. It then proceeds with a geometric builder that generates multiple arrangements of the helices on the sheet of interest by sampling through these values and performing consistency checks that guarantee proper loop geometry between the helices and the strands, minimal number of collisions between the helices, and proper formation of a hydrophobic core. The method is implemented as a module of ProteinShop. Our results show that it produces structures that are within 4–6 Å RMSD of the native one, regardless of the number of helices that need to be packed, though this number may increase if the protein has several helices between two consecutive strands in the sequence that pack on the sheet formed by these two strands. Proteins 2011; Published 2011 Wiley‐Liss, Inc.  相似文献   

13.
NMR offers the possibility of accurate secondary structure for proteins that would be too large for structure determination. In the absence of an X-ray crystal structure, this information should be useful as an adjunct to protein fold recognition methods based on low resolution force fields. The value of this information has been tested by adding varying amounts of artificial secondary structure data and threading a sequence through a library of candidate folds. Using a literature test set, the threading method alone has only a one-third chance of producing a correct answer among the top ten guesses. With realistic secondary structure information, one can expect a 60-80% chance of finding a homologous structure. The method has then been applied to examples with published estimates of secondary structure. This implementation is completely independent of sequence homology, and sequences are optimally aligned to candidate structures with gaps and insertions allowed. Unlike work using predicted secondary structure, we test the effect of differing amounts of relatively reliable data.  相似文献   

14.
Mark E. Snow 《Proteins》1993,15(2):183-190
A novel scheme for the parameterization of a type of “potential energy” function for protein molecules is introduced. The function is parameterized based on the known conformations of previously determined protein structures and their sequence similarity to a molecule whose conformation is to be calculated. Once parameterized, minima of the potential energy function can be located using a version of simulated annealing which has been previously shown to locate global and near-global minima with the given functional form. As a test problem, the potential was parameterized based on the known structures of the rubredoxins from Desulfovibrio vulgaris, Desulfovibrio desulfuricans, and Clostridium pasteurianum, which vary from 45 to 54 amino acids in length, and the sequence alignments of these molecules with the rubredoxin sequence from Desulfovibrio gigas. Since the Desulfovibrio gigas rubredeoxin conformation has also been determined, it is possible to check the accuracy of the results. Ten simulated-annealing runs from random starting conformations were performed. Seven of the 10 resultant conformations have an all-Cα rms deviation from the crystallographically determined conformation of less than 1.7 Å. For five of the structures, the rms deviation is less than 0.8 Å. Four of the structures have conformations which are virtually identical to each other except for the position of the carboxy-terminal residue. This is also the conformation which is achieved if the determined crystal structure is minimized with the same potential. The all-Cα rms difference between the crystal and minimized crystal structures is 0.6 Å. It is further observed that the “energies” of the structures according to the potential function exhibit a strong correlation with rms deviation from the native structure. The conformations of the individual model structures and the computational aspects of the modeling procedure are discussed. © 1993 Wiley-Liss, Inc.  相似文献   

15.
Protein structure prediction is based mainly on the modeling of proteins by homology to known structures; this knowledgebased approach is the most promising method to date. Although it is used in the whole area of protein research, no general rules concerning the quality and applicability of concepts and procedures used in homology modeling have been put forward yet. Therefore, the main goal of the present work is to provide tools for the assessment of accuracy of modeling at a given level of sequence homology. A large set of known structures from different conformational and functional classes, but various degrees of homology was selected. Pairwise structure superpositions were performed. Starting with the definition of the structurally conserved regions and determination of topologically correct sequence alignments, we correlated geometrical properties with sequence homology (defined by the 250 PAM Dayhoff Matrix) and identity. It is shown that both the topological differences of the protein backbones and the relative positions of corresponding side chains diverge with decreasing sequence identity. Below 50% identity, the deviation in regions that are structurally not conserved continually increases, thus implying that with decreasing sequence identity modeling has to take into account more and more structurally diverging loop regions that are difficult to predict. © 1993 Wiley-Liss, Inc.  相似文献   

16.
We developed a method for structure characterization of assembly components by iterative comparative protein structure modeling and fitting into cryo-electron microscopy (cryoEM) density maps. Specifically, we calculate a comparative model of a given component by considering many alternative alignments between the target sequence and a related template structure while optimizing the fit of a model into the corresponding density map. The method relies on the previously developed Moulder protocol that iterates over alignment, model building, and model assessment. The protocol was benchmarked using 20 varied target-template pairs of known structures with less than 30% sequence identity and corresponding simulated density maps at resolutions from 5A to 25A. Relative to the models based on the best existing sequence profile alignment methods, the percentage of C(alpha) atoms that are within 5A of the corresponding C(alpha) atoms in the superposed native structure increases on average from 52% to 66%, which is half-way between the starting models and the models from the best possible alignments (82%). The test also reveals that despite the improvements in the accuracy of the fitness function, this function is still the bottleneck in reducing the remaining errors. To demonstrate the usefulness of the protocol, we applied it to the upper domain of the P8 capsid protein of rice dwarf virus that has been studied by cryoEM at 6.8A. The C(alpha) root-mean-square deviation of the model based on the remotely related template, bluetongue virus VP7, improved from 8.7A to 6.0A, while the best possible model has a C(alpha) RMSD value of 5.3A. Moreover, the resulting model fits better into the cryoEM density map than the initial template structure. The method is being implemented in our program MODELLER for protein structure modeling by satisfaction of spatial restraints and will be applicable to the rapidly increasing number of cryoEM density maps of macromolecular assemblies.  相似文献   

17.
A key concept in template‐based modeling (TBM) is the high correlation between sequence and structural divergence, with the practical consequence that homologous proteins that are similar at the sequence level will also be similar at the structural level. However, conformational diversity of the native state will reduce the correlation between structural and sequence divergence, because structural variation can appear without sequence diversity. In this work, we explore the impact that conformational diversity has on the relationship between structural and sequence divergence. We find that the extent of conformational diversity can be as high as the maximum structural divergence among families. Also, as expected, conformational diversity impairs the well‐established correlation between sequence and structural divergence, which is nosier than previously suggested. However, we found that this noise can be resolved using a priori information coming from the structure‐function relationship. We show that protein families with low conformational diversity show a well‐correlated relationship between sequence and structural divergence, which is severely reduced in proteins with larger conformational diversity. This lack of correlation could impair TBM results in highly dynamical proteins. Finally, we also find that the presence of order/disorder can provide useful beforehand information for better TBM performance.  相似文献   

18.
Structural proteomics aims to understand the structural basis of protein interactions and functions. A prerequisite for this is the availability of 3D protein structures that mediate the biochemical interactions. The explosion in the number of available gene sequences set the stage for the next step in genome-scale projects – to obtain 3D structures for each protein. To achieve this ambitious goal, the slow and costly structure determination experiments are supplemented with theoretical approaches. The current state and recent advances in structure modeling approaches are reviewed here, with special emphasis on comparative protein structure modeling techniques.  相似文献   

19.
利用Modeller7v7软件对米根霉(Rhizopus oryzoe)富马酸酶(fumarase)进行了三级结构的同源建模并对结果的空间和能量上的合理性进行了验证,进一步对酶的结构域和催化活性位点进行了研究。结果表明:富马酸酶由三个结构域组成,中心区域为一个由五个几乎平行的α螺旋组成的独特的束型结构,其催化活性位点是由三个亚基上的氨基酸相互靠近共同组成的。为以后有针对性的进行富马酸酶的定点突变提高富马酸产量提供分子水平上的理论指导。  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号