首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 696 毫秒
1.
Rapid progress in structural modeling of proteins and their interactions is powered by advances in knowledge-based methodologies along with better understanding of physical principles of protein structure and function. The pool of structural data for modeling of proteins and protein–protein complexes is constantly increasing due to the rapid growth of protein interaction databases and Protein Data Bank. The GWYRE (Genome Wide PhYRE) project capitalizes on these developments by advancing and applying new powerful modeling methodologies to structural modeling of protein–protein interactions and genetic variation. The methods integrate knowledge-based tertiary structure prediction using Phyre2 and quaternary structure prediction using template-based docking by a full-structure alignment protocol to generate models for binary complexes. The predictions are incorporated in a comprehensive public resource for structural characterization of the human interactome and the location of human genetic variants. The GWYRE resource facilitates better understanding of principles of protein interaction and structure/function relationships. The resource is available at http://www.gwyre.org.  相似文献   

2.
For many years it has been accepted that the sequence of a protein can specify its three-dimensional structure. However, there has been limited progress in explaining how the sequence dictates its fold and no attempt to do this computationally without the use of specific structural data has ever succeeded for any protein larger than 100 residues. We describe a method that can predict complex folds up to almost 200 residues using only basic principles that do not include any elements of sequence homology. The method does not simulate the folding chain but generates many thousands of models based on an idealized representation of structure. Each rough model is scored and the best are refined. On a set of five proteins, the correct fold score well and when tested on a set of larger proteins, the correct fold was ranked highest for some proteins more than 150 residues, with others being close topological variants. All other methods that approach this level of success rely on the use of templates or fragments of known structures. Our method is unique in using a database of ideal models based on general packing rules that, in spirit, is closer to an ab initio approach.  相似文献   

3.
Predicting protein sequences that fold into specific native three-dimensional structures is a problem of great potential complexity. Although the complete solution is ultimately rooted in understanding the physical chemistry underlying the complex interactions between amino acid residues that determine protein stability, recent work shows that empirical information about these first principles is embedded in the statistics of protein sequence and structure databases. This review focuses on the use of 'knowledge-based' potentials derived from these databases in designing proteins. In addition, the data suggest how the study of these empirical potentials might impact our fundamental understanding of the energetic principles of protein structure.  相似文献   

4.
5.
Prediction of protein tertiary structures from amino acid sequence and understanding the mechanisms of how proteins fold, collectively known as “the protein folding problem,” has been a grand challenge in molecular biology for over half a century. Theories have been developed that provide us with an unprecedented understanding of protein folding mechanisms. However, computational simulation of protein folding is still difficult, and prediction of protein tertiary structure from amino acid sequence is an unsolved problem. Progress toward a satisfying solution has been slow due to challenges in sampling the vast conformational space and deriving sufficiently accurate energy functions. Nevertheless, several techniques and algorithms have been adopted to overcome these challenges, and the last two decades have seen exciting advances in enhanced sampling algorithms, computational power and tertiary structure prediction methodologies. This review aims at summarizing these computational techniques, specifically conformational sampling algorithms and energy approximations that have been frequently used to study protein-folding mechanisms or to de novo predict protein tertiary structures. We hope that this review can serve as an overview on how the protein-folding problem can be studied computationally and, in cases where experimental approaches are prohibitive, help the researcher choose the most relevant computational approach for the problem at hand. We conclude with a summary of current challenges faced and an outlook on potential future directions.  相似文献   

6.
An important puzzle in structural biology is the question of how proteins are able to fold so quickly into their unique native structures. There is much evidence that protein folding is hierarchic. In that case, folding routes are not linear, but have a tree structure. Trees are commonly used to represent the grammatical structure of natural language sentences, and chart parsing algorithms efficiently search the space of all possible trees for a given input string. Here we show that one such method, the CKY algorithm, can be useful both for providing novel insight into the physical protein folding process, and for computational protein structure prediction. As proof of concept, we apply this algorithm to the HP lattice model of proteins. Our algorithm identifies all direct folding route trees to the native state and allows us to construct a simple model of the folding process. Despite its simplicity, our model provides an account for the fact that folding rates depend only on the topology of the native state but not on sequence composition.  相似文献   

7.
Subbian E  Yabuta Y  Shinde U 《Biochemistry》2004,43(45):14348-14360
Subtilisin E (SbtE) is a member of the ubiquitous superfamily of serine proteases called subtilases and serves as a model for understanding propeptide-mediated protein folding mechanisms. Unlike most proteins that adopt thermodynamically stable conformations, the native state of SbtE is trapped into a kinetically stable conformation. While kinetic stability offers distinct functional advantages to the native state, the constraints that dictate the selection between kinetic and thermodynamic folding and stability remain unknown. Using highly conserved subtilases, we demonstrate that adaptive evolution of sequence dictates selection of folding pathways. Intracellular and extracellular serine proteases (ISPs and ESPs, respectively) constitute two subfamilies within the family of subtilases that have highly conserved sequences, structures, and catalytic activities. Our studies on the folding pathways of subtilisin E (SbtE), an ESP, and its homologue intracellular serine protease 1 (ISP1), an ISP, show that although topology, contact order, and hydrophobicity that drive protein folding reactions are conserved, ISP1 and SbtE fold through significantly different pathways and kinetics. While SbtE absolutely requires the propeptide to fold into a kinetically trapped conformer, ISP1 folds to a thermodynamically stable state more than 1 million times faster and independent of a propeptide. Furthermore, kinetics establish that ISP1 and SbtE fold through different intermediate states. An evolutionary analysis of folding constraints in subtilases suggests that observed differences in folding pathways may be mediated through positive selection of specific residues that map mostly onto the protein surface. Together, our results demonstrate that closely related subtilases can fold through distinct pathways and mechanisms, and suggest that fine sequence details can dictate the choice between kinetic and thermodynamic folding and stability.  相似文献   

8.
There is a fundamental conflict between two different views of how proteins fold. Kinetic experiments and theoretical calculations are often interpreted in terms of different population fractions folding through different intermediates in independent unrelated pathways (IUP model). However, detailed structural information indicates that all of the protein population folds through a sequence of intermediates predetermined by the foldon substructure of the target protein and a sequential stabilization principle. These contrary views can be resolved by a predetermined pathway--optional error (PPOE) hypothesis. The hypothesis is that any pathway intermediate can incorporate a chance misfolding error that blocks folding and must be reversed for productive folding to continue. Different fractions of the protein population will then block at different steps, populate different intermediates, and fold at different rates, giving the appearance of multiple unrelated pathways. A test of the hypothesis matches the two models against extensive kinetic folding results for hen lysozyme which have been widely cited in support of independent parallel pathways. The PPOE model succeeds with fewer fitting constants. The fitted PPOE reaction scheme leads to known folding behavior, whereas the IUP properties are contradicted by experiment. The appearance of a conflict with multipath theoretical models seems to be due to their different focus, namely on multitrack microscopic behavior versus cooperative macroscopic behavior. The integration of three well-documented principles in the PPOE model (cooperative foldons, sequential stabilization, optional errors) provides a unifying explanation for how proteins fold and why they fold in that way.  相似文献   

9.
Protein folding and misfolding: mechanism and principles   总被引:1,自引:0,他引:1  
Two fundamentally different views of how proteins fold are now being debated. Do proteins fold through multiple unpredictable routes directed only by the energetically downhill nature of the folding landscape or do they fold through specific intermediates in a defined pathway that systematically puts predetermined pieces of the target native protein into place? It has now become possible to determine the structure of protein folding intermediates, evaluate their equilibrium and kinetic parameters, and establish their pathway relationships. Results obtained for many proteins have serendipitously revealed a new dimension of protein structure. Cooperative structural units of the native protein, called foldons, unfold and refold repeatedly even under native conditions. Much evidence obtained by hydrogen exchange and other methods now indicates that cooperative foldon units and not individual amino acids account for the unit steps in protein folding pathways. The formation of foldons and their ordered pathway assembly systematically puts native-like foldon building blocks into place, guided by a sequential stabilization mechanism in which prior native-like structure templates the formation of incoming foldons with complementary structure. Thus the same propensities and interactions that specify the final native state, encoded in the amino-acid sequence of every protein, determine the pathway for getting there. Experimental observations that have been interpreted differently, in terms of multiple independent pathways, appear to be due to chance misfolding errors that cause different population fractions to block at different pathway points, populate different pathway intermediates, and fold at different rates. This paper summarizes the experimental basis for these three determining principles and their consequences. Cooperative native-like foldon units and the sequential stabilization process together generate predetermined stepwise pathways. Optional misfolding errors are responsible for 3-state and heterogeneous kinetic folding.  相似文献   

10.
Several fold recognition algorithms are compared to each other in terms of prediction accuracy and significance. It is shown that on standard benchmarks, hybrid methods, which combine scoring based on sequence-sequence and sequence-structure matching, surpass both sequence and threading methods in the number of accurate predictions. However, the sequence similarity contributes most to the prediction accuracy. This strongly argues that most examples of apparently nonhomologous proteins with similar folds are actually related by evolution. While disappointing from the perspective of the fundamental understanding of protein folding, this adds a new significance to fold recognition methods as a possible first step in function prediction. Despite hybrid methods being more accurate at fold prediction than either the sequence or threading methods, each of the methods is correct in some cases where others have failed. This partly reflects a different perspective on sequence/structure relationship embedded in various methods. To combine predictions from different methods, estimates of significance of predictions are made for all methods. With the help of such estimates, it is possible to develop a "jury" method, which has accuracy higher than any of the single methods. Finally, building full three-dimensional models for all top predictions helps to eliminate possible false positives where alignments, which are optimal in the one-dimensional sequences, lead to unsolvable sterical conflicts for the full three-dimensional models.  相似文献   

11.
The influence of native connectivity of secondary structure elements (SSE) on folding is studied using coarse-grained models of proteins with mixed alpha and beta structure and the analysis of the structural database of wild-type proteins. We found that the distribution of SSE along a sequence determines the diversity of folding pathways. If alpha and beta SSE are localized in different parts of a sequence, the diversity of folding pathways is restricted. An even (symmetric) distribution of alpha and beta SSE with respect to sequence midpoint favors multiple folding routes. Simulations are supplemented by the database analysis of the distribution of SSE in wild-type protein sequences. On an average, two-thirds of wild-type proteins with mixed alpha and beta structure have symmetric distribution of alpha and beta SSE. The propensity for symmetric distribution of SSE is especially evident for large proteins with the number of SSE > or = 10. We suggest that symmetric SSE distribution in protein sequences may arise due to nearly random allocation of alpha and beta structure along wild-type sequences. The tendency of long sequences to misfold is perhaps compensated by the enhanced pathway diversity. In addition, folding pathways are shown to progress via hierarchic assembly of SSE in accordance with their proximity along a sequence. We demonstrate that under mild denaturation conditions folding and unfolding pathways are similar. However, the reversibility of folding/unfolding pathways is shown to depend on the distribution of SSE. If alpha and beta SSE are localized in different parts of a sequence, folding and unfolding pathways are likely to coincide.  相似文献   

12.
BACKGROUND: Are folding pathways conserved in protein families? To test this explicitly and ask to what extent structure specifies folding pathways requires comparison of proteins with a common fold. Our strategy is to choose members of a highly diverse protein family with no conservation of function and little or no sequence identity, but with structures that are essentially the same. The immunoglobulin-like fold is one of the most common structural families, and is subdivided into superfamilies with no detectable evolutionary or functional relationship. RESULTS: We compared the folding of a number of immunoglobulin-like proteins that have a common structural core and found a strong correlation between folding rate and stability. The results suggest that the folding pathways of these immunoglobulin-like proteins share common features. CONCLUSIONS: This study is the first to compare the folding of structurally related proteins that are members of different superfamilies. The most likely explanation for the results is that interactions that are important in defining the structure of immunoglobulin-like proteins are also used to guide folding.  相似文献   

13.
The use of simple theoretical models has provided a considerable contribution to our present understanding of the means by which proteins adopt their native fold from the plethora of available unfolded states. A common assumption in building computationally tractable models has been the neglect of stabilizing non-native interactions in the class of models described as "Gō-like." The focus of this study is the characterization of the folding of a number of proteins via a Gō-like model, which aims to map a maximal amount of information reflecting the protein sequence onto a "minimalist" skeleton. This model is shown to contain sufficient information to reproduce the folding transition states of a number of proteins, including topologically analogous proteins that fold via different transition states. Remarkably, these models also demonstrate consistency with the general features of folding transition states thought to be stabilized by non-native interactions. This suggests that native interactions are the primary determinant of most protein folding transition states, and that non-native interactions lead only to local structural perturbations. A prediction is also included for an asymmetrical folding transition state of bacteriophage lambda protein W, which has yet to be subjected to experimental characterization.  相似文献   

14.
蛋白质的二级结构预测研究进展   总被引:1,自引:0,他引:1  
唐媛  李春花  张瑗  尚进  邹凌云  李立奇 《生物磁学》2013,(26):5180-5182
认识蛋白质的二级结构是了解蛋白质的折叠模式和三级结构的基础,并为研究蛋白质的功能以及它们之间的相互作用模式提供结构基础,同时还可以为新药研发提供帮助。故研究蛋白质的二级结构具有重要的意义。随着后基因组时代的到来,越来越多的蛋白质序列不断被发现,给蛋白质的二级结构研究带来巨大的挑战和研究空间。而依靠传统的实验方法很难获取大规模蛋白质的二级结构信息。目前,采用生物信息学手段仍然是获得大部分蛋白质二级结构的途径。近年来,许多研究者通过构建用于二级结构预测的蛋白质数据集,计算、提取蛋白质的各种特征信息,并采用不同的预测算法预测蛋白质的二级结构得到了快速的发展。本文拟从蛋白质的特征信息的提取与筛选、预测算法以及预测效果的检验方法等方面进行综述,介绍蛋白质二级结构预测领域的研究进展。相信随着基因组学、蛋白质组学和生物信息学的不断发展,蛋白质二级结构预测会不断取得新突破。  相似文献   

15.
MOTIVATION: Experimentalists have amassed extensive evidence over the past four decades that proteins appear to fold during production by the ribosome. Protein structure prediction methods, however, do not incorporate this property of folding. A thorough study to find the fingerprint of such sequential folding is the first step towards using it in folding algorithms, so assisting structure prediction. RESULTS: We explore computationally the existence of evidence for cotranslational folding, based on large sets of experimentally determined structures in the PDB. Our perspective is that cotranslational folding is the norm, but that the effect is masked in most classes. We show that it is most evident in alpha/beta proteins, confirming recent findings. We also find mild evidence that older proteins may fold cotranslationally. A tool is provided for determining, within a protein, where cotranslation is most evident.  相似文献   

16.
In this article we review current understanding of basic principles for the folding of membrane proteins, focusing on the more abundant alpha-helical class. Membrane proteins, vital to many biological functions and implicated in numerous diseases, fold into their active conformations in the complex environment of the cell bilayer membrane. While many membrane proteins rely on the translocon and chaperone proteins to fold correctly, others can achieve their functional form in the absence of any translation apparatus or other aides. Nevertheless, the spontaneous folding process is not well understood at the molecular level. Recent findings suggest that helix fraying and loop formation may be important for overall structure, dynamics and regulation of function. Several types of membrane helices with ionizable amino acids change their topology with pH. Additionally we note that some peptides, including many that are rich in arginine, and a particular analogue of gramicidin, are able passively to translocate across cell membranes. The findings indicate that a final protein structure in a lipid-bilayer membrane is sequence-based, with lipids contributing to stability and regulation. While much progress has been made toward understanding the folding process for alpha-helical membrane proteins, it remains a work in progress. This article is part of a Special Issue entitled: Emergence of Complex Behavior in Biomembranes edited by Marjorie Longo.  相似文献   

17.
D J Thomas 《FEBS letters》1992,307(1):10-13
Certain concepts and misconceptions in the field of protein folding are discussed from the viewpoint of a theoretical physicist. It is argued that there can be no protein folding code and that perceived correlations between sequence or composition and three-dimensional structure are more likely to be an artefact of a limited database than a real result. Attempts at using molecular dynamics algorithms are also likely to produce artefactual results because results depend critically on the unknown hamiltonian energy function. Correct calculations of configurational entropy are thought to be the most likely next step in understanding how and why proteins fold.  相似文献   

18.
MOTIVATION: A method for recognizing the three-dimensional fold from the protein amino acid sequence based on a combination of hidden Markov models (HMMs) and secondary structure prediction was recently developed for proteins in the Mainly-Alpha structural class. Here, this methodology is extended to Mainly-Beta and Alpha-Beta class proteins. Compared to other fold recognition methods based on HMMs, this approach is novel in that only secondary structure information is used. Each HMM is trained from known secondary structure sequences of proteins having a similar fold. Secondary structure prediction is performed for the amino acid sequence of a query protein. The predicted fold of a query protein is the fold described by the model fitting the predicted sequence the best. RESULTS: After model cross-validation, the success rate on 44 test proteins covering the three structural classes was found to be 59%. On seven fold predictions performed prior to the publication of experimental structure, the success rate was 71%. In conclusion, this approach manages to capture important information about the fold of a protein embedded in the length and arrangement of the predicted helices, strands and coils along the polypeptide chain. When a more extensive library of HMMs representing the universe of known structural families is available (work in progress), the program will allow rapid screening of genomic databases and sequence annotation when fold similarity is not detectable from the amino acid sequence. AVAILABILITY: FORESST web server at http://absalpha.dcrt.nih.gov:8008/ for the library of HMMs of structural families used in this paper. FORESST web server at http://www.tigr.org/ for a more extensive library of HMMs (work in progress). CONTACT: valedf@tigr.org; munson@helix.nih.gov; garnier@helix.nih.gov  相似文献   

19.
Ultrafast folding proteins have served an important role in benchmarking molecular dynamics simulations and testing protein folding theories. These proteins are simple enough and fold fast enough that realistic simulations are possible, which facilitates the direct comparison of absolute folding rates and folding mechanisms with those observed experimentally. Such comparisons have achieved remarkable success, but have also revealed the shortcomings that remain in experiment, theory and simulation alike. Some ultrafast folding proteins may fold without encountering an activation barrier (downhill folding), allowing the exploration of the molecular timescale of folding and the roughness of the energy landscape. The biological significance of ultrafast folding remains uncertain, but its practical significance is crucial to progress in understanding how proteins fold.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号