首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 60 毫秒
1.
Understanding the parameters influencing the formation of transition state structures in proteins is an important problem in protein folding and kinetics. In this work, we have analyzed the structure-based parameters, surrounding hydrophobicity, secondary structure, solvent accessibility, number of medium- and long-range contacts, and surrounding residues for understanding the transition state structures of 15 proteins. The analysis of Φ-values shows that 29% of the studied 378 mutants have a Φ-value of more than 0.5. The combination of different structure-based parameters could discriminate the residues that have a Φ-value cutoff of more than 0.5 with a 5-fold cross-validation accuracy of 68%, which indicates that the surrounding residues and contacts play important roles in the formation of transition state structures. Systematic analysis on different proteins reveals that the proteins azurin, cold shock protein, and C-terminal domain of ribosomal protein L9 are influenced by the number of medium- and long-range proteins, whereas barnase, FK506 binding protein, and IM9 are influenced by surrounding residues. The discrimination accuracy lies in the ranges of 81–95% and 74–85% for these respective classes of protein. Furthermore, the combination of surrounding residues and contacts improved the accuracy up to 24% in other considered proteins. We suggest that the structure-based parameters along with noncovalent interactions and conservation of residues may aid in identifying the potential residues in the formation of transition state structures in proteins.  相似文献   

2.
3.
Selvaraj S  Gromiha MM 《Proteins》2004,55(4):1023-1035
Understanding the folding pathways of proteins is a challenging task. The Phi value approach provides a detailed understanding of transition-state structures of folded proteins. In this work, we have computed the hydrophobicity associated with each residue in the folded state of 16 two-state proteins and compared the Phi values of each mutant residue. We found that most of the residues with high Phi value coincide with local maximum in surrounding hydrophobicity, or have nearby residues that show such maximum in hydrophobicity, indicating the importance of hydrophobic interactions in the transition state. We have tested our approach to different structural classes of proteins, such as alpha-helical, SH3 domains of all-beta proteins, beta-sandwich, and alpha/beta proteins, and we observed a good agreement with experimental results. Further, we have proposed a hydrophobic contact network pattern to relate the Phi values with long-range contacts, which will be helpful to understand the transition-state structures of folded proteins. The present approach could be used to identify potential hydrophobic clusters that may form through long-range contacts during the transition state.  相似文献   

4.
We have been interested in whether three proteins that share a five-stranded beta-barrel "OB-fold" structural motif but no detectable sequence homology fold by similar mechanisms. Here we describe native-state hydrogen exchange experiments as a function of urea for SN (staphylococcal nuclease), a protein with an OB-fold motif and additional nonconserved elements of structure. The regions of structure with the largest stability and unfolding cooperativity are contained within the conserved OB-fold portion of SN, consistent with previous results for CspA (cold shock protein A) and LysN (anticodon binding domain of lysyl tRNA synthetase). The OB-fold also has the subset of residues with the slowest unfolding rates in the three proteins, as determined by hydrogen exchange experiments in the EX1 limit. Although the protein folding hierarchy is maintained at the level of supersecondary structure, it is not evident for individual residues as might be expected if folding depended on obligatory nucleation sites. Rather, the site-specific stability profiles appear to be linked to sequence hydrophobicity and to the density of long-range contacts at each site in the three-dimensional structures of the proteins. We discuss the implications of the correlation between stability to unfolding and conservation of structure for mechanisms of protein structure evolution.  相似文献   

5.
Detailed analyses of protein structures provide an opportunity to understand conformation and function in terms of amino acid sequence and composition. In this work, we have systematically analyzed the characteristic features of the amino acid residues found in alpha-helical coiled-coils and, in so doing, have developed indices for their properties, conformational parameters, surrounding hydrophobicity and flexibility. As expected, there is preference for hydrophobic (Ala, Leu), positive (Lys, Arg) and negatively (Glu) charged residues in coiled-coil domains. However, the surrounding hydrophobicity of residues in coiled-coil domains is significantly less than that for residues in other regions of coiled-coil proteins. The analysis of temperature factors in coiled-coil proteins shows that the residues in these domains are more stable than those in other regions. Further, we have delineated the medium- and long-range contacts in coiled-coil domains and compared the results with those obtained for other (non-coiled-coil) parts of the same proteins and non-coiled-coil helical segments of globular proteins. The residues in coiled-coil domains are largely influenced by medium-range contacts, whereas long-range interactions play a dominant role in other regions of these same proteins as well as in non-coiled-coil helices. We have also revealed the preference of amino acid residues to form cation-pi interactions and we found that Arg is more likely to form such interactions than Lys. The parameters developed in this work can be used to understand the folding and stability of coiled-coil proteins in general.  相似文献   

6.
The contact order is believed to be an important factor for understanding protein folding mechanisms. In our earlier work, we have shown that the long-range interactions play a vital role in protein folding. In this work, we analyzed the contribution of long-range contacts to determine the folding rate of two-state proteins. We found that the residues that are close in space and are separated by at least ten to 15 residues in sequence are important determinants of folding rates, suggesting the presence of a folding nucleus at an interval of approximately 25 residues. A novel parameter "long-range order" has been proposed to predict protein folding rates. This parameter shows as good a relationship with the folding rate of two-state proteins as contact order. Further, we examined the minimum limit of residue separation to determine the long-range contacts for different structural classes. We observed an excellent correlation between long-range order and folding rate for all classes of globular proteins. We suggest that in mixed-class proteins, a larger number of residues can serve as folding nuclei compared to all-alpha and all-beta proteins. A simple statistical method has been developed to predict the folding rates of two-state proteins using the long-range order that produces an agreement with experimental results that is better or comparable to other methods in the literature.  相似文献   

7.
In nature, 1 out of every 10 proteins has an (alpha/beta)(8) (TIM)-barrel fold, and in most cases, pairwise comparisons show no sequence similarity between them. Hence, delineating the key residues that induce very different sequences to share a common fold is important for understanding the folding and stability of TIM-barrel domains. In this work, we propose a new consensus approach for locating these stabilizing residues based on long-range interactions, hydrophobicity, and conservation of amino acid residues. We have identified 957 stabilizing residues in 63 proteins from a nonredundant set of 71 TIM-barrel domains. Most of these residues are located in the 8-stranded beta-sheet, with nearly one half of them oriented toward the interior of the barrel and the other half oriented toward the surrounding alpha-helices. Several stabilizing residues are found in the N- and C-terminal loops, whereas very few appear in the alpha-helices that surround the internal beta-sheet. Further, these 957 residues are placed in 434 stabilizing segments of various sizes, and each domain contains 1-10 of these segments. We found that 8 segments per domain is the most abundant one, and two thirds of the proteins have 7-9 stabilizing segments. Finally, we verified the identified residues with experimental temperature factors and found that these residues are among the ones with less mobility in the considered proteins. We suggest that our new protocol serves as a powerful tool to identify the stabilizing residues in TIM-barrel domains, which can be used as potential candidates for studying protein folding and stability by means of protein engineering experiments.  相似文献   

8.
Plaxco KW  Simons KT  Ruczinski I  Baker D 《Biochemistry》2000,39(37):11177-11183
The fastest simple, single domain proteins fold a million times more rapidly than the slowest. Ultimately this broad kinetic spectrum is determined by the amino acid sequences that define these proteins, suggesting that the mechanisms that underlie folding may be almost as complex as the sequences that encode them. Here, however, we summarize recent experimental results which suggest that (1) despite a vast diversity of structures and functions, there are fundamental similarities in the folding mechanisms of single domain proteins and (2) rather than being highly sensitive to the finest details of sequence, their folding kinetics are determined primarily by the large-scale, redundant features of sequence that determine a protein's gross structural properties. That folding kinetics can be predicted using simple, empirical, structure-based rules suggests that the fundamental physics underlying folding may be quite straightforward and that a general and quantitative theory of protein folding rates and mechanisms (as opposed to unfolding rates and thus protein stability) may be near on the horizon.  相似文献   

9.
Recognition of protein fold from amino acid sequence is a challenging task. The structure and stability of proteins from different fold are mainly dictated by inter-residue interactions. In our earlier work, we have successfully used the medium- and long-range contacts for predicting the protein folding rates, discriminating globular and membrane proteins and for distinguishing protein structural classes. In this work, we analyze the role of inter-residue interactions in commonly occurring folds of globular proteins in order to understand their folding mechanisms. In the medium-range contacts, the globin fold and four-helical bundle proteins have more contacts than that of DNA-RNA fold although they all belong to all-alpha class. In long-range contacts, only the ribonuclease fold prefers 4-10 range and the other folding types prefer the range 21-30 in alpha/beta class proteins. Further, the preferred residues and residue pairs influenced by these different folds are discussed. The information about the preference of medium- and long-range contacts exhibited by the 20 amino acid residues can be effectively used to predict the folding type of each protein.  相似文献   

10.
Folding rates of small single-domain proteins that fold through simple two-state kinetics can be estimated from details of the three-dimensional protein structure. Previously, predictions of secondary structure had been exploited to predict folding rates from sequence. Here, we estimate two-state folding rates from predictions of internal residue-residue contacts in proteins of unknown structure. Our estimate is based on the correlation between the folding rate and the number of predicted long-range contacts normalized by the square of the protein length. It is well known that long-range order derived from known structures correlates with folding rates. The surprise was that estimates based on very noisy contact predictions were almost as accurate as the estimates based on known contacts. On average, our estimates were similar to those previously published from secondary structure predictions. The combination of these methods that exploit different sources of information improved performance. It appeared that the combined method reliably distinguished fast from slow two-state folders.  相似文献   

11.

Background  

The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins.  相似文献   

12.
PUF proteins are a conserved group of sequence specific RNA-binding proteins that bind to RNA in a modular fashion. The RNA-binding domain of PUF proteins typically consists of eight clustered Puf repeats. Plant genomes code for large families of PUF proteins that show significant variability in their predicted Puf repeat number, organization, and amino acid sequence. Here we sought to determine whether the observed variability in the RNA-binding domains of four plant PUFs results in a preference for nonclassical PUF RNA target sequences. We report the identification of a novel RNA binding sequence for a nucleolar Arabidopsis PUF protein that contains an atypical RNA-binding domain. The Arabidopsis PUM23 (APUM23) binding sequence was 10 nucleotides in length, contained a centrally located UUGA core element, and had a preferred cytosine at nucleotide position 8. These RNA sequence characteristics differ from those of other PUF proteins, because all natural PUFs studied to date bind to RNAs that contain a conserved UGU sequence at their 5′ end and lack specificity for cytosine. Gel mobility shift assays validated the identity of the APUM23 binding sequence and supported the location of 3 of the 10 predicted Puf repeats in APUM23, including the cytosine-binding repeat. The preferred 10-nucleotide sequence bound by APUM23 is present within the 18S rRNA sequence, supporting the known role of APUM23 in 18S rRNA maturation. This work also reveals that APUM23, an ortholog of yeast Nop9, could provide an advanced structural backbone for Puf repeat engineering and target-specific regulation of cellular RNAs.  相似文献   

13.
Cation-pi interactions play an important role to the stability of protein structures. In our earlier work, we have analyzed the influence and energetic contribution of cation-pi interactions in three-dimensional structures of membrane proteins. In this work, we investigate the characteristic features of residues that are involved in cation-pi interactions. We have computed several parameters, such as surrounding hydrophobicity, number of long-range contacts, conservation score and normalized B-factor for all these residues and identified their location, whether in the membrane or at surface. We found that the cation-pi interactions are mainly formed by long-range interactions. The cationic residues involved in cation-pi interactions have higher surrounding hydrophobicity than their average values in the whole dataset and an opposite trend is observed for aromatic residues. In transmembrane helical proteins, except Phe, all other residues that are responsible for cation-pi interactions are highly conserved with other related protein sequences whereas in transmembrane strand proteins, an appreciable conservation is observed only for Arg. The analysis on the flexibility of residues reveals that the cation-pi interaction forming residues are more stable than other residues. The results obtained in the present study would be helpful to understand the role of cation-pi interactions in the structure and folding of membrane proteins.  相似文献   

14.
Folding landscapes of ankyrin repeat proteins: experiments meet theory   总被引:5,自引:0,他引:5  
Nearly 6% of eukaryotic protein sequences contain ankyrin repeat (AR) domains, which consist of several repeats and often function in binding. AR proteins show highly cooperative folding despite a lack of long-range contacts. Both theory and experiment converge to explain that formation of the interface between elements is more favorable than formation of any individual repeat unit. IkappaBalpha and Notch both undergo partial folding upon binding perhaps influencing the binding free energy. The simple architecture, combined with identification of consensus residues that are important for stability, has enabled systematic perturbation of the energy landscape by single point mutations that affect stability or by addition of consensus repeats. The folding energy landscapes appear highly plastic, with small perturbations re-routing folding pathways.  相似文献   

15.
Due to Plaxco, Simons, Baker and others, it is now well known that the two-state single domain protein folding rate is fairly well predicted from knowledge of the topology of the native structure. Plaxco et al found that the folding rates of two-state proteins correlate with the average degree to which native contacts are 'local' within the chain sequence: fast-folders usually have mostly local structures. Here, we dissected the native topology further by focusing on non-local and local contacts using lower and upper bounds of allowable sequence separation in computing the average contact order. We analyzed non-local and local contacts of 82 two-state proteins whose experimental folding rates span over six orders of magnitude. We observed that both the number of non-local contacts and the average sequence separation of non-local contacts (non-local CO) are both negatively correlated with the folding rate, showing that the non-local contacts dominate the barrier-crossing process. Surprisingly, the local contact orders of the proteins also correlate with the folding rates. However, this correlation shows a strong positive trend indicating the role of a diffusive search in the denatured basin.  相似文献   

16.
Many proteins consist of subdomains that can fold and function independently. We investigate here the interaction between the two high mobility group (HMG) box subdomains of the nuclear protein rHMG1. An HMG box is a conserved amino acid sequence of approximately 80 amino acids rich in basic, aromatic and proline side chains that is active in binding DNA in a sequence or structure-specific manner. In the case of HMG1, each box can bind structural DNA substrates including four-way junctions (4WJs) and branched or kinked DNA duplexes. Since proteins containing up to six HMG boxes are known, the question arises whether linking subdomains together influences the folding or function of individual boxes. In an effort to understand interactions between individual DNA-binding domains in HMG1, we created new fusion proteins: one is an inversion of the order of the AB di-domain in HMG1 (BA); in the second, we added a third A domain C-terminal to the AB di-domain (ABA). Pairs of boxes, AB or BA, behave similarly and are functionally active. By contrast, the ABA triple subdomain construct is partially unfolded and is less active than individual boxes or di-domains. Thus, long-range inter-domain effects can influence the activity of HMG boxes.  相似文献   

17.
Potato type II serine proteinase inhibitors are proteins that consist of multiple sequence repeats, and exhibit a multidomain structure. The structural domains are circular permutations of the repeat sequence, as a result of intramolecular domain swapping. Structural studies give indications for the origins of this folding behaviour, and the evolution of the inhibitor family.  相似文献   

18.
Divergence in function of homologous proteins is based on both sequence and structural changes. Overall enzyme function has been reported to diverge earlier (50% sequence identity) than overall structure (35%). We herein study the functional conservation of enzymes and non-enzyme sequences using the protein domain families in CATH-Gene3D. Despite the rapid increase in sequence data since the last comprehensive study by Tian and Skolnick, our findings suggest that generic thresholds of 40% and 60% aligned sequence identity are still sufficient to safely inherit third-level and full Enzyme Commission numbers, respectively. This increases to 50% and 70% on the domain level, unless the multi-domain architecture matches. Assignments from the Kyoto Encyclopedia of Genes and Genomes and the Munich Information Center for Protein Sequences Functional Catalogue seem to be less conserved with sequence, probably due to a more pathway-centric view: 80% domain sequence identity is required for safe function transfer. Comparing domains (more pairwise relationships) and the use of family-specific thresholds (varying evolutionary speeds) yields the highest coverage rates when transferring functions to model proteomes. An average twofold increase in enzyme annotations is seen for 523 proteomes in Gene3D. As simple ‘rules of thumb’, sequence identity thresholds do not require a bioinformatics background. We will provide and update this information with future releases of CATH-Gene3D.  相似文献   

19.
Rat micro class glutathione transferases M1-1 and M2-2 are homodimers that share a 78% sequence identity but display differences in stability. M1-1 is more stable at the secondary and tertiary structural levels, whereas its quaternary structure is less stable. Each subunit in these proteins consists of two structurally distinct domains with intersubunit contacts occurring between domain 1 of one subunit and domain 2 of the other subunit. The chimeric subunit variants M(12), which has domain 1 of M1 and domain 2 of M2, and its complement M(21), were used to investigate the conformational stability of the chimeric homodimers M(12)-(12) and M(21)-(21) to determine the contribution of each domain toward stability. Exchanging entire domains between class micro GSTs is accommodated by the GST fold. Urea-induced equilibrium unfolding data indicate that whereas the class micro equilibrium unfolding mechanism (i.e., N(2) <--> 2I <--> 2U) is not altered, domain exchanges impact significantly on the conformational stability of the native dimers and monomeric folding intermediates. Data for the wild-type and chimeric proteins indicate that the order of stability for the native dimer (N(2)) is M2-2 > M(12)-(12) M1-1 approximately M(21)-(21), and that the order of stability of the monomeric intermediate (I) is M1 > M2 approximately M(12) > M(21). Interactions involving Arg 77, which is topologically conserved in GSTs, appear to play an important role in the stability of both the native dimeric and folding monomeric structures.  相似文献   

20.
The caspase recruitment domain (CARD) of Apaf-1 binds to the CARD of caspase-9 to trigger a proteolytic cascade that leads to apoptotic cell death. We report the crystal structure of the Apaf-1 CARD at 1. 3 A resolution, solved in a two-element multiwavelength anomalous dispersion (MAD) X-ray diffraction experiment. This CARD adopts a six-helix bundle fold with Greek key topology surrounding an extensive hydrophobic core. This fold, which we call the "death fold", is found in other domains that mediate interactions in apoptotic signaling despite very low sequence identity. From a structure-based alignment, we identify conserved patterns that characterize the death fold and its subclasses. Like the Ig-fold, it provides a rigid structural scaffold upon which diverse recognition surfaces are assembled.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号