首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The PE_PGRS family of proteins unique to mycobacteria is demonstrated to contain multiple calcium-binding and glycine-rich sequence motifs GGXGXD/NXUX. This sequence repeat constitutes a calcium-binding parallel beta-roll or parallel beta-helix structure and is found in RTX toxins secreted by many Gram-negative bacteria. It is predicted that the highly homologous PE PGRS proteins containing multiple copies of the nona-peptide motif could fold into similar calcium-binding structures. The implication of the predicted calcium-binding property of PE PGRS proteins in the light of macrophage-pathogen interaction and pathogenesis is presented.  相似文献   

2.
Knowledge of the fold class of a protein is valuable because fold class gives an indication of protein function and evolution. Fold class can be accurately determined from a crystal structure or NMR structure, though these methods are expensive, time-consuming, and inapplicable to all proteins. In contrast, vibrational spectra [infra-red, Raman, or Raman optical activity (ROA)] are rapidly obtained for proteins under wide range of biological molecules under diverse experimental and physiological conditions. Here, we show that the fold class of a protein can be determined from Raman or ROA spectra by converting a spectrum into data of 10 cm−1 bin widths and applying the random forest machine learning algorithm. Spectral data from 605 and 1785 cm−1 were analyzed, as well as the amide I, II, and III regions in isolation and in combination. ROA amide II and III data gave the best performance, with 33 of 44 proteins assigned to one of the correct four top-level structural classification of proteins (SCOP) fold class (all α, all β, α and β, and disordered). The method also shows which spectral regions are most valuable in assigning fold class.  相似文献   

3.
Ichimaru T  Kikuchi T 《Proteins》2003,51(4):515-530
It is a general notion that proteins with very similar three-dimensional structures would show very similar folding kinetics. However, recent studies reveal that the folding kinetic properties of some proteins contradict this thought (i.e., the members in a same protein family fold through different pathways). For example, it has been reported that some beta-proteins in the intracellular lipid-binding protein family fold through quite different pathways (Burns et al., Proteins 1998;33:107-118). Similar differences in folding kinetics are also observed in the members of the globin family (Nishimura et al., Nat Struct Biol 2000;7:679-686). In our study, we examine the possibility of predicting qualitative differences in folding kinetics of the intracellular lipid-binding proteins and two globin proteins (i.e., myoglobin and leghemoglobin). The problem is tackled by means of a contact map based on the average distance statistics between residues, the Average Distance Map (ADM), as constructed from sequence. The ADMs for the three proteins show overall similarity, but some local differences among maps are also observed. Our results demonstrate that some properties of the protein folding kinetics are consistent with local differences in the ADMs. We also discuss the general possibility of predicting folding kinetics from sequence information.  相似文献   

4.
We used sequence and structural comparisons to determine the fold for eukaryotic ornithine decarboxylase, which we found is related to alanine racemase. These enzymes have no detectable sequence identity with any protein of known structure, including three pyridoxal phosphate-utilizing enzymes. Our studies suggest that the N-terminal domain of ornithine decarboxylase folds into a beta/alpha-barrel. Through the analysis of known barrel structures we developed a topographic model of the pyridoxal phosphate-binding domain of ornithine decarboxylase, which predicts that the Schiff base lysine and a conserved glycine-rich sequence both map to the C-termini of the beta-strands. Other residues in this domain that are likely to have essential roles in catalysis, substrate, and cofactor binding were also identified, suggesting that this model will be a suitable guide to mutagenic analysis of the enzyme mechanism.  相似文献   

5.
We studied the evolutionary relationships between gamma-carbonic anhydrase (gamma-CA) and a very diverse group of proteins that share the sequence motif characteristic of the left-handed parallel beta-helix (LbetaH) fold. This sequence motif is characterized by the imperfect tandem repetition of short hexapeptide units, which makes it difficult to obtain a reliable alignment based on sequence information alone. To solve this problem, we used a structural alignment of three members of the group with known crystallographic structures as a seed to obtain a reliable sequence alignment. Then, we applied protein maximum-parsimony and maximum-likelihood phylogenetic inference methods to this alignment. We found that gamma-CA belongs to a diverse superfamily of proteins that share the LbetaH domain. This superfamily is composed mainly of acyltransferases. The most remarkable feature of the phylogenetic tree obtained is that its main branches group together functionally related proteins, so that the coarse topology can be rather easily explained in terms of functional diversification. Regarding the main branch of the tree containing gamma-CA, we found that, in addition to the group of its closest relatives that had already been studied, gamma-CA is closely related to the tetrahydrodipicolinate N-succinyltransferases.  相似文献   

6.
Our recently developed off-lattice bead model capable of simulating protein structures with mixed alpha/beta content has been extended to model the folding of a ubiquitin-like protein and provides a means for examining the more complex kinetics involved in the folding of larger proteins. Using trajectories generated from constant-temperature Langevin dynamics simulations and sampling with the multiple multi-histogram method over five-order parameters, we are able to characterize the free energy landscape for folding and find evidence for folding through compact intermediates. Our model reproduces the observation that the C-terminus loop structure in ubiquitin is the last to fold in the folding process and most likely plays a spectator role in the folding kinetics. The possibility of a productive metastable intermediate along the folding pathway consisting of collapsed states with no secondary structure, and of intermediates or transition structures involving secondary structural elements occurring early in the sequence, is also supported by our model. The kinetics of folding remain multi-exponential below the folding temperature, with glass-like kinetics appearing at T/T(f) approximately 0.86. This new physicochemical model, designed to be predictive, helps validate the value of modeling protein folding at this level of detail for genomic-scale studies, and motivates further studies of other protein topologies and the impact of more complex energy functions, such as the addition of solvation forces.  相似文献   

7.
Imamura H  Chen JZ 《Proteins》2007,67(2):459-468
We present a minimal model for proteins, which is able to capture the structural conversion between the alpha-helix and beta-hairpin. In most regimes of the parameter space, the model produces a stable structure at a low temperature; in a few limited regimes of the parameter space, the model displays an beta-hairpin transition as the physical conditions vary. These variations include a perturbation on hydrogen bonding propensity at the middle of the modeled chain, or the change of the hydrophobicity of a designated pair along the chain. Using Monte Carlo simulations, we demonstrate the structural conversion by means of state diagrams, heat capacity maps, and free energy maps.  相似文献   

8.
A simple approach to estimate the number of alpha-helical and beta-strand segments from protein circular dichroism spectra is described. The alpha-helix and beta-sheet conformations in globular protein structures, assigned by DSSP and STRIDE algorithms, were divided into regular and distorted fractions by considering a certain number of terminal residues in a given alpha-helix or beta-strand segment to be distorted. The resulting secondary structure fractions for 29 reference proteins were used in the analyses of circular dichroism spectra by the SELCON method. From the performance indices of the analyses, we determined that, on an average, four residues per alpha-helix and two residues per beta-strand may be considered distorted in proteins. The number of alpha-helical and beta-strand segments and their average length in a given protein were estimated from the fraction of distorted alpha-helix and beta-strand conformations determined from the analysis of circular dichroism spectra. The statistical test for the reference protein set shows the high reliability of such a classification of protein secondary structure. The method was used to analyze the circular dichroism spectra of four additional proteins and the predicted structural characteristics agree with the crystal structure data.  相似文献   

9.
Sun L  Warncke K 《Proteins》2006,64(2):308-319
The structure of the EutB protein from Salmonella typhimurium, which contains the active site of the coenzyme B12 (adenosylcobalamin)-dependent enzyme, ethanolamine ammonia-lyase, has been predicted by using structural proteomics techniques of comparative modelling. The 453-residue EutB protein displays no significant sequence identity with proteins of known structure. Therefore, secondary structure prediction and fold recognition algorithms were used to identify templates. Multiple three-dimensional template matching (threading) servers identified predominantly beta8alpha8, TIM-barrel proteins, and in particular, the large subunits of diol dehydratase (PDB: 1eex:A, 1dio:A) and glycerol dehydratase (PDB: 1mmf:A), as templates. Consistent with this identification, the dehydratases are, like ethanolamine ammonia-lyase, Class II coenzyme B12-dependent enzymes. Model building was performed by using MODELLER. Models were evaluated by using different programs, including PROCHECK and VERIFY3D. The results identify a beta8alpha8, TIM-barrel fold for EutB. The beta8alpha8, TIM-barrel fold is consistent with a central role of the alpha/beta-barrel structures in radical catalysis conducted by the coenzyme B12- and S-adenosylmethionine-dependent (radical SAM) enzyme superfamilies. The EutB model and multiple sequence alignment among ethanolamine ammonia-lyase, diol dehydratase, and glycerol dehydratase from different species reveal the following protein structural features: (1) a "cap" loop segment that closes the N-terminal region of the barrel, (2) a common cobalamin cofactor binding topography at the C-terminal region of the barrel, and (3) a beta-barrel-internal guanidinium group from EutB R160 that overlaps the position of the active-site potassium ion found in the dehydratases. R160 is proposed to have a role in substrate binding and radical catalysis.  相似文献   

10.
Based on previous studies of interleukin-1beta (IL-1beta) and both acidic and basic fibroblast growth factors (FGFs), it has been suggested that the folding of beta-trefoil proteins is intrinsically slow and may occur via the formation of essential intermediates. Using optical and NMR-detected quenched-flow hydrogen/deuterium exchange methods, we have measured the folding kinetics of hisactophilin, another beta-trefoil protein that has < 10% sequence identity and unrelated function to IL-1beta and FGFs. We find that hisactophilin can fold rapidly and with apparently two-state kinetics, except under the most stabilizing conditions investigated where there is evidence for formation of a folding intermediate. The hisactophilin intermediate has significant structural similarities to the IL-1beta intermediate that has been observed experimentally and predicted theoretically using a simple, topology-based folding model; however, it appears to be different from the folding intermediate observed experimentally for acidic FGF. For hisactophilin and acidic FGF, intermediates are much less prominent during folding than for IL-1beta. Considering the structures of the different beta-trefoil proteins, it appears that differences in nonconserved loops and hydrophobic interactions may play an important role in differential stabilization of the intermediates for these proteins.  相似文献   

11.
The N-terminal 17 residues of ubiquitin have been shown by 1H NMR to fold autonomously into a beta-hairpin structure in aqueous solution. This structure has a specific, native-like register, though side-chain contacts differ in detail from those observed in the intact protein. An autonomously folding hairpin has previously been identified in the case of streptococcal protein G, which is structurally homologous with ubiquitin, but remarkably, the two are not in topologically equivalent positions in the fold. This suggests that the organization of folding may be quite different for proteins sharing similar tertiary structures. Two smaller peptides have also been studied, corresponding to the isolated arms of the N-terminal hairpin of ubiquitin, and significant differences from simple random coil predictions observed in the spectra of these subfragments, suggestive of significant limitation of the backbone conformational space sampled, presumably as a consequence of the strongly beta-structure favoring composition of the sequences. This illustrates the ability of local sequence elements to express a propensity for beta-structure even in the absence of actual sheet formation. Attempts were made to estimate the population of the folded state of the hairpin, in terms of a simple two-state folding model. Using published "random coil" values to model the unfolded state, and values derived from native ubiquitin for the putative unique, folded state, it was found that the apparent population varied widely for different residues and with different NMR parameters. Use of the spectra of the subfragment peptides to provide a more realistic model of the unfolded state led to better agreement in the estimates that could be obtained from chemical shift and coupling constant measurements, while making it clear that some other approaches to population estimation could not give meaningful results, because of the tendency to populate the beta-region of conformational space even in the absence of the hairpin structure.  相似文献   

12.
An important step in understanding how a protein folds is to determine those regions of the sequence that are critical to both its stability and its folding pathway. We chose phosphoribosyl anthranilate isomerase from Escherichia coli, which is a monomeric representative of the (beta alpha)8 barrel family of proteins, to construct a variant that carries an internal tandem duplication of the fifth beta alpha module. This (beta alpha)9 variant was enzymically active and therefore must have a wild-type (beta alpha)8 core. It had a choice a priori to fold to three different folding frames, which are distinguished by carrying the duplicated segment as an insert into one out of three different loops. Steady-state kinetic constants, the fluorescence properties of a crucial tryptophan residue, and limited proteolysis showed that the stable (beta alpha)9 variant carries the insertion between beta-strand 5 and alpha-helix 5. This preference can be explained by the important role of loops between alpha helices and beta strands in stabilizing the structure of the enzyme.  相似文献   

13.
Silva PJ 《Proteins》2008,70(4):1588-1594
Hydrophobic cluster analysis (HCA) has long been used as a tool to detect distant homologies between protein sequences, and to classify them into different folds. However, it relies on expert human intervention, and is sensitive to subjective interpretations of pattern similarities. In this study, we describe a novel algorithm to assess the similarity of hydrophobic amino acid distributions between two sequences. Our algorithm correctly identifies as misattributions several HCA-based proposals of structural similarity between unrelated proteins present in the literature. We have also used this method to identify the proper fold of a large variety of sequences, and to automatically select the most appropriate structure for homology modeling of several proteins with low sequence identity to any other member of the protein data bank. Automatic modeling of the target proteins based on these templates yielded structures with TM-scores (vs. experimental structures) above 0.60, even without further refinement. Besides enabling a reliable identification of the correct fold of an unknown sequence and the choice of suitable templates, our algorithm also shows that whereas most structural classes of proteins are very homogeneous in hydrophobic cluster composition, a tenth of the described families are compatible with a large variety of hydrophobic patterns. We have built a browsable database of every major representative hydrophobic cluster pattern present in each structural class of proteins, freely available at http://www2.ufp.pt/ pedros/HCA_db/index.htm.  相似文献   

14.
A central question in protein folding is the relative importance of locally encoded structure and cooperative interactions among residues distant in sequence. We have been exploring this question in a predominantly β-sheet protein, since β-structure formation clearly relies on both local and global sequence information. We present evidence that a 24-residue peptide corresponding to two linked hairpins of cellular retinoic acid-binding protein I (CRABP I) adopts significant native structure in aqueous solution. Prior work from our laboratory showed that the two turns contained in this fragment (turns III and IV) had the highest tendency of any of the eight turns in this anti-parallel β-barrel to fold into native turns. In addition, the primary sequence of these two turns is well conserved throughout the structural family to which CRABP I belongs, and residues in the turns and their associated hairpins participate in a network of conserved long-range interactions. We propose that the strong local-sequence biases within the chain segment comprising turns III and IV favor longer-range interactions that are crucial to the folding and native-state stability of CRABP I, and may play a similar role in related intracellular lipid-binding proteins (iLBPs).  相似文献   

15.
J Hargbo  A Elofsson 《Proteins》1999,36(1):68-76
There are many proteins that share the same fold but have no clear sequence similarity. To predict the structure of these proteins, so called "protein fold recognition methods" have been developed. During the last few years, improvements of protein fold recognition methods have been achieved through the use of predicted secondary structures (Rice and Eisenberg, J Mol Biol 1997;267:1026-1038), as well as by using multiple sequence alignments in the form of hidden Markov models (HMM) (Karplus et al., Proteins Suppl 1997;1:134-139). To test the performance of different fold recognition methods, we have developed a rigorous benchmark where representatives for all proteins of known structure are matched against each other. Using this benchmark, we have compared the performance of automatically-created hidden Markov models with standard-sequence-search methods. Further, we combine the use of predicted secondary structures and multiple sequence alignments into a combined method that performs better than methods that do not use this combination of information. Using only single sequences, the correct fold of a protein was detected for 10% of the test cases in our benchmark. Including multiple sequence information increased this number to 16%, and when predicted secondary structure information was included as well, the fold was correctly identified in 20% of the cases. Moreover, if the correct secondary structure was used, 27% of the proteins could be correctly matched to a fold. For comparison, blast2, fasta, and ssearch identifies the fold correctly in 13-17% of the cases. Thus, standard pairwise sequence search methods perform almost as well as hidden Markov models in our benchmark. This is probably because the automatically-created multiple sequence alignments used in this study do not contain enough diversity and because the current generation of hidden Markov models do not perform very well when built from a few sequences.  相似文献   

16.
Delineating structures of the transition states in protein folding reactions has provided great insight into the mechanisms by which proteins fold. The most common method for obtaining this information is Φ-value analysis, which is carried out by measuring the changes in the folding and unfolding rates caused by single amino acid substitutions at various positions within a given protein. Canonical Φ-values range between 0 and 1, and residues displaying high values within this range are interpreted to be important in stabilizing the transition state structure, and to elicit this stabilization through native-like interactions. Although very successful in defining the general features of transition state structures, Φ-value analysis can be confounded when non-native interactions stabilize this state. In addition, direct information on backbone conformation within the transition state is not provided. In the work described here, we have investigated structure formation at a conserved β-bulge (with helical conformation) in the Fyn SH3 domain by characterizing the effects of substituting all natural amino acids at one position within this structural motif. By comparing the effects on folding rates of these substitutions with database-derived local structure propensity values, we have determined that this position adopts a non-native backbone conformation in the folding transition state. This result is surprising because this position displays a high and canonical Φ-value of 0.7. This work emphasizes the potential role of non-native conformations in folding pathways and demonstrates that even positions displaying high and canonical Φ-values may, nevertheless, adopt a non-native conformation in the transition state.  相似文献   

17.
Vibrational circular dichroism (VCD) spectra for the glycoproteins alpha1-acid glycoprotein (AGP) and bovine submaxillary mucin (BSM), have been measured in D2O solutions and for the films prepared from aqueous (H2O) buffer solutions in the 1800 to 900 cm(-1) region. The solution VCD results revealed that AGP has beta-sheet structure, along with a significant amount of alpha-helix as evidenced from a W pattern in the amide I region. The VCD of BSM solution suggested a polyproline II type structure, characterized by the appearance of strong negative couplet in the amide I region. The film VCD results on AGP and BSM suggested that the secondary structures of polypeptide fold in the film state are similar to those in the solution. The absence of any significant film VCD in the low frequency region (1200-900 cm(-1)), suggested that the dominant linkage for carbohydrate residues is likely to be a beta linkage. VCD spectroscopy gains importance in the secondary structural analysis of polypeptide fold in glycoproteins due to the absence of interfering VCD from the carbohydrate residues in the conformationally sensitive amide I region. Also, film VCD studies permit measurements in the low wavenumber region (1200-900 cm(-1)) that reveal the dominant type of linkage for carbohydrate residues. Such clear structural information is unlike that from ECD, where ECD bands of acylated amino sugar residues interfere with those of polypeptide backbone in the conformationally sensitive far-UV region.  相似文献   

18.
Several de novo designed ionic peptides that are able to undergo conformational change under the influence of temperature and pH were studied. These peptides have two distinct surfaces with regular repeats of alternating hydrophilic and hydrophobic side chains. This permits extensive ionic and hydrophobic interactions resulting in the formation of stable beta-sheet assemblies. The other defining characteristic of this type of peptide is a cluster of negatively charged aspartic or glutamic acid residues located toward the N-terminus and positively charged arginine or lysine residues located toward the C-terminus. This arrangement of charge balances the alpha-helical dipole moment (C --> N), resulting in a strong tendency to form stable alpha-helices as well. Therefore, these peptides can form both stable alpha-helices and beta-sheets. They are also able to undergo abrupt structural transformations between these structures induced by temperature and pH changes. The amino acid sequence of these peptides permits both stable beta-sheet and alpha-helix formation, resulting in a balance between these two forms as governed by the environment. Some segments in proteins may also undergo conformational changes in response to environmental changes. Analyzing the plasticity and dynamics of this type of peptide may provide insight into amyloid formation. Since these peptides have dynamic secondary structure, they will serve to refine our general understanding of protein structure.  相似文献   

19.
Wang Y  Xue Z  Xu J 《Proteins》2006,65(1):49-54
We have developed a novel method named AlphaTurn to predict alpha-turns in proteins based on the support vector machine (SVM). The prediction was done on a data set of 469 nonhomologous proteins containing 967 alpha-turns. A great improvement in prediction performance was achieved by using multiple sequence alignment generated by PSI-BLAST as input instead of the single amino acid sequence. The introduction of secondary structure information predicted by PSIPRED also improved the prediction performance. Moreover, we handled the very uneven data set by combining the cost factor j with the "state-shifting" rule. This further promoted the prediction quality of our method. The final SVM model yielded a Matthews correlation coefficient (MCC) of 0.25 by a 10-fold cross-validation. To our knowledge, this MCC value is the highest obtained so far for predicting alpha-turns. An online Web server based on this method has been developed and can be freely accessed at http://bmc.hust.edu.cn/bioinformatics/ or http://210.42.106.80/.  相似文献   

20.
Huang Y  Xiao Y 《Proteins》2007,68(1):267-272
Protein folds may evolve from short peptide ancestors via gene duplication and fusion. For proteins with internal structural symmetry, this means that their sequences should be made up of identical repeats. However, many of these repeat signals can only be seen at the structural level yet. Motivated by the fact that proteins may have similar structures if their sequences have more than 25% identical amino acids, we suggest a method to detect the sequence repeats of proteins directly from their sequences. Using this method, we show that the internal repetitions of the immunoglobulin folds could be identified directly at the sequence level.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号