首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The conservation of residues in columns of a multiple sequence alignment (MSA) reflects the importance of these residues for maintaining the structure and function of a protein. To date, many scores have been suggested for quantifying residue conservation, but none has achieved the full rigor both in biology and statistics. In this paper, we present a new approach for measuring the evolutionary conservation at aligned positions. Our conservation measure is related to the logarithmic probabilities for aligned positions, and combines the physicochemical properties and the frequencies of amino acids. Such a measure is both biologically and statistically meaningful. For testing the relationship between an amino acid's evolutionary conservation and its role in the Phi-value defined protein folding kinetics, our results indicate that the folding nucleus residues may not be significantly more conserved than other residues by using the biological-relevance weighted statistical scoring method suggested in this paper as an alternative to entropy-based procedures.  相似文献   

2.
In the family of acyl-coenzyme A binding proteins, a subset of 26 sequence sites are identical in all eukaryotes and conserved throughout evolution of the eukaryotic kingdoms. In the context of the bovine protein, the importance of these 26 sequence positions for structure, function, stability, and folding has been analyzed using single-site mutations. A total of 28 mutant proteins were analyzed which covered 17 conserved sequence positions and three nonconserved positions. As a first step, the influence of the mutations on the protein folding reaction has been probed, revealing a folding nucleus of eight hydrophobic residues formed between the N- and C-terminal helices [Kragelund, B. B., et al. (1999) Nat. Struct. Biol. (In press)]. To fully analyze the role of the conserved residues, the function and the stability have been measured for the same set of mutant proteins. Effects on function were measured by the extent of binding of the ligand dodecanoyl-CoA using isothermal titration calorimetry, and effects on protein stability were measured with chemical denaturation followed by intrinsic tryptophan and tyrosine fluorescence. The sequence sites that have been conserved for direct functional purposes have been identified. These are Phe5, Tyr28, Tyr31, Lys32, Lys54, and Tyr73. Binding site residues are mainly polar or charged residues, and together, four of these contribute approximately 8 kcal mol-1 of the total free energy of binding of 11 kcal mol-1. The sequence sites conserved for stability of the structure have likewise been identified and are Phe5, Ala9, Val12, Leu15, Leu25, Tyr28, Lys32, Gln33, Tyr73, Val77, and Leu80. Essentially, all of the conserved residues that maintain the stability are hydrophobic residues at the interface of the helices. Only one conserved polar residue, Gln33, is involved in stability. The results indicate that conservation of residues in homologous proteins may result from a summed optimization of an effective folding reaction, a stable native protein, and a fully active binding site. This is important in protein design strategies, where optimization of one of these parameters, typically function or stability, may influence any of the others markedly.  相似文献   

3.
To what extent does natural selection act to optimize the details of protein folding kinetics? In an effort to address this question, the relationship between an amino acid's evolutionary conservation and its role in protein folding kinetics has been investigated intensively. Despite this effort, no consensus has been reached regarding the degree to which residues involved in native-like transition state structure (the folding nucleus) are conserved. Here we report the results of an exhaustive, systematic study of sequence conservation among residues known to participate in the experimentally (Phi-value) defined folding nuclei of all of the appropriately characterized proteins reported to date. We observe no significant evidence that these residues exhibit any anomalous sequence conservation. We do observe, however, a significant bias in the existing kinetic data: the mean sequence conservation of the residues that have been the subject of kinetic characterization is greater than the mean sequence conservation of all residues in 13 of 14 proteins studied. This systematic experimental bias gives rise to the previous observation that the median conservation of residues reported to participate in the folding nucleus is greater than the median conservation of all of the residues in a protein. When this bias is corrected (by comparing, for example, the conservation of residues known to participate in the folding nucleus with that of other, kinetically characterized residues) the previously reported preferential conservation is effectively eliminated. In contrast to well-established theoretical expectations, both poorly and highly conserved residues are apparently equally likely to participate in the protein-folding nucleus.  相似文献   

4.
When an amino-acid sequence cannot be optimized for both folding and function, folding can get compromised in favor of function. To understand this tradeoff better, we devise a novel method for extracting the “function-less” folding-motif of a protein fold from a set of structurally similar but functionally diverse proteins. We then obtain the β-trefoil folding-motif, and study its folding using structure-based models and molecular dynamics simulations. CompariA protein sequence serves two purpson with the folding of wild-type β-trefoil proteins shows that function affects folding in two ways: In the slower folding interleukin-1β, binding sites make the fold more complex, increase contact order and slow folding. In the faster folding hisactophilin, residues which could have been part of the folding-motif are used for function. This reduces the density of native contacts in functional regions and increases folding rate. The folding-motif helps identify subtle structural deviations which perturb folding. These may then be used for functional annotation. Further, the folding-motif could potentially be used as a first step in the sequence design of function-less scaffold proteins. Desired function can then be engineered into these scaffolds.  相似文献   

5.
6.
Protein is the working molecule of the cell, and evolution is the hallmark of life. It is important to understand how protein folding and evolution influence each other. Several studies correlating experimental measurement of residue participation in folding nucleus and sequence conservation have reached different conclusions. These studies are based on assessment of sequence conservation at folding nucleus sites using entropy or relative entropy measurement derived from multiple sequence alignment. Here we report analysis of conservation of folding nucleus using an evolutionary model alternative to entropy-based approaches. We employ a continuous time Markov model of codon substitution to distinguish mutation fixed by evolution and mutation fixed by chance. This model takes into account bias in codon frequency, bias-favoring transition over transversion, as well as explicit phylogenetic information. We measure selection pressure using the ratio omega of synonymous versus non-synonymous substitution at individual residue site. The omega-values are estimated using the PAML method, a maximum-likelihood estimator. Our results show that there is little correlation between the extent of kinetic participation in protein folding nucleus as measured by experimental phi-value and selection pressure as measured by omega-value. In addition, two randomization tests failed to show that folding nucleus residues are significantly more conserved than the whole protein, or the median omega value of all residues in the protein. These results suggest that at the level of codon substitution, there is no indication that folding nucleus residues are significantly more conserved than other residues. We further reconstruct candidate ancestral residues of the folding nucleus and suggest possible test tube mutation studies for testing folding behavior of ancient folding nucleus.  相似文献   

7.
Scott KA  Daggett V 《Biochemistry》2007,46(6):1545-1556
The problem of how a protein folds from a linear chain of amino acids to the three-dimensional structure necessary for function is often investigated using proteins with a low degree of sequence identity that adopt different folds. The design of pairs of proteins with a high degree of sequence identity but different folds offers the opportunity for a complementary study; in two highly similar sequences, which residues are the most important in directing folding to a particular structure? Here we use molecular dynamics simulations to characterize the folding-unfolding pathways of a pair of proteins designed by Bryan and co-workers [Alexander, P. A., et al. (2005) Biochemistry 44, 14045-14054; He, Y. N., et al. (2005) Biochemistry 44, 14055-14061]. Despite being 59% identical, the two protein sequences fold to two different structures. The first sequence folds to the alpha+beta protein G structure and the second to the all-alpha-helical protein A structure. We show that the final protein structure is determined early along the folding pathway. In folding to the protein G structure, the single alpha-helix (alpha1) and the beta3-beta4 turn fold early. Formation of the hairpin turn essentially prevents folding to helical structure in this region of the protein. This early structure is then consolidated by formation of long-range hydrophobic interactions between alpha1 and the beta3-beta4 turn. The protein A sequence differs both in the residues that form the beta3-beta4 turn and also in many of the residues that form the early hydrophobic interactions in the protein G structure. Instead, in the protein A sequence, a more hierarchical mechanism is observed, with helices folding before many of the tertiary interactions are formed. We find that small, but critical, sequence differences determine the topology of the protein early along the folding pathway, which help to explain the process by which one fold can evolve into another.  相似文献   

8.
Detailed analysis of the CuZn superoxide dismutase (SOD) structure provides new results concerning the significance and molecular basis for sequence conservation, intron-exon boundary locations, gene duplication, and Greek key beta-barrel evolution. Using 15 aligned sequences, including a new mouse sequence, specific roles have been assigned to all 23 invariant residues and additional residues exhibiting functional equivalence. Sequence invariance is dominated by 15 residues that form the active site stereochemistry, supporting a primary biological function of superoxide dismutation. The beta-strands have no sequence insertions and deletions, whereas insertions occur within the loops connecting the beta-strands and at both termini. Thus, the beta-barrel with only four invariant residues is apparently over-determined, but dependent on multiple cooperative side chain interactions. The regions encoded by exon I, a proposed nucleation site for protein folding, and exon III, the Zn loop involved in stability and catalysis, are the major structural subdomains not included in the internal twofold axis of symmetry passing near the catalytic Cu ion. This provides strong confirmatory evidence for gene evolution by duplication and fusion followed by the addition of these two exons. The proposed evolutionary pathway explains the structural versatility of the Greek key beta-barrel through functional specialization and subdomain insertions in new loop connections, and provides a rationale for the size of the present day enzyme.  相似文献   

9.
Protein engineering studies suggest that the transition state for the folding of ubiquitin is highly polarised towards the N-terminal part of the sequence and involves a nucleus of residues within the beta-hairpin (residues 1-17) and main alpha-helix (residues 23-34). In contrast, the observation of small phi-values for residues in the C-terminal portion of the sequence (residues 35-76), coupled with a folding topology that results in a much higher contact order, suggests that fast folding of ubiquitin is dependent upon configurational flexibility in the C-terminal part of the polypeptide chain to ensure passage down a relatively smooth folding funnel to the native state. We show that the introduction of a small mini-hairpin motif as an extension of the native 43-50 hairpin stabilises local interactions in the C-terminal part of the sequence, resulting largely in a deceleration of the unfolding kinetics without perturbing the apparent two-state folding mechanism. However, a single-point Leu-->Phe substitution within the engineered hairpin sequence leads to the premature collapse of the denatured ensemble through the stabilisation of non-native interactions and the population of a compact intermediate. Non-linear effects in the kinetic data at low concentrations of denaturant suggest that the collapsed state, which is further stabilised in the presence of cosmotropic salts, may subsequently fold directly to the native state through a "triangular" reaction scheme involving internal rearrangement rather than unfolding and refolding.  相似文献   

10.
Cooperative unfolding penalties are calculated by statistically evaluating an ensemble of denatured states derived from native structures. The ensemble of denatured states is determined by dividing the native protein into short contiguous segments and defining all possible combinations of native, i.e., interacting, and non-native, i.e., non-interacting, segments. We use a novel knowledge-based scoring function, derived from a set of non-homologous proteins in the Protein Data Bank, to describe the interactions among residues. This procedure is used for the structural identification of cooperative folding cores for four globular proteins: bovine pancreatic trypsin inhibitor, horse heart cytochrome c, French bean plastocyanin, and staphylococcal nuclease. The theoretical folding units are shown to correspond to regions that exhibit enhanced stability against denaturation as determined from experimental hydrogen exchange protection factors. Using a sequence similarity score for related sequences, we show that, in addition to residues necessary for enzymatic function, those amino acids comprising structurally important folding cores are also preferentially conserved during evolution. This implies that the identified folding cores may be part of an array of fundamental structural folding units.  相似文献   

11.
Some asparagine and glutamine residues in proteins undergo deamidation to aspartate and glutamate with rates that depend upon the sequence and higher-order structure of the protein. Functional groups within the protein can catalyze this reaction, acting as general acids, bases, or stabilizers of the transition state. Information from specific proteins that deamidate and analysis of protein sequence and structure data bases suggest that asparagine and glutamine lability has been a selective pressure in the evolution of protein sequence and folding. Asparagine and glutamine deamidation can affect protein structure and function in natural and engineered mutant sequences, and may play a role in the regulation of protein folding, protein breakdown, and aging.  相似文献   

12.
The acyl-coenzyme A-binding proteins (ACBPs) contain 26 highly conserved sequence positions. The majority of these have been mutated in the bovine protein, and their influence on the rate of two-state folding and unfolding has been measured. The results identify eight sequence positions, out of 24 probed, that are critical for fast productive folding. The residues are all hydrophobic and located in the interface between the N- and C-terminal helices. The results suggest that one specific site dominated by conserved hydrophobic residues forms the structure of the productive rate-determining folding step and that a sequential framework model can describe the protein folding reaction.  相似文献   

13.
Cyanogen bromide (CNBr) cleavage of total rat liver histone H1 generates a C-terminal peptide which originates from a methionine-containing subfraction. This subfraction comprises approx. 20% of the whole rat liver H1 population, resembles calf thymus CTL-1 in size but contains methionine and histidine, higher proportions of serine and less alanine and proline. Edman degradation established the N-terminal sequence of the CNBr peptide as Arg-Arg-Lys-Ala-Ser-Gly-Pro-Pro-Val-Glu. By alignment with calf thymus CTL-1, methionine was identified as residue 30 replacing alanine in a non-conservative replacement. Residue 40 is deleted but sequence homology near the double proline sequence in the G-domain is retained. The CNBr peptide is estimated at 177-181 residues and comprises the complete G- and C-domain and two arginines from the basic cluster in the N-domain. Removal from H1 of all but two residues of the N-domain does not abolish secondary and tertiary folding. This GC-peptide opens new approaches to the study of the function of H1 in chromatin.  相似文献   

14.
15.
Patterns of hydrophobic and hydrophilic residues play a major role in protein folding and function. Long, predominantly hydrophobic strings of 20-22 amino acids each are associated with transmembrane helices and have been used to identify such sequences. Much less attention has been paid to hydrophobic sequences within globular proteins. In prior work on computer simulations of the competition between on-pathway folding and off-pathway aggregate formation, we found that long sequences of consecutive hydrophobic residues promoted aggregation within the model, even controlling for overall hydrophobic content. We report here on an analysis of the frequencies of different lengths of contiguous blocks of hydrophobic residues in a database of amino acid sequences of proteins of known structure. Sequences of three or more consecutive hydrophobic residues are found to be significantly less common in actual globular proteins than would be predicted if residues were selected independently. The result may reflect selection against long blocks of hydrophobic residues within globular proteins relative to what would be expected if residue hydrophobicities were independent of those of nearby residues in the sequence.  相似文献   

16.
The profile method, for detecting distantly related proteins by sequence comparison, has been extended to incorporate secondary structure information from known X-ray structures. The sequence of a known structure is aligned to sequences of other members of a given folding class. From the known structure, the secondary structure (alpha-helix, beta-strand or "other") is assigned to each position of the aligned sequences. As in the standard profile method, a position-dependent scoring table, termed a profile, is calculated from the aligned sequences. However, rather than using the standard Dayhoff mutation table in calculating the profile, we use distinct amino acid mutation tables for residues in alpha-helices, beta-strands or other secondary structures to calculate the profile. In addition, we also distinguish between internal and external residues. With this new secondary structure-based profile method, we created a profile for eight-stranded, antiparallel beta barrels of the insecticyanin folding class. It is based on the sequences of retinol-binding protein, insecticyanin and beta-lactoglobulin. Scanning the sequence database with this profile, it was possible to detect the sequence of avidin. The structure of streptavidin is known, and it appears to be distantly related to the antiparallel beta barrels. Also detected is the sequence of complement component C8, which we therefore predict to be a member of this folding class.  相似文献   

17.
To understand the folding behavior of proteins is an important and challenging problem in modern molecular biology. In the present investigation, a large number of features representing protein sequences were developed based on sequence autocorrelation weighted by properties of amino acid residues. Genetic algorithm (GA) combined with multiple linear regression (MLR) was employed to select significant features related to protein folding rates, and to build global predictive model. Moreover, local lazy regression (LLR) method was also used to predict the protein folding rates. The obtained results indicated that LLR performed much better than the global MLR model. The important properties of amino acid residues affecting protein folding rates were also analyzed. The results of this study will be helpful to understand the mechanism of protein folding. Our results also demonstrate that the features of amino acid sequence autocorrelation is effective in representing the relationship between protein sequence and folding rates, and the local method is a powerful tool to predict the protein folding rates.  相似文献   

18.
19.
20.
The importance of chain connectivity in determining protein function and stability can be examined by breaking the peptide backbone using a technique such as circular permutation. Cleavage at certain positions results in a complete loss of the ability of the protein to fold. When such cleavage sites occur sequentially in the primary structure, we call the region a 'folding element', a new concept that could assist in our understanding of the protein folding problem. The folding elements of dihydrofolate reductase have been assigned by conducting a systematic circular permutation analysis in which the peptide backbone was sequentially broken between every pair of residues in the protein. The positions of folding elements do not appear to correspond to secondary structure motifs, substrate or coenzyme binding sites, or accessible surface area. However, almost all of the amino acid residues known to be involved in early folding events are located within the folding elements.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号