首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 125 毫秒
1.
We have performed a statistical analysis of unstructured amino acid residues in protein structures available in the databank of protein structures. Data on the occurrence of disordered regions at the ends and in the middle part of protein chains have been obtained: in the regions near the ends (at distance less than 30 residues from the N- or C-terminus), there are 66% of unstructured residues (38% are near the N-terminus and 28% are near the C-terminus), although these terminal regions include only 23% of the amino acid residues. The frequencies of occurrence of unstructured residues have been calculated for each of 20 types in different positions in the protein chain. It has been shown that relative frequencies of occurrence of unstructured residues of 20 types at the termini of protein chains differ from the ones in the middle part of the protein chain; amino acid residues of the same type have different probabilities to be unstructured in the terminal regions and in the middle part of the protein chain. The obtained frequencies of occurrence of unstructured residues in the middle part of the protein chain have been used as a scale for predicting disordered regions from amino acid sequence using the method (FoldUnfold) previously developed by us. This scale of frequencies of occurrence of unstructured residues correlates with the contact scale (previously developed by us and used for the same purpose) at a level of 95%. Testing the new scale on a database of 427 unstructured proteins and 559 completely structured proteins has shown that this scale can be successfully used for the prediction of disordered regions in protein chains.  相似文献   

2.
There is indirect evidence that the amino acid composition of proteins depends on their dimension. The amino acid composition of a nonredundant set of about 550,000 proteins was determined and it was observed that, in the range of 50-200 residues, the percentage of occurrence of most of the residue types significantly depends on protein dimension. This result should prove useful in analyzing protein sequences and genomics.  相似文献   

3.
It is a well known phenomenon that the occurrence of several distinct amino acids at the C-terminus of proteins is non-random. We have analysed all Saccharomyces cerevisiae proteins predicted by computer databases and found lysine to be the most frequent residue both at the last (-1) and at the penultimate amino acid (-2) positions. To test the hypothesis that C-terminal basic residues efficiently bind to phospholipids we randomly expressed GST-fusion proteins from a yeast genomic library. Fifty-four different peptide fragments were found to bind phospholipids and 40% of them contained lysine/arginine residues at the (-1) or (-2) positions. One peptide showed high sequence similarity with the yeast protein Sip18p. Mutational analysis revealed that both C-terminal lysine residues of Sip18p are essential for phospholipid-binding in vitro. We assume that basic amino acid residues at the (-1) and (-2) positions in C-termini are suitable to attach the C-terminus of a given protein to membrane components such as phospholipids, thereby stabilizing the spatial structure of the protein or contributing to its subcellular localization. This mechanism could be an additional explanation for the C-terminal amino acid bias observed in proteins of several species.  相似文献   

4.
Approaching a complete classification of protein secondary structure   总被引:2,自引:0,他引:2  
A complete classification of types of the protein secondary structure is developed on the basis of computer analysis of the crystallographic structural data deposited in the protein Data Bank. The majority of amino acid residues fall into five conformation types. A conclusion is drawn that the number of sequence variants of torsion angles phi, psi in globular proteins is limited and is essentially less than the number of possible amino acid sequences for this chain length. Along with alpha-helix and beta-structure, the distribution analysis assigning every maximum of distribution of amino acid conformations on Ramachandran map to a certain type of the secondary structure exposed a third type of the secondary structure that was previously neglected. This type of the structure is extended left-handed helical conformation, designated as mobile (M-) conformation. A full set of M-conformation fragments that seems to play a major role in protein globule dynamics has been obtained, a small radius of correlation for the polypeptide chain in M-conformation is demonstrated. It explains a prevalence of short segments of mobile conformation revealed in globular proteins. For secondary structure types, the frequency of occurrence of amino acid residues has been computed.  相似文献   

5.
We have created a database of two-domain proteins with homology less than 25% (452 proteins). Based on one half of this set of proteins statistics of appearance of amino acid residues on the domain boundaries of multiple domain proteins has been obtained. Small and hydrophilic amino acids (proline, glycine, asparagine, glutamic acid, arginine and others) appear on the domain boundaries more often than in the whole protein. Opposite, hydrophobic amino acid residues (tryptophane, methionine, phenylalanine and others) appear on the domain boundaries more rarely. The obtained scales of the appearance of amino acid residues on the boundary regions from the statistics have been used for calculation of domain boundaries in the proteins of the second half of the database. The probability scale obtained by averaging the appearance of amino acid residues on the domain boundary region including 8 residues (+/-4 residues from the real domain boundary) gives the best result: for 57% of proteins the predicted boundary was closer than 40 residues to the boundary assigned from three-dimensional structures, for 41% it was closer than 20 residues from the real boundary. The probability scale was used to predict domain boundaries for proteins with unknown three-dimensional structure (international competition CASP6).  相似文献   

6.
It has been shown that malignant activation of ras proto-oncogenes was mediated by point mutations which resulted in the single amino acid conversions at positions 12, 13 or 61 of the ras gene products (p21 proteins). By analyzing randomly mutated ras genes, it has been demonstrated that amino acid substitutions at residues 12, 13, 59 and 63 activated p21. Furthermore, it has been shown that residues 16, 116 and 119 in p21 played critical roles in the guanine nucleotide binding and, consequently, the ability of the protein to induce changes characteristic of cellular transformation. By using the protein conformational prediction method of Chou and Fasman, the present work predicts that these critical amino acids, except glutamic acid at position 63, are located within beta-turns. The major "hot spots" for ras activation are codons 12 and 61. The author has predicted in an earlier paper that the single amino acid conversions at positions 12 and 61 would occur at beta-turn conformation consisting of residues 10-13 and 58-61, respectively. In the present study, probabilities of beta-turn occurrence at residues 10-13 or 58-61 of the p21 proteins encoded by various ras genes are compared. The probability for the normal p21 containing glycine as residue 12 is greatest, and the cancer-associated variants show less probabilities. The single amino acid substitutions at position 61 do not cause so decreased probabilities of beta-turn potential at residues 58-61, except the replacement by histidine. Histidine at position 61 is not predicted as occurring within a beta-turn.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

7.
A database of 452 two-domain proteins with less than 25% homology was constructed. One half of the database was used to obtain statistics on the appearance of amino acid residues at domain boundaries. Small and hydrophilic residues (proline, glycine, asparagine, glutamic acid, arginine, etc.) occurred more often at domain boundaries than in total proteins. Hydrophobic residues (tryptophan, methionine, phenylalanine, etc.) were rarer at domain boundaries than in total proteins. Probability scales of amino acid appearance in boundary-flanking regions were constructed with these statistics and used to predict the domain boundaries in proteins of the other half of the database. The probability scale obtained by averaging the appearance of amino acids over an 8-residue region (±4 residues from the real domain boundaries) yielded the best results: domain boundaries were predicted within 40 residues of the real boundary in 57% of proteins and within 20 residues of the real boundary in 41% of proteins. The probability scale was used to predict the domain boundaries in proteins with unknown structures (CASP6).  相似文献   

8.
Discriminating outer membrane (OM) proteins from globular proteins is an important task. The structural analysis of β-strands dominating globular (all-β) proteins and OM proteins provides useful insight to distinguish between them. In this work, we analyze the characteristic features of the 20 amino acid residues in all-β and OM proteins. We set up numerical indices for several properties of amino acid residues, such as, conformational parameters, surrounding hydrophobicity, accessible surface area and reduction in accessibility, and inter-residue contacts. We found that all the aromatic residues prefer to be in β-strands of both globular and OM proteins. The surrounding hydrophobicity of aromatic and non-polar amino acid residues in globular proteins is significantly higher than that of OM proteins. The residues Trp, Arg, Phe and Gln show a remarkable difference of reduction in accessibility between all-β globular (βG) and OM proteins. The positively charged residues, Lys and Arg in the membrane part of OM proteins have more number of contacts than globular proteins. Further, the behavior of the 20 amino acid residues in β-strand segments of globular and OM proteins have been discussed. The parameters developed in this work can be used for identifying transmembrane β-strands in OM proteins and for discriminating βG proteins from OM proteins.  相似文献   

9.
A measure of similarity between amino acid residues based on the analysis of the surroundings of each residue in primary structures of native proteins is proposed. The statistical data used for this purpose were obtained from the analysis of 168,808 protein sequences, which comprise the Protein Identification Research database (release 63). Using various threshold values of the proposed measure, amino acid residues were classified into several groups. The classification elaborated differs essentially from groupings previously used. The numerical measure of amino acid residues similarity can be used in site-directed mutagenesis studies for the prediction of probability of local spatial rearrangements in proteins.  相似文献   

10.
Isogai Y 《Biochemistry》2006,45(8):2488-2492
Hydrophobic core mutants of sperm whale apomyoglobin were constructed to investigate the amino acid sequence features that determine the folding properties. Replacements of all of the Ile residues with Leu and of all of the Ile and Val residues with Leu decreased the thermodynamic stability of the folded states against the unfolded states but increased the stability of the folding intermediates against the unfolded states, indicating that the amino acid composition of the protein core is important for the protein stability and folding cooperativity. To examine the effect of the arrangement of these hydrophobic residues, mutant proteins were further constructed: 12 sites out of the 18 Leu, 9 Ile, and 8 Val residues of the wild-type myoglobin were randomly replaced with each other so that the amino acid compositions were similar to that of the wild-type protein. Four mutant proteins were obtained without selection of the protein properties. These residue replacements similarly resulted in the stabilization of both the intermediate and folded states against the unfolded states, as compared to the wild-type protein. Thus, the arrangements of the hydrophobic residues in the native amino acid sequence are selected to destabilize the folding intermediate rather than to stabilize the folded state. The present results suggest that the two-state transition of protein folding or the transient formation of the unstable intermediate, which seems to be required for effective production of the functional proteins, has been a major driving force in the molecular evolution of natural globular proteins.  相似文献   

11.
The hydration isotherms of alpha-chymotrypsin, lysozyme, pork insulin, pork pepsin and serum albumin were obtained by means of dynamic method. The values of BET-monolayers for processes of water sorption leads to (h) and desorption comes from (h) do not depend on the static or dynamic way of achieving of hydration equilibrium in spite of difference in the shape of isotherms. The values of comes from h for proteins with known tertiary structure (alpha-chymotrypsin, lysozyme and insulin) coinside with the number of exposed polar amino acid side chains. The lowering of leads to h values in comparison with comes from h is correlated with inability of omega-amido groups of Asn and Gln residues and of ion pair-forming residues to take part in the formation of sorptive BET-monolayer. These rules for the interpretation of hydration isotherms were used to evaluate the numbers of exposed and buried polar side chains in proteins with unknown tertiary structure--pepsin and serum albumin.  相似文献   

12.
An extensive search for internal regularities in amino acid sequences has been made, using both the genetic code and the relative frequencies of amino acid alternatives in homologous proteins. The two methods give very similar results and strongly suggest the occurrence of significant linear and inverted repetitions (similar sequences of opposite polarity) in several proteins. A hypothesis is developed to explain the occurrence of such internal regularities in proteins. This hypothesis is based on a process of duplication of an ancestral loop in which a symmetrical arrangement of amino acid allows stabilization by interaction between the amino acid side chains.  相似文献   

13.
The occurrence and relative positions of cysteine residues were investigated in proteins of various species. Considering random mathematical occurrence for an amino acid coded by two codons (3. 28%), cysteine is underrepresented in all organisms investigated. Representation of cysteine appears to correlate positively with the complexity of the organism, ranging between 2.26% in mammals and 0. 5% in some members of the Archeabacteria order. This observation, together with the results obtained from comparison of cysteine content of various ribosomal proteins, indicates that evolution takes advantage of increased use of cysteine residues. In all organisms studied except plants, two cysteines are frequently found two amino acid residues apart (C-(X)(2)-C motif). Such a motif is known to be present in a variety of metal-binding proteins and oxidoreductases. Remarkably, more than 21% of all of cysteines were found within the C-(X)(2)-C motifs in ARCHEA.: This observation may indicate that cysteine appeared in ancient metal-binding proteins first and was introduced into other proteins later.  相似文献   

14.
The amino acid composition and sequence in primary structure of 180 proteins have been studied. It is shown that the distribution of amino acid residues is near to a random one, i.e. it is determined by the amino acid composition. The ratio between statistical and unique character of protein primary structures has been discussed. The amino acid sequence is suggested to be unique in fibrous proteins. In contrast the amino acid sequence in globular proteins is a statistical one. The statistical character of amino acids distribution in globular proteins explains the possibility of sensible text generation under the frame shift mutations, deletions and insertions.  相似文献   

15.
The dynamic differential equation model developed and tested for bovine pancreatic trypsin inhibitor and tuna ferrocytochrome c in Ponnuswamy, P.K. & Bhaskaran, R. (Int. J. Peptide Protein Res. 24, 168-179, 1984) is extended for 17 more protein crystals in this work. Average displacements are computed for 20 amino acid residues observed in 19 proteins. Detailed information on the dynamic behaviour of the individual proteins and individual residues is presented. The effect of atomic packing on the fluctuations of the amino acid residues in alpha-chymotrypsin is illustrated. A number of general points on the dynamic characteristics of globular protein molecules are presented.  相似文献   

16.
The complete amino acid sequence of a structural protein isolated from pharate cuticle of the locust Locusta migratoria was determined. The protein has an unusual amino acid composition: 42% of the residues are alanine and only 14 of the 20 common amino acid residues are present. The primary structure consists of regions enriched in particular amino acid residues. The N-terminal region and a region close to the C-terminus are enriched in glycine. The rest of the protein is dominated by alanine, except for two short regions enriched in hydrophilic residues. Almost all the proline residues are situated in the alanine-rich regions in a conserved sequence 'A-A-P-A/V'. An internal duplication has taken place covering most of the protein except for the glycine-rich regions. Owing to the unusual features of the protein a combination of automated Edman degradations and plasma-desorption m.s. was used to determine the complete sequence. The protein does not show sequence homology to other proteins, but proteins divided into regions enriched in the same kind of amino acid residues have been isolated from other insect structures.  相似文献   

17.
The orientation of many membrane proteins is determined by the asymmetric distribution of positively charged amino acid residues in cytoplasmic and translocated loops. The positive-inside rule states that loops with large amounts of these residues tend to have cytoplasmic locations. Orientations of constructs derived from the inner membrane protein leader peptidase from Escherichia coli were found to depend on the anionic phospholipid content of the membrane. Lowering the contents of anionic phospholipids facilitated membrane passage of positively charged loops. On the other hand, elevated contents of acidic phospholipids in the membrane rendered translocation more sensitive to positively charged residues. The results demonstrate that anionic lipids are determinants of membrane protein topology and suggest that interactions between negatively charged phospholipids and positively charged amino acid residues contribute to the orientation of membrane proteins.  相似文献   

18.
19.
This study views each protein structure as a network of noncovalent connections between amino acid side chains. Each amino acid in a protein structure is a node, and the strength of the noncovalent interactions between two amino acids is evaluated for edge determination. The protein structure graphs (PSGs) for 232 proteins have been constructed as a function of the cutoff of the amino acid interaction strength at a few carefully chosen values. Analysis of such PSGs constructed on the basis of edge weights has shown the following: 1), The PSGs exhibit a complex topological network behavior, which is dependent on the interaction cutoff chosen for PSG construction. 2), A transition is observed at a critical interaction cutoff, in all the proteins, as monitored by the size of the largest cluster (giant component) in the graph. Amazingly, this transition occurs within a narrow range of interaction cutoff for all the proteins, irrespective of the size or the fold topology. And 3), the amino acid preferences to be highly connected (hub frequency) have been evaluated as a function of the interaction cutoff. We observe that the aromatic residues along with arginine, histidine, and methionine act as strong hubs at high interaction cutoffs, whereas the hydrophobic leucine and isoleucine residues get added to these hubs at low interaction cutoffs, forming weak hubs. The hubs identified are found to play a role in bringing together different secondary structural elements in the tertiary structure of the proteins. They are also found to contribute to the additional stability of the thermophilic proteins when compared to their mesophilic counterparts and hence could be crucial for the folding and stability of the unique three-dimensional structure of proteins. Based on these results, we also predict a few residues in the thermophilic and mesophilic proteins that can be mutated to alter their thermal stability.  相似文献   

20.
At present, accuracies of secondary structural prediction scarcely go beyond 70-75%. Secondary structural comparison is carried out among sequence-identified proteins. The results show natural wobble between different secondary structural types is possible in homologous families, and the best prediction accuracy will rarely be 100%. Besides shortcoming of the prediction approaches, secondary structural wobble is found to be responsible for nearly all secondary structural prediction limits. Only average 73.2% of amino acid residue is conserved in secondary structural types. The wobble allows alpha-class/coil and beta-class/coil transitions but not direct alpha-class/beta-class transition. Propensity values representing the statistical occurrence of 20 amino acid residues in secondary structural wobbles are given.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号