首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We describe a general, modular method for developing protocols to identify the amino acid residues that most likely define the division of a protein superfamily into two subsets. As one possibility, we use PROBE to gather superfamily members and perform an ungapped alignment. We then use a modified BLOSUM62 substitution matrix to determine the discriminating power of each column of aligned residues. The overall method is particularly useful for predicting amino acids responsible for substrate or binding specificity when no structures are available. We apply our method to three pairs of protein classes in three different superfamilies, and present our results, some of which have been experimentally verified. This approach may accelerate the elucidation of enzymic substrate specificity, which is critical for both mechanistic insights into biocatalysis and ultimate application.  相似文献   

2.
Large Hydrophobic Residues (LHR) such as phenylalanine, isoleucine, leucine, methionine and valine play an important role in protein structure and activity. We describe the role of LHR in complete set of protein sequences in 15 different species. That is the distribution of LHR in different proteins of different species is reported. It is observed that the proteins prefer to have 27% of large hydrophobic residues in total and all along the sequence. It is also observed that proteins accumulate more LHR in its active sites. A window analysis on these protein sequences shows that the 27% of LHR is more frequent at window length of 45 amino acids. The influenza virus and P. falciparum show a random distribution of LHR in its proteins compared to other model organisms.  相似文献   

3.
The structural organization of the B1 domain of streptococcal protein G (PGA) has been probed using molecular dynamics simulations, with a particular emphasis on the role of the solvent exposed Ile6 residue. In addition to the native protein (WT-PGA), three single-mutants (I6G-PGA, I6F-PGA, and I6T-PGA), one double-mutant (I6T,T53G-PGA), and three isolated peptide fragments (corresponding to the helix and the two beta-hairpins) were studied in the presence of explicit water molecules. Comparative analysis of the various systems showed that the level of perturbation was directly related to the hydrophobicity and the size of the side chain of residue 6, the internal rigidity of the proteins decreasing in the order I6T-PGA > I6G-PGA > WT-PGA > I6F-PGA. The results emphasized the importance of residue 6 in controlling both the integrity of the sheet's surface and the orientation of the helix in relation to the sheet by modulation of surface/core interactions. The effects of mutations were delocalized across the structure, and glycine residues, in particular, absorbed most of the introduced strain. A qualitative structural decomposition of the native fold into elementary building-blocks was achieved using principal component analysis and mechanical response matrices. Within this framework, internal motions of the protein were described as coordinated articulations of these structural units, mutations affecting mostly the amplitude of the motions rather than the structure/location of the building-blocks. Analysis of the isolated peptidic fragments suggested that packing did not play a determinant role in defining the elementary building-blocks, but that chain topology was mostly responsible.  相似文献   

4.
Patterns of alternation of hydrophobic and polar residues are a profound aspect of amino acid sequences, but a feature not easily interpreted for soluble proteins. Here we report statistics of hydrophobicity patterns in proteins of known structure in a current protein database as compared with results from earlier, more limited structure sets. Previous studies indicated that long hydrophobic runs, common in membrane proteins, are underrepresented in soluble proteins. Long runs of hydrophobic residues remain significantly underrepresented in soluble proteins, with none longer than 16 residues observed. These long runs most commonly occur as buried alpha helices, with extended hydrophobic strands less common. Avoiding aggregation of partially folded intermediates during intracellular folding remains a viable explanation for the rarity of long hydrophobic runs in soluble proteins. Comparison between database editions reveals robustness of statistics on aqueous proteins despite an approximately twofold increase in nonredundant sequences. The expanded database does now allow us to explain several deviations of hydrophobicity statistics from models of random sequence in terms of requirements of specific secondary structure elements. Comparison to prior membrane-bound protein sequences, however, shows significant qualitative changes, with the average hydrophobicity and frequency of long runs of hydrophobic residues noticeably increasing between the database editions. These results suggest that the aqueous proteins of solved structure may represent an essentially complete sample of the universe of aqueous sequences, while the membrane proteins of known structure are not yet representative of the universe of membrane-associated proteins, even by relatively simple measures of hydrophobic patterns.  相似文献   

5.
The location measure of a residue in a globular protein is defined as the number of C alpha atoms surrounding the residue located within a sphere of the radius of 14 A. This quantity is a measure of the exposure of a residue to solvent, and is related closely to the distance from the center of mass of a protein. In this work, the experimental value for each residue of a protein is obtained from the X-ray crystallographic data, and the quantity is also calculated from the amino acid sequence data by applying an empirical parameter set to it. The correlation between the experimental and computed quantities is as high as 0.50 on the average over 92 proteins of known three-dimensional structure. Therefore, the location measure of every residue in a globular protein is predictable with good accuracy from the sequence.  相似文献   

6.
Balaji S  Aruna S  Srinivasan N 《Proteins》2003,53(4):783-791
Occurrence and accommodation of charged amino acid residues in proteins that are structurally equivalent to buried non-polar residues in homologues have been investigated. Using a dataset of 1,852 homologous pairs of crystal structures of proteins available at 2A or better resolution, 14,024 examples of apolar residues in the structurally conserved regions replaced by charged residues in homologues have been identified. Out of 2,530 cases of buried apolar residues, 1,677 of the equivalent charged residues in homologues are exposed and the rest of the charged residues are buried. These drastic substitutions are most often observed in homologous protein pairs with low sequence identity (<30%) and in large protein domains (>300 residues). Such buried charged residues in the large proteins are often located in the interface of sub-domains or in the interface of structural repeats, Beyond 7A of residue depth of buried apolar residues, or less than 4% of solvent accessibility, almost all the substituting charged residues are buried. It is also observed that acidic sidechains have higher preference to get buried than the positively charged residues. There is a preference for buried charged residues to get accommodated in the interior by forming hydrogen bonds with another sidechain than the main chain. The sidechains interacting with a buried charged residue are most often located in the structurally conserved regions of the alignment. About 50% of the observations involving hydrogen bond between buried charged sidechain and another sidechain correspond to salt bridges. Among the buried charged residues interacting with the main chain, positively charged sidechains form hydrogen bonds commonly with main chain carbonyls while the negatively charged residues are accommodated by hydrogen bonding with the main chain amides. These carbonyls and amides are usually located in the loops that are structurally variable among homologous proteins.  相似文献   

7.
Mishra R  Gara SK  Mishra S  Prakash B 《Proteins》2005,59(2):332-338
Ras superfamily GTP-binding proteins regulate important signaling events in the cell. Ras, which often serves as a prototype, efficiently hydrolyzes GTP in conjunction with its regulator GAP. A conserved glutamine plays a vital role in GTP hydrolysis in most GTP-binding proteins. Mutating this glutamine in Ras has oncogenic effects, since it disrupts GTP hydrolysis. The analysis presented here is of GTP-binding proteins that are a paradox to oncogenic Ras, since they have the catalytic glutamine (Glncat) substituted by a hydrophobic amino acid, yet can hydrolyze GTP efficiently. We term these proteins HAS-GTPases. Analysis of the amino acid sequences of HAS-GTPases reveals prominent presence of insertions around the GTP-binding pocket. Homology modeling studies suggest an interesting means to achieve catalysis despite the drastic hydrophobic substitution replacing the key Glncat of Ras-like GTPases. The substituted hydrophobic residue adopts a "retracted conformation," where it is positioned away from the GTP, as its role in catalysis would be unproductive. This conformation is further stabilized by interactions with hydrophobic residues in its vicinity. These interacting residues are strongly conserved and hydrophobic in all HAS-GTPases, and correspond to residues Asp92 and Tyr96 of Ras. An experimental support for the "retracted conformation" of Switch II arises from the crystal structures of Ylqf and hGBP1. This conformation allows us to hypothesize that, unlike in classical GTPases, catalytic residues could be supplied by regions other than the Switch II (i.e., either the insertions or a neighboring domain).  相似文献   

8.
Previous experiments on the cystic fibrosis transmembrane conductance regulator suggested that non-native polar residues within membrane domains can compromise protein structure/function. However, depending on context, replacement of a native residue by a non-native residue can result either in genetic disease or in benign effects (e.g., polymorphisms). Knowledge of missense mutations that frequently cause protein malfunction and subsequent disease can accordingly reveal information as to the impact of these residues in local protein environments. We exploited this concept by performing a statistical comparison of disease-causing mutations in protein membrane-spanning domains versus soluble domains. Using the Human Gene Mutation Database of 240 proteins (including 80 membrane proteins) associated with human disease, we compared the relative phenotypic propensity to cause disease of the 20 naturally occurring amino acids when removed from-or inserted into-native protein sequences. We found that in transmembrane domains (TMDs), mutations involving polar residues, and ionizable residues in particular (notably arginine), are more often associated with protein malfunction than soluble proteins. To further test the hypothesis that interhelical cross-links formed by membrane-embedded polar residues stabilize TMDs, we compared the occurrence of such residues in the TMDs of mesophilic and thermophilic prokaryotes. Results showed a significantly higher proportion of ionizable residues in thermophilic organisms, reinforcing the notion that membrane-embedded electrostatic interactions play critical roles in TMD stability.  相似文献   

9.
A database of 452 two-domain proteins with less than 25% homology was constructed. One half of the database was used to obtain statistics on the appearance of amino acid residues at domain boundaries. Small and hydrophilic residues (proline, glycine, asparagine, glutamic acid, arginine, etc.) occurred more often at domain boundaries than in total proteins. Hydrophobic residues (tryptophan, methionine, phenylalanine, etc.) were rarer at domain boundaries than in total proteins. Probability scales of amino acid appearance in boundary-flanking regions were constructed with these statistics and used to predict the domain boundaries in proteins of the other half of the database. The probability scale obtained by averaging the appearance of amino acids over an 8-residue region (±4 residues from the real domain boundaries) yielded the best results: domain boundaries were predicted within 40 residues of the real boundary in 57% of proteins and within 20 residues of the real boundary in 41% of proteins. The probability scale was used to predict the domain boundaries in proteins with unknown structures (CASP6).  相似文献   

10.
Much attention is being paid to protein databases as an important information source for proteome research. Although used extensively for similarity searches, protein databases themselves have not fully been characterized. In a systematic attempt to reveal protein-database characters that could contribute to revealing how protein chains are constructed, frequency distributions of all possible combinatorial sets of three, four, and five amino acids ("triplets," "quartets," and "pentats"; collectively called constituent sequences) have been examined in the nonredundant (nr) protein database, demonstrating the existence of nonrandom bias in their "availability" at the population level. Nonexistent short sequences of pentats were found that showed low availability in biological proteins against their expected probabilities of occurrence. Among them, six representative ones were successfully synthesized as peptides with reasonably high yields in a conventional Fmoc method, excluding the possibility that a putative physicochemical energy barrier in forming them could be a direct cause for the low availability. They were also expressed as soluble fusion proteins in a conventional Escherichia coli BL21Star(DE3) system with reasonably high yield, again excluding a possible difficulty in their biological synthesis. Together, these results suggest that information on three-dimensional structures and functions of proteins exists in the context of connections of short constituent sequences, and that proteins are composed of evolutionarily selected constituent sequences, which are reflected in their availability differences in the database. These results may have biological implications for protein structural studies.  相似文献   

11.
An abundant cytoplasmic 43-kDa protein from Mycoplasma synoviae, a major pathogen from poultry, was identified as elongation factor Tu. The N-terminal amino acid sequence (AKLDFDRSKEHVNVGTIGHV) has 90% identity with the sequence of the Mycoplasma hominis elongation factor Tu protein. Monoclonal antibodies reacting with the M. synoviae elongation factor Tu protein also reacted with 43-kDa proteins from the avian Mycoplasma species Mycoplasma gallinarum, Mycoplasma gallinaceum, Mycoplasma pullorum, Mycoplasma cloacale, Mycoplasma iners and Mycoplasma meleagridis, but not with the proteins from Mycoplasma gallisepticum, Mycoplasma imitans or Mycoplasma iowae. In addition, two groups of phase variable integral membrane proteins, pMSA and pMSB, associated with hemadherence and pathogenicity of M. synoviae strains AAY-4 and ULB925 were identified. The cleavage of a larger hemagglutinating protein encoded by a gene homologous to the vlhA gene of M. synoviae generates pMSB1 and pMSA1 proteins defined by mAb 125 and by hemagglutination inhibiting mAb 3E10, respectively. The N-terminal amino acid sequences of pMSA proteins (SENKLI ... and SENETQ ...) probably indicate the cleavage site of the M. synoviae strain ULB 925 hemagglutinin.  相似文献   

12.
With the rapid growth of sequence databases, there is an increasing need for reliable functional characterisation and annotation of newly predicted proteins. To cope with such large data volumes, faster and more effective means of protein sequence characterisation and annotation are required. One promising approach is automatic large-scale functional characterisation and annotation, which is generated with limited human interaction. However, such an approach is heavily dependent on reliable data sources. The SWISS-PROT protein sequence database plays an essential role here owing to its high level of functional information.  相似文献   

13.
The algorithm was developed to predict the degree of exposure of amino acid residues in globular proteins. This algorithm combined with standard discriminant analysis methods was used for evaluation of the accessibility of Lys and Arg residues for trypsin-like proteases attack. The procedure can be useful for a computer-aided design of prolonged-action protein drug preparations.  相似文献   

14.
The tumor suppressor function of the wild-type p53 protein is transdominantly inhibited by tumor-derived mutant p53 proteins. Such transdominant inhibition limits the prospects for gene therapy approaches that aim to introduce wild-type p53 into cancer cells. The molecular mechanism for transdominant inhibition involves sequestration of wild-type p53 subunits into inactive wild-type/mutant hetero-tetramers. Thus, p53 proteins, whose oligomerization specificity is altered so they cannot interact with tumor-derived mutant p53, would escape transdominant inhibition. Aided by the known three-dimensional structure of the p53 tetramerization domain and by trial and error we designed a novel domain with seven amino acid substitutions in the hydrophobic core. A full-length p53 protein bearing this novel domain formed homo-tetramers and had tumor suppressor function, but did not hetero-oligomerize with tumor-derived mutant p53 and resisted transdominant inhibition. Thus, hydrophobic core residues influence the oligomerization specificity of the p53 tetramerization domain.  相似文献   

15.
Discriminating outer membrane (OM) proteins from globular proteins is an important task. The structural analysis of β-strands dominating globular (all-β) proteins and OM proteins provides useful insight to distinguish between them. In this work, we analyze the characteristic features of the 20 amino acid residues in all-β and OM proteins. We set up numerical indices for several properties of amino acid residues, such as, conformational parameters, surrounding hydrophobicity, accessible surface area and reduction in accessibility, and inter-residue contacts. We found that all the aromatic residues prefer to be in β-strands of both globular and OM proteins. The surrounding hydrophobicity of aromatic and non-polar amino acid residues in globular proteins is significantly higher than that of OM proteins. The residues Trp, Arg, Phe and Gln show a remarkable difference of reduction in accessibility between all-β globular (βG) and OM proteins. The positively charged residues, Lys and Arg in the membrane part of OM proteins have more number of contacts than globular proteins. Further, the behavior of the 20 amino acid residues in β-strand segments of globular and OM proteins have been discussed. The parameters developed in this work can be used for identifying transmembrane β-strands in OM proteins and for discriminating βG proteins from OM proteins.  相似文献   

16.
Experiments carried out to determine the amino acid requirement in growing animals are often based on the premise that the amino acid composition of body protein is constant. However, there are indications that this assumption may not be correct. The objective of this study was to test the effect of feeding piglets a diet deficient or not in total sulfur amino acids (TSAA; Met + Cys) on nitrogen retention and amino acid composition of proteins in different body compartments. Six blocks of three pigs each were used in a combined comparative slaughter and nitrogen balance study. One piglet in each block was slaughtered at 42 days of age, whereas the other piglets received a diet deficient or not in TSAA for 19 days and were slaughtered thereafter. Two diets were formulated to provide either 0.20% Met and 0.45% TSAA (on a standardized ileal digestible basis) or 0.46% Met and 0.70% TSAA. Diets were offered approximately 25% below ad libitum intake. At slaughter, the whole animal was divided into carcass, blood, intestines, liver, and the combined head, tail, feet and other organs (HFTO), which were analyzed for nitrogen and amino acid contents. Samples of the longissimus muscle (LM) were analyzed for myosin heavy chain (MyHC) and actin contents. Nitrogen retention was 20% lower in piglets receiving the TSAA-deficient diet (P < 0.01). In these piglets, the nitrogen content in tissue gain was lower in the empty body, carcass, LM and blood (P < 0.05) or tended to be lower in HFTO (P < 0.10), but was not different in the intestines and liver. The Met content in retained protein was lower in the empty body, LM and blood (P < 0.05), and tended to be lower in the carcass (P < 0.10). The Cys content was lower in LM, but higher in blood of piglets receiving the TSAA-deficient diet (P < 0.05). Skeletal muscle appeared to be affected most by the TSAA deficiency. In LM, the Met content in retained protein was reduced by 12% and total Met retention by more than 60%. The MyHC and actin contents in LM were not affected by the TSAA content of the diet. These results show that a deficient TSAA supply affects the amino acid composition of different body proteins. This questions the use of a constant ideal amino acid profile to express dietary amino acid requirements, but also illustrates the plasticity of the animal to cope with nutritional challenges.  相似文献   

17.
A measure of similarity between amino acid residues based on the analysis of the surroundings of each residue in primary structures of native proteins is proposed. The statistical data used for this purpose were obtained from the analysis of 168,808 protein sequences, which comprise the Protein Identification Research database (release 63). Using various threshold values of the proposed measure, amino acid residues were classified into several groups. The classification elaborated differs essentially from groupings previously used. The numerical measure of amino acid residues similarity can be used in site-directed mutagenesis studies for the prediction of probability of local spatial rearrangements in proteins.  相似文献   

18.
A key question associated with topology predictions for membrane proteins is whether there is sufficient variation in the biophysical properties of residues at the membrane interface to enable identification of TM spans in a robust and efficient manner using relatively simple methods of analysis. Here, a test for the homogeneity of multinomial populations is used to identify statistical differences between the residue compositions of windows within datasets of aligned non-homologous TM α-helices. Using this approach, the accuracy and robustness of the predicted boundaries for datasets of uncleaved signal (US) sequences and stop transfer sequences (ST) is tested. The validity of the 21 residue length, which is generally assumed for TM spans in membrane protein topology prediction is also investigated and it is suggested that ST sequences may be better represented by a length of 22 residues.  相似文献   

19.
20.
For detection of the latent periodicity of the protein families responsible for various biological functions, methods of information decomposition, cyclic profile alignment, and the method of noise decomposition have been used. The latent periodicity, being specific to a particular family, is recognized in 94 of 110 analyzed protein families. Family specific periodicity was found for more than 70% of amino acid sequences in each of these families. Based on such sequences the characteristic profile of the latent periodicity has been deduced for each family. Possible relationship between the recognized latent periodicity, evolution of proteins, and their structural organization is discussed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号