首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
One of the well-known observations of proteins from thermophilic bacteria is the bias of the amino acid composition in which charged residues are present in large numbers, and polar residues are scarce. On the other hand, it has been reported that the molecular surfaces of proteins are adapted to their subcellular locations, in terms of the amino acid composition. Thus, it would be reasonable to expect that the differences in the amino acid compositions between proteins of thermophilic and mesophilic bacteria would be much greater on the protein surface than in the interior. We performed systematic comparisons between proteins from thermophilic bacteria and mesophilic bacteria, in terms of the amino acid composition of the protein surface and the interior, as well as the entire amino acid chains, by using sequence information from the genome projects. The biased amino acid composition of thermophilic proteins was confirmed, and the differences from those of mesophilic proteins were most obvious in the compositions of the protein surface. In contrast to the surface composition, the interior composition was not distinctive between the thermophilic and mesophilic proteins. The frequency of the amino acid pairs that are closely located in the space was also analyzed to show the same trend of the single amino acid compositions. Interestingly, extracellular proteins from mesophilic bacteria showed an inverse trend against thermophilic proteins (i.e. a reduced number of charged residues and rich in polar residues). Nuclear proteins from eukaryotes, which are known to be abundant in positive charges, showed different compositions as a whole from the thermophiles. These results suggest that the bias of the amino acid composition of thermophilic proteins is due to the residues on the protein surfaces, which may be constrained by the extreme environment.  相似文献   

2.
It is known that in thermophiles the G+C content of ribosomal RNA linearly correlates with growth temperature, while that of genomic DNA does not. Although the G+C contents (singlet) of the genomic DNAs of thermophiles and methophiles do not differ significantly, the dinucleotide (doublet) compositions of the two bacterial groups clearly do. The average amino acid compositions of proteins of the two groups are also distinct. Based on these facts, we here analyzed the DNA and protein compositions of various bacteria in terms of the optimal growth temperature (OGT). Regression analyses of the sequence data for thermophilic, mesophilic and psychrophilic bacteria revealed good linear relationships between OGT and the dinucleotide compositions of DNA, and between OGT and the amino acid compositions of proteins. Together with the above-mentioned linear relationship between ribosomal RNA and OGT, the DNA and protein compositions can be regarded as thermostability measures for RNA, DNA and proteins, covering a wide range of temperatures. Both the DNA and proteins of psychrophiles apparently exhibit characteristics diametrically opposite to those of thermophiles. The physicochemical parameters of dinucleotides suggested that supercoiling of DNA is relevant to its thermostability. Protein stability in thermophiles is realized primarily through global changes that increase charged residues (i.e., Glu, Arg, and Lys) on the molecular surface of all proteins. This kind of global change is attainable through a change in the amino acid composition coupled with alterations in the DNA base composition. The general strategies of thermophiles and psychrophiles for adaptation to higher and lower temperatures, respectively, that are suggested by the present study are discussed.  相似文献   

3.
Tekaia F  Yeramian E  Dujon B 《Gene》2002,297(1-2):51-60
Can we infer the lifestyle of an organism from the characteristic properties of its genome? More precisely, what are the relations between easily quantifiable properties from genomic sequences, such as amino-acid compositions, and more subtle characteristics concerning for example lifestyles or evolutionary trends? Here, we seek a global picture for such properties, based on a large number (56) of complete genomes, including significant numbers of representatives from the three domains of life. We consider the amino acid compositions of the predicted proteomes, and we use correspondence analysis, as a multivariate method to extract the relevant information from the large-scale data. From these analyses we derive a series of conclusions, concerning lifestyles, as well as physico-chemical and evolutionary trends: (1) correspondence analysis of the amino acid compositions permits discrimination between the three known lifestyles (mesophily/thermophily/hyperthermophily). (2) For various organisms, amino-acid composition properties are essentially driven by GC content, and to a significantly lesser extent by growth temperatures associated with lifestyles. Roughly speaking, the respective contributions of these two components are 57 and 20%. It is notable that these proportions are essentially unchanged with respect to a previous analysis (Nature 393 (1998) 537), which involved only 15 genomes, available at the time. (3) In terms of amino acid compositional biases, two specific 'signatures' for thermophily (in a broad sense, including hyperthermophily) can be detected. First, thermophilic species display a relative abundance in glutamic acid (Glu), concomitantly with the depletion in glutamine. Second, in thermophilic species, the relative abundance in Glu (negative charge) is significantly correlated (Pearson correlation coefficient r=0.83 with P<0.0001), with the increase in the lumped 'pool' lysine+arginine (positive charges). This correlation (absent in mesophiles) could be interpreted on a physico-chemical basis, relevant to the thermostability of proteins. (4) Statistically significant differences are observed between the average lengths of the genes in the surveyed species, which follow their distribution between the three domains of life. Also a significant difference is observed between the average lengths of thermophilic (283.0+/-5.8) versus mesophilic (340+/-9.4) genes. It is thus possible that the 'general' shortening of the primary sequences in thermophilic proteins plays a role in thermostability. (5) Considering various combinations of conservation properties (genes conserved exclusively in eukaryotes, in archaea, in bacteria, in combinations of two domains, etc.) correspondence analysis reveals a trend towards thermophilic-hyperthermophilic profiles for the most conserved subset of genes (ancient genes). (6) When limited to the subset of species-specific genes, correspondence analysis leads to a different picture for the clustering of genomes following amino-acid compositions: for example, the 'core' specific part of a genome can bear lifestyle signatures different from those of the complete genome.Various results are discussed both on methodological and biological grounds. The evolutionary perspectives opened by our analyses are noted.  相似文献   

4.
A prerequisite for the survival of (micro)organisms at high temperatures is an adaptation of protein stability to extreme environmental conditions. In contrast to soluble proteins, where many factors have already been identified, the mechanisms by which the thermostability of membrane proteins is enhanced are almost unknown. The hydrophobic membrane environment constrains possible stabilizing factors for transmembrane domains, so that a difference might be expected between soluble and membrane proteins. Here we present sequence analysis of predicted transmembrane helices of the genomes from eight thermophilic and 12 mesophilic organisms. A comparison of the amino acid compositions indicates that more polar residues can be found in the transmembrane helices of thermophilic organisms. Particularly, the amino acids aspartic acid and glutamic acid replace the corresponding amides. Cysteine residues are found to be significantly decreased by about 70% in thermophilic membrane domains suggesting a non-specific function of most cysteine residues in transmembrane domains of mesophilic organisms. By a pair-motif analysis of the two sets of transmembrane helices, we found that the small residues glycine and serine contribute more to transmembrane helix-helix interactions in thermophilic organisms. This may result in a tighter packing of the helices allowing more hydrogen bond formation.  相似文献   

5.
Nakariyakul S  Liu ZP  Chen L 《Amino acids》2012,42(5):1947-1953
Detecting thermophilic proteins is an important task for designing stable protein engineering in interested temperatures. In this work, we develop a simple but efficient method to classify thermophilic proteins from mesophilic ones using the amino acid and dipeptide compositions. Since most of the amino acid and dipeptide compositions are redundant, we propose a new forward floating selection technique to select only a useful subset of these compositions as features for support vector machine-based classification. We test the proposed method on a benchmark data set of 915 thermophilic and 793 mesophilic proteins. The results show that our method using 28 amino acid and dipeptide compositions achieves an accuracy rate of 93.3% evaluated by the jackknife cross-validation test, which is higher not only than the existing methods but also than using all amino acid and dipeptide compositions.  相似文献   

6.
H Nakashima  K Nishikawa  T Ooi 《Proteins》1990,8(2):173-178
A compact mitochondrial gene contains all essential information about the synthesis of mitochondrial proteins which play their roles in a small compartment of the mitochondrium. Almost no noncoding regions have been found through the gene, but a necessary set of tRNAs for the 20 amino acids is provided for biosynthesis, some of them coding different amino acids from those in a usual cell. Since the gene is so compact that the produced proteins would have some characteristic aspects for the mitochondrium, amino acid compositions of mitochondrial proteins (mt-proteins) were examined in the 20-dimensional composition space. The results show that compositions of proteins translated from the mitochondrial genes have a distinct character having more hydrophobic content than others, which is illustrated by a clustered distribution in the multidimensional composition space. The cluster is located at the tail edge of the global distribution pattern of a Gaussian shape for other various kinds of proteins in the space. The mt-proteins are rich in hydrophobic amino acids as is a membrane protein, but are different from other membrane proteins in a lesser content of Val. A good correlation found between the base and amino acid compositions for the mitochondria was examined in comparison to those of organisms such as thermophilic bacterium having an extreme G-C-rich base composition.  相似文献   

7.
Starting from two datasets of codon usage in coding sequences from mesophilic and thermophilic bacteria, we used internal correspondence analysis to study the variability of codon usage within and between species, and within and between amino acids. The first dataset included 18,958,458 codons from 58,482 coding sequences from completely sequenced genomes of 25 species, along with 6,793,581 dinucleotides from 21,876 intergenic spaces. The second dataset, with partially sequenced genomes, included 97,095,873 codons from 293 bacterial species. Results were consistent between the two datasets. The trend for the amino-acid composition of thermophilic proteins was found to be under the control of a pressure at the nucleic acid level, not a selection at the protein level. This effect was not present in intergenic spaces, ruling out a pressure at the DNA level. The pattern at the mRNA level was more complex than a simple purine enrichment of the sense strand of coding sequences. Outliers in the partial genome dataset introduced a note of caution about the interpretation of temperature as the direct determinant of the trend observed in thermophiles. The surprising lack of selection on the amino-acid content of thermophilic proteins suggests that the amino-acid repertoire was set up in a hot environment.  相似文献   

8.
Database including 392 homologous pairs of proteins from thermophilic and mesophilic organisms was created. Using this database we have found that proteins from termophilic organisms contain more atom-atom contacts per residue in comparison with mesophilic homologues. Contribution to increase of the number of contacts gives exterior amino acid residues, accessible for the solvent. Amino acid composition of interior, inaccessible for the solvent, and exterior amino acid residues of proteins from thermophilic and mesophilic organisms were analyzed. We have obtained that exterior residues of proteins from thermophilic organisms contain more such amino acid residues as Lys, Arg and Glu and smaller such amino acid residues as Ala, Asp, Asn. Gln, Ser, and Thr in comparison with proteins from mesophilic organisms. Amino acid compositions of interior residues of considered proteins are not different.  相似文献   

9.
Liang HK  Huang CM  Ko MT  Hwang JK 《Proteins》2005,59(1):58-63
Structural analysis is useful in elucidating structural features responsible for enhanced thermal stability of proteins. However, due to the rapid increase of sequenced genomic data, there are far more protein sequences than the corresponding three-dimensional (3D) structures. The usual sequence-based amino acid composition analysis provides useful but simplified clues about the amino acid types related to thermal stability of proteins. In this work, we developed a statistical approach to identify the significant amino acid coupling sequence patterns in thermophilic proteins. The amino acid coupling sequence pattern is defined as any 2 types of amino acids separated by 1 or more amino acids. Using this approach, we construct the rho profiles for the coupling patterns. The rho value gives a measure of the relative occurrence of a coupling pattern in thermophiles compared with mesophiles. We found that thermophiles and mesophiles exhibit significant bias in their amino acid coupling patterns. We showed that such bias is mainly due to temperature adaptation instead of species or GC content variations. Though no single outstanding coupling pattern can adequately account for protein thermostability, we can use a group of amino acid coupling patterns having strong statistical significance (p values < 10(-7)) to distinguish between thermophilic and mesophilic proteins. We found a good correlation between the optimal growth temperatures of the genomes and the occurrences of the coupling patterns (the correlation coefficient is 0.89). Furthermore, we can separate the thermophilic proteins from their mesophilic orthologs using the amino acid coupling patterns. These results may be useful in the study of the enhanced stability of proteins from thermophiles-especially when structural information is scarce. Proteins 2005. (c) 2005 Wiley-Liss, Inc.  相似文献   

10.
The influence of dipeptide composition on protein thermostability   总被引:5,自引:0,他引:5  
Ding Y  Cai Y  Zhang G  Xu W 《FEBS letters》2004,569(1-3):284-288
In this work, the influence of dipeptide composition on protein thermostability was studied. After comparing the normalized dipeptide composition between mesophilic proteins and (hyper)thermophilic proteins, we concluded that when organism optimal growth temperature increased, for archaeal proteins, the compositions of VK, KI, YK, IK, KV, KY, and EV increased significantly and the compositions of DA, AD, TD, DD, DT, HD, DH, DR, and DG decreased significantly; and for bacterial proteins, the compositions of KE, EE, EK, YE, VK, KV, KK, LK, EI, EV, RK, EF, KY, VE, KI, KG, EY, FK, KF, FE, KR, VY, MK, WK, and WE increased significantly and the compositions of WQ, AA, QA, MQ, AW, QW, QQ, RQ, QH, HQ, AD, AQ, WL, QL, HA, and DA decreased significantly. So these characteristic dipeptides are correlative to protein thermostability. At the same time, the influence of single amino acid composition on protein thermostability was also studied for comparison. We found that the influence of single amino acid composition could be deduced from the influence of dipeptide composition. So we thought that the influence of dipeptide composition on protein thermostability is larger than the influence of amino acid composition. The characteristic dipeptides not only describe the dipeptides that influence protein thermostability significantly but also show the relationship among significant single amino acids that influence protein thermostability.  相似文献   

11.
A number of studies have addressed the environmental temperatures experienced by ancient life. Computational studies using a nonhomogeneous evolution model have estimated ancestral G + C contents of ribosomal RNAs and the amino acid compositions of ancestral proteins, generating hypotheses regarding the mesophilic last universal common ancestor. In contrast, our previous study computationally reconstructed ancestral amino acid sequences of nucleoside diphosphate kinases using a homogeneous model and then empirically resurrected the ancestral proteins. The thermal stabilities of these ancestral proteins were equivalent to or greater than those of extant homologous thermophilic proteins, supporting the thermophilic universal ancestor theory. In this study, we reinferred ancestral sequences using a dataset from which hyperthermophilic sequences were excluded. We also reinferred ancestral sequences using a nonhomogeneous evolution model. The newly reconstructed ancestral proteins are still thermally stable, further supporting the hypothesis that the ancient organisms contained thermally stable proteins and therefore that they were thermophilic.  相似文献   

12.
MOTIVATION: Knowledge of how proteomic amino acid composition has changed over time is important for constructing realistic models of protein evolution and increasing our understanding of molecular evolutionary history. The proteomic amino acid composition of the Last Universal Ancestor (LUA) of life is of particular interest, since that might provide insight into the early evolution of proteins and the nature of the LUA itself. RESULTS: We introduce a method to estimate ancestral amino acid composition that is based on expectation-maximization. On simulated data, the approach was found to be very effective in estimating ancestral amino acid composition, with accuracy improving as the number of residues in the dataset was increased. The method was then used to infer the amino acid composition of a set of proteins in the LUA. In general, as compared with the modern protein set, LUA proteins were found to be richer in amino acids that are believed to have been most abundant in the prebiotic environment and poorer in those believed to have been unavailable or scarce. Additionally, we found the inferred amino acid composition of this protein set in the LUA to be more similar to the observed composition of the same set in extant thermophilic species than in extant mesophilic species, supporting the idea that the LUA lived in a thermophilic environment. AVAILABILITY: The program is available at http://compbio.cs.princeton.edu/ancestralaa  相似文献   

13.
Base composition, codon usages and amino acid usages have been analyzed by taking 529 orthologous sequences of Aquifex aeolicus and Bacillus subtilis, having different optimal growth temperatures. These two bacteria do not have significant difference in overall GC composition, but GC(1+2) and GC3 levels were found to vary significantly. Significant increments in purine content and GC3 composition have been observed in the coding sequences of Aquifex aeolicus than its Bacillus subtilis counterparts. Correspondence analyses on codon and amino acid usages reveal that variation in base composition actually influences their codon and amino acid usages. Two selection pressures acting on the nucleotide level (GC3 and purine enrichment), causes variation in the amino acid usage differently in different protein secondary structures. Our results suggest that adaptation of amino acid usages in coil structure of Aquifex aeolicus proteins is under the control of both purine increment and GC3 composition, whereas the adaptation of the amino acids in the helical region of thermophilic bacteria is strongly influenced by the purine content. Evolutionary perspectives concerning the temperature adaptation of DNA and protein molecules of these two bacteria have been discussed on the basis of these results.  相似文献   

14.
Archaea, bacteria and eukaryotes represent the main kingdoms of life. Is there any trend for amino acid compositions of proteins found in full genomes of species of different kingdoms? What is the percentage of totally unstructured proteins in various proteomes? We obtained amino acid frequencies for different taxa using 195 known proteomes and all annotated sequences from the Swiss-Prot data base. Investigation of the two data bases (proteomes and Swiss-Prot) shows that the amino acid compositions of proteins differ substantially for different kingdoms of life, and this difference is larger between different proteomes than between different kingdoms of life. Our data demonstrate that there is a surprisingly small selection for the amino acid composition of proteins for higher organisms (eukaryotes) and their viruses in comparison with the "random" frequency following from a uniform usage of codons of the universal genetic code. On the contrary, lower organisms (bacteria and especially archaea) demonstrate an enhanced selection of amino acids. Moreover, according to our estimates, 12%, 3% and 2% of the proteins in eukaryotic, bacterial and archaean proteomes are totally disordered, and long (> 41 residues) disordered segments are found to occur in 16% of arhaean, 20% of eubacterial and 43% of eukaryotic proteins for 19 archaean, 159 bacterial and 17 eukaryotic proteomes, respectively. A correlation between amino acid compositions of proteins of various taxa, show that the highest correlation is observed between eukaryotes and their viruses (the correlation coefficient is 0.98), and bacteria and their viruses (the correlation coefficient is 0.96), while correlation between eukaryotes and archaea is 0.85 only.  相似文献   

15.
嗜热与嗜常温微生物的蛋白质氨基酸组成比较   总被引:11,自引:0,他引:11  
嗜热微生物的嗜热特性与其蛋白质的高度热稳定性紧密相关。为了探索嗜热蛋白质的热稳定机制,比较嗜热和嗜常温微生物的蛋白质在氨基酸组成上的差别,收集110对分别来自嗜热和嗜常温微生物的同源蛋白质序列,比较两组蛋白质各种氨基酸含量以及疏水性氨基酸组成、疏水性指数和荷电氨基酸组成的差别,结果两者在多种氨基酸含量上存在微小但统计学上显著的差别,嗜热蛋白质比嗜常温蛋白质具有较高的平均疏水性和荷电氨基酸组成。对两组蛋白质的“脂肪族氨基酸指数”进行分析,证明嗜热蛋白质之所以具有较高的脂肪族氨基酸指数是由于其亮氨酸含量较高,与影响该指数的其它几种氨基酸无关;从而认为该指数的意义值得怀疑。通过对大量同源嗜热蛋白质和嗜常温蛋白质氨基酸组成的比较,能够揭示一些有关蛋白质热稳定性的普遍规律。  相似文献   

16.
The factors contributing to the thermal stability of proteins from thermophilic origins are matters of intense debate and investigation. Thermophilic proteins are thought to possess better packed interiors than their mesophilic counterparts, leading to lesser overall flexibility and a corresponding reduction in surface-to-volume ratio. These observations prompted an analysis of B values reported in high-resolution X-ray crystal structures of mesophilic and thermophilic proteins. In this analysis, the following aspects were addressed: (1) frequency distribution of normalized B values (B' factors) over all the proteins and for individual amino acids; (2) amino acid compositions in high B value regions of polypeptide chains; (3) variation in the B values from core to the surface of proteins in terms of their radius of gyration; and (4) degree of dispersion of normalized B values in spheres around the Calpha atoms. The analysis revealed that (1) Ser and Thr have lesser flexibility in thermophiles than in mesophiles, (2) the proportion of Glu and Lys in high B value regions of thermophiles is higher and that of Ser and Thr is lower and (3) the dispersion of B values within spheres at Calpha atoms is similar in mesophiles and thermophiles. These observations reflect plausible differences in the dynamics of thermophilic and mesophilic proteins and suggest amino acid substitutions that are likely to change thermal stability.  相似文献   

17.
We have cloned and sequenced the gene for DNA ligase from Thermus thermophilus. A comparison of this sequence and those of other ligases reveals significant homology only with that of Escherichia coli. The overall amino acid composition of the thermophilic ligase and the pattern of amino acid substitutions between the two proteins are consistent with compositional biases in other thermophilic enzymes. We have engineered the expression of the T. thermophilus gene in Escherichia coli, and we show that E. coli proteins may be substantially removed from the thermostable ligase by a simple heat precipitation step.  相似文献   

18.
为了研究一级结构对蛋白质耐热性的影响,利用软件DNAMAN对16个家族32种蛋白质序列进行了氨基酸含量分析,并统计分析了氨基酸组成对蛋白质耐热性的影响。通过比较同一家族的高低温蛋白质序列及16个家族中所有高温和低温蛋白质序列中氨基酸含量的变化可以推断(从低温到高温):Ser、Cys.含量降低显著,Arg、Ile、Pro含量升高显著。由此可知高温蛋白质倾向于含有疏水性氨基酸而避免亲水性氨基酸。  相似文献   

19.
Zhou XX  Wang YB  Pan YJ  Li WF 《Amino acids》2008,34(1):25-33
Summary. Thermophilic proteins show substantially higher intrinsic thermal stability than their mesophilic counterparts. Amino acid composition is believed to alter the intrinsic stability of proteins. Several investigations and mutagenesis experiment have been carried out to understand the amino acid composition for the thermostability of proteins. This review presents some generalized features of amino acid composition found in thermophilic proteins, including an increase in residue hydrophobicity, a decrease in uncharged polar residues, an increase in charged residues, an increase in aromatic residues, certain amino acid coupling patterns and amino acid preferences for thermophilic proteins. The differences of amino acids composition between thermophilic and mesophilic proteins are related to some properties of amino acids. These features provide guidelines for engineering mesophilic protein to thermophilic protein. Authors’ addresses: Yuan-Jiang Pan, Institute of Chemical Biology and Pharmaceutical Chemistry, Zhejiang University, Zhejiang University Road 38, Hangzhou 310027, China; Wei-Fen Li, Microbiology Division, College of Animal Science, Zhejiang University, Hangzhou 310029, China  相似文献   

20.
The molecular weight of malate synthase purified from a thermophilic Bacillus was determined to be 62,000 by sedimentation equilibrium methods, confirming the value obtained earlier by the gel filtration technique. This enzyme and its homologs from other bacteria, which are all monomeric proteins with molecular weights of approximately 60,000. therefore differ from the considerably larger and multimeric malate synthases from yeast, Neurospora crassa, and other eucaryotic microorganisms and plants. Amino acid analysis reveals the thermophile synthase to be relatively rich in glutamic acid and to have a higher content of arginine in comparison with the yeast enzyme. The Bacillus enzyme is an acidic protein with an isoelectric pH of 4.6 and has two sulfhydryl groups titratable with 5,5′-dithiobis(2-nitrobenzoic acid). Its parameters indicative of its overall hydrophobicity and of levels of helicity and turn, which were deduced from the amino acid composition, lie well within the range recorded for a number of mesophile and thermophile enzymes. However, the level of β-sheet structure is considerably lower than that calculated for the yeast synthase; this supports a trend recently observed for certain other thermophile proteins. The synthase isolated from the thermophilic Bacillus appears to be homogeneous by several criteria, although upon electrophoresis in the native state in polyacrylamide it yields two protein bands that are both enzymatically active. Several kinetic characteristics of this enzyme are also reported.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号