首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Singer GA  Hickey DA 《Gene》2003,317(1-2):39-47
A number of recent studies have shown that thermophilic prokaryotes have distinguishable patterns of both synonymous codon usage and amino acid composition, indicating the action of natural selection related to thermophily. On the other hand, several other studies of whole genomes have illustrated that nucleotide bias can have dramatic effects on synonymous codon usage and also on the amino acid composition of the encoded proteins. This raises the possibility that the thermophile-specific patterns observed at both the codon and protein levels are merely reflections of a single underlying effect at the level of nucleotide composition. Moreover, such an effect at the nucleotide level might be due entirely to mutational bias. In this study, we have compared the genomes of thermophiles and mesophiles at three levels: nucleotide content, codon usage and amino acid composition. Our results indicate that the genomes of thermophiles are distinguishable from mesophiles at all three levels and that the codon and amino acid frequency differences cannot be explained simply by the patterns of nucleotide composition. At the nucleotide level, we see a consistent tendency for the frequency of adenine to increase at all codon positions within the thermophiles. Thermophiles are also distinguished by their pattern of synonymous codon usage for several amino acids, particularly arginine and isoleucine. At the protein level, the most dramatic effect is a two-fold decrease in the frequency of glutamine residues among thermophiles. These results indicate that adaptation to growth at high temperature requires a coordinated set of evolutionary changes affecting (i) mRNA thermostability, (ii) stability of codon-anticodon interactions and (iii) increased thermostability of the protein products. We conclude that elevated growth temperature imposes selective constraints at all three molecular levels: nucleotide content, codon usage and amino acid composition. In addition to these multiple selective effects, however, the genomes of both thermophiles and mesophiles are often subject to superimposed large changes in composition due to mutational bias.  相似文献   

2.
We studied the evolution of thermophily in prokaryotes using the phylogenetic relationships between 279 bacteria and archaea and their thermophilic amino acid composition signature. Our findings suggest several examples in which the capacity of thermophilic adaptation has been gained or lost over relatively short evolutionary periods throughout the evolution of prokaryotes.  相似文献   

3.
Basak S  Roy S  Ghosh TC 《FEBS letters》2007,581(30):5825-5830
Synonymous codon usage analysis between thermophilic and mesophilic prokaryotes has gained wide attention in recent years. Although it is known that thermophilic and mesophilic prokaryotes use different subset of synonymous codons, no reason for this difference is known so far. In the present communication, by analyzing a large number of thermophilic and mesophilic prokaryotes, we provide evidence that bias in the selection of synonymous codons between thermophilic and mesophilic prokaryotes is related to differential folding pattern of mRNA secondary structures. Moreover, we observe that error-minimizing property has significant influence in differentiating the synonymous codon usage between thermophilic and mesophilic prokaryotes. Biological implications of these results are discussed.  相似文献   

4.
Zhou XX  Wang YB  Pan YJ  Li WF 《Amino acids》2008,34(1):25-33
Summary. Thermophilic proteins show substantially higher intrinsic thermal stability than their mesophilic counterparts. Amino acid composition is believed to alter the intrinsic stability of proteins. Several investigations and mutagenesis experiment have been carried out to understand the amino acid composition for the thermostability of proteins. This review presents some generalized features of amino acid composition found in thermophilic proteins, including an increase in residue hydrophobicity, a decrease in uncharged polar residues, an increase in charged residues, an increase in aromatic residues, certain amino acid coupling patterns and amino acid preferences for thermophilic proteins. The differences of amino acids composition between thermophilic and mesophilic proteins are related to some properties of amino acids. These features provide guidelines for engineering mesophilic protein to thermophilic protein. Authors’ addresses: Yuan-Jiang Pan, Institute of Chemical Biology and Pharmaceutical Chemistry, Zhejiang University, Zhejiang University Road 38, Hangzhou 310027, China; Wei-Fen Li, Microbiology Division, College of Animal Science, Zhejiang University, Hangzhou 310029, China  相似文献   

5.
Thermophily is thought to be a primitive trait, characteristic of early forms of life on Earth, that has been gradually lost over evolutionary time. The genus Bacillus provides an ideal model for studying the evolution of thermophily as it is an ancient taxon and its contemporary species inhabit a range of thermal environments. The thermostability of reconstructed ancestral proteins has been used as a proxy for ancient thermal adaptation. The reconstruction of ancestral "enzymes" has the added advantages of demonstrable activity, which acts as an internal control for accurate inference, and providing insights into the evolution of enzymatic catalysis. Here, we report the reconstruction of the structurally complex core metabolic enzyme LeuB (3-isopropylmalate dehydrogenase, E. C. 1.1.1.85) from the last common ancestor (LCA) of Bacillus using both maximum likelihood (ML) and Bayesian inference. ML LeuB from the LCA of Bacillus shares only 76% sequence identity with its closest contemporary homolog, yet it is fully functional, thermophilic, and exhibits high values for k(cat), k(cat)/K(M), and ΔG(?) for unfolding. The Bayesian version of this enzyme is also thermophilic but exhibits anomalous catalytic kinetics. We have determined the 3D structure of the ML enzyme and found that it is more closely aligned with LeuB from deeply branching bacteria, such as Thermotoga maritima, than contemporary Bacillus species. To investigate the evolution of thermophily, three descendents of LeuB from the LCA of Bacillus were also reconstructed. They reveal a fluctuating trend in thermal evolution, with a temporal adaptation toward mesophily followed by a more recent return to thermophily. Structural analysis suggests that the determinants of thermophily in LeuB from the LCA of Bacillus and the most recent ancestor are distinct and that thermophily has arisen in this genus at least twice via independent evolutionary paths. Our results add significant fluctuations to the broad trend in thermal adaptation previously proposed and demonstrate that thermophily is not exclusively a primitive trait, as it can be readily gained as well as lost. Our findings also demonstrate that reconstruction of complex functional Precambrian enzymes is possible and can provide empirical access to the evolution of ancient phenotypes and metabolisms.  相似文献   

6.
Summary The maintenance of a proper distribution of charged amino acid residues might be expected to be an important factor in protein evolution. We therefore compared the inferred changes in charge during the evolution of 43 protein families with the changes expected on the basis of random base substitutions. It was found that certain proteins, like the eye lens crystallins and most histones, display an extreme avoidance of changes in charge. Other proteins, like phospholipase A2 and ferredoxin, apparently have sustained more charged replacements than expected, suggesting a positive selection for changes in charge. Depending on function and structure of a protein, charged residues apparently can be important targets for selective forces in protein evolution. It appears that actual biased codon usage tends to decrease the proportion of charged amino acid replacements. The influence of nonrandomness of mutations is more equivocal. Genes that use the mitochondrial instead of the universal code lower the probability that charge changes will occur in the encoded proteins.  相似文献   

7.
Starting from two datasets of codon usage in coding sequences from mesophilic and thermophilic bacteria, we used internal correspondence analysis to study the variability of codon usage within and between species, and within and between amino acids. The first dataset included 18,958,458 codons from 58,482 coding sequences from completely sequenced genomes of 25 species, along with 6,793,581 dinucleotides from 21,876 intergenic spaces. The second dataset, with partially sequenced genomes, included 97,095,873 codons from 293 bacterial species. Results were consistent between the two datasets. The trend for the amino-acid composition of thermophilic proteins was found to be under the control of a pressure at the nucleic acid level, not a selection at the protein level. This effect was not present in intergenic spaces, ruling out a pressure at the DNA level. The pattern at the mRNA level was more complex than a simple purine enrichment of the sense strand of coding sequences. Outliers in the partial genome dataset introduced a note of caution about the interpretation of temperature as the direct determinant of the trend observed in thermophiles. The surprising lack of selection on the amino-acid content of thermophilic proteins suggests that the amino-acid repertoire was set up in a hot environment.  相似文献   

8.
It is important and meaningful to understand the codon usage pattern and the factors that shape codon usage of maize. In this study, trends in synonymous codon usage in maize have been firstly examined through the multivariate statistical analysis on 7402 cDNA sequences. The results showed that the genes positions on the primary axis were strongly negatively correlated with GC3s, GC content of individual gene and gene expression level assessed by the codon adaptation index (CAI) values, which indicated that nucleotide composition and gene expression level were the main factors in shaping the codon usage of maize, and the variation in codon usage among genes may be due to mutational bias at the DNA level and natural selection acting at the level of mRNA translation. At the same time, CDS length and the hydrophobicity of each protein were, respectively, significantly correlated with the genes locations on the primary axis, GC3s and CAI values. We infer that genes length and the hydrophobicity of the encoded protein may play minor role in shaping codon usage bias. Additional 28 codons ending with a G or C base have been defined as “optimal codons”, which may provide useful information for maize gene-transformation and gene prediction.  相似文献   

9.
Miller SR 《Molecular ecology》2003,12(5):1237-1246
Determining the molecular basis of enzyme adaptation is central to understanding the evolution of environmental tolerance but is complicated by the fact that not all amino acid differences between ecologically divergent taxa are adaptive. Analysing patterns of nucleotide sequence evolution can potentially guide the investigation of protein adaptation by identifying candidate codon sites on which diversifying selection has been operating. Here, I test whether there is evidence for molecular adaptation of the carbon fixation gene rbcL for a clade of hot spring cyanobacteria in the genus Synechococcus that has diverged in thermotolerance. Amino acid replacements during Synechococcus radiation have resulted in an increase in the number of hydrophobic residues in the RbcLs of more thermotolerant strains. A similar increase in hydrophobicity has been observed for many thermostable proteins. Maximum likelihood models which allow for heterogeneity among codon sites in the ratio of nonsynonymous to synonymous nucleotide substitutions estimated a class of amino acid sites as a target of positive selection. Depending on the model, a single amino acid site that interacts with a flexible element involved in the opening and closing of the active site was estimated with either low or moderate support to be a member of this class. Site-directed mutagenesis approaches are being explored in order to directly test its adaptive significance.  相似文献   

10.
Across all kingdoms of biological life, protein-coding genes exhibit unequal usage of synonymous codons. Although alternative theories abound, translational selection has been accepted as an important mechanism that shapes the patterns of codon usage in prokaryotes and simple eukaryotes. Here we analyze patterns of codon usage across 74 diverse bacteriophages that infect E. coli, P. aeruginosa, and L. lactis as their primary host. We use the concept of a “genome landscape,” which helps reveal non-trivial, long-range patterns in codon usage across a genome. We develop a series of randomization tests that allow us to interrogate the significance of one aspect of codon usage, such as GC content, while controlling for another aspect, such as adaptation to host-preferred codons. We find that 33 phage genomes exhibit highly non-random patterns in their GC3-content, use of host-preferred codons, or both. We show that the head and tail proteins of these phages exhibit significant bias towards host-preferred codons, relative to the non-structural phage proteins. Our results support the hypothesis of translational selection on viral genes for host-preferred codons, over a broad range of bacteriophages.  相似文献   

11.
Evolutionary traces of thermophilic adaptation are manifest, on the whole-genome level, in compositional biases toward certain types of amino acids. However, it is sometimes difficult to discern their causes without a clear understanding of underlying physical mechanisms of thermal stabilization of proteins. For example, it is well-known that hyperthermophiles feature a greater proportion of charged residues, but, surprisingly, the excess of positively charged residues is almost entirely due to lysines but not arginines in the majority of hyperthermophilic genomes. All-atom simulations show that lysines have a much greater number of accessible rotamers than arginines of similar degree of burial in folded states of proteins. This finding suggests that lysines would preferentially entropically stabilize the native state. Indeed, we show in computational experiments that arginine-to-lysine amino acid substitutions result in noticeable stabilization of proteins. We then hypothesize that if evolution uses this physical mechanism as a complement to electrostatic stabilization in its strategies of thermophilic adaptation, then hyperthermostable organisms would have much greater content of lysines in their proteomes than comparably sized and similarly charged arginines. Consistent with that, high-throughput comparative analysis of complete proteomes shows extremely strong bias toward arginine-to-lysine replacement in hyperthermophilic organisms and overall much greater content of lysines than arginines in hyperthermophiles. This finding cannot be explained by genomic GC compositional biases or by the universal trend of amino acid gain and loss in protein evolution. We discovered here a novel entropic mechanism of protein thermostability due to residual dynamics of rotamer isomerization in native state and demonstrated its immediate proteomic implications. Our study provides an example of how analysis of a fundamental physical mechanism of thermostability helps to resolve a puzzle in comparative genomics as to why amino acid compositions of hyperthermophilic proteomes are significantly biased toward lysines but not similarly charged arginines.  相似文献   

12.
Due to their inherent stability, thermophilic bacteria and archaea serve as important resources for biochemical and biophysical analyses of many biological processes. Unfortunately, scientists characterizing eukaryote-specific processes, such as nuclear pre-mRNA splicing, are unable to take advantage of these sources of thermostable proteins. To identify and provide a source of thermostable eukaryotic proteins, we are characterizing splicing factors in the thermotolerant deep-sea vent polychaete, Alvinella pompejana. This worm, also known as the Pompeii worm, is found in the extreme environment of deep-sea hydrothermal vents, and is one of the most thermotolerant eukaryotic organisms known. We report on detailed analyses of U2AF65, the large subunit of the U2 small nuclear ribonucleoprotein auxiliary factor, an essential splicing factor important for intron definition and alternative splicing. The cloning and characterization of Pompeii U2AF65 show it is highly similar to human U2AF65 in sequence and function and is more thermostable than the human protein when bound to RNA in vitro. Notably, Pompeii U2AF65 can restore splicing in a human extract depleted of human U2AF. We also determine that the general splicing mechanisms and signal sequences are conserved in the Pompeii worm, an annelid which has previously been uncharacterized in terms of splicing factors and signals.  相似文献   

13.
Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome—between 5% and 33%, depending on genome size—while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl–tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an “adaptome” by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea.  相似文献   

14.
Serine hydroxymethyltransferase (SHMT) catalyzes the reversible cleavage of serine to form glycine and monocarbonic groups, essential in several biosynthetic pathways. The availability of crystallographic structures of SHMT from mesophilic organisms and information produced by the genomic projects prompted the analysis of the adaptation of SHMT to "extreme" environments, such as high temperatures, by exploitation of structural data from thermophilic organisms. The sequences of 10 thermophilic/hyperthermophilic SHMTs were multiply aligned to 53 mesophilic homologs and analyzed by a comparative approach, examining the amino acid compositions and preferred residue exchanges between mesophiles and extremophiles. The structural basis of the observed exchanges was further investigated through the application of homology modeling to the 10 extremophilic SHMTs. The results of this study indicate that, in SHMT, thermal stability can be achieved mainly through three strategies: (i) increased number of charged residues at the protein surface; (ii) increased hydrophobicity of the protein core; and (iii) substitution of thermolabile residues exposed to the solvent. Additional features of the archaeal SHMTs, for which no structural data are available yet, were also investigated to explain their quaternary assemblage and the interaction with modified folates.  相似文献   

15.
Asymmetric substitution patterns in the two DNA strands of bacteria   总被引:35,自引:10,他引:25  
  相似文献   

16.
The causes of the variation between genomes in their guanine (G) and cytosine (C) content is one of the central issues in evolutionary genomics. The thermal adaptation hypothesis conjectures that, as G:C pairs in DNA are more thermally stable than adenonine:thymine pairs, high GC content may he a selective response to high temperature. A compilation of data on genomic GC content and optimal growth temperature for numerous prokaryotes failed to demonstrate the predicted correlation. By contrast, the GC content of Structural RNAs is higher at high temperatures. The issue that we address here is whether more freely evolving sites in exons (i.e. codonic third positions) evolve in the same manner as genomic DNA as a whole, Showing no correlated response, or like structural RNAs showing a strong correlation. The latter pattern would provide strong support for the thermal adaptation hypothesis, as the variation in GC content between orthologous genes is typically most profoundly seen at codon third sites (GC3). Simple analysis of completely sequenced prokaryotic genomes shows that GC3, but not genomic GC, is higher on average in thermophilic species. This demonstrates, if nothing else, that the results from the two measures cannot be presumed to be the same. A proper analysis, however, requires phylogenetic control. Here, therefore, we report the results of a comparative analysis of GC composition and optimal growth temperature for over 100 prokaryotes. Comparative analysis fails to show, in either Archea or Eubacteria, any hint of connection between optimal growth temperature and GC content in the genome as a whole, in protein-coding regions or, more crucially at GC. Conversely, comparable analysis confirms that GC content of structural RNA is strongly correlated with optimal temperature. Against the expectations of the thermal adaptation hypothesis, within prokaryotes GC content in protein-coding genies, even at relatively freely evolving sites, cannot be considered an adaptation to the thermal environment.  相似文献   

17.
Lin YS 《Proteins》2008,73(1):53-62
Factors that are related to thermostability of proteins have been extensively studied in recent years, especially by comparing thermophiles and mesophiles. However, most of them are global characters. It is still not clear how to identify specific residues or fragments which may be more relevant to protein thermostability. Moreover, some of the differences among the thermophiles and mesophiles may be due to phylogenetic differences instead of thermal adaptation. To resolve these problems, I adopted a strategy to identify residue substitutions evolved convergently in thermophiles or mesophiles. These residues may therefore be responsible for thermal adaptation. Four classes of genomes were utilized in this study, including thermophilic archaea, mesophilic archaea, thermophilic bacteria, and mesophilic bacteria. For most clusters of orthologous groups (COGs) with sequences from all of these four classes of genomes, I can identify specific residues or fragments that may potentially be responsible for thermal adaptation. Functional or structural constraints (represented as sequence conservation) were suggested to have higher impact on thermal adaptation than secondary structure or solvent accessibility does. I further compared thermophilic archaea and mesophilic bacteria, and found that the most diverged fragments may not necessarily correspond to the thermostability-determining ones. The usual approach to compare thermophiles and mesophiles without considering phylogenetic relationships may roughly identify sequence features contributing to thermostability; however, to specifically identify residue substitutions responsible for thermal adaptation, one should take sequence evolution into consideration.  相似文献   

18.
Synonymous codons are unevenly distributed among genes, a phenomenon termed codon usage bias. Understanding the patterns of codon bias and the forces shaping them is a major step towards elucidating the adaptive advantage codon choice can confer at the level of individual genes and organisms. Here, we perform a large-scale analysis to assess codon usage bias pattern of pyrimidine-ending codons in highly expressed genes in prokaryotes. We find a bias pattern linked to the degeneracy of the encoded amino acid. Specifically, we show that codon-pairs that encode two- and three-fold degenerate amino acids are biased towards the C-ending codon while codons encoding four-fold degenerate amino acids are biased towards the U-ending codon. This codon usage pattern is widespread in prokaryotes, and its strength is correlated with translational selection both within and between organisms. We show that this bias is associated with an improved correspondence with the tRNA pool, avoidance of mis-incorporation errors during translation and moderate stability of codon-anticodon interaction, all consistent with more efficient translation.  相似文献   

19.
Palidwor GA  Perkins TJ  Xia X 《PloS one》2010,5(10):e13431

Background

In spite of extensive research on the effect of mutation and selection on codon usage, a general model of codon usage bias due to mutational bias has been lacking. Because most amino acids allow synonymous GC content changing substitutions in the third codon position, the overall GC bias of a genome or genomic region is highly correlated with GC3, a measure of third position GC content. For individual amino acids as well, G/C ending codons usage generally increases with increasing GC bias and decreases with increasing AT bias. Arginine and leucine, amino acids that allow GC-changing synonymous substitutions in the first and third codon positions, have codons which may be expected to show different usage patterns.

Principal Findings

In analyzing codon usage bias in hundreds of prokaryotic and plant genomes and in human genes, we find that two G-ending codons, AGG (arginine) and TTG (leucine), unlike all other G/C-ending codons, show overall usage that decreases with increasing GC bias, contrary to the usual expectation that G/C-ending codon usage should increase with increasing genomic GC bias. Moreover, the usage of some codons appears nonlinear, even nonmonotone, as a function of GC bias. To explain these observations, we propose a continuous-time Markov chain model of GC-biased synonymous substitution. This model correctly predicts the qualitative usage patterns of all codons, including nonlinear codon usage in isoleucine, arginine and leucine. The model accounts for 72%, 64% and 52% of the observed variability of codon usage in prokaryotes, plants and human respectively. When codons are grouped based on common GC content, 87%, 80% and 68% of the variation in usage is explained for prokaryotes, plants and human respectively.

Conclusions

The model clarifies the sometimes-counterintuitive effects that GC mutational bias can have on codon usage, quantifies the influence of GC mutational bias and provides a natural null model relative to which other influences on codon bias may be measured.  相似文献   

20.
Actin was purified from calf thymus, bovine brain and SV40-transformed mouse 3T3 cells grown in tissue culture. Isoelectric focusing analysis showed the presence of the two actin polypeptides beta and gamma typical for non-muscle actins in all three actins. Tryptic and thermolytic peptides accounting for the complete amino-acid sequence of the cytoplasmic actins were separated and isolated by preparative fingerprint techniques. All peptides were characterized by amino-acid analysis and compared with the corresponding peptides from rabbit skeletal muscle actin. Peptides which differed in amino-acid composition from the corresponding skeletal muscle actin peptides were subjected to sequence analysis in order to localize the amino-acid replacement. The results obtained show that all three mammalian cytoplasmic actins studied contain the same amino-acid exchanges indicating that mammalian cytoplasmic actins are very similar if not identical in amino-acid sequence. The presence of two different isoelectric species beta and gamma in cytoplasmic actins from higher vertebrates is acccounted for by the isolation of two very similar but not identical amino-terminal peptides in all three actin preparations. The nature of the amino-acid replacements in these two peptides not only accounts for the different isoelectric forms but also shows that beta and gamma cytoplasmic actins are the products of two different structural genes expressed in the same cell. The total number of amino-acid replacements so far detected in the comparison of these cytoplasmic actins and skeletal muscle actin is 25 for the beta chain and 24 for the gamma chain. With the exception of the amino-terminal three or four residues, which are responsible for the isoelectric differences, the replacements do not involve charged amino acids. The exchanges are not randomly distributed. No replacements were detected in regions 18--75 and 299--356 while the regions between residues 2--17 and 259--298 show a high number of replacements. In addition documentation for a few minor revisions of the amino acid sequence of rabbit skeletal muscle actin is provided.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号