首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
The relative quantities of 26 known transfer RNAs of Escherichia coli have been measured previously (Ikemura, 1981). Based on this relative abundance, the usage of cognate codons in E. coli genes as well as in transposon and coliphage genes was examined. A strong positive correlation between tRNA content and the occurrence of respective codons was found for most E. coli genes that had been sequenced, although the correlation was less significant for transposon and phage genes. The dependence of the usage of isoaccepting tRNA, in E. coli genes encoding abundant proteins, on tRNA content was especially noticeable and was greater than that expected from the proportional relationship between the two variables, i.e. these genes selectively use codons corresponding to major tRNAs but almost completely avoid using codons of minor tRNAs. Therefore, codon choice in E. coli genes was considered to be largely constrained by tRNA availability and possibly by translational efficiency. Based on the content of isoaccepting tRNA and the nature of codon-anticodon interaction, it was then possible to predict for most amino acids the order of preference among synonymous codons. The synonymous codon predicted in this way to be the most preferred codon was thought to be optimized for the E. coli translational system and designated as the “Optimal codon”. E. coli genes encoding abundant protein species use the optimal codons selectively, and other E. coli genes, such as amino acid synthesizing genes, use optimal and “non-optimal” codons to a roughly equal degree. The finding that the frequency of usage of optimal codons is closely correlated with the production levels of individual genes was discussed from an evolutionary viewpoint.  相似文献   

3.
In a lacZ expression vector (pMC1403Plac), all 64 codons were introduced immediately 3' from the AUG initiation codon. The expression of the second codon variants was measured by immunoprecipitation of the plasmid-coded fusion proteins. A 15-fold difference in expression was found among the codon variants. No distinct correlation could be made with the level of tRNA corresponding to the codons and large differences were observed between synonymous codons that use the same tRNA. Therefore the effect of the second codon is likely to be due to the influence of its composing nucleotides, presumably on the structure of the ribosomal binding site. An analysis of the known sequences of a large number of Escherichia coli genes shows that the use of codons in the second position deviates strongly from the overall codon usage in E. coli. It is proposed that codon selection at the second position is not based on requirements of the gene product (a protein) but is determined by factors governing gene regulation at the initiation step of translation.  相似文献   

4.
5.
Does the 'non-coding' strand code?   总被引:3,自引:2,他引:1       下载免费PDF全文
The hypothesis that DNA strands complementary to the coding strand contain in phase coding sequences has been investigated. Statistical analysis of the 50 genes of bacteriophage T7 shows no significant correlation between patterns of codon usage on the coding and non-coding strands. In Bacillus and yeast genes the correlation observed is not different from that expected with random synonymous codon usage, while a high correlation seen in 52 E. coli genes can be explained in terms of an excess of RNY codons. A deficiency of UUA, CUA and UCA codons (complementary to termination) seems to be restricted to the E. coli genes, and may be due to low abundance of the relevant cognate tRNA species. Thus the analysis shows that the non-coding strand has the properties expected of a sequence complementary to a coding strand, with no indications that it encodes, or may have encoded, proteins.  相似文献   

6.
As shown in the accompanying paper (5), the oligonucleotide composition of the E. coli genome is highly asymmetric for sequences up to 6 bp in length when ranked from highest to lowest abundance. We show here that this largely reflects codon usage because heavily used codons were found in the highly abundant oligomers whereas rarely used codons, with some exceptions, occurred in sequences in low abundance. Furthermore, linear regression analysis revealed a strong correlation between the frequencies of each trinucleotide and its usage as a codon. Dinucleotides are also not randomly distributed across each codon position and the dinucleotide composition of genes that are transcribed but not translated (rRNA and tRNA genes) was highly related to that seen in genes encoding polypeptides. However, 45 tetra-, 8 penta-, and 6 hexanucleotides were significantly over- or underabundant by Markov chain analysis and could not be accounted for by codon usage. Of these underrepresented sequences, many were palindromes, including the Dam methylation site.  相似文献   

7.
It has often been suggested that differential usage of codons recognized by rare tRNA species, i.e. "rare codons", represents an evolutionary strategy to modulate gene expression. In particular, regulatory genes are reported to have an extraordinarily high frequency of rare codons. From E. coli we have compiled codon usage data for highly expressed genes, moderately/lowly expressed genes, and regulatory genes. We have identified a clear and general trend in codon usage bias, from the very high bias seen in very highly expressed genes and attributed to selection, to a rather low bias in other genes which seems to be more influenced by mutation than by selection. There is no clear tendency for an increased frequency of rare codons in the regulatory genes, compared to a large group of other moderately/lowly expressed genes with low codon bias. From this, as well as a consideration of evolutionary rates of regulatory genes, and of experimental data on translation rates, we conclude that the pattern of synonymous codon usage in regulatory genes reflects primarily the relaxation of natural selection.  相似文献   

8.
Proteins from hyperthermophilic microorganisms are attractive candidates for novel biocatalysts because of their high resistance to temperature extremes. However, archaeal genes are usually poorly expressed in Escherichia coli because of differences in codon usage. Genes from the thermoacidophilic archaea Sulfolobus solfataricus and Thermoplasma acidophilum contain high proportions of rare codons for arginine, isoleucine, and leucine, which are recognized by the tRNAs encoded by the argU, ileY, and leuW genes, respectively, and which are rarely used in E. coli. To examine the effects of these rare codons on heterologous expression, we expressed the Sso_gnaD and Tac_gnaD genes from S. solfataricus and T. acidophilum, respectively, in E. coli. The Sso_gnaD product was expressed at very low levels when the open reading frame (ORF) was cloned in pRSET and expressed in E. coli BL21(DE3), and was expressed at much higher levels in the E. coli BL21(DE3)-CodonPlus RIL strain, which contains extra copies of the argU, ileY, and leuW tRNA genes. In contrast, Tac_gnaD was expressed at similar levels in both E. coli strains. Comparison of the Sso_gnaD and Tac_gnaD gene sequences revealed that the 5'-end of the Sso_gnaD sequence was rich in AGA(arg) and ATA(Ile) codons. These codons were replaced with the codons commonly used in E. coli by polymerase chain reaction-mediated site-directed mutagenesis. The results of expression studies showed that a non-tandem repeat of rare codons is critical in the observed interference in heterologous expression of this gene. We concluded that the level of heterologous expression of Sso_gnaD in E. coli was limited by the clustering of the rare codons in the ORF, rather than on the rare codon frequency.  相似文献   

9.
Genetic code redundancy allows most amino acids to be encoded by multiple codons that are non-randomly distributed along coding sequences. An accepted theory explaining the biological significance of such non-uniform codon selection is that codons are translated at different speeds. Thus, varying codon placement along a message may confer variable rates of polypeptide emergence from the ribosome, which may influence the capacity to fold toward the native state. Previous studies report conflicting results regarding whether certain codons correlate with particular structural or folding properties of the encoded protein. This is partly due to different criteria traditionally utilized for predicting translation speeds of codons, including their usage frequencies and the concentration of tRNA species capable of decoding them, which do not always correlate. Here, we developed a metric to predict organism-specific relative translation rates of codons based on the availability of tRNA decoding mechanisms: Watson-Crick, non-Watson-Crick or both types of interactions. We determine translation rates of messages by pulse-chase analyses in living Escherichia coli cells and show that sequence engineering based on these concepts predictably modulates translation rates in a manner that is superior to codon usage frequency, which occur during the elongation phase, and significantly impacts folding of the encoded polypeptide. Finally, we demonstrate that sequence harmonization based on expression host tRNA pools, designed to mimic ribosome movement of the original organism, can significantly increase the folding of the encoded polypeptide. These results illuminate how genetic code degeneracy may function to specify properties beyond amino acid encoding, including folding.  相似文献   

10.
X Xia 《Genetics》1998,149(1):37-44
The optimization of the translational machinery in cells requires the mutual adaptation of codon usage and tRNA concentration, and the adaptation of tRNA concentration to amino acid usage. Two predictions were derived based on a simple deterministic model of translation which assumes that elongation of the peptide chain is rate-limiting. The highest translational efficiency is achieved when the codon recognized by the most abundant tRNA reaches the maximum frequency. For each codon family, the tRNA concentration is optimally adapted to codon usage when the concentration of different tRNA species matches the square-root of the frequency of their corresponding synonymous codons. When tRNA concentration and codon usage are well adapted to each other, the optimal content of all tRNA species carrying the same amino acid should match the square-root of the frequency of the amino acid. These predictions are examined against empirical data from Escherichia coli, Salmonella typhimurium, and Saccharomyces cerevisiae.  相似文献   

11.
Insects, the most biodiverse taxonomic group, have high AT content in their mitochondrial genomes. Although codon usage tends to be AT-rich, base composition and codon usage of mitochondrial genomes may vary among taxa. Thus, we compare base composition and codon usage patterns of 49 insect mitochondrial genomes. For protein coding genes, AT content is as high as 80% in the Hymenoptera and Lepidoptera and as low as 72% in the Orthopotera. The AT content is high at positions 1 and 3, but A content is low at position 2. A close correlation occurs between codon usage and tRNA abundance in nuclear genomes. Optimal codons can pair well with the antr codons of the most abundant tRNAs. One tRNA gene translates a synonymous codon family in vertebrate mitochondrial genomes and these tRNA anticodons can pair with optimal codons. However, optimal codons cannot pair with anticodons in mtDNA ofCochiiomyia hominivorax (Dipteral: CaLliphoridae). Ten optimal codons cannot pair with tRNA anticodons in all 49 insect mitochondrial genomes; non-optimal codon-anticodon usage is common and codon usage is not influenced by tRNA abundance.  相似文献   

12.
基因表达水平与同义密码子使用关系的初步研究   总被引:3,自引:0,他引:3  
提出一个预测基因表达水平和同义密码子使用的自洽信息聚类方法。将同义密码子分成最适密码子、非最适密码子和稀有密码子,认为三者的使用频率是调控基因表达水平的主要因素。基于这一观点,对Ecoli和Yeast两类生物的基因表达水平和密码子的使用,用自洽信息聚类方法进行了预测。发现高低表达基因明显分开,基因表达水平被分为四级;甚高表达基因(VH)、高表达基因(H)、较低表达基因(LM)和低表达基因(LL);  相似文献   

13.
Stenström CM  Jin H  Major LL  Tate WP  Isaksson LA 《Gene》2001,263(1-2):273-284
The codon that follows the AUG initiation triplet (+2 codon) affects gene expression in Escherichia coli. We have extended this analysis using two model genes lacking any apparent Shine-Dalgarno sequence. Depending on the identity of the +2 codon a difference in gene expression up to 20-fold could be obtained. The effects did not correlate with the levels of intracellular pools of cognate tRNA for the +2 codon, with putative secondary mRNA structures, or with mRNA stability. However, most +2 iso-codons that were decoded by the same species of tRNA gave pairwise similar effects, suggesting that the effect on gene expression was associated with the decoding tRNA. High adenine content of the +2 codon was associated with high gene expression. Of the fourteen +2 codons that mediated the highest efficiency, all except two had an adenine as the first base of the codon. Analysis of the 3540 E. coli genes from the TransTerm database revealed that codons associated with high gene expression in the two expression systems are over-represented at the +2 position in natural genes. Codons that are associated with low gene expression are under-represented. The data suggest that evolution has favored codons at the +2 position that give high translation initiation.  相似文献   

14.
15.
Codon usage data of bacteriophage T4 genes were compiled and synonymous codon preferences were investigated in comparison with tRNA availabilities in an infected cell. Since the genome of T4 is highly AT rich and its codon usage pattern is significantly different from that of its host Escherichia coli, certain codons of T4 genes need to be translated by appropriate host transfer RNAs present in minor amounts. To avoid this predicament, T4 phage seems to direct the synthesis of its own tRNA molecules and these phage tRNAs are suggested to supplement the host tRNA population with isoacceptors that are normally present in minor amounts. A positive correlation was found in that the frequency of E. coli optimal codons in T4 genes increases as the number of protein monomers per phage particle increases. A negative correlation was also found between the number of protein monomers per phage and the frequency of "T4 optimal codons", which are defined as those codons that are efficiently recognized by T4 tRNAs. From these observations it was proposed that tRNAs from the host are predominantly used for translation of highly expressed T4 genes while tRNAs from T4 tend to be used for translation of weakly expressed T4 genes. This distinctive tRNA-usage in T4 may be an optimization of translational efficiency, and an adjustment of T4-encoded tRNAs to the synonymous codon preferences, which are largely influenced by the high genomic AT-content, would have occurred during evolution.  相似文献   

16.
17.
18.
Ribosome-mediated translational pause and protein domain organization.   总被引:26,自引:0,他引:26       下载免费PDF全文
Because regions on the messenger ribonucleic acid differ in the rate at which they are translated by the ribosome and because proteins can fold cotranslationally on the ribosome, a question arises as to whether the kinetics of translation influence the folding events in the growing nascent polypeptide chain. Translationally slow regions were identified on mRNAs for a set of 37 multidomain proteins from Escherichia coli with known three-dimensional structures. The frequencies of individual codons in mRNAs of highly expressed genes from E. coli were taken as a measure of codon translation speed. Analysis of codon usage in slow regions showed a consistency with the experimentally determined translation rates of codons; abundant codons that are translated with faster speeds compared with their synonymous codons were found to be avoided; rare codons that are translated at an unexpectedly higher rate were also found to be avoided in slow regions. The statistical significance of the occurrence of such slow regions on mRNA spans corresponding to the oligopeptide domain termini and linking regions on the encoded proteins was assessed. The amino acid type and the solvent accessibility of the residues coded by such slow regions were also examined. The results indicated that protein domain boundaries that mark higher-order structural organization are largely coded by translationally slow regions on the RNA and are composed of such amino acids that are stickier to the ribosome channel through which the synthesized polypeptide chain emerges into the cytoplasm. The translationally slow nucleotide regions on mRNA possess the potential to form hairpin secondary structures and such structures could further slow the movement of ribosome. The results point to an intriguing correlation between protein synthesis machinery and in vivo protein folding. Examination of available mutagenic data indicated that the effects of some of the reported mutations were consistent with our hypothesis.  相似文献   

19.
In all, 238 and 155 transfer (t)RNA genes were predicted from the genomes of Phytophthora sojae and P. ramorum, respectively. After omitting pseudogenes and undetermined types of tRNA genes, there remained 208 P. sojae tRNA genes and 140 P. ramorum tRNA genes. There were 45 types of tRNA genes, with distinct anticodons, in each species. Fourteen common anticodon types of tRNAs are missing altogether from the genome in the two species; however, these appear to be compensated by wobbling of other tRNA anticodons in a manner which is tied to the codon bias in Phytophthora genes. The most abundant tRNA class was arginine in both P. sojae and P. ramorum. A codon usage table was generated for these two organisms from a total of 9,803,525 codons in P. sojae and 7,496,598 codons in P. ramorum. The most abundant codon type detected from the codon usage tables was GAG (encoding glutamic acid), whereas the most numerous tRNA gene had a methionine anticodon (CAT). The correlation between the frequencies of tRNA genes and the codon frequencies in protein-coding genes was very low (0.12 in P. sojae and 0.19 in P. ramorum); however, the correlation between amino acid tRNA gene frequency and the corresponding amino acid codon frequency in P. sojae and P. ramorum was substantially higher (0.53 in P. sojae and 0.77 in P. ramorum). The codon usage frequencies of P. sojae and P ramorum were very strongly correlated (0.99), as were tRNA gene frequencies (0.77). Approximately 60% of orthologous tRNA gene pairs in P sojae and P. ramorum are located in regions that have conserved synteny in the two species.  相似文献   

20.
In Escherichia coli, the isoleucine codon AUA occurs at a frequency of about 0.4% and is the fifth rarest codon in E. coli mRNA. Since there is a correlation between the frequency of codon usage and the level of its cognate tRNA, translational problems might be expected when the mRNA contains high levels of AUA codons. When a hemagglutinin from the influenza virus, a 304-amino-acid protein with 12 (3.9%) AUA codons and 1 tandem codon, and a mupirocin-resistant isoleucyl tRNA synthetase, a 1,024-amino-acid protein, with 33 (3.2%) AUA codons and 2 tandem codons, were expressed in E. coli, product accumulation was highly variable and dependent to some degree on the growth medium. In rich medium, the flu antigen represented about 16% of total cell protein, whereas in minimal medium, it was only 2 to 3% of total cell protein. In the presence of the cloned ileX, which encodes the cognate tRNA for AUA, however, the antigen was 25 to 30% of total cell protein in cells grown in minimal medium. Alternatively, the isoleucyl tRNA synthetase did not accumulate to detectable levels in cells grown in Luria broth unless the ileX tRNA was coexpressed when it accounted for 7 to 9% of total cell protein. These results indicate that the rare isoleucine AUA codon, like the rare arginine codons AGG and AGA, can interfere with the efficient expression of cloned proteins.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号