首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The occurrence of nucleotides of the 3' side of codons has been determined in highly and weakly expressed genes from Escherichia coli. It was found that the usage of some amino acid codons in highly expressed genes was site specific, depending on the base 3' to the codon. The role of the 3' nucleotide as a modulator of codon translation effectiveness is discussed. The rules of synonymous codon usage in relation to the 3' flanking nucleotide have been established for highly expressed genes. For example, if a triplet next to the lysine codon starts with guanosine, lysine is preferably encoded by AAA and not by AAG (P less than 10(-8), while of cytidine is 3' to the lysine codon, AAG is preferred over AAA (P less than 0.001). These rules are observed in highly and absent in weakly expressed mRNAs and can be used in the chemical synthesis of genes designed for expression in E. coli.  相似文献   

2.
The constraints on nucleotide sequences of highly and weakly expressed genes from Escherichia coli have been analysed and compared. Differences in synonymous codon spectra in highly and weakly expressed genes lead to different frequencies of nucleotides (in the first and third codon positions) and dinucleotides in the two groups of genes. It has been found that the choice of synonymous codons in highly expressed genes depends on the nucleotides adjacent to the codon. For example, lysine is preferably encoded by the AAA codon if guanosine is 3' to the lysine codon (AAA-G, P less than 10(-9)). And, on the contrary, AAG is used more often than AAA (P less than 0.001) if cytidine is 3' adjacent to lysine. Guanosine occurs more frequently than adenosine 5' to all the lysine codons (AAR, P less than 10(-5), i.e. NNG codons are preferred over the synonymous NNA codons 5' to the positions of lysine in the genes. The context effect was observed in nonsense and missense suppression experiments. Therefore, a hypothesis has been suggested that the efficiency of translation of some codons (for which the constraints on the adjacent nucleotides were found) can be modulated by the codon context. The rules for preferable synonymous codon choice in highly expressed genes depending on the nucleotides surrounding the codon are presented. These rules can be used in the chemical synthesis of genes designed for expression in E. coli.  相似文献   

3.
4.
A novel bias in codon third-letter usage was found in Escherichia coli genes with low fractions of "optimal codons", by comparing intact sequences with control random sequences. Third-letter usage has been found to be biased according to preference in codon usage and to doublet preference from the following first letter. The present study examines third-letter usage in the context of the nucleotide sequence when these preferences are considered. In order to exclude any influence by these factors, the random sequences were generated such that the amino acid sequence, codon usage, and the doublet frequency in each gene were all preserved. Comparison of intact sequences with these randomly generated sequences reveals that third letters of codons show a strong preference for the purine/pyrimidine pattern of the next codons: purine (R) is preferred to pyrimidine (Y) at the third site when followed by an R-Y-R codon, and pyrimidine is preferred when followed by an R-R-Y, an R-Y-Y or a Y-R-Y codon. This bias is probably related to interactions of tRNA molecules in the ribosome.  相似文献   

5.
H Grosjean  W Fiers 《Gene》1982,18(3):199-209
By considering the nucleotide sequence of several highly expressed coding regions in bacteriophage MS2 and mRNAs from Escherichia coli, it is possible to deduce some rules which govern the selection of the most appropriate synonymous codons NNU or NNC read by tRNAs having GNN, QNN or INN as anticodon. The rules fit with the general hypothesis that an efficient in-phase translation is facilitated by proper choice of degenerate codewords promoting a codon-anticodon interaction with intermediate strength (optimal energy) over those with very strong or very weak interaction energy. Moreover, codons corresponding to minor tRNAs are clearly avoided in these efficiently expressed genes. These correlations are clearcut in the normal reading frame but not in the corresponding frameshift sequences +1 and +2. We hypothesize that both the optimization of codon-anticodon interaction energy and the adaptation of the population to codon frequency or vice versa in highly expressed mRNAs of E. coli are part of a strategy that optimizes the efficiency of translation. Conversely, codon usage in weakly expressed genes such as repressor genes follows exactly the opposite rules. It may be concluded that, in addition to the need for coding an amino acid sequence, the energetic consideration for codon-anticodon pairing, as well as the adaptation of codons to the tRNA population, may have been important evolutionary constraints on the selection of the optimal nucleotide sequence.  相似文献   

6.
E P Rocha  A Danchin    A Viari 《Nucleic acids research》1999,27(17):3567-3576
We analysed the Bacillus subtilis protein coding sequences termini, and compared it to other genomes. The analysis focused on signals, com-positional biases of nucleotides, oligonucleotides, codons and amino acids and mRNA secondary structure. AUG is the preferred start codon in all genomes, independent of their G+C content, and seems to induce less stable mRNA structures. However, it is not conserved between homologous genes neither is it preferred in highly expressed genes. In B.subtilis the ribosome binding site is very strong. We found that downstream boxes do not seem to exist either in Escherichia coli or in B.subtilis. UAA stop codon usage is correlated with the G+C content and is strongly selected in highly expressed genes. We found less stable mRNA structures at both termini, which we related to mRNA-ribosome and mRNA-release-factor interactions. This pattern seems to impose a peculiar A-rich nucleotide and codon usage bias in these regions. Finally the analysis of all proteins from B.subtilis revealed a similar amino acid bias near both termini of proteins consisting of over-representation of hydrophilic residues. This bias near the stop codon is partially release-factor specific.  相似文献   

7.
The nucleotide sequence of the rat cytoplasmic beta-actin gene.   总被引:120,自引:23,他引:97       下载免费PDF全文
U Nudel  R Zakut  M Shani  S Neuman  Z Levy    D Yaffe 《Nucleic acids research》1983,11(6):1759-1771
The nucleotide sequence of the rat beta-actin gene was determined. The gene codes for a protein identical to the bovine beta-actin. It has a large intron in the 5' untranslated region 6 nucleotides upstream from the initiator ATG, and 4 introns in the coding region at codons specifying amino acids 41/42, 121/122, 267, and 327/328. Unlike the skeletal muscle actin gene and many other actin genes, the beta-actin gene lacks the codon for Cys between the initiator ATG and the codon for the N-terminal amino acid of the mature protein. The usage of synonymous codons in the beta-actin gene is nonrandom, and is similar to that in the rat skeletal muscle and other vertebrate actin genes, but differs from the codon usage in yeast and soybean actin genes.  相似文献   

8.
Codon usage in Pseudomonas aeruginosa.   总被引:83,自引:2,他引:81       下载免费PDF全文
We have generated a codon usage table for Pseudomonas aeruginosa. Codon usage in P. aeruginosa is extremely biased. In contrast to E. coli and yeast, P. aeruginosa preferentially uses those codons within a synonymous codon group with the strongest predicted codon-anticodon interaction. We were unable to correlate a particular codon usage pattern with predicted levels of mRNA expressivity. The choice of a third base reflects the high guanine plus cytosine content of the P. aeruginosa genome (67.2%) and cytosine is the preferred nucleotide for the third codon position.  相似文献   

9.
We examined the codon usages in well-conserved and less-well-conserved regions of vertebrate protein genes and found them to be similar. Despite this similarity, there is a statistically significant decrease in codon bias in the less-well-conserved regions. Our analysis suggests that although those codon changes initially fixed under amino acid replacements tend to follow the overall codon usage pattern, they also reduce the bias in codon usage. This decrease in codon bias leads one to predict that the rate of change of synonymous codons should be greater in those regions that are less well conserved at the amino acid level than in the better-conserved regions. Our analysis supports this prediction. Furthermore, we demonstrate a significantly elevated rate of change of synonymous codons among the adjacent codons 5' to amino acid replacement positions. This provides further support for the idea that there are contextual constraints on the choice of synonymous codons in eukaryotes.  相似文献   

10.
The nucleotide frequencies 5' and 3' to the sense codons in highly and weakly expressed genes have been investigated by the chi-squares method. A comparison between the experimental and computer-generated random nucleotide sequences (in which each codon is substituted by a random synonymous one) was made. It was shown that the choice of a particular codon among the synonymous ones in a given position of the gene depends on the three nucleotides 3' and 5' adjacent to the codon in highly expressed genes (the triplet 3' and a single nucleotide 5' to the codons in weakly expressed genes). Concrete patterns for the preferable choice of synonymous codons depending on their contexts are presented. It is suggested that these constraints are related to the efficiency of messenger translation. The constraints on the amino acid sequences of encoded proteins also lead to statistically significant bases in nucleotide frequencies around the sense codons. The biological role of these constraints is discussed.  相似文献   

11.
We present the nucleotide sequence of the tolC gene of Escherichia coli K12, and the amino acid sequence of the TolC protein (an outer membrane protein) as deduced from it. The mature TolC protein comprises 467 amino acid residues, and, as previously reported (1), a signal sequence of 22 amino acid residues is attached to the N-terminus. The C-terminus of the gene is followed by a stem-loop structure (8 base pair stem, 4 base loop) which may be a rho-independent termination signal. The codon usage of the gene is nonrandom; the major isoaccepting species of tRNA are preferentially utilised, or, among synonomous codons recognized by the same tRNA, those codons are used which can interact better with the anticodon (2,3). In contrast to the codon usage for other outer membrane proteins of E. coli (4) the rare arginine codons AGA and AGG are used once and twice respectively.  相似文献   

12.
A Muto  Y Kawauchi  F Yamao    S Osawa 《Nucleic acids research》1984,12(21):8209-8217
The nucleotide sequence of the 1.3 kilobase-pair DNA segment, which contains the genes for ribosomal proteins S8 and L6, and a part of L18 of Mycoplasma capricolum, has been determined and compared with the corresponding sequence in Escherichia coli (Cerretti et al., Nucl. Acids Res. 11, 2599, 1983). Identities of the predicted amino acid sequences of S8 and L6 between the two organisms are 54% and 42%, respectively. The A + T content of the M. capricolum genes is 71%, which is much higher than that of E. coli (49%). Comparisons of codon usage between the two organisms have revealed that M. capricolum preferentially uses A- and U-rich codons. More than 90% of the codon third positions and 57% of the first positions in M. capricolum is either A or U, whereas E. coli uses A or U for the third and the first positions at a frequency of 51% and 36%, respectively. The biased choice of the A- and U-rich codons in this organism has been also observed in the codon replacements for conservative amino acid substitutions between M. capricolum and E. coli. These facts suggest that the codon usage of M. capricolum is strongly influenced by the high A + T content of the genome.  相似文献   

13.
The 5' context of 671 Escherichia coli stop codons UGA and UAA has been compared with the context of stop-like codons (UAC, UAU and CAA for UAA; UGG, UGC, UGU and CGA for UGA). We have observed highly significant deviations from the expected nucleotide distribution: adenine is over-represented whereas pyrimidines are under-represented in position -2 upstream from UAA. Uridine is over-represented in position -3 upstream from UGA. Lysine codons are preferable immediately prior to UAA. A complete set of codons for serine and the phenylalanine UUC codon are preferable immediately 5' to UGA. This non-random codon distribution before stop codons could be considered as a molecular device for modulation of translation termination. We have found that certain fragment of E. coli release factor 2 (RF2) (amino acids 93-114) is similar to the amino acid sequences of seryl-tRNA synthetase (positions 10-19 and 80-93) and of beta (small) subunit (positions 72-94) of phenylalanyl-tRNA synthetase from E. coli. Three-dimensional structure of E. coli seryl-tRNA synthetase is known [1]: Its N-terminus represents an antiparallel alpha-helical coiled-coil domain and contains a region homologous to RF2. On the basis of the above-mentioned results we assume that a specific interaction between RF2 and the last peptidyl-tRNA(Ser/Phe) occurs during polypeptide chain termination in prokaryotic ribosomes.  相似文献   

14.
15.
This study aimed at measuring the nucleotide non-randomness in the region downstream of start codons in bacterial genes and to see if the non-randomness differs between biased and unbiased genes, in terms of the effective number of codons (Nc) and the codon adaptation index (CAI). In Escherichia coli, there was a marked elevation in nucleotide conservation for the genes having low Nc-values compared to the genes having high Nc-values, i.e the more biased genes showed a higher level of non-randomness. Likewise, the genes displaying high CAI-values showed stronger nucleotide conservation than the genes of low CAI-values. This elevated conservation is visible up to approximately 15-17 nucleotides downstream of the start codon, after which there is little difference. This indicates that there may be distinct selectional mechanisms acting upon the first 5-6 codons within genes in E. coli. In B. subtilis, these effects are less pronounced, if present at all. Furthermore, analyses of codons used in this region were not in support of the hypothesis that the elevation in nucleotide non-randomness is a question of selection for certain optimal codons.  相似文献   

16.
The relative quantities of 26 known transfer RNAs of Escherichia coli have been measured previously (Ikemura, 1981). Based on this relative abundance, the usage of cognate codons in E. coli genes as well as in transposon and coliphage genes was examined. A strong positive correlation between tRNA content and the occurrence of respective codons was found for most E. coli genes that had been sequenced, although the correlation was less significant for transposon and phage genes. The dependence of the usage of isoaccepting tRNA, in E. coli genes encoding abundant proteins, on tRNA content was especially noticeable and was greater than that expected from the proportional relationship between the two variables, i.e. these genes selectively use codons corresponding to major tRNAs but almost completely avoid using codons of minor tRNAs. Therefore, codon choice in E. coli genes was considered to be largely constrained by tRNA availability and possibly by translational efficiency. Based on the content of isoaccepting tRNA and the nature of codon-anticodon interaction, it was then possible to predict for most amino acids the order of preference among synonymous codons. The synonymous codon predicted in this way to be the most preferred codon was thought to be optimized for the E. coli translational system and designated as the “Optimal codon”. E. coli genes encoding abundant protein species use the optimal codons selectively, and other E. coli genes, such as amino acid synthesizing genes, use optimal and “non-optimal” codons to a roughly equal degree. The finding that the frequency of usage of optimal codons is closely correlated with the production levels of individual genes was discussed from an evolutionary viewpoint.  相似文献   

17.
Selection Intensity for Codon Bias   总被引:26,自引:7,他引:19       下载免费PDF全文
D. L. Hartl  E. N. Moriyama    S. A. Sawyer 《Genetics》1994,138(1):227-234
The patterns of nonrandom usage of synonymous codons (codon bias) in enteric bacteria were analyzed. Poisson random field (PRF) theory was used to derive the expected distribution of frequencies of nucleotides differing from the ancestral state at aligned sites in a set of DNA sequences. This distribution was applied to synonymous nucleotide polymorphisms and amino acid polymorphisms in the gnd and putP genes of Escherichia coli. For the gnd gene, the average intensity of selection against disfavored synonymous codons was estimated as approximately 7.3 X 10(-9); this value is significantly smaller than the estimated selection intensity against selectively disfavored amino acids in observed polymorphisms (2.0 X 10(-8)), but it is approximately of the same order of magnitude. The selection coefficients for optimal synonymous codons estimated from PRF theory were consistent with independent estimates based on codon usage for threonine and glycine. Across 118 genes in E. coli and Salmonella typhimurium, the distribution of estimated selection coefficients, expressed as multiples of the effective population size, has a mean and standard deviation of 0.5 +/- 0.4. No significant differences were found in the degree of codon bias between conserved positions and replacement positions, suggesting that translational misincorporation is not an important selective constraint among synonymous polymorphic codons in enteric bacteria. However, across the first 100 codons of the genes, conserved amino acids with identical codons have significantly greater codon bias than of either synonymous or nonidentical codons, suggesting that there are unique selective constraints, perhaps including mRNA secondary structures, in this part of the coding region.  相似文献   

18.
同义密码子用语的位置依赖   总被引:4,自引:0,他引:4  
研究了在大肠杆菌编码区不同位置上的同底密码子用语,发现许多氨基酸的密码子用语在转译起始区有显著的变化,仅有少数氨基酸在转译区有较弱的变化,由于密码子用语与基因表达关系密切。这些结果与实验发现的编码区5‘端密码子用对表达的重要性是一致的。更进一步的结果还暗示了哪些密码子在特定位置的使用可能会影响基因表达。  相似文献   

19.
Strongly biased codon usage is common in unicellular organisms, particularly in highly expressed genes. The bias is most simply explained as a balance between selection and mutation, with selection favouring those codons which are more efficiently translated. In a review Ikemura (1985) has proposed four rules for predicting which codons will be preferred, based on the properties of the transfer RNAs responsible for translating messenger RNA into protein. In this paper codon usage in E. coli and yeast is re-examined using the recent compilation of Maruyama et al. (1986). The codon adaptation index of Sharp and Li (1986a) is used as a measure of gene expression to investigate the importance of this factor. It is found that Ikemura's rules successfully predict preferred codons for yeast, but that two of them work less well for E. coli, and it is suggested that some of the apparent bias in weakly expressed genes of E. coli may be due to contextual effects on mutation rates.  相似文献   

20.
The signal for the termination of protein synthesis in procaryotes.   总被引:24,自引:14,他引:10       下载免费PDF全文
The sequences around the stop codons of 862 Escherichia coli genes have been analysed to identify any additional features which contribute to the signal for the termination of protein synthesis. Highly significant deviations from the expected nucleotide distribution were observed, both before and after the stop codon. Immediately prior to UAA stop codons in E. coli there is a preference for codons of the form NAR (any base, adenine, purine), and in particular those that code for glutamine or the basic amino acids. In contrast, codons for threonine or branched nonpolar amino acids were under-represented. Uridine was over-represented in the nucleotide position immediately following all three stop codons, whereas adenine and cytosine were under-represented. This pattern is accentuated in highly expressed genes, but is not as marked in either lowly expressed genes or those that terminate in UAG, the codon specifically recognised by polypeptide chain release factor-1. These observations suggest that for the efficient termination of protein synthesis in E. coli, the 'stop signal' may be a tetranucleotide, rather than simply a tri-nucleotide codon, and that polypeptide chain release factor-2 recognises this extended signal. The sequence following stop codons was analysed in genes from several other procaryotes and bacteriophages. Salmonella typhimurium, Bacillus subtilis, bacteriophages and the methanogenic archaebacteria showed a similar bias to E. coli.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号