首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Ribosome-mediated translational pause and protein domain organization.   总被引:26,自引:0,他引:26       下载免费PDF全文
Because regions on the messenger ribonucleic acid differ in the rate at which they are translated by the ribosome and because proteins can fold cotranslationally on the ribosome, a question arises as to whether the kinetics of translation influence the folding events in the growing nascent polypeptide chain. Translationally slow regions were identified on mRNAs for a set of 37 multidomain proteins from Escherichia coli with known three-dimensional structures. The frequencies of individual codons in mRNAs of highly expressed genes from E. coli were taken as a measure of codon translation speed. Analysis of codon usage in slow regions showed a consistency with the experimentally determined translation rates of codons; abundant codons that are translated with faster speeds compared with their synonymous codons were found to be avoided; rare codons that are translated at an unexpectedly higher rate were also found to be avoided in slow regions. The statistical significance of the occurrence of such slow regions on mRNA spans corresponding to the oligopeptide domain termini and linking regions on the encoded proteins was assessed. The amino acid type and the solvent accessibility of the residues coded by such slow regions were also examined. The results indicated that protein domain boundaries that mark higher-order structural organization are largely coded by translationally slow regions on the RNA and are composed of such amino acids that are stickier to the ribosome channel through which the synthesized polypeptide chain emerges into the cytoplasm. The translationally slow nucleotide regions on mRNA possess the potential to form hairpin secondary structures and such structures could further slow the movement of ribosome. The results point to an intriguing correlation between protein synthesis machinery and in vivo protein folding. Examination of available mutagenic data indicated that the effects of some of the reported mutations were consistent with our hypothesis.  相似文献   

2.
Stabilization of secondary structure elements by specific combinations of hydrophobic and hydrophilic amino acids has been studied by the way of analysis of pentapeptide fragments from twelve partial bacterial proteomes. PDB files describing structures of proteins from species with extremely high and low genomic GC-content, as well as with average G + C were included in the study. Amino acid residues in 78,009 pentapeptides from alpha helices, beta strands and coil regions were classified into hydrophobic and hydrophilic ones. The common propensity scale for 32 possible combinations of hydrophobic and hydrophilic amino acid residues in pentapeptide has been created: specific pentapeptides for helix, sheet and coil were described. The usage of pentapeptides preferably forming alpha helices is decreasing in alpha helices of partial bacterial proteomes with the increase of the average genomic GC-content in first and second codon positions. The usage of pentapeptides preferably forming beta strands is increasing in coil regions and in helices of partial bacterial proteomes with the growth of the average genomic GC-content in first and second codon positions. Due to these circumstances the probability of coil-sheet and helix-sheet transitions should be increased in proteins encoded by GC-rich genes making them prone to form amyloid in certain conditions. Possible causes of the described fact that importance of alpha helix and coil stabilization by specific combinations of hydrophobic and hydrophilic amino acids is growing with the decrease of genomic GC-content have been discussed.  相似文献   

3.
H Grosjean  W Fiers 《Gene》1982,18(3):199-209
By considering the nucleotide sequence of several highly expressed coding regions in bacteriophage MS2 and mRNAs from Escherichia coli, it is possible to deduce some rules which govern the selection of the most appropriate synonymous codons NNU or NNC read by tRNAs having GNN, QNN or INN as anticodon. The rules fit with the general hypothesis that an efficient in-phase translation is facilitated by proper choice of degenerate codewords promoting a codon-anticodon interaction with intermediate strength (optimal energy) over those with very strong or very weak interaction energy. Moreover, codons corresponding to minor tRNAs are clearly avoided in these efficiently expressed genes. These correlations are clearcut in the normal reading frame but not in the corresponding frameshift sequences +1 and +2. We hypothesize that both the optimization of codon-anticodon interaction energy and the adaptation of the population to codon frequency or vice versa in highly expressed mRNAs of E. coli are part of a strategy that optimizes the efficiency of translation. Conversely, codon usage in weakly expressed genes such as repressor genes follows exactly the opposite rules. It may be concluded that, in addition to the need for coding an amino acid sequence, the energetic consideration for codon-anticodon pairing, as well as the adaptation of codons to the tRNA population, may have been important evolutionary constraints on the selection of the optimal nucleotide sequence.  相似文献   

4.
Rare codons cluster   总被引:1,自引:0,他引:1  
Clarke TF  Clark PL 《PloS one》2008,3(10):e3412
Most amino acids are encoded by more than one codon. These synonymous codons are not used with equal frequency: in every organism, some codons are used more commonly, while others are more rare. Though the encoded protein sequence is identical, selective pressures favor more common codons for enhanced translation speed and fidelity. However, rare codons persist, presumably due to neutral drift. Here, we determine whether other, unknown factors, beyond neutral drift, affect the selection and/or distribution of rare codons. We have developed a novel algorithm that evaluates the relative rareness of a nucleotide sequence used to produce a given protein sequence. We show that rare codons, rather than being randomly scattered across genes, often occur in large clusters. These clusters occur in numerous eukaryotic and prokaryotic genomes, and are not confined to unusual or rarely expressed genes: many highly expressed genes, including genes for ribosomal proteins, contain rare codon clusters. A rare codon cluster can impede ribosome translation of the rare codon sequence. These results indicate additional selective pressures govern the use of synonymous codons, and specifically that local pauses in translation can be beneficial for protein biogenesis.  相似文献   

5.
Although human DNA polymerase beta (DNA pol beta) shows 96% identity with rat DNA pol beta at the amino acid level, it is weakly expressed in Escherichia (E.) coli relative to the rat enzyme. The mechanism of this suppression was investigated. Pulse-chase protein labeling and steady state mRNA analysis showed that mature human DNA pol beta protein is relatively stable in E. coli and the levels of human and rat DNA pol beta mRNA were comparable indicating that the human DNA pol beta expression is suppressed at the translational level. By systematic expression analysis of a number of chimeric genes composed of human and rat cDNAs, two strong translational suppression regions were mapped in the human DNA pol beta mRNA; one was named TSR-1, corresponding to CGG encoding arginine (arg) at position 4 and the other, termed TSR-2, is located between codons 153 and 199. Since substitution of the rat Arg-4 codon with synonymous codons showed strong effects upon the expression level, we propose that the arg codon at the N-terminal coding region plays a role in modulating expression.  相似文献   

6.
The occurrence of nucleotides of the 3' side of codons has been determined in highly and weakly expressed genes from Escherichia coli. It was found that the usage of some amino acid codons in highly expressed genes was site specific, depending on the base 3' to the codon. The role of the 3' nucleotide as a modulator of codon translation effectiveness is discussed. The rules of synonymous codon usage in relation to the 3' flanking nucleotide have been established for highly expressed genes. For example, if a triplet next to the lysine codon starts with guanosine, lysine is preferably encoded by AAA and not by AAG (P less than 10(-8), while of cytidine is 3' to the lysine codon, AAG is preferred over AAA (P less than 0.001). These rules are observed in highly and absent in weakly expressed mRNAs and can be used in the chemical synthesis of genes designed for expression in E. coli.  相似文献   

7.
从GenBank获得大肠杆菌K-12MG1655株的全基因组序列,计算了与基因密码子偏好性相关的多个参数(Nc、CAI、GC、GC3s),对其mRNA编码区长度、形成二级结构倾向与密码子偏好性之间的关系进行了统计学分析,发现虽然翻译效率(包括翻译速度和翻译精度)是制约大肠杆菌高表达基因的密码子偏好性的主要因素,同时,mRNA编码区长度及其形成二级结构的倾向也是形成这种偏好性的不可忽略的原因,而且对偏好性有一定程度的削弱。另外对mRNA编码区形成二级结构倾向的生物学意义进行了讨论分析。  相似文献   

8.
Biased usage of synonymous codons has been elucidated under the perspective of cellular tRNA abundance for quite a long time now. Taking advantage of publicly available gene expression data for Saccharomyces cerevisiae, a systematic analysis of the codon and amino acid usages in two different coding regions corresponding to the regular (helix and strand) as well as the irregular (coil) protein secondary structures, have been performed. Our analyses suggest that apart from tRNA abundance, mRNA folding stability is another major evolutionary force in shaping the codon and amino acid usage differences between the highly and lowly expressed genes in S. cerevisiae genome and surprisingly it depends on the coding regions corresponding to the secondary structures of the encoded proteins. This is obviously a new paradigm in understanding the codon usage in S. cerevisiae. Differential amino acid usage between highly and lowly expressed genes in the regions coding for the irregular protein secondary structure in S. cerevisiae is expounded by the stability of the mRNA folded structure. Irrespective of the protein secondary structural type, the highly expressed genes always tend to encode cheaper amino acids in order to reduce the overall biosynthetic cost of production of the corresponding protein. This study supports the hypothesis that the tRNA abundance is a consequence of and not a reason for the biased usage of amino acid between highly and lowly expressed genes.  相似文献   

9.
Translation of mRNA into protein is a unidirectional information flow process. Analysing the input (mRNA) and output (protein) of translation, we find that local protein structure information is encoded in the mRNA nucleotide sequence. The Coding Sequence and Structure (CSandS) database developed in this work provides a detailed mapping between over 4000 solved protein structures and their mRNA. CSandS facilitates a comprehensive analysis of codon usage over many organisms. In assigning translation speed, we find that relative codon usage is less informative than tRNA concentration. For all speed measures, no evidence was found that domain boundaries are enriched with slow codons. In fact, genes seemingly avoid slow codons around structurally defined domain boundaries. Translation speed, however, does decrease at the transition into secondary structure. Codons are identified that have structural preferences significantly different from the amino acid they encode. However, each organism has its own set of ‘significant codons’. Our results support the premise that codons encode more information than merely amino acids and give insight into the role of translation in protein folding.  相似文献   

10.
Highly expressed plastid genes display codon adaptation, which is defined as a bias toward a set of codons which are complementary to abundant tRNAs. This type of adaptation is similar to what is observed in highly expressed Escherichia coli genes and is probably the result of selection to increase translation efficiency. In the current work, the codon adaptation of plastid genes is studied with regard to three specific features that have been observed in E. coli and which may influence translation efficiency. These features are (1) a relatively low codon adaptation at the 5′ end of highly expressed genes, (2) an influence of neighboring codons on codon usage at a particular site (codon context), and (3) a correlation between the level of codon adaptation of a gene and its amino acid content. All three features are found in plastid genes. First, highly expressed plastid genes have a noticeable decrease in codon adaptation over the first 10–20 codons. Second, for the twofold degenerate NNY codon groups, highly expressed genes have an overall bias toward the NNC codon, but this is not observed when the 3′ neighboring base is a G. At these sites highly expressed genes are biased toward NNT instead of NNC. Third, plastid genes that have higher codon adaptations also tend to have an increased usage of amino acids with a high G + C content at the first two codon positions and GNN codons in particular. The correlation between codon adaptation and amino acid content exists separately for both cytosolic and membrane proteins and is not related to any obvious functional property. It is suggested that at certain sites selection discriminates between nonsynonymous codons based on translational, not functional, differences, with the result that the amino acid sequence of highly expressed proteins is partially influenced by selection for increased translation efficiency. Received: 21 July 1999 / Accepted: 5 November 1999  相似文献   

11.
The levels of synonymous codon bias is shown to be positively correlated to gene length in Escherichia coli genes which are thought to be expressed at similar levels; these are genes whose products are present in multimeric proteins in equimolar amounts. It is argued that the positive correlation could be caused by selection to avoid missense errors during translation. Since the cost of producing a protein is proportional to its length, selection in favor of codons which increase accuracy should be greater in longer genes, and long genes should therefore have higher synonymous codon bias. It is also shown that there is variation in synonymous codon use which is independent of either expression level, gene length, amino acid composition, or chromosomal location. This variation is consistent with selection for translational accuracy but may have other origins.   相似文献   

12.
In a lacZ expression vector (pMC1403Plac), all 64 codons were introduced immediately 3' from the AUG initiation codon. The expression of the second codon variants was measured by immunoprecipitation of the plasmid-coded fusion proteins. A 15-fold difference in expression was found among the codon variants. No distinct correlation could be made with the level of tRNA corresponding to the codons and large differences were observed between synonymous codons that use the same tRNA. Therefore the effect of the second codon is likely to be due to the influence of its composing nucleotides, presumably on the structure of the ribosomal binding site. An analysis of the known sequences of a large number of Escherichia coli genes shows that the use of codons in the second position deviates strongly from the overall codon usage in E. coli. It is proposed that codon selection at the second position is not based on requirements of the gene product (a protein) but is determined by factors governing gene regulation at the initiation step of translation.  相似文献   

13.
Synonymous codon replacement can change protein structure and function, indicating that protein structure depends on DNA sequence. During heterologous protein expression, low expression or formation of insoluble aggregates may be attributable to differences in synonymous codon usage between expression and natural hosts. This discordance may be particularly important during translation of the domain boundaries (link/end segments) that separate elements of higher ordered structure. Within such regions, ribosomal progression slows as the ribosome encounters clusters of infrequently used codons that preferentially encode a subset of amino acids. To replicate the modulation of such localized translation rates during heterologous expression, we used known relationships between codon usage frequencies and secondary protein structure to develop an algorithm ("codon harmonization") for identifying regions of slowly translated mRNA that are putatively associated with link/end segments. It then recommends synonymous replacement codons having usage frequencies in the heterologous expression host that are less than or equal to the usage frequencies of native codons in the native expression host. For protein regions other than these putative link/end segments, it recommends synonymous substitutions with codons having usage frequencies matched as nearly as possible to the native expression system. Previous application of this algorithm facilitated E. coli expression, manufacture and testing of two Plasmodium falciparum vaccine candidates. Here we describe the algorithm in detail and apply it to E. coli expression of three additional P. falciparum proteins. Expression of the "recoded" genes exceeded that of the native genes by 4- to 1,000-fold, representing levels suitable for vaccine manufacture. The proteins were soluble and reacted with a variety of functional conformation-specific mAbs suggesting that they were folded properly and had assumed native conformation. Codon harmonization may further provide a general strategy for improving the expression of soluble functional proteins during heterologous expression in hosts other than E. coli.  相似文献   

14.
Synonymous codons encode the same amino acid, but differ in other biophysical properties. The evolutionary selection of codons whose properties are optimal for a cell generates the phenomenon of codon bias. Although recent studies have shown strong effects of codon usage changes on protein expression levels and cellular physiology, no translational control mechanism is known that links codon usage to protein expression levels. Here, we demonstrate a novel translational control mechanism that responds to the speed of ribosome movement immediately after the start codon. High initiation rates are only possible if start codons are liberated sufficiently fast, thus accounting for the observation that fast codons are overrepresented in highly expressed proteins. In contrast, slow codons lead to slow liberation of the start codon by initiating ribosomes, thereby interfering with efficient translation initiation. Codon usage thus evolved as a means to optimise translation on individual mRNAs, as well as global optimisation of ribosome availability.  相似文献   

15.
16.
In many organisms, selection acts on synonymous codons to improve translation. However, the precise basis of this selection remains unclear in the majority of species. Selection could be acting to maximize the speed of elongation, to minimize the costs of proofreading, or to maximize the accuracy of translation. Using several data sets, we find evidence that codon use in Escherichia coli is biased to reduce the costs of both missense and nonsense translational errors. Highly conserved sites and genes have higher codon bias than less conserved ones, and codon bias is positively correlated to gene length and production costs, both indicating selection against missense errors. Additionally, codon bias increases along the length of genes, indicating selection against nonsense errors. Doublet mutations or replacement substitutions do not explain our observations. The correlations remain when we control for expression level and for conflicting selection pressures at the start and end of genes. Considering each amino acid by itself confirms our results. We conclude that selection on synonymous codon use in E. coli is largely due to selection for translational accuracy, to reduce the costs of both missense and nonsense errors.  相似文献   

17.
The Selection-Mutation-Drift Theory of Synonymous Codon Usage   总被引:69,自引:11,他引:58       下载免费PDF全文
M. Bulmer 《Genetics》1991,129(3):897-907
It is argued that the bias in synonymous codon usage observed in unicellular organisms is due to a balance between the forces of selection and mutation in a finite population, with greater bias in highly expressed genes reflecting stronger selection for efficiency of translation. A population genetic model is developed taking into account population size and selective differences between synonymous codons. A biochemical model is then developed to predict the magnitude of selective differences between synonymous codons in unicellular organisms in which growth rate (or possibly growth yield) can be equated with fitness. Selection can arise from differences in either the speed or the accuracy of translation. A model for the effect of speed of translation on fitness is considered in detail, a similar model for accuracy more briefly. The model is successful in predicting a difference in the degree of bias at the beginning than in the rest of the gene under some circumstances, as observed in Escherichia coli, but grossly overestimates the amount of bias expected. Possible reasons for this discrepancy are discussed.  相似文献   

18.
J Heider  C Baron    A Bck 《The EMBO journal》1992,11(10):3759-3766
Incorporation of selenocysteine into proteins is directed by specifically 'programmed' UGA codons. The determinants for recognition of the selenocysteine codon have been investigated by analysing the effect of mutations in fdhF, the gene for formate dehydrogenase H of Escherichia coli, on selenocysteine incorporation. It was found that selenocysteine was also encoded when the UGA codon was replaced by UAA and UAG, provided a proper codon-anticodon interaction was possible with tRNA(Sec). This indicates that none of the three termination codons can function as efficient translational stop signals in that particular mRNA position. The discrimination of the selenocysteine 'sense' codon from a regular stop codon has previously been shown to be dependent on an RNA secondary structure immediately 3' of the UGA codon in the fdhF mRNA. It is demonstrated here that the correct folding of this structure as well as the existence of primary sequence elements located within the loop portion at an appropriate distance to the UGA codon are absolutely required. A recognition sequence can be defined which mediates specific translation of a particular codon inside an mRNA with selenocysteine and a model is proposed in which translation factor SELB interacts with this recognition sequence, thus forming a quaternary complex at the mRNA together with GTP and selenocysteyl-tRNA(Sec).  相似文献   

19.
Does the 'non-coding' strand code?   总被引:3,自引:2,他引:1       下载免费PDF全文
The hypothesis that DNA strands complementary to the coding strand contain in phase coding sequences has been investigated. Statistical analysis of the 50 genes of bacteriophage T7 shows no significant correlation between patterns of codon usage on the coding and non-coding strands. In Bacillus and yeast genes the correlation observed is not different from that expected with random synonymous codon usage, while a high correlation seen in 52 E. coli genes can be explained in terms of an excess of RNY codons. A deficiency of UUA, CUA and UCA codons (complementary to termination) seems to be restricted to the E. coli genes, and may be due to low abundance of the relevant cognate tRNA species. Thus the analysis shows that the non-coding strand has the properties expected of a sequence complementary to a coding strand, with no indications that it encodes, or may have encoded, proteins.  相似文献   

20.
Tetanus toxin fragment C had been previously expressed in Escherichia coli at 3-4% cell protein. The codon bias for tetanus toxin in Clostridium tetani is very different from that of highly expressed homologous genes in E. coli, resulting in the presence of many rare E. coli codons in the sequence encoding fragment C. We have replaced the coding sequence by sequence optimized for codon usage in E. coli, and show that the expression of fragment C is increased. Although the level of mRNA also increased this appeared to be a secondary consequence of more efficient translation. Complete sequence replacement increased expression to approximately 11-14% cell protein but only after the promoter strength had been improved.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号