首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Synonymous codon replacement can change protein structure and function, indicating that protein structure depends on DNA sequence. During heterologous protein expression, low expression or formation of insoluble aggregates may be attributable to differences in synonymous codon usage between expression and natural hosts. This discordance may be particularly important during translation of the domain boundaries (link/end segments) that separate elements of higher ordered structure. Within such regions, ribosomal progression slows as the ribosome encounters clusters of infrequently used codons that preferentially encode a subset of amino acids. To replicate the modulation of such localized translation rates during heterologous expression, we used known relationships between codon usage frequencies and secondary protein structure to develop an algorithm ("codon harmonization") for identifying regions of slowly translated mRNA that are putatively associated with link/end segments. It then recommends synonymous replacement codons having usage frequencies in the heterologous expression host that are less than or equal to the usage frequencies of native codons in the native expression host. For protein regions other than these putative link/end segments, it recommends synonymous substitutions with codons having usage frequencies matched as nearly as possible to the native expression system. Previous application of this algorithm facilitated E. coli expression, manufacture and testing of two Plasmodium falciparum vaccine candidates. Here we describe the algorithm in detail and apply it to E. coli expression of three additional P. falciparum proteins. Expression of the "recoded" genes exceeded that of the native genes by 4- to 1,000-fold, representing levels suitable for vaccine manufacture. The proteins were soluble and reacted with a variety of functional conformation-specific mAbs suggesting that they were folded properly and had assumed native conformation. Codon harmonization may further provide a general strategy for improving the expression of soluble functional proteins during heterologous expression in hosts other than E. coli.  相似文献   

2.
3.
Corynebacteria codon usage exhibits an overall GC content of 67%, and a wobble-position GC content of 88%. Escherichia coli, on the other hand has an overall GC content of 51%, and a wobble-position GC content of 55%. The high GC content of Corynebacteria genes results in an unfavorable codon preference for heterologous expression, and can present difficulties for polymerase-based manipulations due to secondary-structure effects. Since these characteristics are due primarily to base composition at the wobble-position, synthetic genes can, in principle, be designed to eliminate these problems and retain the wild-type amino acid sequence. Such genes would obviate the need for special additives or bases during in vitro polymerase-based manipulation and mutant host strains containing uncommon tRNA's for heterologous expression. We have evaluated synthetic genes with reduced wobble-position G/C content using two variants of the enzyme 2,5-diketo-D-gluconic acid reductase (2,5-DKGR A and B) from Corynebacterium. The wild-type genes are refractory to polymerase-based manipulations and exhibit poor heterologous expression in enteric bacteria. The results indicate that a subset of codons for five amino acids (alanine, arginine, glutamate, glycine and valine) contribute the greatest contribution to reduction in G/C content at the wobble-position. Furthermore, changes in codons for two amino acids (leucine and proline) enhance bias for expression in enteric bacteria without affecting the overall G/C content. The synthetic genes are readily amplified using polymerase-based methodologies, and exhibit high levels of heterologous expression in E. coli.  相似文献   

4.
Proteins from hyperthermophilic microorganisms are attractive candidates for novel biocatalysts because of their high resistance to temperature extremes. However, archaeal genes are usually poorly expressed in Escherichia coli because of differences in codon usage. Genes from the thermoacidophilic archaea Sulfolobus solfataricus and Thermoplasma acidophilum contain high proportions of rare codons for arginine, isoleucine, and leucine, which are recognized by the tRNAs encoded by the argU, ileY, and leuW genes, respectively, and which are rarely used in E. coli. To examine the effects of these rare codons on heterologous expression, we expressed the Sso_gnaD and Tac_gnaD genes from S. solfataricus and T. acidophilum, respectively, in E. coli. The Sso_gnaD product was expressed at very low levels when the open reading frame (ORF) was cloned in pRSET and expressed in E. coli BL21(DE3), and was expressed at much higher levels in the E. coli BL21(DE3)-CodonPlus RIL strain, which contains extra copies of the argU, ileY, and leuW tRNA genes. In contrast, Tac_gnaD was expressed at similar levels in both E. coli strains. Comparison of the Sso_gnaD and Tac_gnaD gene sequences revealed that the 5'-end of the Sso_gnaD sequence was rich in AGA(arg) and ATA(Ile) codons. These codons were replaced with the codons commonly used in E. coli by polymerase chain reaction-mediated site-directed mutagenesis. The results of expression studies showed that a non-tandem repeat of rare codons is critical in the observed interference in heterologous expression of this gene. We concluded that the level of heterologous expression of Sso_gnaD in E. coli was limited by the clustering of the rare codons in the ORF, rather than on the rare codon frequency.  相似文献   

5.
Codon usage and gene expression.   总被引:36,自引:16,他引:20       下载免费PDF全文
L Holm 《Nucleic acids research》1986,14(7):3075-3087
The hypothesis that codon usage regulates gene expression at the level of translation is tested. Codon usage of Escherichia coli and phage lambda is compared by correspondence analysis, and the basis of this hypothesis is examined by connecting codon and tRNA distributions to polypeptide elongation kinetics. Both approaches indicate that if codon usage was random tRNA limitation would only affect the rarest tRNA species. General discrimination against their cognate codons indicates that polypeptide elongation rates are maintained constant. Thus, differences in expression of E. coli genes are not a consequence of their variable codon usage. The preference of codons recognized by the most abundant tRNAs in E. coli genes encoding abundant proteins is explained by a constraint on the cost of proof-reading.  相似文献   

6.
We have constructed an expression system for heterologous proteins which uses the molecular machinery responsible for the high level production of bacteriorhodopsin in Halobacterium salinarum. Cloning vectors were assembled that fused sequences of the bacterio-opsin gene (bop) to coding sequences of heterologous genes and generated DNA fragments with cloning sites that permitted transfer of fused genes into H. salinarum expression vectors. Gene fusions include: (i) carboxyl-terminal-tagged bacterio-opsin; (ii) a carboxyl-terminal fusion with the catalytic subunit of the Escherichia coli aspartate transcarbamylase; (iii) the human muscarinic receptor, subtype M1; (iv) the human serotonin receptor, type 5HT2c; and (v) the yeast alpha mating factor receptor, Ste2. Characterization of the expression of these fusions revealed that the bop gene coding region contains previously undescribed molecular determinants which are critical for high level expression. For example, introduction of immunogenic and purification tag sequences into the C-terminal coding region significantly decreased bop gene mRNA and protein accumulation. The bacteriorhodopsin-aspartate transcarbamylase fusion protein was expressed at 7 mg per liter of culture, demonstrating that E. coli codon usage bias did not limit the system's potential for high level expression. The work presented describes initial efforts in the development of a novel heterologous protein expression system, which may have unique advantages for producing multiple milligram quantities of membrane-associated proteins.  相似文献   

7.
8.
High-level expression from one particular heterologous gene in Escherichia coli generally requires the optimization of codon usage. Genes encoding for Hepatitis C virus core protein (HCcAg), human interferon alpha2 and 8 subtypes (HUIFNalpha2 and HUIFNalpha8) show a high content of AGA/AGG codons. These are encoded by the product of the dnaY gene in E. coli. The proteins used in this work have a high therapeutic value and were used as models for studying the effects of these rare codons on the efficiency of heterologous gene expression in E. coli. Expression plasmids were constructed to express any of these proteins and the dnaY gene product simultaneously in E. coli. After dnaY gene expression, HCcAg, and HUIFNalpha2 expression levels increased 5 and 3 times, respectively. However, HUIFNalpha8 expression was barely detected either supplying or not the additional dnaY gene product. These results suggest that the high frequency of AGA/AGG codons present in the HCcAg and HUIFNalpha2 genes could be one of the factors limiting its expression in E. coli. Nevertheless, for HUIFNalpha8 it seems that other factors prevail upon the lack of dnaY product. Data presented here for HCcAg and HUIFNalpha2 expressions proved the value of this approach to obtain therapeutic proteins in E. coli.  相似文献   

9.
The frequencies of occurrence of nucleotides at the 5' side of codons have been determined in highly and weakly expressed genes from E. coli. Significant constraints on the nucleotide 5' to some codons were found in highly expressed genes. Certain rules of synonymous codon usage depending on the amino acid 3' of the codon were established. E. g., codon possessing quanosine in the third position (NNG) are preferred over NNA if the next amino acid is lysine (P less than 10(-5)). On the other hand, rules of synonymous codon usage in relation to 5' flanking nucleotide were found. For example, when coding for aspartic acid, GAC codon is preferred over GAU (P less than 0.001) if uridine is 5' to codon and on the contrary GAU is favoured (P less than 0.0001) if quanosine is at the 5' side of aspartic acid codon. These rules can be used in the chemical synthesis of genes designed for expression in E. coli.  相似文献   

10.
基因表达水平与同义密码子使用关系的初步研究   总被引:3,自引:0,他引:3  
提出一个预测基因表达水平和同义密码子使用的自洽信息聚类方法。将同义密码子分成最适密码子、非最适密码子和稀有密码子,认为三者的使用频率是调控基因表达水平的主要因素。基于这一观点,对Ecoli和Yeast两类生物的基因表达水平和密码子的使用,用自洽信息聚类方法进行了预测。发现高低表达基因明显分开,基因表达水平被分为四级;甚高表达基因(VH)、高表达基因(H)、较低表达基因(LM)和低表达基因(LL);  相似文献   

11.
The Rickettsia prowazekii ATP/ADP translocase (Tlc) is the first member of a new family of ATP/ADP exchangers that includes both prokaryotic and eukaryotic proteins. We optimized the codon usage for expression of tlc in Escherichia coli by means of gene synthesis, expressed the synthetic gene in E. coli, and purified a modified Tlc that contained a C-terminal tag of 10 consecutive histidine residues by immobilized metal affinity chromatography. Although codon usage in R. prowazekii is very different from E. coli, the optimization of the codon usage by itself was insufficient to improve expression. However, the change of the cloning vector from pET11a to pT7-5 led to a 3-10-fold increase in the specific ATP transport rate by cells expressing the synthetic construct. The authenticity of the purified protein was confirmed by N-terminal amino acid sequencing and a matrix assisted laser desorption/ionization mass spectrometry.  相似文献   

12.
研究了Escherichiacoli(115个基因)和SacharomycesYeast(97个基因)核酸序列的密码子使用频率与基因表达水平的关系.将同义密码子按使用频率统计值分成三种特性的密码子:最适密码子(H)、非最适密码子(L)和稀有密码子(R),对每一基因序列的编码区,算出它们各自出现的概率P(H),P(L)和P(R).以P(H)和P(R)为指标,用图论法聚类,发现每种生物的高低表达基因明显分开,基因表达水平被分为四级:甚高表达基因(VH)、高表达基因(H)、较低表达基因(LM)和低表达基因(LL).每类基因的表达水平与实验结果保持了很好的相关性,与E.coli和Yeast的现有资料相比,符合很好.  相似文献   

13.
14.
Gu W  Zhou T  Ma J  Sun X  Lu Z 《Bio Systems》2004,73(2):89-97
The role of silent position in the codon on the protein structure is an interesting and yet unclear problem. In this paper, 563 Homo sapiens genes and 417 Escherichia coli genes coding for proteins with four different folding types have been analyzed using variance analysis, a multivariate analysis method newly used in codon usage analysis, to find the correlation between amino acid composition, synonymous codon, and protein structure in different organisms. It has been found that in E. coli, both amino acid compositions in differently folded proteins and synonymous codon usage in different gene classes coding for differently folded proteins are significantly different. It was also found that only amino acid composition is different in different protein classes in H. sapiens. There is no universal correlation between synonymous codon usage and protein structure in these two different organisms. Further analysis has shown that GC content on the second codon position can distinguish coding genes for different folded proteins in both organisms.  相似文献   

15.
The occurrence of nucleotides of the 3' side of codons has been determined in highly and weakly expressed genes from Escherichia coli. It was found that the usage of some amino acid codons in highly expressed genes was site specific, depending on the base 3' to the codon. The role of the 3' nucleotide as a modulator of codon translation effectiveness is discussed. The rules of synonymous codon usage in relation to the 3' flanking nucleotide have been established for highly expressed genes. For example, if a triplet next to the lysine codon starts with guanosine, lysine is preferably encoded by AAA and not by AAG (P less than 10(-8), while of cytidine is 3' to the lysine codon, AAG is preferred over AAA (P less than 0.001). These rules are observed in highly and absent in weakly expressed mRNAs and can be used in the chemical synthesis of genes designed for expression in E. coli.  相似文献   

16.
同义密码子用语的位置依赖   总被引:4,自引:0,他引:4  
研究了在大肠杆菌编码区不同位置上的同底密码子用语,发现许多氨基酸的密码子用语在转译起始区有显著的变化,仅有少数氨基酸在转译区有较弱的变化,由于密码子用语与基因表达关系密切。这些结果与实验发现的编码区5‘端密码子用对表达的重要性是一致的。更进一步的结果还暗示了哪些密码子在特定位置的使用可能会影响基因表达。  相似文献   

17.
The tryptophanase structural gene, tnaA, of Escherichia coli K-12 was cloned and sequenced. The size, amino acid composition, and sequence of the protein predicted from the nucleotide sequence agree with protein structure data previously acquired by others for the tryptophanase of E. coli B. Physiological data indicated that the region controlling expression of tnaA was present in the cloned segment. Sequence data suggested that a second structural gene of unknown function was located distal to tnaA and may be in the same operon. The pattern of codon usage in tnaA was intermediate between codon usage in four of the ribosomal protein structural genes and the structural genes for three of the tryptophan biosynthetic proteins.  相似文献   

18.
类弹性蛋白多肽(ELP)为含有人工合成的ELP60基因的表达载体pRELPN,能促使外源基因在大肠杆菌中的高表达。当ELP60在大肠杆菌表达载体pET28a的多克隆位点被克隆后,其自身的表达低,也不与目的基因构成ELP融合蛋白质,而是促进克隆在ELP60基因后的含起始密码ATG的外源目的基因独立高表达。外源目的基因表达量占宿主蛋白的20% ~ 60%,比用pET28a载体表达的外源基因表达量高2~10倍。此类表达载体pRELPN适合于表达包括抗体、抗原、酶、重组蛋白质、多肽及ELP融合蛋白质等的外源基因的独立高表达。这些结果表明,pRELPN代表了一种有效的表达载体,有助于解决在原核表达中,所受限的普通载体对外源基因低表达或不表达所导致的不能产业化的问题。  相似文献   

19.
Synonymous codon usage is a commonly used means for estimating gene expression levels of Escherichia coli genes and has also been used for predicting highly expressed genes for a number of prokaryotic genomes. By comparison of expression level-dependent features in codon usage with protein abundance data from two proteome studies of exponentially growing E. coli and Bacillus subtilis cells, we try to evaluate whether the implicit assumption of this approach can be confirmed with experimental data. Log-odds ratio scores are used to model differences in codon usage between highly expressed genes and genomic average. Using these, the strength and significance of expression level-dependent features in codon usage were determined for the genes of the Escherichia coli, Bacillus subtilis and Haemophilus influenzae genomes. The comparison of codon usage features with protein abundance data confirmed a relationship between these to be present, although exceptions to this, possibly related to functional context, were found. For species with expression level-dependent features in their codon usage, the applied methodology could be used to improve in silico simulations of the outcome of two-dimensional gel electrophoretic experiments.  相似文献   

20.
Escherichia coli has long been regarded as a model organism in the study of codon usage bias (CUB). However, most studies in this organism regarding this topic have been computational or, when experimental, restricted to small datasets; particularly poor attention has been given to genes with low CUB. In this work, correspondence analysis on codon usage is used to classify E.coli genes into three groups, and the relationship between them and expression levels from microarray experiments is studied. These groups are: group 1, highly biased genes; group 2, moderately biased genes; and group 3, AT-rich genes with low CUB. It is shown that, surprisingly, there is a negative correlation between codon bias and expression levels for group 3 genes, i.e. genes with extremely low codon adaptation index (CAI) values are highly expressed, while group 2 show the lowest average expression levels and group 1 show the usual expected positive correlation between CAI and expression. This trend is maintained over all functional gene groups, seeming to contradict the E.coli-yeast paradigm on CUB. It is argued that these findings are still compatible with the mutation-selection balance hypothesis of codon usage and that E.coli genes form a dynamic system shaped by these factors.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号