首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 296 毫秒
1.
基因表达水平与同义密码子使用关系的初步研究   总被引:3,自引:0,他引:3  
提出一个预测基因表达水平和同义密码子使用的自洽信息聚类方法。将同义密码子分成最适密码子、非最适密码子和稀有密码子,认为三者的使用频率是调控基因表达水平的主要因素。基于这一观点,对Ecoli和Yeast两类生物的基因表达水平和密码子的使用,用自洽信息聚类方法进行了预测。发现高低表达基因明显分开,基因表达水平被分为四级;甚高表达基因(VH)、高表达基因(H)、较低表达基因(LM)和低表达基因(LL);  相似文献   

2.
影响链球菌属肺炎球菌基因组密码子使用的因素分析   总被引:5,自引:2,他引:5  
侯卓成  杨宁 《遗传学报》2002,29(8):747-752
链球菌属肺炎球菌(Steptococcus pneumoniae)的完整基因组序列已经测定完毕并于近期发表,对肺炎球菌基因组序列进行了详细分析,研究了基因组密码子的使用模式和影响密码子使用的因素,高水平高达基因的密码子第三位碱基使用胞嘧啶(C)的频率比表达水平低的基因使用C有显著的提高,表达水平较低的基因在密码子的第三位碱基更趋向使用嘌呤),基因的表达水平与对应分析的第一条向量轴呈显著相关(R=0.86),比较表达水平高,低的两组基因的密码子使用模式发现,基因的表达水平对于密码子使用有显著的影响,基因碱基G+C的组成与基因的表达水平(R=0.44),对应分析的第一条向量轴(R=0.5)有显著的相关,对基因的表达水平,密码子的使用有显著的影响,通过GC-skew,蛋白质的疏水性,基因的长度分析,发现不同长度的基因表达水平,GC含量,GC3s有差异,结果表明,在表达水平上的自然选择以及基因的碱基组成是影响肺炎球菌基因密码子使用的主要因素,基因的长度对密码子的使用有一定影响。  相似文献   

3.
Wolbachia pipientis wMel基因组水平上的密码子使用分析   总被引:1,自引:0,他引:1  
本实验检测了黑腹果蝇的专性寄生菌Wolbachia pipientis wMel基因组的密码子使用模式,并推测影响其密码子组成的因素.选择了478条蛋白编码基因作为研究对象,它们的GC含量比较低,约0.282~0.432.本研究结果显示,关联突变(context-dependent mutation)是影响W.pipientis wMel基因组密码子组成的主要因素.同时,Nc—Plot曲线显示,基因组的密码子组成还受到了核苷酸组成偏好性(nucleotide composition bias)的影响.对应分析的结果显示,基因长度(R=0.123,P〈0.01)和基因表达水平(R=-0.312,P〈0.01)也对基因组的密码子使用偏好性起到了一定的作用.因此,关联突变、核苷酸组成、基因长度和基因表达水平都会影响到基因组的密码子使用偏好性.Wolbachia基因组的几个特征和动物线粒体基因组具有较高的相似性,而且现在已有报道说明它们有共同的起源,但仍需进一步验证.  相似文献   

4.
以植物钾离子外排通道(K’channeloutward.rectifier,KCO)基因为研究对象,运用CodonW软件分析了75个植物KCO基因密码子的使用模式,探讨密码子的使用模式和影响密码子使用的各种可能因素。结果表明:碱基组成差异(r=0.961,P〈0.01)和自然选择(r=0.568,P〈0.01)是影响密码子使用的主要因素,并且高表达的基因强烈偏爱使用以G或C结尾的密码子。确定了UUC、CUC等26个均以G/C结尾的密码子为植物KcD基因的高表达优越密码子。  相似文献   

5.
翻译起始调控是基因表达调控的一个关键步骤之一。本文以鸡为研究材料,比较研究了鸡基因组高表达基因和低表达基因翻译起始密码子上下游的碱基序列差异,旨在寻找影响鸡基因表达水平的特异性调控位点。全部3 020个单剪接基因完整的mRNA序列及有详细注释的5'UTRs序列从Ensembl下载。编写计算机程序,读取每个基因mRNA起始密码子上下游各位点的碱基。研究发现,起始密码子上游-3、-2位点可能是鸡基因组基因表达起始密码子正确识别的关键位点。起始密码子上下游的碱基组成分析发现,高表达基因和低表达基因起始密码子的上游均倾向使用(G+C),高表达基因的使用偏倚尤为强烈。序列差异比较发现,高表达基因在-9、-6、-3、+4位点显著偏向G,在-1、-2、-4、-5位点显著偏向C。低表达基因起始密码子上游使用A、U的频率显著高于低表达基因。在-19位点强烈偏向A,在+1、+11、+14位点强烈偏向U。  相似文献   

6.
从NCBI数据库(http://www.ncbi.nlm.nih.gov/projects/mapview/map)下载珍珠鸟全部小染色体基因的cDNA序列,最终共有1586个基因的CDS序列纳入统计分析。密码子的偏性分析使用CodonW(1.4.2)完成,初步确定了UUC、UCC、UCG等27个密码子为珍珠鸟小染色体基因表达的“最优”密码子。对应分析表明,影响珍珠鸟小染色体基因密码子使用的主要因素分别为GC3s、CDS的GC含量基以及因的表达丰度。珍珠鸟小染色体基因的密码子用法受到了基因碱基组成的显著影响,其密码子的偏性是碱基组成及选择等因素综合作用的结果。本研究的目的是系统探究珍珠乌小染色体基因的密码子用法,探究鸟类基因表达的分子调控机制。  相似文献   

7.
SARS冠状病毒的密码子偏爱性分析   总被引:14,自引:0,他引:14  
为了分析SARS(Severe Acute Respiratory Syndrome)冠状病毒的密码子偏爱性(codon preference),为SARS冠状病毒基因表达中宿主系统的选择提供参考。运用EMBOSS(The European Molecular Biology Open Software Suite)的CHIPS(Codon Hetemzygosity in a Protein-codingSequence)和CUSP(Create a eodon usege table)程序对SARS冠状病毒的6个编码蛋白的基因进行分析,并将这6个编码序列拼接在一起进行全基因组的密码子偏爱性分析。分析结果与大肠杆菌、酵母及人的密码子偏爱性进行比较。结果显示SARS冠状病毒的CHIPS分析Nc(effective number of codons)值为53.338,S、E、M、N蛋白、3CL水解酶、RNA聚合酶的Nc值分别为45.733,61.000,59.040,46.618,46.924,51.902。编码SARS冠状病毒A,P,R,S,T,L等氨基酸的不同密码子使用频率有较大差异。大肠杆菌有25个、酵母有12个、人有20个密码子与SARS冠状病毒密码子使用偏爱性差异较大.因此可以得出结论:编码SARS冠状病毒氨基酸的密码子出现的频率较均一。SARS冠状病毒的密码子偏爱性与真核生物较接近,与原核生物相差较远,其基因表达选择在酵母等真核系统可能更为合适。  相似文献   

8.
为确定痘苗病毒密码子偏向性与基因表达的关系及其在痘苗病毒与宿主细胞相互作用过程中的作用,按痘苗病毒的优势密码子对HIV-1 gag基因进行改造,并对合成基因与野生型HIV-1 gag基因在痘苗病毒载体系统的表达水平进行了研究。结果显示:①各目的基因分别正向插入了痘苗病毒TK区7.5k启动子下游;②免疫荧光检测显示,改造前后的gag基因均能够很好地在痘苗病毒中表达;③Western blot检测显示,在相同感染量时,改造后的gag基因具有更高的表达水平;④流式细胞术检测显示,密码子改造后的gag基因较野生型gag基因表达水平提高约17%。上述结果表明:按照痘苗病毒优势密码子进行外源基因改造,可作为提高外源基因在痘苗病毒中表达的策略,同时提示,密码子偏向性是痘苗病毒与宿主细胞相互作用的重要调控因素。  相似文献   

9.
目的:检测分析胰岛素样生长因子-1受体(IGF-IR)在乳腺癌组织中的表达状况及其临床意义。方法:应用半运用半定量RT-PCR方法分析84例乳腺癌和癌旁正常乳腺组织中IGF—IR基因mRNA的表达水平,并分析其表达与患者临床病理特征及预后之间的关系。结果:乳腺癌组织中IGF-IR基因mRNA表达水平显著高于癌旁乳腺组织,二者具有统计学差别(P〈0.001)。乳腺癌组织中IGF-IR基因mRNA表达水平与肿瘤组织分化程度及乳腺癌患者的TNM分期和淋巴结转移情况显著相关(P值分别是0.005,0.025和0.041)。另外,高表达IGF-IR的乳腺癌患者的五年总体生存率(38.3%)显著高于低表达IGF-1R的患者(49.7%;P=0.009)。多因素COX模型分析结果表明:IGF-IR基因mRNA表达水平是乳腺癌患者的一个独立预后分子(HR=2.78,95%CI:1.94-3.94,P=0.041)。结论:IGF-IR基因表达水平上调在乳腺癌发展过程中起着重要的作用。IGF-IR基因mRNA表达水平有望成为临床乳腺癌患者预后判断的一个重要分子标志物。  相似文献   

10.
杨树同义密码子用法的初步分析   总被引:1,自引:0,他引:1  
杨树是世界上广泛栽培的重要造林树种之一,已经成为林木基因工程研究的模式植物。用杨树的314个蛋白编码基因,通过对应分析和ENC-plot分析探讨了若干重要因子对杨树密码子用法的效应。从分析结果中可以看出,在影响最大的第一条向量轴上,基因的坐标位置与该基因的表达水平(CAI)极显著负相关(r=-0.94**),其次是与GC3S和基因长度极显著相关(r=0.86**和r=-0.57**),说明基因表达水平高低是影响密码子发挥作用的主要因素,基因编码区碱基组成和基因长度次之。ENC-plot分析结果也证明了这一点。相对密码子使用值(RSCU)的计算结果表明,高表达基因强烈偏好以A或T结尾的密码子,并确定了TTA和ATA等10个密码子为杨树的主要偏爱密码子。将杨树的密码子使用频率与拟南芥、水稻、大肠杆菌和人等不同模式生物种比较后发现,杨树密码子的偏爱性与同为双子叶植物的拟南芥最为相似,与人和大肠杆菌之间的差异较大。  相似文献   

11.
Preferential usage of some minor codons in bacteria   总被引:2,自引:0,他引:2  
Ohno H  Sakai H  Washio T  Tomita M 《Gene》2001,276(1-2):107-115
In many bacterial species, such as Deinococcus radiodurans, Haemophilus influenzae, and Methanobacterium thermoautotrophicum, some minor codons are preferentially used near the initiation codon. Among these codons, there are some minor codons that have strong preference for the initiation site in the high codon adaptation index (CAI) group (comprising of highly expressed genes) rather than in the low CAI genes group (comprising of low expressing genes). In the present study, codon usage in the initiation site and in the rest of the gene was systematically compared in the 27 complete bacterial genomes and Saccharomyces cerevisiae genome. Furthermore, we classified genes into two groups according to the CAI values and conducted the same analysis for each of the two groups. Our results suggest a role for some minor codons in the initiation site of the regulating translation system in many bacteria. We have summarized codons that are preferentially used in the initiation site and probably play a role in regulating genes expression in these organisms.  相似文献   

12.
The constraints on nucleotide sequences of highly and weakly expressed genes from Escherichia coli have been analysed and compared. Differences in synonymous codon spectra in highly and weakly expressed genes lead to different frequencies of nucleotides (in the first and third codon positions) and dinucleotides in the two groups of genes. It has been found that the choice of synonymous codons in highly expressed genes depends on the nucleotides adjacent to the codon. For example, lysine is preferably encoded by the AAA codon if guanosine is 3' to the lysine codon (AAA-G, P less than 10(-9)). And, on the contrary, AAG is used more often than AAA (P less than 0.001) if cytidine is 3' adjacent to lysine. Guanosine occurs more frequently than adenosine 5' to all the lysine codons (AAR, P less than 10(-5), i.e. NNG codons are preferred over the synonymous NNA codons 5' to the positions of lysine in the genes. The context effect was observed in nonsense and missense suppression experiments. Therefore, a hypothesis has been suggested that the efficiency of translation of some codons (for which the constraints on the adjacent nucleotides were found) can be modulated by the codon context. The rules for preferable synonymous codon choice in highly expressed genes depending on the nucleotides surrounding the codon are presented. These rules can be used in the chemical synthesis of genes designed for expression in E. coli.  相似文献   

13.
In all, 238 and 155 transfer (t)RNA genes were predicted from the genomes of Phytophthora sojae and P. ramorum, respectively. After omitting pseudogenes and undetermined types of tRNA genes, there remained 208 P. sojae tRNA genes and 140 P. ramorum tRNA genes. There were 45 types of tRNA genes, with distinct anticodons, in each species. Fourteen common anticodon types of tRNAs are missing altogether from the genome in the two species; however, these appear to be compensated by wobbling of other tRNA anticodons in a manner which is tied to the codon bias in Phytophthora genes. The most abundant tRNA class was arginine in both P. sojae and P. ramorum. A codon usage table was generated for these two organisms from a total of 9,803,525 codons in P. sojae and 7,496,598 codons in P. ramorum. The most abundant codon type detected from the codon usage tables was GAG (encoding glutamic acid), whereas the most numerous tRNA gene had a methionine anticodon (CAT). The correlation between the frequencies of tRNA genes and the codon frequencies in protein-coding genes was very low (0.12 in P. sojae and 0.19 in P. ramorum); however, the correlation between amino acid tRNA gene frequency and the corresponding amino acid codon frequency in P. sojae and P. ramorum was substantially higher (0.53 in P. sojae and 0.77 in P. ramorum). The codon usage frequencies of P. sojae and P ramorum were very strongly correlated (0.99), as were tRNA gene frequencies (0.77). Approximately 60% of orthologous tRNA gene pairs in P sojae and P. ramorum are located in regions that have conserved synteny in the two species.  相似文献   

14.
We have analyzed factors affecting the codon usage pattern of the chloroplasts genomes of representative species of pooid grass family. Correspondence analysis of relative synonymous codon usages (RSCU) showed that genes on secondary axis were correlated with their GC3S values (all r > 0.3, p < 0.05), indicating mutational bias as an important selective force that shaped the variation in the codon usage among chloroplast genes. The Nc-plot showed that although a majority of the points with low-Nc values were lying below the expected curve, a few genes lied on the expected curve. Nc plot clearly showed that mutational bias plays a major role in codon biology across the monocot plastomes. The hydrophobicity and aromaticity of encoded proteins of each species were found to be other factors of codon usage variation. In the view of above light, besides natural selection, several other factors also likely to be involved in determining the selective constraints on codon bias in plastomes of pooid grass genomes. In addition, five codons (B. distachyon), seven codons (H. vulgare), and four codons (T. aestivum) were identified as optimal codons of the three grass chloroplasts. To identify genes evolving under positive selection, rates of nonsynonymous substitutions (Ka) and synonymous substitutions (Ks) were computed for all groups of orthologous gene pairs.  相似文献   

15.
Xia X 《PloS one》2007,2(2):e188
The optimal context for translation initiation in mammalian species is GCCRCCaugG (where R = purine and "aug" is the initiation codon), with the -3R and +4G being particularly important. The presence of +4G has been interpreted as necessary for efficient translation initiation. Accumulated experimental and bioinformatic evidence has suggested an alternative explanation based on amino acid constraint on the second codon, i.e., amino acid Ala or Gly are needed as the second amino acid in the nascent peptide for the cleavage of the initiator Met, and the consequent overuse of Ala and Gly codons (GCN and GGN) leads to the +4G consensus. I performed a critical test of these alternative hypotheses on +4G based on 34169 human protein-coding genes and published gene expression data. The result shows that the prevalence of +4G is not related to translation initiation. Among the five G-starting codons, only alanine codons (GCN), and glycine codons (GGN) to a much smaller extent, are overrepresented at the second codon, whereas the other three codons are not overrepresented. While highly expressed genes have more +4G than lowly expressed genes, the difference is caused by GCN and GGN codons at the second codon. These results are inconsistent with +4G being needed for efficient translation initiation, but consistent with the proposal of amino acid constraint hypothesis.  相似文献   

16.
《Gene》1998,215(2):405-413
Biases in the codon usage and base compositions at three codon sites in different genes of A+T-rich Gram-negative bacterium Haemophillus influenzae and G+C-rich Gram-positive bacterium Mycobacterium tuberculosis have been examined to address the following questions: (1) whether the synonymous codon usage in organisms having highly skewed base compositions is totally dictated by the mutational bias as reported previously (Sharp, P.M., Devine, K.M., 1989. Codon usage and gene expression level in Dictyostelium discoideum: highly expressed genes do `prefer' optimal codons. Nucleic Acids Res. 17, 5029–5039), or is also controlled by translational selection; (2) whether preference of G in the first codon positions by highly expressed genes, as reported in Escherichia coli (Gutierrez, G., Marquez, L., Marin, A., 1996. Preference for guanosine at first codon position in highly expressed Escherichia coli genes. A relationship with translational efficiency. Nucleic Acids Res. 24, 2525–2527), is true in other bacteria; and (3) whether the usage of bases in three codon positions is species-specific. Result presented here show that even in organisms with high mutational bias, translational selection plays an important role in dictating the synonymous codon usage, though the set of optimal codons is chosen in accordance with the mutational pressure. The frequencies of G-starting codons are positively correlated to the level of expression of genes, as estimated by their Codon Adaptation Index (CAI) values, in M. tuberculosis as well as in H. influenzae in spite of having an A+T-rich genome. The present study on the codon preferences of two organisms with oppositely skewed base compositions thus suggests that the preference of G-starting codons by highly expressed genes might be a general feature of bacteria, irrespective of their overall G+C contents. The ranges of variations in the frequencies of individual bases at the first and second codon positions of genes of both H. influenzae and M. tuberculosis are similar to those of E. coli, implying that though the composition of all three codon positions is governed by a selection-mutation balance, the mutational pressure has little influence in the choice of bases at the first two codon positions, even in organisms with highly biased base compositions.  相似文献   

17.
杨树派间不同种的遗传密码子使用频率分析   总被引:1,自引:0,他引:1  
周猛  童春发  施季森 《遗传学报》2007,34(6):555-561
遗传密码子的简并性特征造成了不同物种使用的密码子存在偏爱性。了解不同物种的密码子使用特点,可以为外源基因导入过程中的基因改造提供依据,从而实现外源基因的高效表达。杨树是世界上广泛栽培的重要造林树种之一,已经成为林木基因工程研究的模式植物。本研究采用高频密码子分析法,对美洲山杨P.tremuloides,毛白杨P.tomentosa,美洲黑杨P.deltoids和毛果杨P.trichocarpa 4种杨树的蛋白质编码基因序列(CDS)进行了分析,计算出了杨树同义密码子相对使用频率(RFSC),确定了4种杨树的高频率密码子,发现虽然不同种类的杨树密码子使用上有一些差别,但是偏爱密码子的差别却很小,共性的密码子占绝大多数。仅有Pro,Thr和Cys等少数几个氨基酸的偏爱密码子有差别。这种“共性”提示我们,用不同种的杨树中任何一种杨树的偏爱密码子所设计的外源基因在其他杨树中也可以使用。  相似文献   

18.
The nucleotide frequencies 5' and 3' to the sense codons in highly and weakly expressed genes have been investigated by the chi-squares method. A comparison between the experimental and computer-generated random nucleotide sequences (in which each codon is substituted by a random synonymous one) was made. It was shown that the choice of a particular codon among the synonymous ones in a given position of the gene depends on the three nucleotides 3' and 5' adjacent to the codon in highly expressed genes (the triplet 3' and a single nucleotide 5' to the codons in weakly expressed genes). Concrete patterns for the preferable choice of synonymous codons depending on their contexts are presented. It is suggested that these constraints are related to the efficiency of messenger translation. The constraints on the amino acid sequences of encoded proteins also lead to statistically significant bases in nucleotide frequencies around the sense codons. The biological role of these constraints is discussed.  相似文献   

19.
With the three-letter alphabet [R,Y,N] (R = purine, Y = pyrimidine, N = R or Y), there are 26 codons (NNN being excluded): RNN,...,NNY (six codons at two unspecified bases N), RRN,...,NYY (12 codons at one unspecified base N), RRR,...,YYY (eight specified codons). A statistical methodology that uses the codon frequency and a reduced centered variable leads to similar results for a codon occurrence study, regardless of gene function and regardless of a particular protein coding gene taxonomic population. Therefore, this variable can be considered a new codon usage index, whose use removes certain nonsignificant results found with the frequency statistic. This methodology identifies the common and rare codons (i.e., the codons having the highest and lowest occurrence) and leads to a model of codon evolution at three successive states: RNN, then RNY, and finally RYY. Some biological relations between this model and the YRY(N)6YRY preferential occurrence are also presented.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号