首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
In recent years, the amount of molecular sequencing data from Tetrahymena thermophila has dramatically increased. We analyzed G + C content, codon usage, initiator codon context and stop codon sites in the extremely A + T rich genome of this ciliate. Average G + C content was 38% for protein coding regions, 21% for 5' non-coding sequences, 19% for 3' non-coding sequences, 15% for introns, 19% for micronuclear limited sequences and 17% for macronuclear retained sequences flanking micronuclear specific regions. The 75 available T. thermophila protein coding sequences favored codons ending in T and, where possible, avoided those with G in the third position. Highly expressed genes were relatively G + C-rich and exhibited an extremely biased pattern of codon usage while developmentally regulated genes were more A + T-rich and showed less codon usage bias. Regions immediately preceding Tetrahymena translation initiator codons were generally A-rich. For the 60 stop codons examined, the frequency of G in the end + 1 site was much higher than expected whereas C never occupied this position.  相似文献   

3.
紫花苜蓿叶绿体基因组密码子偏好性分析   总被引:1,自引:0,他引:1  
喻凤  韩明 《广西植物》2021,41(12):2069-2076
为分析紫花苜蓿叶绿体基因组密码子偏好性的使用模式,该文以紫花苜蓿叶绿体基因组中筛选到的49条蛋白质编码序列为研究对象,利用CodonW、CUSP、CHIPS、SPSS等软件对其密码子的使用模式和偏好性进行研究。结果表明:(1)紫花苜蓿叶绿体基因的第3位密码子的平均GC含量为26.44%,有效密码子数(ENC)在40.6~51.41之间,多数密码子的偏好性较弱。(2)相对同义密码子使用度(RSCU)分析发现,RSCU>1 的密码子数目有30个,以A、U结尾的有29个,说明了紫花苜蓿叶绿体基因组A或U出现的频率较高。(3)中性分析发现,GC3与 GC12的相关性不显著,表明密码子偏性主要受自然选择的影响; ENC-plot 分析发现一部分基因落在曲线的下方及周围,表明突变也影响了部分密码子偏性的形成。此外,有17个密码子被鉴定为紫花苜蓿叶绿体基因组的最优密码子。紫花苜蓿叶绿体基因组的密码子偏好性可能受自然选择和突变的共同作用。该研究将为紫花苜蓿叶绿体基因工程的开展和目标性状的遗传改良奠定基础。  相似文献   

4.
文中对子囊菌代表类群的延伸因子1 alpha基因密码子的使用模式进行了研究。结果表明:该基因的密码子使用偏好性不仅与核酸碱基组成密切相关,也受到其他选择性压力的影响。统计分析揭示了子囊菌各类群该基因的密码子组成和编码特点,在同义密码子的选择模式上,酵母纲(Saccharomycetes)的成员具有较独特的偏好性。基于密码子用法分歧度的聚类分析方法较合理地反映了大部分类群的分类学地位,但在各个纲的内部,密码子偏好性的变化程度存在差异。  相似文献   

5.
Two species of the DNA virus Torque teno sus virus (TTSuV), TTSuV1 and TTSuV2, have become widely distributed in pig-farming countries in recent years. In this study, we performed a comprehensive analysis of synonymous codon usage bias in 41 available TTSuV2 coding sequences (CDS), and compared the codon usage patterns of TTSuV2 and TTSuV1. TTSuV codon usage patterns were found to be phylogenetically conserved. Values for the effective number of codons (ENC) indicated that the overall extent of codon usage bias in both TTSuV2 and TTSuV1 was not significant, the most frequently occurring codons had an A or C at the third codon position. Correspondence analysis (COA) was performed and TTSuV2 and TTSuV1 sequences were located in different quadrants of the first two major axes. A plot of the ENC revealed that compositional constraint was the major factor determining the codon usage bias for TTSuV2. In addition, hierarchical cluster analysis of 41 TTSuV2 isolates based on relative synonymous codon usage (RSCU) values suggested that there was no association between geographic distribution and codon bias of TTSuV2 sequences. Finally, the comparison of RSCU for TTSuV2, TTSuV1 and the corresponding host sequence indicated that the codon usage pattern of TTSuV2 was similar to that of TTSuV1. However the similarity was low for each virus and its host. These conclusions provide important insight into the synonymous codon usage pattern of TTSuV2, as well as better understangding of the molecular evolution of TTSuV2 genomes.  相似文献   

6.
The hepatitis C virus (HCV) infects at least 3 % of people worldwide, and continues to cause substantial morbidity and mortality. To better understand the phylodynamics and molecular evolution of the HCV 1a, a phylogenetic analysis of 186 full-length genomic sequences isolated from five countries between 1977 and 2009 was conducted in this study. Nucleotide substitution rates and molecular epidemiology were assessed by Bayesian coalescent analysis using time-stamped entire coding sequences. We showed that the substitution rates of ten genomic regions are diverse and higher than those of previously estimated. The coalescent analysis indicated that the transmission of subtype 1a probably started in the second half of the twentieth century and an explosion of epidemics occurred between 1960s and 1980s. Selection analysis suggested that the HCV 1a evolves under purifying selection. However, a total of 58 positively selected sites were detected and further analysis suggested that these sites may play an important role in adaptive evolution of HCV 1a strains. In addition, the codon usage and the factors accounting for shaping the codon usage pattern of HCV 1a were investigated to evaluate the dynamics of the virus evolution. Surveys of codon usage variation showed that mutational pressure and selection pressure account for HCV 1a codon usage pattern.  相似文献   

7.
为分析栽培大豆和野生大豆线粒体基因组的密码子使用特征差异,该文以其线粒体基因组编码序列为研究对象,比较其密码子偏性形成的影响因素和演化过程。结果表明:(1)栽培大豆和野生大豆线粒体基因组编码区的GC含量分别为44.56%和44.58%,说明栽培大豆和野生大豆线粒体编码基因均富含A/T碱基。(2)栽培大豆和野生大豆线粒体基因组密码子第1位、第2位GC含量平均值与第3位GC含量的相关性均呈极显著水平,说明突变在其密码子偏性形成中的作用不可忽略; PR2-plot分析显示,在同义密码子第3位碱基的使用频率上,嘌呤低于嘧啶; Nc-plot分析中Nc比值位于-0.1~0.2区间的基因数占总基因数的95%以上;突变和选择等多重因素共同作用影响了大豆线粒体基因组编码序列密码子使用偏性的形成。(3)有20、21个密码子分别被确定为栽培大豆和野生大豆线粒体基因组编码序列的最优密码子,其中除丝氨酸TCC密码子外均以A或T结尾。综上结果认为,栽培大豆线粒体密码子偏性的形成受选择的影响要高于野生大豆,这可能是栽培大豆由野生大豆经长期人工栽培驯化的结果。  相似文献   

8.
目前,有关同义密码子使用偏性对蛋白质折叠的影响研究中,样本蛋白均来源于不同的物种。考虑到同义密码子使用偏性的物种差异性,选取枯草杆菌的核蛋白为研究对象。首先,将每条核蛋白按二级结构截取为α螺旋片段、β折叠片段和无规卷曲(α-β混合)片段,并计算其蛋白质折叠速率。然后,整理每个片段相应的核酸序列信息,计算其同义密码子使用度。在此基础上,分析枯草芽孢杆菌核蛋白的同义密码子使用偏性与蛋白质折叠速率的相关性。发现对于不同二级结构的肽链片段,都有部分密码子的使用偏性与其对应的肽链折叠速率显著相关。进一步分析发现,与肽链片段折叠速率显著相关的密码子绝大部分为枯草杆菌全序列或核蛋白序列的每一组同义密码子中使用度最高的密码子。结果表明,在蛋白质的折叠过程中,枯草芽孢杆菌的同义密码子使用偏性起着重要作用。  相似文献   

9.
10.
Summary This paper reports the cloning and characterization of a gene encoding galactoside acetyltransferase from a strain ofLactococcus lactis. AP stI library ofL. lactis strain ATCC7962 DNA was constructed in plasmid pUC18. A clone harbouring a 10 kbp DNA fragment containing part of thelac operon was isolated using a labelled probe generated by PCR. DNA sequence analysis revealed the presence of a gene encoding a protein with 64.5% similarity to the galactoside acetyltransferase fromEscherichia coli. The codon usage pattern of this gene was not typical of lactococcal genes. The lactococcallac operon organization appears to be different to that of other organisms.  相似文献   

11.
12.
Structural analysis of 55 nearly full-length cDNA clones of rat liver alkaline phosphatase mRNAs revealed the presence of two totally different sequence stretches at the 5'-distal region starting from the position 88 nucleotides upstream of the initiation codon ATG. Since each of these two sequences, E1 and E2, was assigned on the rat genome about 36 kilobase pairs (kbp) and 10 kbp upstream of the common exon E3, respectively, they are presumably used as alternatively spliced exons. The distances between these sequences and E3 were unusually long, as compared with other intronic distances (0.4-4 kbp) observed between successive pairs of the eleven exons which are common to both types of mRNAs. The relative ratio of E1-containing mRNA to E2-mRNA was about three in the liver after bile-duct ligation and colchicine treatment.  相似文献   

13.
Codon usage in Clonorchis sinensis was analyzed using 12,515 codons from 38 coding sequences. Total GC content was 49.83%, and GC1, GC2 and GC3 contents were 56.32%, 43.15% and 50.00%, respectively. The effective number of codons converged at 51-53 codons. When plotted against total GC content or GC3, codon usage was distributed in relation to GC3 biases. Relative synonymous codon usage for each codon revealed a single major trend, which was highly correlated with GC content at the third position when codons began with A or U at the first two positions. In codons beginning with G or C base at the first two positions, the G or C base rarely occurred at the third position. These results suggest that codon usage is shaped by a bias towards G or C at the third base, and that this is affected by the first and second bases.  相似文献   

14.
The cyclic nucleotide phosphodiesterase (phosphodiesterase) plays essential roles throughout the development of Dictyostelium discoideum. It is crucial to cellular aggregation and to postaggregation morphogenesis. The phosphodiesterase gene is transcribed into three mRNAs, containing the same coding sequence connected to different 5' untranslated sequences, that accumulate at different times during the life cycle. A 1.9-kilobase (kb) mRNA is specific for growth, a 2.4-kb mRNA is specific for aggregation, and a 2.2-kb mRNA is specific for late development and is only expressed in prestalk cells. Hybridization of RNA isolated from cells at various stages of development with different upstream regions of the gene indicated separate promoters for each of the three mRNAs. The existence of specific promoters was confirmed by fusing the three putative promoter regions to the chloramphenicol acetyltransferase reporter gene, and the analysis of transformants containing these constructs. The three promoters are scattered within a 4.1-kilobase pair (kbp) region upstream of the initiation codon. The late promoter is proximal to the coding sequence, the growth-specific promoter has an initiation site that is 1.9 kbp upstream of the ATG codon, and the aggregation-specific promoter has an initiation site 3 kbp upstream.  相似文献   

15.
16.
不同PRRSV毒株间ORF1a基因密码子偏爱性差异分析   总被引:1,自引:0,他引:1  
运用CodonW、ClustalX、TreeView软件及EMBOSS(,rIleEuropean MolecularBiologyOpenSoftwareSuite)、CIMMiner在线分析软件对选取的29株PRRSVORFla基因进行密码子偏爱性聚类分析.CAI、CBI、Fop、Nc、GC3s和GC含量、基因长度等相关性分析显示PRRSV各毒株编码的ORFla基因密码子偏爱性各有差异,其中Lelystadvirus、LV4-2.1、VR-2332、RespPRRSMIV与国内分离的高致病性PRRSV变异株之间差异较大.密码子使用概率聚类分析表明CC.1、NVSL.97.7895、CH—1a、RespPRRSMLV、LV4.2.1、Lelystadvirus与高致病性PRRSV变异株距离较远.而国内分离株相互间的聚类距离则较接近。此结果与基于氨基酸序列比对构建的系统进化树图谱基本一致.由此可见.PRRSV病毒ORF1a基因密码子使用偏爱性的差别与病毒的遗传多样性密切相关.  相似文献   

17.
A novel bias in codon third-letter usage was found in Escherichia coli genes with low fractions of "optimal codons", by comparing intact sequences with control random sequences. Third-letter usage has been found to be biased according to preference in codon usage and to doublet preference from the following first letter. The present study examines third-letter usage in the context of the nucleotide sequence when these preferences are considered. In order to exclude any influence by these factors, the random sequences were generated such that the amino acid sequence, codon usage, and the doublet frequency in each gene were all preserved. Comparison of intact sequences with these randomly generated sequences reveals that third letters of codons show a strong preference for the purine/pyrimidine pattern of the next codons: purine (R) is preferred to pyrimidine (Y) at the third site when followed by an R-Y-R codon, and pyrimidine is preferred when followed by an R-R-Y, an R-Y-Y or a Y-R-Y codon. This bias is probably related to interactions of tRNA molecules in the ribosome.  相似文献   

18.
Hepatitis C virus infection (HCV) alarmingly increases worldwide; it causes chronic hepatitis, liver cirrhosis and hepatocellular carcinoma, so there is urgent need of developing effective and sufficient quantity of vaccine. HCV envelope protein E2 is the main target for developing as a vaccine candidate. Presently recombinant proteins can successfully be used as a vaccine for many diseases. This concern, it is challenging to produce sufficient quantities of many recombinant proteins from their expression hosts. One of the main factors affecting the success of expression of foreign genes in heterologous hosts is the divergence of codon usage of the target gene from that used in the expression system. In this study, we optimized the various genotypes of HCV envelope protein E2 gene according to the codon usage of Pichia pastoris and predicted the expression level. Synonymous codon usage of E2 adapted to that used by P. pastoris was estimated using the relative synonymous codon usage value (RSCU), codon adaptation index (CAI) and effective number of codon (ENC). The CAI of optimized HCV E2 sequences was enhanced from 0.638 to 0.833 and %GC was decreased from 56.05 to 44.05; this was significantly (p < 0.01) different from the native sequences. Codon with RSCU value less than one was replaced with most preferred synonymous codons. The ENC values of optimized HCV E2 sequences varied from 47.00 to 47.50, with a mean value of 47.15 and an SD of 0.14. Our study suggested that, from the measured values of predicted expression level, the codon optimized HCV E2 protein could be produced in sufficient quantity in the expression host; knowledge of the codon usage patterns of E2 of various genotypes facilitate the production of a promising unique vaccine candidate for HCV.  相似文献   

19.
Codon usage in mitochondrial genome of the six different plants was analyzed to find general patterns of codon usage in plant mitochondrial genomes. The neutrality analysis indicated that the codon usage patterns of mitochondrial genes were more conserved in GC content and no correlation between GC12 and GC3. T and A ending codons were detected as the preferred codons in plant mitochondrial genomes. The Parity Rule 2 plot analysis showed that T was used more frequently than A. The ENC-plot showed that although a majority of the points with low ENC values were lying below the expected curve, a few genes lied on the expected curve. Correspondence analysis of relative synonymous codon usage yielded a first axis that explained only a partial amount of variation of codon usage. These findings suggest that natural selection is likely to be playing a large role in codon usage bias in plant mitochondrial genomes, but not only natural selection but also other several factors are likely to be involved in determining the selective constraints on codon bias in plant mitochondrial genomes. Meantime, 1 codon (P. patens), 6 codons (Z. mays), 9 codons (T. aestivum), 15 codons (A. thaliana), 15 codons (M. polymorpha) and 15 codons (N. tabacum) were defined as the preferred codons of the six plant mitochondrial genomes.  相似文献   

20.
The definition of a typical sec-dependent bacterial signal peptide contains a positive charge at the N-terminus, thought to be required for membrane association. In this study the amino acid distribution of all Escherichia coli secretory proteins were analysed. This revealed that there was a statistically significant bias for lysine at the second codon position (P2), consistent with a role for the positive charge in secretion. Removal of the positively charged residue P2 in two different model systems revealed that a positive charge is not required for protein export. A well-characterized feature of large amino acids like lysine at P2 is inhibition of N-terminal methionine removal by methionyl amino-peptidase (MAP). Substitution of lysine at P2 for other large or small amino acids did not affect protein export. Analysis of codon usage revealed that there was a bias for the AAA lysine codon at P2, suggesting that a non-coding function for the AAA codon may be responsible for the strong bias for lysine at P2 of secretory signal sequences. We conclude that the selection for high translation initiation efficiency maybe the selective pressure that has led to codon and consequent amino acid usage at P2 of secretory proteins.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号