首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 968 毫秒
1.
In this study codon usage bias of all experimentally known genes of Lactococcus lactis has been analyzed. Since Lactococcus lactis is an AT rich organism, it is expected to occur A and/or T at the third position of codons and detailed analysis of overall codon usage data indicates that A and/or T ending codons are predominant in this organism. However, multivariate statistical analyses based both on codon count and on relative synonymous codon usage (RSCU) detect a large number of genes, which are supposed to be highly expressed are clustered at one end of the first major axis, while majority of the putatively lowly expressed genes are clustered at the other end of the first major axis. It was observed that in the highly expressed genes C and T ending codons are significantly higher than the lowly expressed genes and also it was observed that C ending codons are predominant in the duets of highly expressed genes, whereas the T endings codons are abundant in the quartets. Abundance of C and T ending codons in the highly expressed genes suggest that, besides, compositional biases, translational selection are also operating in shaping the codon usage variation among the genes in this organism as observed in other compositionally skewed organisms. The second major axis generated by correspondence analysis on simple codon counts differentiates the genes into two distinct groups according to their hydrophobicity values, but the same analysis computed with relative synonymous codon usage values could not discriminate the genes according to the hydropathy values. This suggests that amino acid composition exerts constraints on codon usage in this organism. On the other hand the second major axis produced by correspondence analysis on RSCU values differentiates the genes into two groups according to the synonymous codon usage for cysteine residues (rarest amino acids in this organism), which is nothing but a artifactual effect induced by the RSCU values. Other factors such as length of the genes and the positions of the genes in the leading and lagging strand of replication have practically no influence in the codon usage variation among the genes in this organism.  相似文献   

2.
Studies on codon usage in Entamoeba histolytica   总被引:13,自引:0,他引:13  
Codon usage bias of Entamoeba histolytica, a protozoan parasite, was investigated using the available DNA sequence data. Entamoeba histolytica having AT rich genome, is expected to have A and/or T at the third position of codons. Overall codon usage data analysis indicates that A and/or T ending codons are strongly biased in the coding region of this organism. However, multivariate statistical analysis suggests that there is a single major trend in codon usage variation among the genes. The genes which are supposed to be highly expressed are clustered at one end, while the majority of the putatively lowly expressed genes are clustered at the other end. The codon usage pattern is distinctly different in these two sets of genes. C ending codons are significantly higher in the putatively highly expressed genes suggesting that C ending codons are translationally optimal in this organism. In the putatively lowly expressed genes A and/or T ending codons are predominant, which suggests that compositional constraints are playing the major role in shaping codon usage variation among the lowly expressed genes. These results suggest that both mutational bias and translational selection are operational in the codon usage variation in this organism.  相似文献   

3.
Liu Q 《Bio Systems》2006,85(2):99-106
The main factors shaping codon usage bias in the Deinococcus radiodurans genome were reported. Correspondence analysis (COA) was carried out to analyze synonymous codon usage bias. The results showed that the main trend was strongly correlated with gene expression level assessed by the "Codon Adaptation Index" (CAI) values, a result that was confirmed by the distribution of genes along the first axis. The results of correlation analysis, variance analysis and neutrality plot indicated that gene nucleotide composition was clearly contributed to codon bias. CDS length was also key factor in dictating codon usage variation. A general tendency of more biased codon usage of genes with longer CDS length to higher expression level was found. Further, the hydrophobicity of each protein also played a role in shaping codon usage in this organism, which could be confirmed by the significant correlation between the positions of genes placed on the first axis and the hydrophobicity values (r=-0.100, P<0.01). In summary, gene expression level played a crucial role, nucleotide mutational bias, CDS length and the hydrophobicity of each protein just in a minor way in shaping the codon usage pattern of D. radiodurans. Notably, 19 codons firstly defined as "optimal codons" may provide useful clues for molecular genetic engineering and evolutionary studying.  相似文献   

4.
Compositional distributions in three different codon positions as well as codon usage biases of all available DNA sequences of Buchnera aphidicola genome have been analyzed. It was observed that GC levels among the three codon positions is I>II>III as observed in other extremely high AT rich organisms. B. aphidicola being an AT rich organism is expected to have A and/or T at the third positions of codons. Overall codon usage analyses indicate that A and/or T ending codons are predominant in this organism and some particular amino acids are abundant in the coding region of genes. However, multivariate statistical analysis indicates two major trends in the codon usage variation among the genes; one being strongly correlated with the GC contents at the third synonymous positions of codons, and the other being associated with the expression level of genes. Moreover, codon usage biases of the highly expressed genes are almost identical with the overall codon usage biases of all the genes of this organism. These observations suggest that mutational bias is the main factor in determining the codon usage variation among the genes in B. aphidicola.  相似文献   

5.
Synonymous codon usage of 53 protein coding genes in chloroplast genome of Coffea arabica was analyzed for the first time to find out the possible factors contributing codon bias. All preferred synonymous codons were found to use A/T ending codons as chloroplast genomes are rich in AT. No difference in preference for preferred codons was observed in any of the two strands, viz., leading and lagging strands. Complex correlations between total base compositions (A, T, G, C, GC) and silent base contents (A3, T3, G3, C3, GC3) revealed that compositional constraints played crucial role in shaping the codon usage pattern of C. arabica chloroplast genome. ENC Vs GC3 plot grouped majority of the analyzed genes on or just below the left side of the expected GC3 curve indicating the influence of base compositional constraints in regulating codon usage. But some of the genes lie distantly below the continuous curve confirmed the influence of some other factors on the codon usage across those genes. Influence of compositional constraints was further confirmed by correspondence analysis as axis 1 and 3 had significant correlations with silent base contents. Correlation of ENC with axis 1, 4 and CAI with 1, 2 prognosticated the minor influence of selection in nature but exact separation of highly and lowly expressed genes could not be seen. From the present study, we concluded that mutational pressure combined with weak selection influenced the pattern of synonymous codon usage across the genes in the chloroplast genomes of C. arabica.  相似文献   

6.
In the present study, major constraints for codon and amino acid usage of Sulfolobus acidocaldarius, Sulfolobus solfataricus, Sulfolobus tokodali, Sulfolobus islandis and 6 other isolates from islandicus species of genus Sulfolobus were investigated. Correspondence analysis revealed high significant correlation between the major trend of synonymous codon usage and gene expression level, as assessed by the “Codon Adaptation Index” (CAI). There is a significant negative correlation between Nc (Effective number of codons) and CAI demonstrating role of codon bias as an important determinant of codon usage. The significant correlation between major trend of synonymous codon usage and GC3s (G + C at third synonymous position) indicated dominant role of mutational bias in codon usage pattern. The result was further supported from SCUO (synonymous codon usage order) analysis. The amino acid usage was found to be significantly influenced by aromaticity and hydrophobicity of proteins. However, translational selection which causes a preference for codons that are most rapidly translated by current tRNA with multiple copy numbers was not found to be highly dominating for all studied isolates. Notably, 26 codons that were found to be optimally used by genes of S. acidocaldarius at higher expression level and its comparative analysis with 9 other isolates may provide some useful clues for further in vivo genetic studies on this genus.  相似文献   

7.
Adaptive codon usage provides evidence of natural selection in one of its most subtle forms: a fitness benefit of one synonymous codon relative to another. Codon usage bias is evident in the coding sequences of a broad array of taxa, reflecting selection for translational efficiency and/or accuracy as well as mutational biases. Here, we quantify the magnitude of selection acting on alternative codons in genes of the nematode Caenorhabditis remanei, an outcrossing relative of the model organism C. elegans, by fitting the expected mutation-selection-drift equilibrium frequency distribution of preferred and unpreferred codon variants to the empirical distribution. This method estimates the intensity of selection on synonymous codons in genes with high codon bias as N(e)s = 0.17, a value significantly greater than zero. In addition, we demonstrate for the first time that estimates of ongoing selection on codon usage among genes, inferred from nucleotide polymorphism data, correlate strongly with long-term patterns of codon usage bias, as measured by the frequency of optimal codons in a gene. From the pattern of polymorphisms in introns, we also infer that these findings do not result from the operation of biased gene conversion toward G or C nucleotides. We therefore conclude that coincident patterns of current and ancient selection are responsible for shaping biased codon usage in the C. remanei genome.  相似文献   

8.
Chromohalobacter salexigens, a Gammaproteobacterium belonging to the family Halomonadaceae, shows a broad salinity range for growth. In order to reveal the factors influencing architecture of protein coding genes in C. salexigens, pattern of synonymous codon usage bias has been investigated. Overall codon usage analysis of the microorganism revealed that C and G ending codons are predominantly used in all the genes which are indicative of mutational bias. Multivariate statistical analysis showed that the genes are separated along the first major explanatory axis according to their expression levels and their genomic GC content at the synonymous third positions of the codons. Both NC plot and correspondence analysis on Relative Synonymous Codon Usage (RSCU) indicates that the variation in codon usage among the genes may be due to mutational bias at the DNA level and natural selection acting at the level of mRNA translation. Gene length and the hydrophobicity of the encoded protein also influence the codon usage variation of genes to some extent. A comparison of the relative synonymous codon usage between 10% each of highly and lowly expressed genes determines 23 optimal codons, which are statistically over represented in the former group of genes and may provide useful information for salt-stressed gene prediction and gene-transformation. Furthermore, genes for regulatory functions; mobile and extrachromosomal element functions; and cell envelope are observed to be highly expressed. The study could provide insight into the gene expression response of halophilic bacteria and facilitate establishment of effective strategies to develop salt-tolerant crops of agronomic value.  相似文献   

9.
Enterogenic Escherichia coli (ETEC) F18 strains are the main pathogenic bacteria causing severe diarrhea in humans and domestic animals. However, the information about synonymous codon usage pattern of ETEC F18 genome remains unclear. We conducted a genome-wide analysis of synonymous codon usage patterns in the ETEC F18 strain SRA: SAMN02471895. After filtering of the complete genome sequence, 4327 coding sequences were analyzed using multivariate statistical methods to calculate synonymous codon usage patterns and to evaluate the influence of various factors in shaping the codon usage. The mean GC content was 51.38%, with a slight preference for G/C-ending codons. Twenty-two codons were determined as ‘‘optimal codons”. ENC plots showed some of the genes were on or close to the expected curve, while only points with low-ENC values were below the curve. PR2 analysis showed that GC and AT were not used proportionally, suggesting major roles for mutational pressure and natural selection in shaping usage. Neutrality plots showed a significant correlation between GC12 and GC3, suggesting that mutational pressure is responsible for nucleotide composition in shaping the strength of codon usage. Translational selection was the main factor shaping the codon usage pattern of ETEC F18 genome, while other factors such as protein length, GRAVY and ARO values also influenced codon usage to some extent. We analyzed the codon usage pattern systematically and identified the factors shaping codon usage bias in the ETEC F18 genome. Such information further elucidates the mechanisms of synonymous codon usage bias and provides the basis of molecular genetic engineering and evolutionary studies.  相似文献   

10.
Chlamydia trachomatis (C.t) is a Gram-negative obligate intracellular bacteria and is a major causative of infectious blindness and sexually transmitted diseases. Among the varied serovars of this organism, A, B and C are reported as prominent ocular pathogens. Genomic studies of these strains shall aid in deciphering potential drug targets and genomic influence on pathogenesis. Hence, in this study we performed deep statistical profiling of codon usage in these serovars. The overall base composition analysis reveals that these serovars are over biased to AU than GC. Similarly, relative synonymous codon usage also showed preference towards A/U ending codons. Parity Rule 2 analysis inferred unequal distribution of AT and GC, indicative of other unknown factors acting along with mutational pressure to influence codon usage bias (CUB). Moreover, absolute quantification of CUB also revealed lower bias across these serovars. The effect of natural selection on CUB was also confirmed by neutrality plot, reinforcing natural selection under mutational pressure turned to be a pivotal role in shaping the CUB in the strains studied. Correspondence analysis (COA) clarified that, C.t C/TW-3 to show a unique trend in codon usage variation. Host influence analysis on shaping the codon usage pattern also inferred some speculative relativity. In a nutshell, our finding suggests that mutational pressure is the dominating factor in shaping CUB in the strains studied, followed by natural selection. We also propose potential drug targets based on cumulative analysis of strand bias, CUB and human non-homologue screening.  相似文献   

11.
Liu Q  Feng Y  Xue Q 《Mitochondrion》2004,4(4):313-320
In this paper, the main factors shaping codon usage in the mitochondrion genome of rice were reported. Correspondence analysis, a commonly used multivariate statistical approach, was carried out to analyze synonymous codon usage bias. The results showed that the main trend was strongly correlated with the gene expression level assessed by the 'Codon Adaptation Index' value, a result that was confirmed by the distribution of genes along the first axis. From the results that there were two significant correlations between axis 1 coordinates and the GC, GC3s content at silent sites of each sequence, and clearly significant correlations between the 'Effective Number of Codons' values and GC, GC3s content, we inferred that codon usage bias was affected by gene nucleotide composition also. In addition, the hydrophobicity of each protein also played some roles in shaping codon usage in this organelle, which could be confirmed by the significant correlation between the positions of genes placed on the first axis and the hydrophobicity value of each protein. In summary, natural selection played a crucial role, nucleotide mutational bias and amino acid composition only in a minor way, in shaping codon usage in the mitochondrion genome of rice. Notably, 21 codons defined firstly as 'optimal codons' might provide some more useful information for gene engineering and/or evolution studying.  相似文献   

12.
Sau K  Gupta SK  Sau S  Mandal SC  Ghosh TC 《Bio Systems》2006,85(2):107-113
Synonymous codon and amino acid usage biases have been investigated in 903 Mimivirus protein-coding genes in order to understand the architecture and evolution of Mimivirus genome. As expected for an AT-rich genome, third codon positions of the synonymous codons of Mimivirus carry mostly A or T bases. It was found that codon usage bias in Mimivirus genes is dictated both by mutational pressure and translational selection. Evidences show that four factors such as mean molecular weight (MMW), hydropathy, aromaticity and cysteine content are mostly responsible for the variation of amino acid usage in Mimivirus proteins. Based on our observation, we suggest that genes involved in translation, DNA repair, protein folding, etc., have been laterally transferred to Mimivirus a long ago from living organism and with time these genes acquire the codon usage pattern of other Mimivirus genes under selection pressure.  相似文献   

13.

Background

Synonymous codon usage varies widely between genomes, and also between genes within genomes. Although there is now a large body of data on variations in codon usage, it is still not clear if the observed patterns reflect the effects of positive Darwinian selection acting at the level of translational efficiency or whether these patterns are due simply to the effects of mutational bias. In this study, we have included both intra-genomic and inter-genomic comparisons of codon usage. This allows us to distinguish more efficiently between the effects of nucleotide bias and translational selection.

Results

We show that there is an extreme degree of heterogeneity in codon usage patterns within the rice genome, and that this heterogeneity is highly correlated with differences in nucleotide content (particularly GC content) between the genes. In contrast to the situation observed within the rice genome, Arabidopsis genes show relatively little variation in both codon usage and nucleotide content. By exploiting a combination of intra-genomic and inter-genomic comparisons, we provide evidence that the differences in codon usage among the rice genes reflect a relatively rapid evolutionary increase in the GC content of some rice genes. We also noted that the degree of codon bias was negatively correlated with gene length.

Conclusion

Our results show that mutational bias can cause a dramatic evolutionary divergence in codon usage patterns within a period of approximately two hundred million years.The heterogeneity of codon usage patterns within the rice genome can be explained by a balance between genome-wide mutational biases and negative selection against these biased mutations. The strength of the negative selection is proportional to the length of the coding sequences. Our results indicate that the large variations in synonymous codon usage are not related to selection acting on the translational efficiency of synonymous codons.
  相似文献   

14.
It is important and meaningful to understand the codon usage pattern and the factors that shape codon usage of maize. In this study, trends in synonymous codon usage in maize have been firstly examined through the multivariate statistical analysis on 7402 cDNA sequences. The results showed that the genes positions on the primary axis were strongly negatively correlated with GC3s, GC content of individual gene and gene expression level assessed by the codon adaptation index (CAI) values, which indicated that nucleotide composition and gene expression level were the main factors in shaping the codon usage of maize, and the variation in codon usage among genes may be due to mutational bias at the DNA level and natural selection acting at the level of mRNA translation. At the same time, CDS length and the hydrophobicity of each protein were, respectively, significantly correlated with the genes locations on the primary axis, GC3s and CAI values. We infer that genes length and the hydrophobicity of the encoded protein may play minor role in shaping codon usage bias. Additional 28 codons ending with a G or C base have been defined as “optimal codons”, which may provide useful information for maize gene-transformation and gene prediction.  相似文献   

15.
Multivariate analysis of codon and amino acid usage was performed for three Leishmania species, including L. donovani, L. infantum and L. major. It was revealed that all three species are under mutational bias and translational selection. Lower GC 12 and higher GC 3S in all three parasites suggests that the ancestral highly expressed genes (HEGs), compared to lowly expressed genes (LEGs), might have been rich in AT-content. This also suggests that there must have been a faster rate of evolution under GC-bias in LEGs. It was observed from the estimation of synonymous/non-synonymous substitutions in HEGs that the HEG dataset of L. donovani is much closer to L. major evolutionarily. This is also supported by the higher d N value as compared to d S between L. donovani and L. major, suggesting the conservation of synonymous codon positions between these two species and the role of translational selection in shaping the composition of protein-coding genes.  相似文献   

16.
以普通野生稻(Oryza rufipogon Griff.)线粒体基因组为对象,分析其蛋白质编码基因的密码子使用特征及与亚洲栽培稻(O. sativa L.)的差异,探讨其密码子偏性形成的影响因素和进化过程。结果显示:普通野生稻线粒体基因组编码序列第1、第2和第3位碱基的GC含量依次为49.18%、42.67%和40.86%;有效密码子数(Nc)分布于45.32~61.00之间,其密码子偏性较弱; Nc值仅与GC_3呈显著相关,密码子第3位的碱基组成对密码子偏性影响较大;第1向量轴上显示9.91%的差异,其与GC3s、Nc、密码子偏好指数(CBI)和最优密码子使用频率(Fop)的相关性均达到显著水平;而GC_3和GC12的相关性未达到显著水平。因此,普通野生稻线粒体基因组密码子的使用偏性主要受自然选择压力影响而形成。本研究确定了21个普通野生稻线粒体基因组的最优密码子,大多以A或T结尾,与叶绿体密码子具有趋同进化,但是与核基因组具有不同的偏好性。同义密码子相对使用度(RSCU)、PR2偏倚分析和中性绘图分析显示,普通野生稻线粒体基因功能和其密码子使用密切相关,且线粒体密码子使用在普通野生稻、粳稻(O. sativa L. subsp. japonica Kato)和籼稻(O. sativa L. subsp.indica Kato)内具有同质性。  相似文献   

17.
In this study, major factors shaping codon and amino acid usage variation Lactobacillus sakei 23K were investigated. It included 13 other Lactobacillus species for a comparative analysis. The correspondence analysis (COA) showed that in 13 species the major trend of synonymous codon usage was highly correlated with gene expression level as assessed by the “Codon Adaptation Index” (CAI) values. In addition, Nc (effective number of codons) plot, SCUO (synonymous codon usage order) plot and correlation analyses showed that the base composition and mutational bias have dominant role in the codon usage variation. However, the translational selection for genes at higher expression level, where more frequent synonymous codons correspond to more abundant cognate transfer RNAs (tRNAs), was not found to be similar in all species. The study also showed that the amino acid usage in these species was significantly (P < 0.01) influenced by hydrophobicity and aromaticity of proteins. Furthermore, 24 codons that were found to be optimally used by L. sakei and its comparative study with 13 Lactobacillus species might provide some useful information in their further study of molecular evolution and genetic engineering.  相似文献   

18.
Codon usage in mitochondrial genome of the six different plants was analyzed to find general patterns of codon usage in plant mitochondrial genomes. The neutrality analysis indicated that the codon usage patterns of mitochondrial genes were more conserved in GC content and no correlation between GC12 and GC3. T and A ending codons were detected as the preferred codons in plant mitochondrial genomes. The Parity Rule 2 plot analysis showed that T was used more frequently than A. The ENC-plot showed that although a majority of the points with low ENC values were lying below the expected curve, a few genes lied on the expected curve. Correspondence analysis of relative synonymous codon usage yielded a first axis that explained only a partial amount of variation of codon usage. These findings suggest that natural selection is likely to be playing a large role in codon usage bias in plant mitochondrial genomes, but not only natural selection but also other several factors are likely to be involved in determining the selective constraints on codon bias in plant mitochondrial genomes. Meantime, 1 codon (P. patens), 6 codons (Z. mays), 9 codons (T. aestivum), 15 codons (A. thaliana), 15 codons (M. polymorpha) and 15 codons (N. tabacum) were defined as the preferred codons of the six plant mitochondrial genomes.  相似文献   

19.
Codon usage in Clonorchis sinensis was analyzed using 12,515 codons from 38 coding sequences. Total GC content was 49.83%, and GC1, GC2 and GC3 contents were 56.32%, 43.15% and 50.00%, respectively. The effective number of codons converged at 51-53 codons. When plotted against total GC content or GC3, codon usage was distributed in relation to GC3 biases. Relative synonymous codon usage for each codon revealed a single major trend, which was highly correlated with GC content at the third position when codons began with A or U at the first two positions. In codons beginning with G or C base at the first two positions, the G or C base rarely occurred at the third position. These results suggest that codon usage is shaped by a bias towards G or C at the third base, and that this is affected by the first and second bases.  相似文献   

20.
Rao Y  Wu G  Wang Z  Chai X  Nie Q  Zhang X 《DNA research》2011,18(6):499-512
Synonymous codons are used with different frequencies both among species and among genes within the same genome and are controlled by neutral processes (such as mutation and drift) as well as by selection. Up to now, a systematic examination of the codon usage for the chicken genome has not been performed. Here, we carried out a whole genome analysis of the chicken genome by the use of the relative synonymous codon usage (RSCU) method and identified 11 putative optimal codons, all of them ending with uracil (U), which is significantly departing from the pattern observed in other eukaryotes. Optimal codons in the chicken genome are most likely the ones corresponding to highly expressed transfer RNA (tRNAs) or tRNA gene copy numbers in the cell. Codon bias, measured as the frequency of optimal codons (Fop), is negatively correlated with the G + C content, recombination rate, but positively correlated with gene expression, protein length, gene length and intron length. The positive correlation between codon bias and protein, gene and intron length is quite different from other multi-cellular organism, as this trend has been only found in unicellular organisms. Our data displayed that regional G + C content explains a large proportion of the variance of codon bias in chicken. Stepwise selection model analyses indicate that G + C content of coding sequence is the most important factor for codon bias. It appears that variation in the G + C content of CDSs accounts for over 60% of the variation of codon bias. This study suggests that both mutation bias and selection contribute to codon bias. However, mutation bias is the driving force of the codon usage in the Gallus gallus genome. Our data also provide evidence that the negative correlation between codon bias and recombination rates in G. gallus is determined mostly by recombination-dependent mutational patterns.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号