首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Synonymous codon usage bias is a broadly observed phenomenon in bacteria, plants, and invertebrates and may result from selection. However, the role of selective pressures in shaping codon bias is still controversial in vertebrates, particularly for mammals. The myosin heavy-chain (MyHC) gene family comprises multiple isoforms of the major force-producing contractile protein in cardiac and skeletal muscles. Slow and fast genes are tandemly arrayed on separate chromosomes, and have distinct patterns of functionality and expression in muscle. We analyze both full-length MyHC genes (~5400?bp) and a larger collection of partial sequences at the 3' end (~500?bp). The MyHC isoforms are an interesting system in which to study codon usage bias because of their length, expression, and critical importance to organismal mobility. Codon bias and GC content differs among MyHC genes with regards to functional type, isoform, and position within the gene. Codon bias even varies by isoform within a species. We find evidence in favor of both chromosomal influences on nucleotide composition and selection against nonsense errors (SANE) acting on codon usage in MyHC genes. Intragenic variation in codon bias and elongation rate is significant, with a strong trend for increasing codon bias and elongation rate towards the 3' end of the gene, although the trend is dependent upon the degeneracy class of the codons. Therefore, patterns of codon usage in MyHC genes are consistent with models supporting SANE as a major force shaping codon usage.  相似文献   

2.
从GenBank获得大肠杆菌K-12MG1655株的全基因组序列,计算了与基因密码子偏好性相关的多个参数(Nc、CAI、GC、GC3s),对其mRNA编码区长度、形成二级结构倾向与密码子偏好性之间的关系进行了统计学分析,发现虽然翻译效率(包括翻译速度和翻译精度)是制约大肠杆菌高表达基因的密码子偏好性的主要因素,同时,mRNA编码区长度及其形成二级结构的倾向也是形成这种偏好性的不可忽略的原因,而且对偏好性有一定程度的削弱。另外对mRNA编码区形成二级结构倾向的生物学意义进行了讨论分析。  相似文献   

3.
双孢蘑菇Agaricus bisporus是世界上最广泛栽培的食用菌之一.本研究通过分析双孢蘑菇基因组密码子使用偏性,探讨密码子偏性的影响因素及其对基因表达的影响.以双孢蘑菇基因组和转录组数据为依据,分析了双孢蘑菇基因组基因、高表达基因(high expression gene,HEG)和低表达基因(low expre...  相似文献   

4.
The molecular evolution of the histone multigene family was studied by cloning and determining the nucleotide sequences of the histone 3 genes in seven Drosophila species, D. takahashii, D. lutescens, D. ficusphila, D. persimilis, D.pseudoobscura, D. americana and D. immigrans. CT repeats, a TATA box and an AGTG motif in the 5' region, and a hairpin loop and purine-rich motifs (CAA(T/G)GAGA) in the 3' region were conserved even in distantly related species. In D. hydei and D.americana, the GC content at the third codon position in the protein coding region was relatively low (49% and 45%), while in D. takahashii and D. lutescens it was relatively high (64% and 65%). The non- significant correlation between the GC contents in the 3' region and at the third codon position as well as the evidence of less constraint in the 3' region suggested that mutational bias may not be the major mechanism responsible for the biased nucleotide change at the third codon position or for codon usage bias.  相似文献   

5.
To understand the synonymous codon usage pattern in mitochondrial genome of Antheraea assamensis, we analyzed the 13 mitochondrial protein‐coding genes of this species using a bioinformatic approach as no work was reported yet. The nucleotide composition analysis suggested that the percentages of A, T, G,and C were 33.73, 46.39, 9.7 and 10.17, respectively and the overall GC content was 19.86, that is, lower than 50% and the genes were AT rich. The mean effective number of codons of mitochondrial protein‐coding genes was 36.30 and it indicated low codon usage bias (CUB). Relative synonymous codon usage analysis suggested overrepresented and underrepresented codons in each gene and the pattern of codon usage was different among genes. Neutrality plot analysis revealed a narrow range of distribution for GC content at the third codon position and some points were diagonally distributed, suggesting both mutation pressure and natural selection influenced the CUB.  相似文献   

6.
In some Drosophila species, there are two types of greatly diverged amylase (Amy) genes (Amy clusters 1 and 2), each encoding active amylase isozymes. Cluster 1 is located at the middle of its chromosomal arm, and the region has a normal local recombination rate. However, cluster 2 is near the centromere, and this region is known to have a reduced recombination rate. Although nonsynonymous substitutions follow a molecular clock, synonymous substitutions were accelerated in cluster 2 after gene duplications. This resulted in a higher GC content at the third codon position (GC3) and codon usage bias in cluster 1, and lower GC3 content and codon usage bias in the cluster 2. However, no systematic difference in GC content was observed in the first and second codon positions or the 3'-flanking regions. Therefore, differences in local recombination rate rather than mutation bias might explain the divergence at synonymous sites between the two Amy clusters within species (Hill-Robertson effect). Alternatively, the different patterns and levels of expression between the two clusters may imply that the reduced expression level in cluster 2 caused by chromatin potentiation decreased the codon bias. Both of these hypotheses imply the importance of the genomic background as a driving force of divergence between non-tandemly duplicated genes.  相似文献   

7.
Divergence in codon usage of Lactobacillus species.   总被引:3,自引:0,他引:3       下载免费PDF全文
We have analyzed codon usage patterns of 70 sequenced genes from different Lactobacillus species. Codon usage in lactobacilli is highly biased. Both inter-species and intra-species heterogeneity of codon usage bias was observed. Codon usage in L. acidophilus is similar to that in L. helveticus, but dissimilar to that in L. bulgaricus, L. casei, L. pentosus and L. plantarum. Codon usage in the latter three organisms is not significantly different, but is different from that in L. bulgaricus. Inter-species differences in codon usage can, at least in part, be explained by differences in mutational drift. L. bulgaricus shows GC drift, whereas all other species show AT drift. L. acidophilus and L. helveticus rarely use NNG in family-box (a set of synonymous) codons, in contrast to all other species. This result may be explained by assuming that L. acidophilus and L. helveticus, but not other species examined, use a single tRNA species for translation of family-box codons. Differences in expression level of genes are positively correlated with codon usage bias. Highly expressed genes show highly biased codon usage, whereas weakly expressed genes show much less biased codon usage. Codon usage patterns at the 5'-end of Lactobacillus genes is not significantly different from that of entire genes. The GC content of codons 2-6 is significantly reduced compared with that of the remainder of the gene. The possible implications of a reduced GC content for the control of translation efficiency are discussed.  相似文献   

8.
Analysis of synonymous codon usage bias in Chlamydia   总被引:9,自引:0,他引:9  
Chlamydiae are obligate intracellular bacterial pathogens that cause ocular and sexuallytransmitted diseases,and are associated with cardiovascular diseases.The analysis of codon usage mayimprove our understanding of the evolution and pathogenesis of Chlamydia and allow reengineering of targetgenes to improve their expression for gene therapy.Here,we analyzed the codon usage of C.muridarum,C.trachomatis(here indicating biovar trachoma and LGV),C.pneumoniae,and C.psittaci using the codonusage database and the CUSP(Create a codon usage table)program of EMBOSS(The European MolecularBiology Open Software Suite).The results show that the four genomes have similar codon usage patterns,with a strong bias towards the codons with A and T at the third codon position.Compared with Homosapiens,the four chlamydial species show discordant seven or eight preferred codons.The ENC(effectivenumber of codons used in a gene)-plot reveals that the genetic heterogeneity in Chlamydia is constrained bythe G+C content,while translational selection and gene length exert relatively weaker influences.Moreover,mutational pressure appears to be the major determinant of the codon usage variation among the chlamydialgenes.In addition,we compared the codon preferences of C.trachomatis with those of E.coli,yeast,adenovirus and Homo sapiens.There are 23 codons showing distinct usage differences between C.trachomatisand E.coli,24 between C.trachomatis and adenovirus,21 between C.trachomatis and Homo sapiens,butonly six codons between C.trachomatis and yeast.Therefore,the yeast system may be more suitable for theexpression of chlamydial genes.Finally,we compared the codon preferences of C.trachomatis with those ofsix eukaryotes,eight prokaryotes and 23 viruses.There is a strong positive correlation between the differ-ences in coding GC content and the variations in codon bias(r=0.905,P<0,001).We conclude that thevariation of codon bias between C.trachomatis and other organisms is much less influenced by phylogeneticlineage and primarily determined by the extent of disparities in GC content.  相似文献   

9.
Base composition varies among and within eukaryote genomes. Although mutational bias and selection have initially been invoked, more recently GC-biased gene conversion (gBGC) has been proposed to play a central role in shaping nucleotide landscapes, especially in yeast, mammals, and birds. gBGC is a kind of meiotic drive in favor of G and C alleles, associated with recombination. Previous studies have also suggested that gBGC could be at work in grass genomes. However, these studies were carried on third codon positions that can undergo selection on codon usage. As most preferred codons end in G or C in grasses, gBGC and selection can be confounded. Here we investigated further the forces that might drive GC content evolution in the rice genus using both coding and noncoding sequences. We found that recombination rates correlate positively with equilibrium GC content and that selfing species (Oryza sativa and O. glaberrima) have significantly lower equilibrium GC content compared with more outcrossing species. As recombination is less efficient in selfing species, these results suggest that recombination drives GC content. We also detected a positive relationship between expression levels and GC content in third codon positions, suggesting that selection favors codons ending with G or C bases. However, the correlation between GC content and recombination cannot be explained by selection on codon usage alone as it was also observed in noncoding positions. Finally, analyses of polymorphism data ruled out the hypothesis that genomic variation in GC content is due to mutational processes. Our results suggest that both gBGC and selection on codon usage affect GC content in the Oryza genus and likely in other grass species.  相似文献   

10.
11.
ABSTRACT: BACKGROUND: Synonymous codon usage bias has typically been correlated with, and attributed to translational efficiency. However, there are other pressures on genomic sequence composition that can affect codon usage patterns such as mutational biases. This study provides an analysis of the codon usage patterns in Arabidopsis thaliana in relation to gene expression levels, codon volatility, mutational biases and selective pressures. RESULTS: We have performed synonymous codon usage and codon volatility analyses for all genes in the A. thaliana genome. In contrast to reports for species from other kingdoms, we find that neither codon usage nor volatility are correlated with selection pressure (as measured by dN/dS), nor with gene expression levels on a genome wide level. Our results show that codon volatility and usage are not synonymous, rather that they are correlated with the abundance of G and C at the third codon position (GC3). CONCLUSIONS: Our results indicate that while the A. thaliana genome shows evidence for synonymous codon usage bias, this is not related to the expression levels of its constituent genes. Neither codon volatility nor codon usage are correlated with expression levels or selective pressures but, because they are directly related to the composition of G and C at the third codon position, they are the result of mutational bias. Therefore, in A. thaliana codon volatility and usage do not result from selection for translation efficiency or protein functional shift as measured by positive selection.  相似文献   

12.
Summary Ubiquitin is ubiquitous in all eukaryotes and its amino acid sequence shows extreme conservation. Ubiquitin genes comprise direct repeats of the ubiquitin coding unit with no spacers. The nucleotide sequences coding for 13 ubiquitin genes from 11 species reported so far have been compiled and analyzed. The G+C content of codon third base reveals a positive linear correlation with the genome G+C content of the corresponding species. The slope strongly suggests that the overall G+C content of codons of polyubiquitin genes clearly reflects the genome G+C content by AT/GC substitutions at the codon third position. The G+C content of ubiquitin codon third base also shows a positive linear correlation with the overall G+C content of coding regions of compiled genes, indicating the codon choices among synonymous codons reflect the average codon usage pattern of corresponding species. On the other hand, the monoubiquitin gene, which is different from the polyubiquitin gene in gene organization, gene expression, and function of the encoding protein, shows a different codon usage pattern compared with that of the polyubiquitin gene. From comparisons of the levels of synonymous substitutions among ubiquitin repeats and the homology of the amino acid sequence of the tail of monomeric ubiquitin genes, we propose that the molecular evolution of ubiquitin genes occurred as follows: Plural primitive ubiquitin sequences were dispersed on genome in ancestral eukaryotes. Some of them situated in a particular environment fused with the tail sequence to produce monomeric ubiquitin genes that were maintained across species. After divergence of species, polyubiquitin genes were formed by duplication of the other primitive ubiquitin sequences on different chromosomes. Differences in the environments in which ubiquitin genes are embedded reflect the differences in codon choice and in gene expression pattern between poly- and monomeric ubiquitin genes.  相似文献   

13.
Analysis of synonymous codon usage pattern in the genome of a thermophilic cyanobacterium, Thermosynechococcus elongatus BP-1 using multivariate statistical analysis revealed a single major explanatory axis accounting for codon usage variation in the organism. This axis is correlated with the GC content at third base of synonymous codons (GC3s) in correspondence analysis taking T. elongatus genes. A negative correlation was observed between effective number of codons i.e. Nc and GC3s. Results suggested a mutational bias as the major factor in shaping codon usage in this cyanobacterium. In comparison to the lowly expressed genes, highly expressed genes of this organism possess significantly higher proportion of pyrimidine-ending codons suggesting that besides, mutational bias, translational selection also influenced codon usage variation in T. elongatus. Correspondence analysis of relative synonymous codon usage (RSCU) with A, T, G, C at third positions (A3s, T3s, G3s, C3s, respectively) also supported this fact and expression levels of genes and gene length also influenced codon usage. A role of translational accuracy was identified in dictating the codon usage variation of this genome. Results indicated that although mutational bias is the major factor in shaping codon usage in T. elongatus, factors like translational selection, translational accuracy and gene expression level also influenced codon usage variation.  相似文献   

14.
Salim HM  Ring KL  Cavalcanti AR 《Protist》2008,159(2):283-298
We used the recently sequenced genomes of the ciliates Tetrahymena thermophila and Paramecium tetraurelia to analyze the codon usage patterns in both organisms; we have analyzed codon usage bias, Gln codon usage, GC content and the nucleotide contexts of initiation and termination codons in Tetrahymena and Paramecium. We also studied how these trends change along the length of the genes and in a subset of highly expressed genes. Our results corroborate some of the trends previously described in Tetrahymena, but also negate some specific observations. In both genomes we found a strong bias toward codons with low GC content; however, in highly expressed genes this bias is smaller and codons ending in GC tend to be more frequent. We also found that codon bias increases along gene segments and in highly expressed genes and that the context surrounding initiation and termination codons are always AT rich. Our results also suggest differences in the efficiency of translation of the reassigned stop codons between the two species and between the reassigned codons. Finally, we discuss some of the possible causes for such translational efficiency differences.  相似文献   

15.
It is important and meaningful to understand the codon usage pattern and the factors that shape codon usage of maize. In this study, trends in synonymous codon usage in maize have been firstly examined through the multivariate statistical analysis on 7402 cDNA sequences. The results showed that the genes positions on the primary axis were strongly negatively correlated with GC3s, GC content of individual gene and gene expression level assessed by the codon adaptation index (CAI) values, which indicated that nucleotide composition and gene expression level were the main factors in shaping the codon usage of maize, and the variation in codon usage among genes may be due to mutational bias at the DNA level and natural selection acting at the level of mRNA translation. At the same time, CDS length and the hydrophobicity of each protein were, respectively, significantly correlated with the genes locations on the primary axis, GC3s and CAI values. We infer that genes length and the hydrophobicity of the encoded protein may play minor role in shaping codon usage bias. Additional 28 codons ending with a G or C base have been defined as “optimal codons”, which may provide useful information for maize gene-transformation and gene prediction.  相似文献   

16.
Eck S  Stephan W 《Gene》2008,424(1-2):102-107
There are several sequence-dependent factors regulating gene expression. Some of them have been extensively studied, among the most prominent are GC content and codon usage bias. Other factors hypothesized to have an impact on gene expression are gene length and the thermodynamic stability of mRNA secondary structure. In this work, we analyzed two different microarray datasets of Drosophila melanogaster gene expression and one dataset of Escherichia coli. To investigate the relationship between gene expression, codon usage bias and GC content of first, second and third codon position, gene length and mRNA stability we employed a multiple regression analysis using a comprehensive linear model. It is shown that codon usage bias and GC content of the first, second and third codon position show a significant influence on gene expression, whereas no significant effect of mRNA secondary structure stability is observed.  相似文献   

17.
葡萄基因组密码子使用偏好模式研究   总被引:2,自引:0,他引:2  
根据完整基因组序列,运用多元统计分析和对应分析的方法,探讨了葡萄全基因组序列密码子的使用模式和影响密码子使用的各种可能因素。结果显示:葡萄密码子偏好性主要受到碱基差异(r=0.925)和自然选择(r=0.193)共同作用的影响,突变压力占了主导因素,自然选择的作用较小。同时基因长度和蛋白质疏水性也对密码子的偏好性有所影响。确定了葡萄的20个最优密码子。  相似文献   

18.
The "expression measure" of a gene, E(g), is a statistic devised to predict the level of gene expression from codon usage bias. E(g) has been used extensively to analyze prokaryotic genome sequences. We discuss 2 problems with this approach. First, the formulation of E(g) is such that genes with the strongest selected codon usage bias are not likely to have the highest predicted expression levels; indeed the correlation between E(g) and expression level is weak among moderate to highly expressed genes. Second, in some species, highly expressed genes do not have unusual codon usage, and so codon usage cannot be used to predict expression levels. We outline a simple approach, first to check whether a genome shows evidence of selected codon usage bias and then to assess the strength of bias in genes as a guide to their likely expression level; we illustrate this with an analysis of Shewanella oneidensis.  相似文献   

19.
20.
Compositional distributions in three different codon positions as well as codon usage biases of all available DNA sequences of Buchnera aphidicola genome have been analyzed. It was observed that GC levels among the three codon positions is I>II>III as observed in other extremely high AT rich organisms. B. aphidicola being an AT rich organism is expected to have A and/or T at the third positions of codons. Overall codon usage analyses indicate that A and/or T ending codons are predominant in this organism and some particular amino acids are abundant in the coding region of genes. However, multivariate statistical analysis indicates two major trends in the codon usage variation among the genes; one being strongly correlated with the GC contents at the third synonymous positions of codons, and the other being associated with the expression level of genes. Moreover, codon usage biases of the highly expressed genes are almost identical with the overall codon usage biases of all the genes of this organism. These observations suggest that mutational bias is the main factor in determining the codon usage variation among the genes in B. aphidicola.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号