首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 21 毫秒
1.
Analysis of synonymous codon usage pattern in the genome of a thermophilic cyanobacterium, Thermosynechococcus elongatus BP-1 using multivariate statistical analysis revealed a single major explanatory axis accounting for codon usage variation in the organism. This axis is correlated with the GC content at third base of synonymous codons (GC3s) in correspondence analysis taking T. elongatus genes. A negative correlation was observed between effective number of codons i.e. Nc and GC3s. Results suggested a mutational bias as the major factor in shaping codon usage in this cyanobacterium. In comparison to the lowly expressed genes, highly expressed genes of this organism possess significantly higher proportion of pyrimidine-ending codons suggesting that besides, mutational bias, translational selection also influenced codon usage variation in T. elongatus. Correspondence analysis of relative synonymous codon usage (RSCU) with A, T, G, C at third positions (A3s, T3s, G3s, C3s, respectively) also supported this fact and expression levels of genes and gene length also influenced codon usage. A role of translational accuracy was identified in dictating the codon usage variation of this genome. Results indicated that although mutational bias is the major factor in shaping codon usage in T. elongatus, factors like translational selection, translational accuracy and gene expression level also influenced codon usage variation.  相似文献   

2.
3.
The codon usage in the Vibrio cholerae genome is analyzed in this paper. Although there are much more genes on the chromosome 1 than on chromosome 2, the codon usage patterns of genes on the two chromosomes are quite similar, indicating that the two chromosomes may have coexisted in the same cell for a very long history. Unlike the base frequency pattern observed in other genomes, the G+C content at the third codon position of the V. cholerae genome varies in a rather small interval. The most notable feature of codon usage of V. cholerae genome is that there is a fraction of genes show significant bias in base choice at the second codon position. The 2,006 known genes can be classified into two clusters according to the base frequencies at this position. The smaller cluster contains 227 genes, most of which code for proteins involved in transport and binding functions. The encoding products of these genes have significant bias in amino acids composition as compared with other genes. The codon usage patterns for the 1,836 function unknown ORFs are also analyzed, which is useful to study their functions.  相似文献   

4.
以普通野生稻(Oryza rufipogon Griff.)线粒体基因组为对象,分析其蛋白质编码基因的密码子使用特征及与亚洲栽培稻(O. sativa L.)的差异,探讨其密码子偏性形成的影响因素和进化过程。结果显示:普通野生稻线粒体基因组编码序列第1、第2和第3位碱基的GC含量依次为49.18%、42.67%和40.86%;有效密码子数(Nc)分布于45.32~61.00之间,其密码子偏性较弱; Nc值仅与GC_3呈显著相关,密码子第3位的碱基组成对密码子偏性影响较大;第1向量轴上显示9.91%的差异,其与GC3s、Nc、密码子偏好指数(CBI)和最优密码子使用频率(Fop)的相关性均达到显著水平;而GC_3和GC12的相关性未达到显著水平。因此,普通野生稻线粒体基因组密码子的使用偏性主要受自然选择压力影响而形成。本研究确定了21个普通野生稻线粒体基因组的最优密码子,大多以A或T结尾,与叶绿体密码子具有趋同进化,但是与核基因组具有不同的偏好性。同义密码子相对使用度(RSCU)、PR2偏倚分析和中性绘图分析显示,普通野生稻线粒体基因功能和其密码子使用密切相关,且线粒体密码子使用在普通野生稻、粳稻(O. sativa L. subsp. japonica Kato)和籼稻(O. sativa L. subsp.indica Kato)内具有同质性。  相似文献   

5.
Burkholderia pseudomallei is a recognized biothreat agent and the causative agent of melioidosis. Codon usage biases of all protein-coding genes (length greater than or equal to 300 bp) from the complete genome of B. pseudomallei K96243 have been analyzed. As B. pseudomallei is a GC-rich organism (68.5%), overall codon usage data analysis indicates that indeed codons ending in G and/or C are predominant in this organism. But multivariate statistical analysis indicates that there is a single major trend in the codon usage variation among the genes in this organism, which has a strong positively correlation with the expressivities of the genes. The majority of the lowly expressed genes are scattered towards the negative end of the major axis whereas the highly expressed genes are clustered towards the positive end. At the same time, from the results that there were two significant correlations between axis 1 coordinates and the GC, GC3s content at silent sites of each sequence, and clearly significant negatively correlations between the ‘Effective Number of Codons’ values and GC, GC3s content, we inferred that codon usage bias was affected by gene nucleotide composition also. In addition, some other factors such as the lengths of the genes as well as the hydrophobicity of genes also influence the codon usage variation among the genes in this organism in a minor way. At the same time, notably, 21 codons have been defined as ‘optimal codons’ of the B. pseudomallei. In summary, our work have provided a basic understanding of the mechanisms for codon usage bias and some more useful information for improving the expression of target genes in vivo and in vitro. Sheng Zhao and Qin Zhang contributed equally to this work.  相似文献   

6.
Synonymous codon usage of 53 protein coding genes in chloroplast genome of Coffea arabica was analyzed for the first time to find out the possible factors contributing codon bias. All preferred synonymous codons were found to use A/T ending codons as chloroplast genomes are rich in AT. No difference in preference for preferred codons was observed in any of the two strands, viz., leading and lagging strands. Complex correlations between total base compositions (A, T, G, C, GC) and silent base contents (A3, T3, G3, C3, GC3) revealed that compositional constraints played crucial role in shaping the codon usage pattern of C. arabica chloroplast genome. ENC Vs GC3 plot grouped majority of the analyzed genes on or just below the left side of the expected GC3 curve indicating the influence of base compositional constraints in regulating codon usage. But some of the genes lie distantly below the continuous curve confirmed the influence of some other factors on the codon usage across those genes. Influence of compositional constraints was further confirmed by correspondence analysis as axis 1 and 3 had significant correlations with silent base contents. Correlation of ENC with axis 1, 4 and CAI with 1, 2 prognosticated the minor influence of selection in nature but exact separation of highly and lowly expressed genes could not be seen. From the present study, we concluded that mutational pressure combined with weak selection influenced the pattern of synonymous codon usage across the genes in the chloroplast genomes of C. arabica.  相似文献   

7.
Mitogen activated protein kinase (MAPK) genes provide resistance to various biotic and abiotic stresses. Codon usage profiling of the genes reveals the characteristic features of the genes like nucleotide composition, gene expressivity, optimal codons etc. The present study is a comparative analysis of codon usage patterns for different MAPK genes in three organisms, viz. Arabidopsis thaliana, Glycine max (soybean) and Oryza sativa (rice). The study has revealed a high AT content in MAPK genes of Arabidopsis and soybean whereas in rice a balanced AT-GC content at the third synonymous position of codon. The genes show a low bias in codon usage profile as reflected in the higher values (50.83 to 56.55) of effective number of codons (Nc). The prediction of gene expression profile in the MAPK genes revealed that these genes might be under the selective pressure of translational optimization as reflected in the low codon adaptation index (CAI) values ranging from 0.147 to 0.208.  相似文献   

8.
Microbial communities represent the largest portion of the Earth’s biomass. Metagenomics projects use high-throughput sequencing to survey these communities and shed light on genetic capabilities that enable microbes to inhabit every corner of the biosphere. Metagenome studies are generally based on (i) classifying and ranking functions of identified genes; and (ii) estimating the phyletic distribution of constituent microbial species. To understand microbial communities at the systems level, it is necessary to extend these studies beyond the species’ boundaries and capture higher levels of metabolic complexity. We evaluated 11 metagenome samples and demonstrated that microbes inhabiting the same ecological niche share common preferences for synonymous codons, regardless of their phylogeny. By exploring concepts of translational optimization through codon usage adaptation, we demonstrated that community-wide bias in codon usage can be used as a prediction tool for lifestyle-specific genes across the entire microbial community, effectively considering microbial communities as meta-genomes. These findings set up a ‘functional metagenomics’ platform for the identification of genes relevant for adaptations of entire microbial communities to environments. Our results provide valuable arguments in defining the concept of microbial species through the context of their interactions within the community.  相似文献   

9.
A correspondence analysis of codon usage in human genes revealed, as expected, that the first axis is strongly correlated with the base composition at synonymous third codon positions. At one extreme of the second axis were localized genes with a high frequency of NCG and CGN codons. The great majority of these sequences were embedded in CpG islands, while the opposite is true for the genes placed at the other extreme. The two main conclusions of this paper are: (1) the influence of CpG islands on codon usage, and (2) since the second axis is orthogonal (and therefore independent) of the first, GC3-rich genes are not necessarily associated with CpG islands.  相似文献   

10.
Naya H  Romero H  Carels N  Zavala A  Musto H 《FEBS letters》2001,501(2-3):127-130
In unicellular species codon usage is determined by mutational biases and natural selection. Among prokaryotes, the influence of these factors is different if the genome is skewed towards AT or GC, since in AT-rich organisms translational selection is absent. On the other hand, in AT-rich unicellular eukaryotes the two factors are present. In order to understand if GC-rich genomes display a similar behavior, the case of Chlamydomonas reinhardtii was studied. Since we found that translational selection strongly influences codon usage in this species, we conclude that there is not a common pattern among unicellular organisms.  相似文献   

11.
Summary Ubiquitin is ubiquitous in all eukaryotes and its amino acid sequence shows extreme conservation. Ubiquitin genes comprise direct repeats of the ubiquitin coding unit with no spacers. The nucleotide sequences coding for 13 ubiquitin genes from 11 species reported so far have been compiled and analyzed. The G+C content of codon third base reveals a positive linear correlation with the genome G+C content of the corresponding species. The slope strongly suggests that the overall G+C content of codons of polyubiquitin genes clearly reflects the genome G+C content by AT/GC substitutions at the codon third position. The G+C content of ubiquitin codon third base also shows a positive linear correlation with the overall G+C content of coding regions of compiled genes, indicating the codon choices among synonymous codons reflect the average codon usage pattern of corresponding species. On the other hand, the monoubiquitin gene, which is different from the polyubiquitin gene in gene organization, gene expression, and function of the encoding protein, shows a different codon usage pattern compared with that of the polyubiquitin gene. From comparisons of the levels of synonymous substitutions among ubiquitin repeats and the homology of the amino acid sequence of the tail of monomeric ubiquitin genes, we propose that the molecular evolution of ubiquitin genes occurred as follows: Plural primitive ubiquitin sequences were dispersed on genome in ancestral eukaryotes. Some of them situated in a particular environment fused with the tail sequence to produce monomeric ubiquitin genes that were maintained across species. After divergence of species, polyubiquitin genes were formed by duplication of the other primitive ubiquitin sequences on different chromosomes. Differences in the environments in which ubiquitin genes are embedded reflect the differences in codon choice and in gene expression pattern between poly- and monomeric ubiquitin genes.  相似文献   

12.
糜子叶绿体基因组密码子使用偏性的分析   总被引:2,自引:0,他引:2       下载免费PDF全文
密码子使用偏性(CUB)是生物体重要的进化特征,对研究物种进化、基因功能以及外源基因表达等具有重要科学意义。本研究利用糜子(Panicum miliaceum L.)叶绿体基因组中筛选出的53条蛋白编码序列,对其密码子使用模式及偏性进行了分析。结果表明,糜子叶绿体基因的有效密码子数(ENC)在37.14~61之间,多数密码子的偏性较弱。相对同义密码子使用度(RSCU)分析发现,RSCU > 1的密码子有32个,其中28个以A、U结尾,表明第3位密码子偏好使用A和U碱基。中性分析发现,GC3与GC12的相关性不显著,回归曲线斜率为0.2129,表明密码子偏性主要受到自然选择的影响;而ENC-plot分析发现大部分基因落在曲线的上方及周围,表明突变也影响了密码子偏性的形成。进一步的对应性分析发现,第1轴为主要影响因素,解释了17.92%的差异,其与ENC、GC3S值的相关性均达到显著水平,但与CBI、GCall不相关。最后,9个密码子被鉴定为糜子叶绿体基因组的最优密码子,糜子叶绿体基因组的密码子使用偏性可能受选择和突变共同作用。  相似文献   

13.
Summary We searched the complete 39,936 base DNA sequence of bacteriophage T7 for nonrandomness that might be attributed to natural selection. Codon usage in the 50 genes of T7 is nonrandom, both over the whole code and among groups of synonymous codons. There is a great excess of purineany base-pyrimidine (RNY) codons. Codon usage varies between genes, but from the pooled data for the whole genome (12,145 codons) certain putative selective constraints can be identified. Codon usage appears to be influenced by host tRNA abundance (particularly in highly expressed genes), tRNA-mRNA interactions (one such interaction being perhaps responsible for maintaining the excess of RNY codons) and a lack of short palindromes. This last constraint is probably due to selection against host restriction enzyme recognition sites; this is the first report of an effect of this kind on codon usage. Selection against susceptibility to mutational damage does not appear to have been involved.  相似文献   

14.
A role for tRNA modifications in genome structure and codon usage   总被引:1,自引:0,他引:1  
Transfer RNA (tRNA) gene content is a differentiating feature of genomes that contributes to the efficiency of the translational apparatus, but the principles shaping tRNA gene copy number and codon composition are poorly understood. Here, we report that the emergence of two specific tRNA modifications shaped the structure and composition of all extant genomes. Through the analysis of more than 500 genomes, we identify two kingdom-specific tRNA modifications as major contributors that separated archaeal, bacterial, and eukaryal genomes in terms of their tRNA gene composition. We show that, contrary to prior observations, genomic codon usage and tRNA gene frequencies correlate in all kingdoms if these two modifications are taken into account and that presence or absence of these modifications explains patterns of gene expression observed in previous studies. Finally, we experimentally demonstrate that human gene expression levels correlate well with genomic codon composition if these identified modifications are considered.  相似文献   

15.
Liu Q  Xue Q 《Bio Systems》2004,77(1-3):33-39
Using an approach based on the Readthrough Candidate Extraction System (RCES), we extracted 111 candidates from 9620 gene sequences of rice. The results of homology search and sequence analysis demonstrated that these candidates included actual readthrough genes that would be important for further investigating the mechanism of translation termination regulated by readthrough event, and could also give some useful clues for functional genome annotation. Between the candidates and non-candidates of gene sequences in rice, there exist significant base biases at the positions surrounding the stop codons. These positions, especially both -1 and +4, are referred to as part of an extended stop signal. In candidates, G at position -1, and G or C at position +4 are much more favored than that in non-candidates. Both stop sequence patterns, GUAGC and GUGAG, might drive high readthrough efficiency in rice. Secondary structure analysis revealed that the -1 and +1 amino acids around the first stop codon of candidates have a strong bias toward arginine, particularly the +1 position (20.7%), which indicated that the amino acids at the readthrough region being frequently located in the hydrophilic region of beta-turn might be a determinant for efficient translation termination or not.  相似文献   

16.

Background

The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori.

Results

A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans).

Conclusions

The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the “optimal codons” of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1596-z) contains supplementary material, which is available to authorized users.  相似文献   

17.
Rao Y  Wu G  Wang Z  Chai X  Nie Q  Zhang X 《DNA research》2011,18(6):499-512
Synonymous codons are used with different frequencies both among species and among genes within the same genome and are controlled by neutral processes (such as mutation and drift) as well as by selection. Up to now, a systematic examination of the codon usage for the chicken genome has not been performed. Here, we carried out a whole genome analysis of the chicken genome by the use of the relative synonymous codon usage (RSCU) method and identified 11 putative optimal codons, all of them ending with uracil (U), which is significantly departing from the pattern observed in other eukaryotes. Optimal codons in the chicken genome are most likely the ones corresponding to highly expressed transfer RNA (tRNAs) or tRNA gene copy numbers in the cell. Codon bias, measured as the frequency of optimal codons (Fop), is negatively correlated with the G + C content, recombination rate, but positively correlated with gene expression, protein length, gene length and intron length. The positive correlation between codon bias and protein, gene and intron length is quite different from other multi-cellular organism, as this trend has been only found in unicellular organisms. Our data displayed that regional G + C content explains a large proportion of the variance of codon bias in chicken. Stepwise selection model analyses indicate that G + C content of coding sequence is the most important factor for codon bias. It appears that variation in the G + C content of CDSs accounts for over 60% of the variation of codon bias. This study suggests that both mutation bias and selection contribute to codon bias. However, mutation bias is the driving force of the codon usage in the Gallus gallus genome. Our data also provide evidence that the negative correlation between codon bias and recombination rates in G. gallus is determined mostly by recombination-dependent mutational patterns.  相似文献   

18.
The immergence and dissemination of multidrug-resistant strains of Staphylococcus aureus in recent years have expedited the research on the discovery of novel anti-staphylococcal agents promptly. Bacteriophages have long been showing tremendous potentialities in curing the infections caused by various pathogenic bacteria including S. aureus. Thus far, only a few virulent bacteriophages, which do not carry any toxin-encoding gene but are capable of eradicating staphylococcal infections, were reported. Based on the codon usage analysis of sixteen S. aureus phages, previously three phages were suggested to be useful as the anti-staphylococcal agents. To search for additional S. aureus phages suitable for phage therapy, relative synonymous codon usage bias has been investigated in the protein-coding genes of forty new staphylococcal phages. All phages appeared to carry A and T ending codons. Several factors such as mutational pressure, translational selection and gene length seemed to be responsible for the codon usage variation in the phages. Codon usage indeed varied phage to phage. Of the phages, phages G1, Twort, 66 and Sap-2 may be extremely lytic in nature as majority of their genes possess high translational efficiency, indicating that these phages may be employed in curing staphylococcal infections.  相似文献   

19.
Analysis of synonymous codon usage bias in Chlamydia   总被引:9,自引:0,他引:9  
Chlamydiae are obligate intracellular bacterial pathogens that cause ocular and sexuallytransmitted diseases,and are associated with cardiovascular diseases.The analysis of codon usage mayimprove our understanding of the evolution and pathogenesis of Chlamydia and allow reengineering of targetgenes to improve their expression for gene therapy.Here,we analyzed the codon usage of C.muridarum,C.trachomatis(here indicating biovar trachoma and LGV),C.pneumoniae,and C.psittaci using the codonusage database and the CUSP(Create a codon usage table)program of EMBOSS(The European MolecularBiology Open Software Suite).The results show that the four genomes have similar codon usage patterns,with a strong bias towards the codons with A and T at the third codon position.Compared with Homosapiens,the four chlamydial species show discordant seven or eight preferred codons.The ENC(effectivenumber of codons used in a gene)-plot reveals that the genetic heterogeneity in Chlamydia is constrained bythe G+C content,while translational selection and gene length exert relatively weaker influences.Moreover,mutational pressure appears to be the major determinant of the codon usage variation among the chlamydialgenes.In addition,we compared the codon preferences of C.trachomatis with those of E.coli,yeast,adenovirus and Homo sapiens.There are 23 codons showing distinct usage differences between C.trachomatisand E.coli,24 between C.trachomatis and adenovirus,21 between C.trachomatis and Homo sapiens,butonly six codons between C.trachomatis and yeast.Therefore,the yeast system may be more suitable for theexpression of chlamydial genes.Finally,we compared the codon preferences of C.trachomatis with those ofsix eukaryotes,eight prokaryotes and 23 viruses.There is a strong positive correlation between the differ-ences in coding GC content and the variations in codon bias(r=0.905,P<0,001).We conclude that thevariation of codon bias between C.trachomatis and other organisms is much less influenced by phylogeneticlineage and primarily determined by the extent of disparities in GC content.  相似文献   

20.
Myosins play an important role in various developmental processes in plants. We have identified 14 myosin genes in rice (Oryza sativa cv. Nipponbare) genome using sequence information available in public databases. Phylogenetic analysis of these sequences with other plant and non-plant myosins revealed that two of the predicted sequences belonged to class VIII and the others to class XI. All of these genes were distributed on seven chromosomes in the rice genome. Domain searches on these sequences indicated that a typical rice myosin consisted of Myosin_N, head domain, neck (IQ motifs), tail, and dilute (DIL) domain. Based on the sequence information obtained from predicted myosins, we isolated and sequenced two full-length cDNAs, OsMyoVIIIA and OsMyoXIE, representing each of the two classes of myosins. These two cDNAs isolated from different organs existed in isoforms due to differential splicing and showed minor differences from the predicted myosin in exon organization. Out of 14 myosin genes 11 were expressed in three major organs: leaves, panicles, and roots, among which three myosins exhibited different expression levels. On the other hand, three of the total myosin sequences showed organ-specific expression. The existence of different myosin genes and their isoforms in different organs or tissues indicates the diversity of myosin functions in rice.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号