首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The 'effective number of codons' revisited   总被引:1,自引:0,他引:1  
Frank Wright [Gene 87 (1990) 23] derived a formula for calculation of a quantity termed the 'effective number of codons' (Nc) based on codon homozygosities. This quantity is a number between 20 and 61 and tells to what degree the codon usage in a gene is biased, i.e., it approaches 20 codons for the extremely biased genes, and approaches 61 for the genes where all possible codons are used with no preference. Among the different measures of codon bias Nc is considered the most useful and has found widespread use in papers dealing with codon usage phenomena. In this paper, the mathematical behaviours of codon homozygosities and Nc are evaluated, using Escherichia coli as the model organism. The results indicate that the classical formula for calculation of Nc could appropriately be substituted under circumstances, where there is bias discrepancy, i.e., when one amino acid (or more) within a degeneracy group is associated with strong codon bias while at the same time others in the same degeneracy group have little bias. An alternative estimator, termed Nc, is proposed and tested against Nc, and performs better when there is such bias discrepancy.  相似文献   

2.
The 'effective number of codons' used in a gene   总被引:64,自引:0,他引:64  
F Wright 《Gene》1990,87(1):23-29
A simple measure is presented that quantifies how far the codon usage of a gene departs from equal usage of synonymous codons. This measure of synonymous codon usage bias, the 'effective number of codons used in a gene', Nc, can be easily calculated from codon usage data alone, and is independent of gene length and amino acid (aa) composition. Nc can take values from 20, in the case of extreme bias where one codon is exclusively used for each aa, to 61 when the use of alternative synonymous codons is equally likely. Nc thus provides an intuitively meaningful measure of the extent of codon preference in a gene. Codon usage patterns across genes can be investigated by the Nc-plot: a plot of Nc vs. G + C content at synonymous sites. Nc-plots are produced for Homo sapiens, Saccharomyces cerevisiae, Escherichia coli, Bacillus subtilis, Dictyostelium discoideum, and Drosophila melanogaster. A FORTRAN77 program written to calculate Nc is available on request.  相似文献   

3.
Analysis of synonymous codon usage pattern in the genome of a thermophilic cyanobacterium, Thermosynechococcus elongatus BP-1 using multivariate statistical analysis revealed a single major explanatory axis accounting for codon usage variation in the organism. This axis is correlated with the GC content at third base of synonymous codons (GC3s) in correspondence analysis taking T. elongatus genes. A negative correlation was observed between effective number of codons i.e. Nc and GC3s. Results suggested a mutational bias as the major factor in shaping codon usage in this cyanobacterium. In comparison to the lowly expressed genes, highly expressed genes of this organism possess significantly higher proportion of pyrimidine-ending codons suggesting that besides, mutational bias, translational selection also influenced codon usage variation in T. elongatus. Correspondence analysis of relative synonymous codon usage (RSCU) with A, T, G, C at third positions (A3s, T3s, G3s, C3s, respectively) also supported this fact and expression levels of genes and gene length also influenced codon usage. A role of translational accuracy was identified in dictating the codon usage variation of this genome. Results indicated that although mutational bias is the major factor in shaping codon usage in T. elongatus, factors like translational selection, translational accuracy and gene expression level also influenced codon usage variation.  相似文献   

4.
In the present study, major constraints for codon and amino acid usage of Sulfolobus acidocaldarius, Sulfolobus solfataricus, Sulfolobus tokodali, Sulfolobus islandis and 6 other isolates from islandicus species of genus Sulfolobus were investigated. Correspondence analysis revealed high significant correlation between the major trend of synonymous codon usage and gene expression level, as assessed by the “Codon Adaptation Index” (CAI). There is a significant negative correlation between Nc (Effective number of codons) and CAI demonstrating role of codon bias as an important determinant of codon usage. The significant correlation between major trend of synonymous codon usage and GC3s (G + C at third synonymous position) indicated dominant role of mutational bias in codon usage pattern. The result was further supported from SCUO (synonymous codon usage order) analysis. The amino acid usage was found to be significantly influenced by aromaticity and hydrophobicity of proteins. However, translational selection which causes a preference for codons that are most rapidly translated by current tRNA with multiple copy numbers was not found to be highly dominating for all studied isolates. Notably, 26 codons that were found to be optimally used by genes of S. acidocaldarius at higher expression level and its comparative analysis with 9 other isolates may provide some useful clues for further in vivo genetic studies on this genus.  相似文献   

5.
不同PRRSV毒株间ORF1a基因密码子偏爱性差异分析   总被引:1,自引:0,他引:1  
运用CodonW、ClustalX、TreeView软件及EMBOSS(,rIleEuropean MolecularBiologyOpenSoftwareSuite)、CIMMiner在线分析软件对选取的29株PRRSVORFla基因进行密码子偏爱性聚类分析.CAI、CBI、Fop、Nc、GC3s和GC含量、基因长度等相关性分析显示PRRSV各毒株编码的ORFla基因密码子偏爱性各有差异,其中Lelystadvirus、LV4-2.1、VR-2332、RespPRRSMIV与国内分离的高致病性PRRSV变异株之间差异较大.密码子使用概率聚类分析表明CC.1、NVSL.97.7895、CH—1a、RespPRRSMLV、LV4.2.1、Lelystadvirus与高致病性PRRSV变异株距离较远.而国内分离株相互间的聚类距离则较接近。此结果与基于氨基酸序列比对构建的系统进化树图谱基本一致.由此可见.PRRSV病毒ORF1a基因密码子使用偏爱性的差别与病毒的遗传多样性密切相关.  相似文献   

6.
以普通野生稻(Oryza rufipogon Griff.)线粒体基因组为对象,分析其蛋白质编码基因的密码子使用特征及与亚洲栽培稻(O. sativa L.)的差异,探讨其密码子偏性形成的影响因素和进化过程。结果显示:普通野生稻线粒体基因组编码序列第1、第2和第3位碱基的GC含量依次为49.18%、42.67%和40.86%;有效密码子数(Nc)分布于45.32~61.00之间,其密码子偏性较弱; Nc值仅与GC_3呈显著相关,密码子第3位的碱基组成对密码子偏性影响较大;第1向量轴上显示9.91%的差异,其与GC3s、Nc、密码子偏好指数(CBI)和最优密码子使用频率(Fop)的相关性均达到显著水平;而GC_3和GC12的相关性未达到显著水平。因此,普通野生稻线粒体基因组密码子的使用偏性主要受自然选择压力影响而形成。本研究确定了21个普通野生稻线粒体基因组的最优密码子,大多以A或T结尾,与叶绿体密码子具有趋同进化,但是与核基因组具有不同的偏好性。同义密码子相对使用度(RSCU)、PR2偏倚分析和中性绘图分析显示,普通野生稻线粒体基因功能和其密码子使用密切相关,且线粒体密码子使用在普通野生稻、粳稻(O. sativa L. subsp. japonica Kato)和籼稻(O. sativa L. subsp.indica Kato)内具有同质性。  相似文献   

7.
The effective number of codons used in a gene is a commonly used measure of codon usage. It varies between 20 and 61 (standard genetic code) and indicates to which degree the entire genetic code is used. It is a drawback of this method that it does not take background composition into account. This led Novembre to introduce a variant called Nc' (Novembre JA. 2002. Accounting for background nucleotide composition when measuring codon usage bias. Mol Biol Evol 19:1390-4). In this letter, its properties are under the loupe, with special emphasis on phenomena relating to codon homozygosity. A theoretical misunderstanding regarding this estimator is explained in detail, notably Nc varies between 0 and 61 instead of 20 and 61 (with the standard genetic code). Practical examples from the genome of Pseudomonas aeruginosa are given which demonstrate that the problem is not just theoretical.  相似文献   

8.
A comparative genomic analysis of three species of the soil bacterium Arthrobacter was undertaken with specific emphasis on genes involved in important and core energy metabolism pathways like glycolysis and amino acid metabolism. During the course of this study, it was revealed that codon bias of a particular species, namely Arthrobacter aurescens TC1, is significantly lower than that of the other two species A. chlorophenolicus A6 and Arthrobacter sp. FB24. The codon bias was also found to be negatively correlated with gene expression level which is determined by computing codon adaptation index of the genes. Uniformity in codon usage pattern among three species is evident in terms of genes which has high codon bias and multifunctional nature. Further, it was observed that this trend is present amongst the genes of important metabolic pathways, such as glycolysis and amino acid metabolism. The evolutionary divergence of the pathway gene sequences was calculated and was found to be equivalent in nature in the case of Arthrobacter sp. FB24 and Arthrobacter chlorophenolicus A6, but turned out to be dissimilar in the case of Arthrobacter aurescens TC1. A strong correlation between synonymous substitution rate and effective codon number or Nc was also observed. These observations clearly point out that the genes having low bias, in Arthrobacter aurescens TC1, and even of those that are part of highly conserved metabolic pathways like glycolysis and amino acid ensemble pathways have undergone a different type of evolution and might be subjected to positive selection pressure in comparison with Arthrobacter sp. FB24 and Arthrobacter chlorophenolicus A6.  相似文献   

9.
以普通野生稻(Oryza rufipogon Griff.)线粒体基因组为对象,分析其蛋白质编码基因的密码子使用特征及与亚洲栽培稻(O.sativa L.)的差异,探讨其密码子偏性形成的影响因素和进化过程。结果显示:普通野生稻线粒体基因组编码序列第1、第2和第3位碱基的GC含量依次为49.18%、42.67%和40.86%;有效密码子数(Nc)分布于45.32~61.00之间,其密码子偏性较弱;Nc值仅与GC3呈显著相关,密码子第3位的碱基组成对密码子偏性影响较大;第1向量轴上显示9.91%的差异,其与GC3s、Nc、密码子偏好指数(CBI)和最优密码子使用频率(Fop)的相关性均达到显著水平;而GC3和GC12的相关性未达到显著水平。因此,普通野生稻线粒体基因组密码子的使用偏性主要受自然选择压力影响而形成。本研究确定了21个普通野生稻线粒体基因组的最优密码子,大多以A或T结尾,与叶绿体密码子具有趋同进化,但是与核基因组具有不同的偏好性。同义密码子相对使用度(RSCU)、PR2偏倚分析和中性绘图分析显示,普通野生稻线粒体基因功能和其密码子使用密切相关,且线粒体密码子使用在普通野生稻、粳稻(O.sativa L.subsp.japonica Kato)和籼稻(O.sativa L.subsp.indica Kato)内具有同质性。  相似文献   

10.
Despite the degeneracy of the genetic code, whereby different codons encode the same amino acid, alternative codons and amino acids are utilized nonrandomly within and between genomes. Such biases in codon and amino acid usage have been demonstrated extensively in prokaryote genomes and likely reflect a balance between the action of mutation, selection, and genetic drift. Here, we quantify the effects of selection and mutation drift as causes of codon and amino acid-usage bias in a large collection of nematode partial genomes from 37 species spanning approximately 700 Myr of evolution, as inferred from expressed sequence tag (EST) measures of gene expression and from base composition variation. Average G + C content at silent sites among these taxa ranges from 10% to 63%, and EST counts range more than 100-fold, underlying marked differences between the identities of major codons and optimal codons for a given species as well as influencing patterns of amino acid abundance among taxa. Few species in our sample demonstrate a dominant role of selection in shaping intragenomic codon-usage biases, and these are principally free living rather than parasitic nematodes. This suggests that deviations in effective population size among species, with small effective sizes among parasites, are partly responsible for species differences in the extent to which selection shapes patterns of codon usage. Nevertheless, a consensus set of optimal codons emerges that is common to most taxa, indicating that, with some notable exceptions, selection for translational efficiency and accuracy favors similar sets of codons regardless of the major codon-usage trends defined by base compositional properties of individual nematode genomes.  相似文献   

11.
Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.  相似文献   

12.
为分析栽培大豆和野生大豆线粒体基因组的密码子使用特征差异,该文以其线粒体基因组编码序列为研究对象,比较其密码子偏性形成的影响因素和演化过程。结果表明:(1)栽培大豆和野生大豆线粒体基因组编码区的GC含量分别为44.56%和44.58%,说明栽培大豆和野生大豆线粒体编码基因均富含A/T碱基。(2)栽培大豆和野生大豆线粒体基因组密码子第1位、第2位GC含量平均值与第3位GC含量的相关性均呈极显著水平,说明突变在其密码子偏性形成中的作用不可忽略; PR2-plot分析显示,在同义密码子第3位碱基的使用频率上,嘌呤低于嘧啶; Nc-plot分析中Nc比值位于-0.1~0.2区间的基因数占总基因数的95%以上;突变和选择等多重因素共同作用影响了大豆线粒体基因组编码序列密码子使用偏性的形成。(3)有20、21个密码子分别被确定为栽培大豆和野生大豆线粒体基因组编码序列的最优密码子,其中除丝氨酸TCC密码子外均以A或T结尾。综上结果认为,栽培大豆线粒体密码子偏性的形成受选择的影响要高于野生大豆,这可能是栽培大豆由野生大豆经长期人工栽培驯化的结果。  相似文献   

13.
Hepatitis C virus (HCV) infection is among the leading causes of hepatocellular carcinoma and liver cirrhosis globally, with a high economic burden. The disease progression is well established, but less is known about the spontaneous HCV infection clearance. This study tries to establish the relationship between codon biasness and expression of HCV clearance candidate genes in normal and HCV infected liver tissues. A total of 112 coding sequences comprising 151 679 codons were subjected to the computation of codon indices, namely relative synonymous codon usage, an effective number of codon (Nc), frequency of optimal codon, codon adaptation index, codon bias index, and base compositions. Codon indices report of GC3s, GC12, hydropathicity, and aromaticity implicates both mutational and translational selection in the candidate gene set. This was further correlated with the differentially expressed genes among the selected genes using BioGPS. A significant correlation is observed between the gene expression of normal liver and cancerous liver tissues with codon bias (Nc). Gene expression is also correlated with relative codon bias values, indicating that CCL5, APOA2, CD28, IFITM1, and TNFSF4 genes have higher expression. These results are quite encouraging in selecting the high responsive genes in HCV clearance. However, there could be additional genes which could also orchestrate the clearance role with the above mentioned first line of defensive genes.  相似文献   

14.
Li Y  Wang C  Cheng X  Wu T  Zhang C 《Bio Systems》2011,104(1):42-47
Three very virulent infectious bursal disease virus (vvIBDV) strains were isolated from a single farm and shown to be phylogenetically related to the vvIBDV isolate UK661. In this study, a comparative analysis of the synonymous codon usage in the hypervariable region of theVP2 (vVP2) gene of the vvIBDV strains was done on viruses serially passaged in chicken embryos. Sequencing demonstrated that codons change during the serial passage in the vVP2 gene of the viruses. Nine codon mutations resulted in amino acids changes. The amino acid changes were I256V, I296L 6in isolate XA1989, A222P, I242V, Q253H, I256V in isolate XA1998, and Q253H, I256V, I296L in isolate XA2004. Three of the nine amino acid changes occurred at residue 256. The codons of the amino acids A232, N233, I234, T269, T283 and H338 changed to the synonymous codons in XA1989 after the 16th passage, in XA1998 after the 24th passage and in XA2004 22nd passage viruses. These mutations change the key amino acid residues Q253H and I256V in the domains which are essential for its virulence, and the synonymous codons were observed compared to classical virulent IBDV. The results indicated that the codon changes during the serial passage comprised of synonymous codon usage in the vVP2 gene of IBDV, and this synonymous codon bias was correlated with pathotypes. The extent of synonymous codon usage bias in the IBDV-vVP2 gene maybe influence the gene expression level and secondary structure of protein as well as hydrophobicity, therefore the results provide useful perspectives for evolution and understanding of the pathogenesis of IBDV.  相似文献   

15.
从GenBank获得大肠杆菌K-12MG1655株的全基因组序列,计算了与基因密码子偏好性相关的多个参数(Nc、CAI、GC、GC3s),对其mRNA编码区长度、形成二级结构倾向与密码子偏好性之间的关系进行了统计学分析,发现虽然翻译效率(包括翻译速度和翻译精度)是制约大肠杆菌高表达基因的密码子偏好性的主要因素,同时,mRNA编码区长度及其形成二级结构的倾向也是形成这种偏好性的不可忽略的原因,而且对偏好性有一定程度的削弱。另外对mRNA编码区形成二级结构倾向的生物学意义进行了讨论分析。  相似文献   

16.

Background

The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori.

Results

A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans).

Conclusions

The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the “optimal codons” of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1596-z) contains supplementary material, which is available to authorized users.  相似文献   

17.
The BRCA1 gene is located on the human chromosome 17q21.31 and plays important role in biological processes. The aminoacyl-tRNA synthetases (AARS) are a family of heterogenous enzymes responsible protein synthesis and whose secondary functions include a role in autoimmune myositis. Our findings reveal that the compositional constraint and the preference of more A/T –ending codons determine the codon usage patterns in BRCA1 gene while more G/C-ending codons influence the codon usage pattern of AARS gene among mammals. The codon usage bias in BRCA1 and AARS genes is low. The codon CGC encoding arginine amino acid and the codon TTA encoding leucine were uniformly distributed in BRCA1 and AARS genes, respectively in mammals including human. Natural selection might have played a major role while mutation pressure might have played a minor role in shaping the codon usage pattern of BRCA1 and AARS genes.  相似文献   

18.
In this study, major factors shaping codon and amino acid usage variation Lactobacillus sakei 23K were investigated. It included 13 other Lactobacillus species for a comparative analysis. The correspondence analysis (COA) showed that in 13 species the major trend of synonymous codon usage was highly correlated with gene expression level as assessed by the “Codon Adaptation Index” (CAI) values. In addition, Nc (effective number of codons) plot, SCUO (synonymous codon usage order) plot and correlation analyses showed that the base composition and mutational bias have dominant role in the codon usage variation. However, the translational selection for genes at higher expression level, where more frequent synonymous codons correspond to more abundant cognate transfer RNAs (tRNAs), was not found to be similar in all species. The study also showed that the amino acid usage in these species was significantly (P < 0.01) influenced by hydrophobicity and aromaticity of proteins. Furthermore, 24 codons that were found to be optimally used by L. sakei and its comparative study with 13 Lactobacillus species might provide some useful information in their further study of molecular evolution and genetic engineering.  相似文献   

19.
20.
Wall DP  Herbeck JT 《Journal of molecular evolution》2003,56(6):673-88; discussion 689-90
In this study we reconstruct the evolution of codon usage bias in the chloroplast gene rbcL using a phylogeny of 92 green-plant taxa. We employ a measure of codon usage bias that accounts for chloroplast genomic nucleotide content, as an attempt to limit plausible explanations for patterns of codon bias evolution to selection- or drift-based processes. This measure uses maximum likelihood-ratio tests to compare the performance of two models, one in which a single codon is overrepresented and one in which two codons are overrepresented. The measure allowed us to analyze both the extent of bias in each lineage and the evolution of codon choice across the phylogeny. Despite predictions based primarily on the low G + C content of the chloroplast and the high functional importance of rbcL, we found large differences in the extent of bias, suggesting differential molecular selection that is clade specific. The seed plants and simple leafy liverworts each independently derived a low level of bias in rbcL, perhaps indicating relaxed selectional constraint on molecular changes in the gene. Overrepresentation of a single codon was typically plesiomorphic, and transitions to overrepresentation of two codons occurred commonly across the phylogeny, possibly indicating biochemical selection. The total codon bias in each taxon, when regressed against the total bias of each amino acid, suggested that twofold amino acids play a strong role in inflating the level of codon usage bias in rbcL, despite the fact that twofolds compose a minority of residues in this gene. Those amino acids that contributed most to the total codon usage bias of each taxon are known through amino acid knockout and replacement to be of high functional importance. This suggests that codon usage bias may be constrained by particular amino acids and, thus, may serve as a good predictor of what residues are most important for protein fitness.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号