首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.
Rao Y  Wu G  Wang Z  Chai X  Nie Q  Zhang X 《DNA research》2011,18(6):499-512
Synonymous codons are used with different frequencies both among species and among genes within the same genome and are controlled by neutral processes (such as mutation and drift) as well as by selection. Up to now, a systematic examination of the codon usage for the chicken genome has not been performed. Here, we carried out a whole genome analysis of the chicken genome by the use of the relative synonymous codon usage (RSCU) method and identified 11 putative optimal codons, all of them ending with uracil (U), which is significantly departing from the pattern observed in other eukaryotes. Optimal codons in the chicken genome are most likely the ones corresponding to highly expressed transfer RNA (tRNAs) or tRNA gene copy numbers in the cell. Codon bias, measured as the frequency of optimal codons (Fop), is negatively correlated with the G + C content, recombination rate, but positively correlated with gene expression, protein length, gene length and intron length. The positive correlation between codon bias and protein, gene and intron length is quite different from other multi-cellular organism, as this trend has been only found in unicellular organisms. Our data displayed that regional G + C content explains a large proportion of the variance of codon bias in chicken. Stepwise selection model analyses indicate that G + C content of coding sequence is the most important factor for codon bias. It appears that variation in the G + C content of CDSs accounts for over 60% of the variation of codon bias. This study suggests that both mutation bias and selection contribute to codon bias. However, mutation bias is the driving force of the codon usage in the Gallus gallus genome. Our data also provide evidence that the negative correlation between codon bias and recombination rates in G. gallus is determined mostly by recombination-dependent mutational patterns.  相似文献   

4.
The analysis on codon usage bias of GPAT gene of Camellia sinensis (L.) O. Kuntze may provide a basis for understanding the evolution relationship of C. sinensis and for selecting appropriate host expression systems to improve the expression of target genes. In the present study, the coding sequence of CsGPAT was analyzed with CodonW, CHIPS and CUSP programs, and compared with the genome of C. sinensis and GPAT genes of other 9 plant species. Our results showed that the cluster tree based on CDs could reveal the evolutional relations among the 10 plant species, whereas the cluster tree based on relative synonymous codon usage (RSCU) could not. There were 31 codons showing distinct usage differences between CsGPAT and genome of Escherichia coli, 21 between CsGPAT and yeast, but 13 between CsGPAT and Arabidopsis thaliana. But there were slightly fewer differences in codon usage between CsGPAT and A. thaliana. Therefore, the A. thaliana expression system may be more suitable for the expression of CsGPAT. These results may improve our understanding of the codon usage bias and functional studies of CsGPAT.  相似文献   

5.

Background

The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori.

Results

A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans).

Conclusions

The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the “optimal codons” of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1596-z) contains supplementary material, which is available to authorized users.  相似文献   

6.
Marine unicellular cyanobacteria, represented by Synechococcus and Prochlorococcus, dominate the total phytoplankton biomass and production in oligotrophic ocean. In this study, we employed comparative genomics approaches to extensively investigate synonymous codon usage bias and evolutionary rates in a large number of closely related species of marine unicellular cyanobacteria. Although these two groups of marine cyanobacteria have a close phylogenetic relationship, we find that they are highly divergent not only in codon usage patterns but also in the driving forces behind the diversification. It is revealed that in Prochlorococcus, mutation and genome compositional constraints are the main forces contributing to codon usage bias, whereas in Synechococcus, translational selection. In addition, nucleotide substitution rate analysis indicates that they are not evolving at a constant rate after the divergence and that the average dN/dS values of core genes in Synechococcus are significantly higher than those in Prochlorococcus. Our evolutionary genomic analysis provides the first insight into codon usage, evolutionary genetic mechanisms and environmental adaptation of Synechococcus and Prochlorococcus after divergence.  相似文献   

7.
Hepatitis C virus (HCV) infection is among the leading causes of hepatocellular carcinoma and liver cirrhosis globally, with a high economic burden. The disease progression is well established, but less is known about the spontaneous HCV infection clearance. This study tries to establish the relationship between codon biasness and expression of HCV clearance candidate genes in normal and HCV infected liver tissues. A total of 112 coding sequences comprising 151 679 codons were subjected to the computation of codon indices, namely relative synonymous codon usage, an effective number of codon (Nc), frequency of optimal codon, codon adaptation index, codon bias index, and base compositions. Codon indices report of GC3s, GC12, hydropathicity, and aromaticity implicates both mutational and translational selection in the candidate gene set. This was further correlated with the differentially expressed genes among the selected genes using BioGPS. A significant correlation is observed between the gene expression of normal liver and cancerous liver tissues with codon bias (Nc). Gene expression is also correlated with relative codon bias values, indicating that CCL5, APOA2, CD28, IFITM1, and TNFSF4 genes have higher expression. These results are quite encouraging in selecting the high responsive genes in HCV clearance. However, there could be additional genes which could also orchestrate the clearance role with the above mentioned first line of defensive genes.  相似文献   

8.
The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias.  相似文献   

9.
落叶松-杨栅锈菌基因组密码子使用偏好分析   总被引:1,自引:0,他引:1  
周显臻  曹支敏  于丹 《菌物学报》2020,39(2):289-297
为了解落叶松‐杨栅锈菌密码子使用模式,并探究影响其密码子偏好形成的因素,本研究利用CondonW对落叶松‐杨栅锈菌标准菌株98AG31基因组中14 650个基因进行分析,计算基因的有效密码子数,及64个密码子的相对使用度等偏好性参数。结果表明,落叶松‐杨栅锈菌全基因组水平的密码子偏好程度较低,只有少数基因呈现出高偏好性。落叶松‐杨栅锈菌的高频密码子多以A或T结尾,而最优密码子则倾向以G或C结尾。PR2-plot分析及ENC-plot曲线与中性绘图分析显示,落叶松‐杨栅锈菌基因密码子使用模式受到选择压力和突变压力等多重因素的影响,相较于选择压力,落叶松‐杨栅锈菌基因密码子的偏好更多地受到突变压力的影响。相关性分析表明,密码子碱基组成会对密码子偏好性产生影响,其他因素如序列长度等均不会影响密码子偏好性。  相似文献   

10.
Suo G  Chen B  Zhang J  Duan Z  He Z  Yao W  Yue C  Dai J 《Plant cell reports》2006,25(7):689-697
Bone morphogenetic protein 2 (BMP2) has great potential in therapeutic applications. We are working on generating transgenic plants as a bioreactor to produce BMP2. We have studied the effects of codon optimization on the expression of human BMP2 (hBMP2) in tobacco plants. Three modified hBMP2 genes were transformed into tobacco under the control of either cauliflower mosaic virus 35S (CaMV35S) promoter or double-CaMV35S promoter plus alfalfa mosaic virus (AMV) enhancer. The fused β-glucuronidase (GUS) reporter gene was used to facilitate the assay of protein expression. The results indicated that codon optimization could increase the protein expression level obviously under CaMV35S promoter. However, under relatively stronger initiation condition (double-CaMV35S promoter plus AMV enhancer), only the gene with the lowest degree of codon optimization could increase the protein expression level. Our findings suggest that the action of codon optimization may be influenced by the factors of promoter strength and A+T content in tobacco plants.  相似文献   

11.
Codon usage: nature's roadmap to expression and folding of proteins   总被引:1,自引:0,他引:1  
  相似文献   

12.
Esophageal cancer involves multiple genetic alternations. A systematic codon usage bias analysis was completed to investigate the bias among the esophageal cancer responsive genes. GC-rich genes were low (average effective number of codon value was 49.28). CAG and GTA are over-represented and under-represented codons, respectively. Correspondence analysis, neutrality plot, and parity rule 2 plot analysis confirmed the dominance over mutation pressure in modulating the codon usage pattern of genes linked with esophageal cancer.  相似文献   

13.
Studying phage codon adaptation is important not only for understanding the process of translation elongation, but also for reengineering phages for medical and industrial purposes. To evaluate the effect of mutation and selection on phage codon usage, we developed an index to measure selection imposed by host translation machinery, based on the difference in codon usage between all host genes and highly expressed host genes. We developed linear and nonlinear models to estimate the C→T mutation bias in different phage lineages and to evaluate the relative effect of mutation and host selection on phage codon usage. C→T-biased mutations occur more frequently in single-stranded DNA (ssDNA) phages than in double-stranded DNA (dsDNA) phages and affect not only synonymous codon usage, but also nonsynonymous substitutions at second codon positions, especially in ssDNA phages. The host translation machinery affects codon adaptation in both dsDNA and ssDNA phages, with a stronger effect on dsDNA phages than on ssDNA phages. Strand asymmetry with the associated local variation in mutation bias can significantly interfere with codon adaptation in both dsDNA and ssDNA phages.  相似文献   

14.
To understand the synonymous codon usage pattern in mitochondrial genome of Antheraea assamensis, we analyzed the 13 mitochondrial protein‐coding genes of this species using a bioinformatic approach as no work was reported yet. The nucleotide composition analysis suggested that the percentages of A, T, G,and C were 33.73, 46.39, 9.7 and 10.17, respectively and the overall GC content was 19.86, that is, lower than 50% and the genes were AT rich. The mean effective number of codons of mitochondrial protein‐coding genes was 36.30 and it indicated low codon usage bias (CUB). Relative synonymous codon usage analysis suggested overrepresented and underrepresented codons in each gene and the pattern of codon usage was different among genes. Neutrality plot analysis revealed a narrow range of distribution for GC content at the third codon position and some points were diagonally distributed, suggesting both mutation pressure and natural selection influenced the CUB.  相似文献   

15.
为确定瑶药紫九牛叶绿体基因组密码子的使用模式及其成因,该研究以紫九牛叶绿体基因组50条蛋白质编码序列为研究对象,利用Codon W 1.4.2和在线软件CUSP和Chips分析其密码子偏好性。结果表明:(1)RSCU>1的密码子有29个,其中有28个以A/U结尾,说明叶绿体基因组的同义密码子中偏好以A/U结尾。(2)紫九牛叶绿体基因组密码子的GC含量GC1(47.38%)>GC2(39.81%)>GC3(29.60%),ENC值大于45的有40个,说明紫九牛叶绿体基因组存在较弱的偏性。(3)中性绘图分析和ENC-plot分析说明了紫九牛叶绿体基因组密码子的偏好性既受到选择的作用,又受到突变因素的影响。(4)通过构建的高低基因表达库最终确定了15个最优密码子,分别为UUG、AUU、GUU、GUA、UCU、 CCU、ACU、ACA、GCU、CAA、AAC、GAA、UGU、CGU和GGU。该研究为紫九牛叶绿体基因组的确定以及遗传多样性分析提供了依据。  相似文献   

16.
Genetic variation was estimated in ten samples populations of Aedes aegypti from the Brazilian Amazon, by using a 380 bp fragment of the mitochocondrial NADH dehydrogenase subunit 4 (ND4) gene. A total of 123 individuals were analyzed, whereby 13 haplotypes were found. Mean genetic diversity was slightly high (h = 0.666 ± 0.029; π = 0.0115 ± 0.0010). Two AMOVA analyses indicated that most of the variation (~70%-72%) occurred within populations. The variation found among and between populations within the groups disclosed lower, but even so, highly significant values. F(ST) values were not significant in most of the comparisons, except for the samples from Pacaraima and Rio Branco. The isolation by distance (IBD) model was not significant (r = 0.2880; p = 0.097) when the samples from Pacaraima and Rio Branco were excluded from the analyses, this indicating that genetic distance is not related to geographic distance. This result may be explained either by passive dispersal patterns (via human migrations and commercial exchange) or be due to the recent expansion of this mosquito in the Brazilian Amazon. Phylogenetic relationship analysis showed two genetically distinct groups (lineages) within the Brazilian Amazon, each sharing haplotypes with populations from West Africa and Asia.  相似文献   

17.
Codon usage and base composition in sequences from the A + T-rich genome ofRickettsia prowazekii, a member of the alpha Proteobacteria, have been investigated. Synonymous codon usage patterns are roughly similar among genes, even though the data set includes genes expected to be expressed at very different levels, indicating that translational selection has been ineffective in this species. However, multivariate statistical analysis differentiates genes according to their G + C contents at the first two codon positions. To study this variation, we have compared the amino acid composition patterns of 21R. prowazekii proteins with that of a homologous set of proteins fromEscherichia coli. The analysis shows that individual genes have been affected by biased mutation rates to very different extents: genes encoding proteins highly conserved among other species being the least affected. Overall, protein coding and intergenic spacer regions have G + C content values of 32.5% and 21.4%, respectively. Extrapolation from these values suggests thatR. prowazekii has around 800 genes and that 60–70% of the genome may be coding. Correspondence to: S.G.E. Andersson  相似文献   

18.
In the present study, we examined the codon usage bias between pseudorabies virus (PRV) US1 gene and the US1-like genes of 20 reference alphaherpesviruses. Comparative analysis showed noticeable disparities of the synonymous codon usage bias in the 21 alphaherpesviruses, indicated by codon adaptation index, effective number of codons (ENc) and GC3s value. The codon usage pattern of PRV US1 gene was phylogenetically conserved and similar to that of the US1-like genes of the genus Varicellovirus of alphaherpesvirus, with a strong bias towards the codons with C and G at the third codon position. Cluster analysis of codon usage pattern of PRV US1 gene with its reference alphaherpesviruses demonstrated that the codon usage bias of US1-like genes of 21 alphaherpesviruses had a very close relation with their gene functions. ENc-plot revealed that the genetic heterogeneity in PRV US1 gene and the 20 reference alphaherpesviruses was constrained by G+C content, as well as the gene length. In addition, comparison of codon preferences in the US1 gene of PRV with those of E. coli, yeast and human revealed that there were 50 codons showing distinct usage differences between PRV and yeast, 49 between PRV and human, but 48 between PRV and E. coli. Although there were slightly fewer differences in codon usages between E.coli and PRV, the difference is unlikely to be statistically significant, and experimental studies are necessary to establish the most suitable expression system for PRV US1. In conclusion, these results may improve our understanding of the evolution, pathogenesis and functional studies of PRV, as well as contributing to the area of herpesvirus research or even studies with other viruses.  相似文献   

19.
In this study we reconstruct the evolution of codon usage bias in the chloroplast gene rbcL using a phylogeny of 92 green-plant taxa. We employ a measure of codon usage bias that accounts for chloroplast genomic nucleotide content, as an attempt to limit plausible explanations for patterns of codon bias evolution to selection- or drift-based processes. This measure uses maximum likelihood-ratio tests to compare the performance of two models, one in which a single codon is overrepresented and one in which two codons are overrepresented. The measure allowed us to analyze both the extent of bias in each lineage and the evolution of codon choice across the phylogeny. Despite predictions based primarily on the low G+C content of the chloroplast and the high functional importance of rbcL, we found large differences in the extent of bias, suggesting differential molecular selection that is clade specific. The seed plants and simple leafy liverworts each independently derived a low level of bias in rbcL, perhaps indicating relaxed selectional constraint on molecular changes in the gene. Overrepresentation of a single codon was typically plesiomorphic, and transitions to overrepresentation of two codons occurred commonly across the phylogeny, possibly indicating biochemical selection. The total codon bias in each taxon, when regressed against the total bias of each amino acid, suggested that twofold amino acids play a strong role in inflating the level of codon usage bias in rbcL, despite the fact that twofolds compose a minority of residues in this gene. Those amino acids that contributed most to the total codon usage bias of each taxon are known through amino acid knockout and replacement to be of high functional importance. This suggests that codon usage bias may be constrained by particular amino acids and, thus, may serve as a good predictor of what residues are most important for protein fitness. Present address (Joshua T. Herbeck): JBP Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA 02543, USA  相似文献   

20.
Sequences of the complete protein-coding portions of the mitochondrial (mt) genome were analysed for 6 species of cestodes (including hydatid tapeworms and the pork tapeworm) and 5 species of trematodes (blood flukes and liver- and lung-flukes). A near-complete sequence was also available for an additional trematode (the blood fluke Schistosoma malayensis). All of these parasites belong to a large flatworm taxon named the Neodermata. Considerable variation was found in the base composition of the protein-coding genes among these neodermatans. This variation was reflected in statistically-significant differences in numbers of each inferred amino acid between many pairs of species. Both convergence and divergence in nucleotide, and hence amino acid, composition was noted among groups within the Neodermata. Considerable variation in skew (unequal representation of complementary bases on the same strand) was found among the species studied. A pattern is thus emerging of diversity in the mt genome in neodermatans that may cast light on evolution of mt genomes generally.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号