首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 515 毫秒
1.
Background: Mitochondrial ND gene, which encodes NADH dehydrogenase, is the first enzyme of the mitochondrial electron transport chain. Leigh syndrome, a neurodegenerative disease caused by mutation in the ND2 gene (T4681C), is associated with bilateral symmetric lesions in basal ganglia and subcortical brain regions. Therefore, it is of interest to analyze mitochondrial DNA to glean information for evolutionary relationship. This study highlights on the analysis of compositional dynamics and selection pressure in shaping the codon usage patterns in the coding sequence of MT-ND2 gene across pisces, aves and mammals by using bioinformatics tools like effective number of codons (ENC), codon adaptation index (CAI), relative synonymous codon usage (RSCU) etc. Results: We observed a low codon usage bias as reflected by high ENC values in MT-ND2 gene among pisces, aves and mammals. The most frequently used codons were ending with A/C at the 3rd position of codon and the gene was AT rich in all the three classes. The codons TCA, CTA, CGA and TGA were over represented in all three classes. The F1 correspondence showed significant positive correlation with G, T3 and CAI while the F2 axis showed significant negative correlation with A and T but significant positive correlation with G, C, G3, C3, ENC, GC, GC1, GC2 and GC3. Conclusions: The codon usage bias in MTND2 gene is not associated with expression level. Mutation pressure and natural selection affect the codon usage pattern in MT-ND 2 gene.  相似文献   

2.
Liu Q 《Bio Systems》2006,85(2):99-106
The main factors shaping codon usage bias in the Deinococcus radiodurans genome were reported. Correspondence analysis (COA) was carried out to analyze synonymous codon usage bias. The results showed that the main trend was strongly correlated with gene expression level assessed by the "Codon Adaptation Index" (CAI) values, a result that was confirmed by the distribution of genes along the first axis. The results of correlation analysis, variance analysis and neutrality plot indicated that gene nucleotide composition was clearly contributed to codon bias. CDS length was also key factor in dictating codon usage variation. A general tendency of more biased codon usage of genes with longer CDS length to higher expression level was found. Further, the hydrophobicity of each protein also played a role in shaping codon usage in this organism, which could be confirmed by the significant correlation between the positions of genes placed on the first axis and the hydrophobicity values (r=-0.100, P<0.01). In summary, gene expression level played a crucial role, nucleotide mutational bias, CDS length and the hydrophobicity of each protein just in a minor way in shaping the codon usage pattern of D. radiodurans. Notably, 19 codons firstly defined as "optimal codons" may provide useful clues for molecular genetic engineering and evolutionary studying.  相似文献   

3.
Codon usage in Clonorchis sinensis was analyzed using 12,515 codons from 38 coding sequences. Total GC content was 49.83%, and GC1, GC2 and GC3 contents were 56.32%, 43.15% and 50.00%, respectively. The effective number of codons converged at 51-53 codons. When plotted against total GC content or GC3, codon usage was distributed in relation to GC3 biases. Relative synonymous codon usage for each codon revealed a single major trend, which was highly correlated with GC content at the third position when codons began with A or U at the first two positions. In codons beginning with G or C base at the first two positions, the G or C base rarely occurred at the third position. These results suggest that codon usage is shaped by a bias towards G or C at the third base, and that this is affected by the first and second bases.  相似文献   

4.
Synonymous codon usage of 53 protein coding genes in chloroplast genome of Coffea arabica was analyzed for the first time to find out the possible factors contributing codon bias. All preferred synonymous codons were found to use A/T ending codons as chloroplast genomes are rich in AT. No difference in preference for preferred codons was observed in any of the two strands, viz., leading and lagging strands. Complex correlations between total base compositions (A, T, G, C, GC) and silent base contents (A3, T3, G3, C3, GC3) revealed that compositional constraints played crucial role in shaping the codon usage pattern of C. arabica chloroplast genome. ENC Vs GC3 plot grouped majority of the analyzed genes on or just below the left side of the expected GC3 curve indicating the influence of base compositional constraints in regulating codon usage. But some of the genes lie distantly below the continuous curve confirmed the influence of some other factors on the codon usage across those genes. Influence of compositional constraints was further confirmed by correspondence analysis as axis 1 and 3 had significant correlations with silent base contents. Correlation of ENC with axis 1, 4 and CAI with 1, 2 prognosticated the minor influence of selection in nature but exact separation of highly and lowly expressed genes could not be seen. From the present study, we concluded that mutational pressure combined with weak selection influenced the pattern of synonymous codon usage across the genes in the chloroplast genomes of C. arabica.  相似文献   

5.
It is well known that an amino acid can be encoded by more than one codon, called synonymous codons. The preferential use of one particular codon for coding an amino acid is referred to as codon usage bias (CUB). A quantitative analytical method, CUB and a related tool, Codon Adaptative Index have been applied to comparatively study whole genomes of a few pathogenic Trypanosomatid species. This quantitative attempt is of direct help in the comparison of qualitative features like mutational and translational selection. Pathogens of the Leishmania and Trypanosoma genus cause debilitating disease and suffering in human beings and animals. Of these, whole genome sequences are available for only five species. The complete coding sequences (CDS), highly expressed, essential and low expressed genes have all been studied for their CUB signature. The codon usage bias of essential genes and highly expressed genes show distribution similar to codon usage bias of all CDSs in Trypanosomatids. Translational selection is the dominant force selecting the preferred codon, and selection due to mutation is negligible. In contrast to an earlier study done on these pathogens, it is found in this work that CUB and CAI may be used to distinguish the Trypanosomatid genomes at the sub-genus level. Further, CUB may effectively be used as a signature of the species differentiation by using Principal Component Analysis (PCA).

Abbreviations

CUB - Codon Usage Bias, CAI - Codon Adaptative Index, CDS - Coding sequences, t-RNA - Transfer RNA, PCA - Principal Component Analysis.  相似文献   

6.
Mitogen activated protein kinase (MAPK) genes provide resistance to various biotic and abiotic stresses. Codon usage profiling of the genes reveals the characteristic features of the genes like nucleotide composition, gene expressivity, optimal codons etc. The present study is a comparative analysis of codon usage patterns for different MAPK genes in three organisms, viz. Arabidopsis thaliana, Glycine max (soybean) and Oryza sativa (rice). The study has revealed a high AT content in MAPK genes of Arabidopsis and soybean whereas in rice a balanced AT-GC content at the third synonymous position of codon. The genes show a low bias in codon usage profile as reflected in the higher values (50.83 to 56.55) of effective number of codons (Nc). The prediction of gene expression profile in the MAPK genes revealed that these genes might be under the selective pressure of translational optimization as reflected in the low codon adaptation index (CAI) values ranging from 0.147 to 0.208.  相似文献   

7.
Analysis of synonymous codon usage pattern in the genome of a thermophilic cyanobacterium, Thermosynechococcus elongatus BP-1 using multivariate statistical analysis revealed a single major explanatory axis accounting for codon usage variation in the organism. This axis is correlated with the GC content at third base of synonymous codons (GC3s) in correspondence analysis taking T. elongatus genes. A negative correlation was observed between effective number of codons i.e. Nc and GC3s. Results suggested a mutational bias as the major factor in shaping codon usage in this cyanobacterium. In comparison to the lowly expressed genes, highly expressed genes of this organism possess significantly higher proportion of pyrimidine-ending codons suggesting that besides, mutational bias, translational selection also influenced codon usage variation in T. elongatus. Correspondence analysis of relative synonymous codon usage (RSCU) with A, T, G, C at third positions (A3s, T3s, G3s, C3s, respectively) also supported this fact and expression levels of genes and gene length also influenced codon usage. A role of translational accuracy was identified in dictating the codon usage variation of this genome. Results indicated that although mutational bias is the major factor in shaping codon usage in T. elongatus, factors like translational selection, translational accuracy and gene expression level also influenced codon usage variation.  相似文献   

8.

Background

The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori.

Results

A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans).

Conclusions

The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the “optimal codons” of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1596-z) contains supplementary material, which is available to authorized users.  相似文献   

9.
In many unicellular organisms, invertebrates, and plants, synonymous codon usage biases result from a coadaptation between codon usage and tRNAs abundance to optimize the efficiency of protein synthesis. However, it remains unclear whether natural selection acts at the level of the speed or the accuracy of mRNAs translation. Here we show that codon usage can improve the fidelity of protein synthesis in multicellular species. As predicted by the model of selection for translational accuracy, we find that the frequency of codons optimal for translation is significantly higher at codons encoding for conserved amino acids than at codons encoding for nonconserved amino acids in 548 genes compared between Caenorhabditis elegans and Homo sapiens. Although this model predicts that codon bias correlates positively with gene length, a negative correlation between codon bias and gene length has been observed in eukaryotes. This suggests that selection for fidelity of protein synthesis is not the main factor responsible for codon biases. The relationship between codon bias and gene length remains unexplained. Exploring the differences in gene expression process in eukaryotes and prokaryotes should provide new insights to understand this key question of codon usage. Received: 18 June 2000 / Accepted: 10 November 2000  相似文献   

10.
11.
Abstract

Norovirus GII.4 variants, a genotype in genogroup II belonging to the genus Norovirus, is a single-strand positive sense RNA containing three open reading frames (ORF1, ORF2 and ORF3) and is the most important pathogen causing nonbacterial gastroenteritis outbreaks. By using bioinformatic softwares such as Codon W, SPSS and so on, a total of 292 strains of the viruses isolated from 1974 to 2016 were analyzed for nucleotide composition and synonymous codon usage in each ORF. The result shows that it is enriched for A over the other bases in nucleotide composition, G behind the other bases in the 3rd site of all synonymous codons in the three ORFs. The patterns of nucleotide composition and codon bias of ORF2 are similar to those of ORF3 and different from those of ORF1. There are generally UpA motif and CpG motif in the codons with the lowest proportion. Correspondence analysis indicates that the codon usage may be changing over a certain time period for ORF1 in 2006 and 2012, ORF2 in 2012, and ORF3 in 2013. ENC (effective number of codons) plot and other analyses indicate that both natural selection and mutational pressure play partly roles in the ORFs, but natural selection is more important for ORF2 and ORF3. Besides, we also found all optimal codons in the ORFs. The study provides a basic understanding of the mechanism for norovirus GII.4 codon usage bias. Abbreviations ORF Open Reading Frame

ENC Effective Number of Codons

COA correspondence analysis

RSCU Relative Synonymous Codon Usage

CAI Codon Adaptation Index

CBI Codon Bias Index

Fop frequency of optimal codons

L_sym number of synonymous codons

L_aa length amino acids

GRAVY grand average of hydropathicity

Aroma aromaticity

Communicated by Ramaswamy H. Sarma  相似文献   

12.
Human cytomegalovirus (HCMV) infection, a worldwide contagion, causes a serious disorder in infected individuals. Analysis of codon usage can reveal much molecular information about this virus. The effective number of codon (ENC) values, relative synonymous codon usage (RSCU) values, codon adaptation index (CAI), and nucleotide contents was investigated in approximately 160 coding sequences (CDS) among 17 human cytomegalovirus genomes using the software CodonW. Linear regression analysis and logistic regression were performed to explore the preliminary data. The results showed that, overall, HCMV genomes had low codon usage bias (mean ENC = 47.619). However, the ENC of individual CDS varied widely and was distributed unevenly between host-related genes and viral-self-function genes (P = 0.002, odds ratio (OR) = 3.194), as did the GC content (P = 0.016, OR = 2.178). The ENC values correlated with CAI, GC content, and the nucleotide composing at the 3rd codon position (GC3s) (P < 0.001). There was a significant variation in the codon preference that depended on the RSCU data. The predicted ENC curve suggested that mutational pressure, rather than natural selection, was one of the main factors that determined the codon usage bias in HCMV. Among 123 genes with known function, the genes related to viral self-replication and viral–host interaction showed different ENC and CAI values, and GC and GC3s contents. In conclusion, the detailed codon usage bias theoretically revealed information concerning HCMV evolution and could be a valuable additional parameter for HCMV gene function research.  相似文献   

13.
In the present study, we examined the codon usage bias between pseudorabies virus (PRV) US1 gene and the US1-like genes of 20 reference alphaherpesviruses. Comparative analysis showed noticeable disparities of the synonymous codon usage bias in the 21 alphaherpesviruses, indicated by codon adaptation index, effective number of codons (ENc) and GC3s value. The codon usage pattern of PRV US1 gene was phylogenetically conserved and similar to that of the US1-like genes of the genus Varicellovirus of alphaherpesvirus, with a strong bias towards the codons with C and G at the third codon position. Cluster analysis of codon usage pattern of PRV US1 gene with its reference alphaherpesviruses demonstrated that the codon usage bias of US1-like genes of 21 alphaherpesviruses had a very close relation with their gene functions. ENc-plot revealed that the genetic heterogeneity in PRV US1 gene and the 20 reference alphaherpesviruses was constrained by G+C content, as well as the gene length. In addition, comparison of codon preferences in the US1 gene of PRV with those of E. coli, yeast and human revealed that there were 50 codons showing distinct usage differences between PRV and yeast, 49 between PRV and human, but 48 between PRV and E. coli. Although there were slightly fewer differences in codon usages between E.coli and PRV, the difference is unlikely to be statistically significant, and experimental studies are necessary to establish the most suitable expression system for PRV US1. In conclusion, these results may improve our understanding of the evolution, pathogenesis and functional studies of PRV, as well as contributing to the area of herpesvirus research or even studies with other viruses.  相似文献   

14.
Codon Usage Bias and tRNA Abundance in Drosophila   总被引:5,自引:0,他引:5  
Codon usage bias of 1,117 Drosophila melanogaster genes, as well as fewer D. pseudoobscura and D. virilis genes, was examined from the perspective of relative abundance of isoaccepting tRNAs and their changes during development. We found that each amino acid contributes about equally and highly significantly to overall codon usage bias, with the exception of Asp which had very low contribution to overall bias. Asp was also the only amino acid that did not show a clear preference for one of its synonymous codons. Synonymous codon usage in Drosophila was consistent with ``optimal' codons deduced from the isoaccepting tRNA availability. Interestingly, amino acids whose major isoaccepting tRNAs change during development did not show as strong bias as those with developmentally unchanged tRNA pools. Asp is the only amino acid for which the major isoaccepting tRNAs change between larval and adult stages. We conclude that synonymous codon usage in Drosophila is well explained by tRNA availability and is probably influenced by developmental changes in relative abundance. Received: 5 December 1996 / Accepted: 14 June 1997  相似文献   

15.
Chromohalobacter salexigens, a Gammaproteobacterium belonging to the family Halomonadaceae, shows a broad salinity range for growth. In order to reveal the factors influencing architecture of protein coding genes in C. salexigens, pattern of synonymous codon usage bias has been investigated. Overall codon usage analysis of the microorganism revealed that C and G ending codons are predominantly used in all the genes which are indicative of mutational bias. Multivariate statistical analysis showed that the genes are separated along the first major explanatory axis according to their expression levels and their genomic GC content at the synonymous third positions of the codons. Both NC plot and correspondence analysis on Relative Synonymous Codon Usage (RSCU) indicates that the variation in codon usage among the genes may be due to mutational bias at the DNA level and natural selection acting at the level of mRNA translation. Gene length and the hydrophobicity of the encoded protein also influence the codon usage variation of genes to some extent. A comparison of the relative synonymous codon usage between 10% each of highly and lowly expressed genes determines 23 optimal codons, which are statistically over represented in the former group of genes and may provide useful information for salt-stressed gene prediction and gene-transformation. Furthermore, genes for regulatory functions; mobile and extrachromosomal element functions; and cell envelope are observed to be highly expressed. The study could provide insight into the gene expression response of halophilic bacteria and facilitate establishment of effective strategies to develop salt-tolerant crops of agronomic value.  相似文献   

16.
Rao Y  Wu G  Wang Z  Chai X  Nie Q  Zhang X 《DNA research》2011,18(6):499-512
Synonymous codons are used with different frequencies both among species and among genes within the same genome and are controlled by neutral processes (such as mutation and drift) as well as by selection. Up to now, a systematic examination of the codon usage for the chicken genome has not been performed. Here, we carried out a whole genome analysis of the chicken genome by the use of the relative synonymous codon usage (RSCU) method and identified 11 putative optimal codons, all of them ending with uracil (U), which is significantly departing from the pattern observed in other eukaryotes. Optimal codons in the chicken genome are most likely the ones corresponding to highly expressed transfer RNA (tRNAs) or tRNA gene copy numbers in the cell. Codon bias, measured as the frequency of optimal codons (Fop), is negatively correlated with the G + C content, recombination rate, but positively correlated with gene expression, protein length, gene length and intron length. The positive correlation between codon bias and protein, gene and intron length is quite different from other multi-cellular organism, as this trend has been only found in unicellular organisms. Our data displayed that regional G + C content explains a large proportion of the variance of codon bias in chicken. Stepwise selection model analyses indicate that G + C content of coding sequence is the most important factor for codon bias. It appears that variation in the G + C content of CDSs accounts for over 60% of the variation of codon bias. This study suggests that both mutation bias and selection contribute to codon bias. However, mutation bias is the driving force of the codon usage in the Gallus gallus genome. Our data also provide evidence that the negative correlation between codon bias and recombination rates in G. gallus is determined mostly by recombination-dependent mutational patterns.  相似文献   

17.
Unequal use of synonymous codons has been found in several prokaryotic and eukaryotic genomes. This bias has been associated with translational efficiency. The prevalence of this bias across lineages is currently unknown. Here, a new method (GCB) to measure codon usage bias is presented. It uses an iterative approach for the determination of codon scores and allows the computation of an index of codon bias suitable for interspecies comparison. A server to calculate GCB-values of individual genes as well as a list of compiled results are available at . The method was applied to complete bacterial genomes. The relation of codon usage bias with amino acid composition and the choice of stop codons were determined and discussed.  相似文献   

18.
Summary We examined the codon usages in wellconserved and less-well-conserved regions of vertebrate protein genes and found them to be similar. Despite this similarity, there is a statistically significant decrease in codon bias in the less-well-conserved regions. Our analysis suggests that although those codon changes initially fixed under amino acid replacements tend to follow the overall codon usage pattern, they also reduce the bias in codon usage. This decrease in codon bias leads one to predict that the rate of change of synonymous codons should be greater in those regions that are less well conserved at the amino acid level than in the better-conserved regions. Our analysis supports this prediction. Furthermore, we demonstrate a significantly elevated rate of change of synonymous codons among the adjacent codons 5 to amino acid replacement positions. This provides further support for the idea that there are contextual constraints on the choice of synonymous codons in eukaryotes.  相似文献   

19.
In this study, major factors shaping codon and amino acid usage variation Lactobacillus sakei 23K were investigated. It included 13 other Lactobacillus species for a comparative analysis. The correspondence analysis (COA) showed that in 13 species the major trend of synonymous codon usage was highly correlated with gene expression level as assessed by the “Codon Adaptation Index” (CAI) values. In addition, Nc (effective number of codons) plot, SCUO (synonymous codon usage order) plot and correlation analyses showed that the base composition and mutational bias have dominant role in the codon usage variation. However, the translational selection for genes at higher expression level, where more frequent synonymous codons correspond to more abundant cognate transfer RNAs (tRNAs), was not found to be similar in all species. The study also showed that the amino acid usage in these species was significantly (P < 0.01) influenced by hydrophobicity and aromaticity of proteins. Furthermore, 24 codons that were found to be optimally used by L. sakei and its comparative study with 13 Lactobacillus species might provide some useful information in their further study of molecular evolution and genetic engineering.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号