首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Analysis of synonymous codon usage pattern in the genome of a thermophilic cyanobacterium, Thermosynechococcus elongatus BP-1 using multivariate statistical analysis revealed a single major explanatory axis accounting for codon usage variation in the organism. This axis is correlated with the GC content at third base of synonymous codons (GC3s) in correspondence analysis taking T. elongatus genes. A negative correlation was observed between effective number of codons i.e. Nc and GC3s. Results suggested a mutational bias as the major factor in shaping codon usage in this cyanobacterium. In comparison to the lowly expressed genes, highly expressed genes of this organism possess significantly higher proportion of pyrimidine-ending codons suggesting that besides, mutational bias, translational selection also influenced codon usage variation in T. elongatus. Correspondence analysis of relative synonymous codon usage (RSCU) with A, T, G, C at third positions (A3s, T3s, G3s, C3s, respectively) also supported this fact and expression levels of genes and gene length also influenced codon usage. A role of translational accuracy was identified in dictating the codon usage variation of this genome. Results indicated that although mutational bias is the major factor in shaping codon usage in T. elongatus, factors like translational selection, translational accuracy and gene expression level also influenced codon usage variation.  相似文献   

2.
Basak S  Roy S  Ghosh TC 《FEBS letters》2007,581(30):5825-5830
Synonymous codon usage analysis between thermophilic and mesophilic prokaryotes has gained wide attention in recent years. Although it is known that thermophilic and mesophilic prokaryotes use different subset of synonymous codons, no reason for this difference is known so far. In the present communication, by analyzing a large number of thermophilic and mesophilic prokaryotes, we provide evidence that bias in the selection of synonymous codons between thermophilic and mesophilic prokaryotes is related to differential folding pattern of mRNA secondary structures. Moreover, we observe that error-minimizing property has significant influence in differentiating the synonymous codon usage between thermophilic and mesophilic prokaryotes. Biological implications of these results are discussed.  相似文献   

3.
4.
High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms.  相似文献   

5.
6.
Fungal xylanases has important applications in food, baking, pulp and paper industries in addition to various other industries. Xylanases are produced extensively by both bacterial and fungal sources and has tremendous potential of being active at extremes of temperature and pH. In the present study an effort has been made to explore the codon bias perspective of this potential enzyme using bioinformatics tools. Multivariate analysis has been used as a tool to study codon bias perspectives of xylanases. It was further observed that the codon usage of xylanases genes from different fungal sources is not similar and to reveal this phenomenon the relative synonymous codon usage (RSCU) and base composition variation in fungal xylanase genes were also studied. The codon biasing data like GC content at third position (GC3S), effective codon number (NC), codon adaptive index (CAI) were further analyzed with statistical softwares like Sigma1plot 9.0 and Systat 11.0. Furthermore, study of translation selection was also performed to verify the influences of codon usage variation among the 94 xylanase genes. In the present study xylanase gene from 12 organisms were analyzed and codon usages of all xylanases from each organism were compared separately. Analysis indicates biased codon among all 12 fungi taken for study with Aspergillus nidulans, Chaetomium globosum, Aspergillus terreus and Aspergillus clavatus showing maximum biasing. NC plot and correspondence analysis on relative synonymous codon usage indicate that mutation bias and translation selection influences codon usage variation in fungal xylanase gene. To reveal the relative synonymous codon usage and base composition variation in xylanase, 94 genes from 12 fungi were used as model system.  相似文献   

7.
We examined the codon usages in well-conserved and less-well-conserved regions of vertebrate protein genes and found them to be similar. Despite this similarity, there is a statistically significant decrease in codon bias in the less-well-conserved regions. Our analysis suggests that although those codon changes initially fixed under amino acid replacements tend to follow the overall codon usage pattern, they also reduce the bias in codon usage. This decrease in codon bias leads one to predict that the rate of change of synonymous codons should be greater in those regions that are less well conserved at the amino acid level than in the better-conserved regions. Our analysis supports this prediction. Furthermore, we demonstrate a significantly elevated rate of change of synonymous codons among the adjacent codons 5' to amino acid replacement positions. This provides further support for the idea that there are contextual constraints on the choice of synonymous codons in eukaryotes.  相似文献   

8.
《Genomics》2020,112(2):1319-1329
NKX-2.5 gene is responsible for cardiac development and its targeted disruption apprehends cardiac development at the linear heart tube stage. Bioinformatic analysis was employed to investigate the codon usage pattern and dN/dS of mammalian NKX-2.5 gene. The relative synonymous codon usage analysis revealed variation in codon usage and two synonymous codons namely ATA (Ile) and GTA (Val) were absent in NKX-2.5 gene across selected mammalian species suggesting that these two codons were possibly selected against during evolution. Parity rule 2 analysis of two and four fold amino acids showed CT bias whereas six-fold amino acids revealed GA bias. Neutrality analysis suggests that selection played a prominent role while mutation had a minor role. The dN/dS analysis suggests synonymous substitution played a significant role and it negatively correlated with p-distance of the gene. Purifying natural selection played a dominant role in the genetic evolution of NKX-2.5 gene in mammals.  相似文献   

9.
Rao Y  Wu G  Wang Z  Chai X  Nie Q  Zhang X 《DNA research》2011,18(6):499-512
Synonymous codons are used with different frequencies both among species and among genes within the same genome and are controlled by neutral processes (such as mutation and drift) as well as by selection. Up to now, a systematic examination of the codon usage for the chicken genome has not been performed. Here, we carried out a whole genome analysis of the chicken genome by the use of the relative synonymous codon usage (RSCU) method and identified 11 putative optimal codons, all of them ending with uracil (U), which is significantly departing from the pattern observed in other eukaryotes. Optimal codons in the chicken genome are most likely the ones corresponding to highly expressed transfer RNA (tRNAs) or tRNA gene copy numbers in the cell. Codon bias, measured as the frequency of optimal codons (Fop), is negatively correlated with the G + C content, recombination rate, but positively correlated with gene expression, protein length, gene length and intron length. The positive correlation between codon bias and protein, gene and intron length is quite different from other multi-cellular organism, as this trend has been only found in unicellular organisms. Our data displayed that regional G + C content explains a large proportion of the variance of codon bias in chicken. Stepwise selection model analyses indicate that G + C content of coding sequence is the most important factor for codon bias. It appears that variation in the G + C content of CDSs accounts for over 60% of the variation of codon bias. This study suggests that both mutation bias and selection contribute to codon bias. However, mutation bias is the driving force of the codon usage in the Gallus gallus genome. Our data also provide evidence that the negative correlation between codon bias and recombination rates in G. gallus is determined mostly by recombination-dependent mutational patterns.  相似文献   

10.
We have analyzed factors affecting the codon usage pattern of the chloroplasts genomes of representative species of pooid grass family. Correspondence analysis of relative synonymous codon usages (RSCU) showed that genes on secondary axis were correlated with their GC3S values (all r > 0.3, p < 0.05), indicating mutational bias as an important selective force that shaped the variation in the codon usage among chloroplast genes. The Nc-plot showed that although a majority of the points with low-Nc values were lying below the expected curve, a few genes lied on the expected curve. Nc plot clearly showed that mutational bias plays a major role in codon biology across the monocot plastomes. The hydrophobicity and aromaticity of encoded proteins of each species were found to be other factors of codon usage variation. In the view of above light, besides natural selection, several other factors also likely to be involved in determining the selective constraints on codon bias in plastomes of pooid grass genomes. In addition, five codons (B. distachyon), seven codons (H. vulgare), and four codons (T. aestivum) were identified as optimal codons of the three grass chloroplasts. To identify genes evolving under positive selection, rates of nonsynonymous substitutions (Ka) and synonymous substitutions (Ks) were computed for all groups of orthologous gene pairs.  相似文献   

11.
Summary We examined the codon usages in wellconserved and less-well-conserved regions of vertebrate protein genes and found them to be similar. Despite this similarity, there is a statistically significant decrease in codon bias in the less-well-conserved regions. Our analysis suggests that although those codon changes initially fixed under amino acid replacements tend to follow the overall codon usage pattern, they also reduce the bias in codon usage. This decrease in codon bias leads one to predict that the rate of change of synonymous codons should be greater in those regions that are less well conserved at the amino acid level than in the better-conserved regions. Our analysis supports this prediction. Furthermore, we demonstrate a significantly elevated rate of change of synonymous codons among the adjacent codons 5 to amino acid replacement positions. This provides further support for the idea that there are contextual constraints on the choice of synonymous codons in eukaryotes.  相似文献   

12.
Li Y  Wang C  Cheng X  Wu T  Zhang C 《Bio Systems》2011,104(1):42-47
Three very virulent infectious bursal disease virus (vvIBDV) strains were isolated from a single farm and shown to be phylogenetically related to the vvIBDV isolate UK661. In this study, a comparative analysis of the synonymous codon usage in the hypervariable region of theVP2 (vVP2) gene of the vvIBDV strains was done on viruses serially passaged in chicken embryos. Sequencing demonstrated that codons change during the serial passage in the vVP2 gene of the viruses. Nine codon mutations resulted in amino acids changes. The amino acid changes were I256V, I296L 6in isolate XA1989, A222P, I242V, Q253H, I256V in isolate XA1998, and Q253H, I256V, I296L in isolate XA2004. Three of the nine amino acid changes occurred at residue 256. The codons of the amino acids A232, N233, I234, T269, T283 and H338 changed to the synonymous codons in XA1989 after the 16th passage, in XA1998 after the 24th passage and in XA2004 22nd passage viruses. These mutations change the key amino acid residues Q253H and I256V in the domains which are essential for its virulence, and the synonymous codons were observed compared to classical virulent IBDV. The results indicated that the codon changes during the serial passage comprised of synonymous codon usage in the vVP2 gene of IBDV, and this synonymous codon bias was correlated with pathotypes. The extent of synonymous codon usage bias in the IBDV-vVP2 gene maybe influence the gene expression level and secondary structure of protein as well as hydrophobicity, therefore the results provide useful perspectives for evolution and understanding of the pathogenesis of IBDV.  相似文献   

13.
Biased codon usage is common in eukaryotic and prokaryotic genes. Evidence from Escherichia, Saccharomyces, and Drosophila indicates that it favors translational efficiency and accuracy. However, to date no functional advantages have been identified in the codon–anticodon interactions involving the most frequently used (preferred) codons. Here we present evidence that forces not related to the individual codon–anticodon interaction may be involved in determining which synonymous codons are preferred or avoided. We show that the ``off-frame' trinucleotide motif preferences inferrable from Drosophila coding regions are often in the same direction as Drosophila's ``in-frame' codon preferences, i.e., its codon usage. The off-frame preferences were inferred from the nonrandomness of the location of confamilial synonymous codons along coding regions—a pattern often described as a context dependence of nucleotide choice at synonymous positions or as codon-pair bias. We relied on randomizations of the location of confamilial codons that do not alter, and cannot be influenced by, the encoded amino acid sequences, codon usage, or base composition of the genes examined. The statistically significant congruency of in-frame and off-frame trinucleotide preferences suggests that the same kind of reading-frame-independent force(s) may also influence synonymous codon choice. These forces may have produced biases in codon usage that then led to the evolution of the translational advantages of these motifs as preferred codons. Under this scenario, tRNA pool size differences between preferred and nonpreferred codons initially were evolved to track the default overrepresentation of codons with preferred motifs. The motif preference hypothesis can explain the structuring of codon preferences and the similarities in the codon usages of distantly related organisms. Received: 10 November 1998 / Accepted: 23 February 1999  相似文献   

14.
Among a sample of 39 Geodia cydonium (Demospongiae, Porifera) genes, with an average G + C content of 51.2%, extensive structural heterogeneity and considerable variations in synonymous codon usage were found. The G + C content of coding sequences and G + C content at silent codon positions (GC3S) varied from 42.4 to 59.2% and from 35.6 to 76.5%, respectively. Correspondence analysis of 39 genes revealed that putative highly expressed genes preferentially use a limited subset of codons, which were therefore defined as preferred codons in G. cydonium . A total of 22 preferred codons for 18 amino acids with synonyms in codons were identified and they all (with one exception) end with C or G. Among these codons there are also C- and G-ending codons which were previously identified as codons optimal for translation in a variety of eukaryotes, including metazoans and plants. The bias in synonymous codon usage in putative highly expressed G. cydonium genes is moderate, indicating that these genes are not shaped under strong natural selection. We postulate that the preference for C- and G-ending codons was already established in the ancestor of all Metazoa, including also sponges. This ancestor most probably also had a G + C rich genome. The selection toward C- and G-ending codons has been largely conserved throughout eukaryote evolution; exceptions are, for example, mammals for which strong mutational biases caused switches from that rule.  相似文献   

15.
Evolved codon usages often pose a technical challenge over the expressing of eukaryotic genes in microbial systems because of changed translational machinery. In the present study, we investigated the translational effects of intragenic differential codon usage on the expression of the new Drosophila gene, jingwei (jgw), a chimera derived from two unrelated parental genes: Ymp and Adh. We found that jgw possesses a strong intragenic differential usage of synonymous codons, i.e. the Adh-derived C-domain has a significantly higher codon bias than that of the Ymp-derived N-domain (P=0.0023 by t-test). Additional evolutionary analysis revealed the heterogeneous distribution of rare codons, implicating its role in gene regulation and protein translation. The in vitro expression of jgw further demonstrated that the heterogeneous distribution of rare codons has played a role in regulating gene expression, particularly, affecting the quality of protein translation.  相似文献   

16.
《Genomics》2020,112(6):4657-4665
Given the high therapeutic value of the staphylococcal phage, the genome co-evolution of the phage and the host has gained great attention. Though the genome-wide AT richness in staphylococcal phages has been well-studied with nucleotide usage bias, here we proved that host factor, lifestyle and taxonomy are also important factors in understanding the phage nucleotide usages bias using information entropy formula. Such correlation is especially prominent when it comes to the synonymous codon usages of staphylococcal phages, despite the overall scattered codon usage pattern represented by principal component analysis. This strong relationship is explained by nucleotide skew which testified that the usage biases of nucleotide at different codon positions are acting on synonymous codons. Therefore, our study reveals a hidden relationship of genome evolution with host limitation and phagic phenotype, providing new insight into phage genome evolution at genetic level.  相似文献   

17.
Jian-hua Zhou 《Bio Systems》2010,101(1):20-595
The mechanism of utilization of alternative two AUGs in foot-and-mouth disease virus (FMDV) is still unknown to date. In this study, the characteristics of codon usage bias (CUB) of the region between the two AUGs (the region-La) and of the same-sized region behind the second AUG (the region-Lb) in 94 different FMDV RNA sequences were analyzed using relative synonymous codon usage (RSCU) values. The results indicated that many codons with negative CUB were preferentially used in the region-La. There were two conserved residues (Thr and Cys) on the 4th and 6th residue positions of the region-La. The conserved residues had a general tendency to choose synonymous codons with negative CUB. Although most positions in the region-La did not contain conserved residues, many positions tended to use codons with negative CUB in this region. Among these codons, the majority belonged to the amino acids containing synonymous codons with clearly positive and negative CUB, including Asp, Val, Ile, Leu, Thr, Ala, Ser, Asn and Arg. The presence of many codons with negative CUB in the region-La might impair the efficiency of the first AUG selection. The phylogenetic incongruence of the region-La and the region-Lb implied that intertypic recombination played an important role in the evolution of FMDV. Furthermore, due to the existence of more positions with positive CUB and more widespread phylogenetic incongruence in the region-Lb than the region-La, a probable relationship between the degree of CUB and the evolution of the two target regions was revealed.  相似文献   

18.
Protein structure is composed of regular secondary structural elements (α-helix and β-strand) and non-regular region. Unlike the helix and strand, the non-regular region consists of an amino acid defined as a disordered residue (DR). When compared with the effect of the helix and strand, the effect of the DR on enzyme structure and function is elusive. An Aspergillus niger GH10 xylanase (Xyn) was selected as a model molecule of (β/α)(8) because the general structure consists of ~10% enzymes. The Xyn has five N-terminal DRs and one C-terminal DR, respectively, which were deleted to construct three mutants, XynΔN, XynΔC, and XynΔNC. Each mutant was ~2-, 3-, or 4-fold more thermostable and 7-, 4-, or 4-fold more active than the Xyn. The N-terminal deletion decreased the xylanase temperature optimum for activity (T(opt)) 6 °C, but the C-terminal deletion increased its T(opt) 6 °C. The N- and C-terminal deletions had opposing effects on the enzyme T(opt) but had additive effects on its thermostability. The five N-terminal DR deletions had more effect on the enzyme kinetics but less effect on its thermo property than the one C-terminal DR deletion. CD data showed that the terminal DR deletions increased regular secondary structural contents, and hence, led to slow decreased Gibbs free energy changes (ΔG(0)) in the thermal denaturation process, which ultimately enhanced enzyme thermostabilities.  相似文献   

19.
In this study codon usage bias of all experimentally known genes of Lactococcus lactis has been analyzed. Since Lactococcus lactis is an AT rich organism, it is expected to occur A and/or T at the third position of codons and detailed analysis of overall codon usage data indicates that A and/or T ending codons are predominant in this organism. However, multivariate statistical analyses based both on codon count and on relative synonymous codon usage (RSCU) detect a large number of genes, which are supposed to be highly expressed are clustered at one end of the first major axis, while majority of the putatively lowly expressed genes are clustered at the other end of the first major axis. It was observed that in the highly expressed genes C and T ending codons are significantly higher than the lowly expressed genes and also it was observed that C ending codons are predominant in the duets of highly expressed genes, whereas the T endings codons are abundant in the quartets. Abundance of C and T ending codons in the highly expressed genes suggest that, besides, compositional biases, translational selection are also operating in shaping the codon usage variation among the genes in this organism as observed in other compositionally skewed organisms. The second major axis generated by correspondence analysis on simple codon counts differentiates the genes into two distinct groups according to their hydrophobicity values, but the same analysis computed with relative synonymous codon usage values could not discriminate the genes according to the hydropathy values. This suggests that amino acid composition exerts constraints on codon usage in this organism. On the other hand the second major axis produced by correspondence analysis on RSCU values differentiates the genes into two groups according to the synonymous codon usage for cysteine residues (rarest amino acids in this organism), which is nothing but a artifactual effect induced by the RSCU values. Other factors such as length of the genes and the positions of the genes in the leading and lagging strand of replication have practically no influence in the codon usage variation among the genes in this organism.  相似文献   

20.
Summary Based on the rates of synonymous substitution in 42 protein-codin gene pairs from rat and human, a correlation is shown to exist between the frequency of the nucleotides in all positions of the codon and the synonymous substitution rate. The correlation coefficients were positive for A and T and negative for C and G. This means that AT-rich genes accumulate more synonymous substitutions than GC-rich genes. Biased patterns of mutation could not account for this phenomenon. Thus, the variation in synonymous substitution rates and the resulting unequal codon usage must be the consequence of selection against A and T in synonymous positions. Most of the varition in rates of synonymous substitution can be explained by the nucleotide composition in synonymous positions. Codon-anticodon interactions, dinucleotide frequencies, and contextual factors influence neither the rates of synonymous substitution nor codon usage. Interestingly, the nucleotide in the second position of codons (always a nonsynonymous position) was found to affect the rate of synonymous substitution. This finding links the rate of nonsynonymous substitution with the synonymous rate. Consequently, highly conservative proteins are expected to be encoded by genes that evolve slowly in terms of synonymous substitutions, and are consequently highly biased in their codon usage.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号