首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 593 毫秒
1.
The number of completely sequenced archaeal genomes has been sufficient for a large-scale bioinformatic study.We have conducted analyses for each coding region from 36 archaeal genomes using the original CGS algorithm by calculating the total GC content(G+C),GC content in first,second and third codon positions as well as in fourfold and twofold degenerated sites from third codon positions,levels of arginine codon usage(Arg2:AGA/G;Arg4:CGX),levels of amino acid usage and the entropy of amino acid content distribution.In archaeal genomes with strong GC pressure,arginine is coded preferably by GC-rich Arg4 codons,whereas in most of archaeal genomes with G+C0.6,arginine is coded preferably by AT-rich Arg2 codons.In the genome of Haloquadratum walsbyi,which is closely related to GC-rich archaea,GC content has decreased mostly in third codon positions,while Arg4Arg2 bias still persists.Proteomes of archaeal species carry characteristic amino acid biases:levels of isoleucine and lysine are elevated,while levels of alanine,histidine,glutamine and cytosine are relatively decreased.Numerous genomic and proteomic biases observed can be explained by the hypothesis of previously existed strong mutational AT pressure in the common predecessor of all archaea.  相似文献   

2.
Palidwor GA  Perkins TJ  Xia X 《PloS one》2010,5(10):e13431

Background

In spite of extensive research on the effect of mutation and selection on codon usage, a general model of codon usage bias due to mutational bias has been lacking. Because most amino acids allow synonymous GC content changing substitutions in the third codon position, the overall GC bias of a genome or genomic region is highly correlated with GC3, a measure of third position GC content. For individual amino acids as well, G/C ending codons usage generally increases with increasing GC bias and decreases with increasing AT bias. Arginine and leucine, amino acids that allow GC-changing synonymous substitutions in the first and third codon positions, have codons which may be expected to show different usage patterns.

Principal Findings

In analyzing codon usage bias in hundreds of prokaryotic and plant genomes and in human genes, we find that two G-ending codons, AGG (arginine) and TTG (leucine), unlike all other G/C-ending codons, show overall usage that decreases with increasing GC bias, contrary to the usual expectation that G/C-ending codon usage should increase with increasing genomic GC bias. Moreover, the usage of some codons appears nonlinear, even nonmonotone, as a function of GC bias. To explain these observations, we propose a continuous-time Markov chain model of GC-biased synonymous substitution. This model correctly predicts the qualitative usage patterns of all codons, including nonlinear codon usage in isoleucine, arginine and leucine. The model accounts for 72%, 64% and 52% of the observed variability of codon usage in prokaryotes, plants and human respectively. When codons are grouped based on common GC content, 87%, 80% and 68% of the variation in usage is explained for prokaryotes, plants and human respectively.

Conclusions

The model clarifies the sometimes-counterintuitive effects that GC mutational bias can have on codon usage, quantifies the influence of GC mutational bias and provides a natural null model relative to which other influences on codon bias may be measured.  相似文献   

3.
Phytophthora is a genus entirely comprised of destructive plant pathogens. It belongs to the Stramenopila, a unique branch of eukaryotes, phylogenetically distinct from plants, animals, or fungi. Phytophthora genes show a strong preference for usage of codons ending with G or C (high GC3). The presence of high GC3 in genes can be utilized to differentiate coding regions from noncoding regions in the genome. We found that both selective pressure and mutation bias drive codon bias in Phytophthora. Indicative for selection pressure is the higher GC3 value of highly expressed genes in different Phytophthora species. Lineage specific GC increase of noncoding regions is reminiscent of whole-genome mutation bias, whereas the elevated Phytophthora GC3 is primarily a result of translation efficiency-driven selection. Heterogeneous retrotransposons exist in Phytophthora genomes and many of them vary in their GC content. Interestingly, the most widespread groups of retroelements in Phytophthora show high GC3 and a codon bias that is similar to host genes. Apparently, selection pressure has been exerted on the retroelement’s codon usage, and such mimicry of host codon bias might be beneficial for the propagation of retrotransposons. Reviewing Editor: Dr. Yves van de Peer  相似文献   

4.
Patterns of codon usage bias in three dicot and four monocot plant species   总被引:9,自引:0,他引:9  
Codon usage in nuclear genes of four monocot and three dicot species was analyzed to find general patterns in codon choice of plant species. Codon bias was correlated with GC content at the third codon position. GC contents were higher in monocot species than in dicot species at all codon positions. The high GC contents of monocot species might be the result of relatively strong mutational bias that occurred in the lineage of the Poaceae species. In both dicot and monocot species, the effective number of codons (ENCs) for most genes was similar to that for the expected ENCs based on the GC content at the third codon positions. G and C ending codons were detected as the "preferred" codons in monocot species, as in Drosophila. Also, many "preferred" codons are the same in dicot species. Pyrimidine (C and T) is used more frequently than purine (G and A) in four-fold degenerate codon groups.  相似文献   

5.
Analysis of synonymous codon usage pattern in the genome of a thermophilic cyanobacterium, Thermosynechococcus elongatus BP-1 using multivariate statistical analysis revealed a single major explanatory axis accounting for codon usage variation in the organism. This axis is correlated with the GC content at third base of synonymous codons (GC3s) in correspondence analysis taking T. elongatus genes. A negative correlation was observed between effective number of codons i.e. Nc and GC3s. Results suggested a mutational bias as the major factor in shaping codon usage in this cyanobacterium. In comparison to the lowly expressed genes, highly expressed genes of this organism possess significantly higher proportion of pyrimidine-ending codons suggesting that besides, mutational bias, translational selection also influenced codon usage variation in T. elongatus. Correspondence analysis of relative synonymous codon usage (RSCU) with A, T, G, C at third positions (A3s, T3s, G3s, C3s, respectively) also supported this fact and expression levels of genes and gene length also influenced codon usage. A role of translational accuracy was identified in dictating the codon usage variation of this genome. Results indicated that although mutational bias is the major factor in shaping codon usage in T. elongatus, factors like translational selection, translational accuracy and gene expression level also influenced codon usage variation.  相似文献   

6.
Alternative synonymous codons are often used at unequal frequencies. Classically, studies of such codon usage bias (CUB) attempted to separate the impact of neutral from selective forces by assuming that deviations from a predicted neutral equilibrium capture selection. However, GC-biased gene conversion (gBGC) can also cause deviation from a neutral null. Alternatively, selection has been inferred from CUB in highly expressed genes, but the accuracy of this approach has not been extensively tested, and gBGC can interfere with such extrapolations (e.g., if expression and gene conversion rates covary). It is therefore critical to examine deviations from a mutational null in a species with no gBGC. To achieve this goal, we implement such an analysis in the highly AT rich genome of Dictyostelium discoideum, where we find no evidence of gBGC. We infer neutral CUB under mutational equilibrium to quantify “adaptive codon preference,” a nontautologous genome wide quantitative measure of the relative selection strength driving CUB. We observe signatures of purifying selection consistent with selection favoring adaptive codon preference. Preferred codons are not GC rich, underscoring the independence from gBGC. Expression-associated “preference” largely matches adaptive codon preference but does not wholly capture the influence of selection shaping patterns across all genes, suggesting selective constraints associated specifically with high expression. We observe patterns consistent with effects on mRNA translation and stability shaping adaptive codon preference. Thus, our approach to quantifying adaptive codon preference provides a framework for inferring the sources of selection that shape CUB across different contexts within the genome.  相似文献   

7.
8.
以普通野生稻(Oryza rufipogon Griff.)线粒体基因组为对象,分析其蛋白质编码基因的密码子使用特征及与亚洲栽培稻(O. sativa L.)的差异,探讨其密码子偏性形成的影响因素和进化过程。结果显示:普通野生稻线粒体基因组编码序列第1、第2和第3位碱基的GC含量依次为49.18%、42.67%和40.86%;有效密码子数(Nc)分布于45.32~61.00之间,其密码子偏性较弱; Nc值仅与GC_3呈显著相关,密码子第3位的碱基组成对密码子偏性影响较大;第1向量轴上显示9.91%的差异,其与GC3s、Nc、密码子偏好指数(CBI)和最优密码子使用频率(Fop)的相关性均达到显著水平;而GC_3和GC12的相关性未达到显著水平。因此,普通野生稻线粒体基因组密码子的使用偏性主要受自然选择压力影响而形成。本研究确定了21个普通野生稻线粒体基因组的最优密码子,大多以A或T结尾,与叶绿体密码子具有趋同进化,但是与核基因组具有不同的偏好性。同义密码子相对使用度(RSCU)、PR2偏倚分析和中性绘图分析显示,普通野生稻线粒体基因功能和其密码子使用密切相关,且线粒体密码子使用在普通野生稻、粳稻(O. sativa L. subsp. japonica Kato)和籼稻(O. sativa L. subsp.indica Kato)内具有同质性。  相似文献   

9.
A detailed comparison was made of codon usage of chloroplast genes with their host (nuclear) genes in the four angiosperm speciesOryza sativa, Zea mays, Triticum aestivum andArabidopsis thaliana. The average GC content of the entire genes, and at the three codon positions individually, was higher in nuclear than in chloroplast genes, suggesting different genomic organization and mutation pressures in nuclear and chloroplast genes. The results of Nc-plots and neutrality plots suggested that nucleotide compositional constraint had a large contribution to codon usage bias of nuclear genes inO. sativa, Z. mays, andT. aestivum, whereas natural selection was likely to be playing a large role in codon usage bias in chloroplast genomes. Correspondence analysis and chi-test showed that regardless of the genomic environment (species) of the host, the codon usage pattern of chloroplast genes differed from nuclear genes of their host species by their AU-richness. All the chloroplast genomes have predominantly A- and/or U-ending codons, whereas nuclear genomes have G-, C- or U-ending codons as their optimal codons. These findings suggest that the chloroplast genome might display particular characteristics of codon usage that are different from its host nuclear genome. However, one feature common to both chloroplast and nuclear genomes in this study was that pyrimidines were found more frequently than purines at the synonymous codon position of optimal codons.  相似文献   

10.
To study the possible codon usage and base composition variation in the bacteriophages, fourteen mycobacteriophages were used as a model system here and both the parameters in all these phages and their plating bacteria, M. smegmatis had been determined and compared. As all the organisms are GC-rich, the GC contents at third codon positions were found in fact higher than the second codon positions as well as the first + second codon positions in all the organisms indicating that directional mutational pressure is strongly operative at the synonymous third codon positions. Nc plot indicates that codon usage variation in all these organisms are governed by the forces other than compositional constraints. Correspondence analysis suggests that: (i) there are codon usage variation among the genes and genomes of the fourteen mycobacteriophages and M. smegmatis, i.e., codon usage patterns in the mycobacteriophages is phage-specific but not the M. smegmatis-specific; (ii) synonymous codon usage patterns of Barnyard, Che8, Che9d, and Omega are more similar than the rest mycobacteriophages and M. smegmatis; (iii) codon usage bias in the mycobacteriophages are mainly determined by mutational pressure; and (iv) the genes of comparatively GC rich genomes are more biased than the GC poor genomes. Translational selection in determining the codon usage variation in highly expressed genes can be invoked from the predominant occurrences of C ending codons in the highly expressed genes. Cluster analysis based on codon usage data also shows that there are two distinct branches for the fourteen mycobacteriophages and there is codon usage variation even among the phages of each branch.  相似文献   

11.
The phenomenon of codon usage bias is known to exist in many genomes and it is mainly determined by mutation and selection. To understand the patterns of codon usage in nemertean mitochondrial genomes, we use bioinformatic approaches to analyze the protein-coding sequences of eight nemertean species. Neutrality analysis did not find a significant correlation between GC12 and GC3. ENc-plot showed a few genes on or close to the expected curve, but the majority of points with low-ENc values are below it. ENc-plot suggested that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally and we propose that codons containing A or U at third position are used preferentially in nemertean species, regardless of whether corresponding tRNAs are encoded in the mitochondrial DNA. Context-dependent analysis indicated that the nucleotide at the second codon position slightly affects synonymous codon choices. These results suggested that mutational and selection forces are probably acting to codon usage bias in nemertean mitochondrial genomes.  相似文献   

12.
13.
Codon usage in Clonorchis sinensis was analyzed using 12,515 codons from 38 coding sequences. Total GC content was 49.83%, and GC1, GC2 and GC3 contents were 56.32%, 43.15% and 50.00%, respectively. The effective number of codons converged at 51-53 codons. When plotted against total GC content or GC3, codon usage was distributed in relation to GC3 biases. Relative synonymous codon usage for each codon revealed a single major trend, which was highly correlated with GC content at the third position when codons began with A or U at the first two positions. In codons beginning with G or C base at the first two positions, the G or C base rarely occurred at the third position. These results suggest that codon usage is shaped by a bias towards G or C at the third base, and that this is affected by the first and second bases.  相似文献   

14.
Rao Y  Wu G  Wang Z  Chai X  Nie Q  Zhang X 《DNA research》2011,18(6):499-512
Synonymous codons are used with different frequencies both among species and among genes within the same genome and are controlled by neutral processes (such as mutation and drift) as well as by selection. Up to now, a systematic examination of the codon usage for the chicken genome has not been performed. Here, we carried out a whole genome analysis of the chicken genome by the use of the relative synonymous codon usage (RSCU) method and identified 11 putative optimal codons, all of them ending with uracil (U), which is significantly departing from the pattern observed in other eukaryotes. Optimal codons in the chicken genome are most likely the ones corresponding to highly expressed transfer RNA (tRNAs) or tRNA gene copy numbers in the cell. Codon bias, measured as the frequency of optimal codons (Fop), is negatively correlated with the G + C content, recombination rate, but positively correlated with gene expression, protein length, gene length and intron length. The positive correlation between codon bias and protein, gene and intron length is quite different from other multi-cellular organism, as this trend has been only found in unicellular organisms. Our data displayed that regional G + C content explains a large proportion of the variance of codon bias in chicken. Stepwise selection model analyses indicate that G + C content of coding sequence is the most important factor for codon bias. It appears that variation in the G + C content of CDSs accounts for over 60% of the variation of codon bias. This study suggests that both mutation bias and selection contribute to codon bias. However, mutation bias is the driving force of the codon usage in the Gallus gallus genome. Our data also provide evidence that the negative correlation between codon bias and recombination rates in G. gallus is determined mostly by recombination-dependent mutational patterns.  相似文献   

15.

Background

While effective population size (Ne) and life history traits such as generation time are known to impact substitution rates, their potential effects on base composition evolution are less well understood. GC content increases with decreasing body mass in mammals, consistent with recombination-associated GC biased gene conversion (gBGC) more strongly impacting these lineages. However, shifts in chromosomal architecture and recombination landscapes between species may complicate the interpretation of these results. In birds, interchromosomal rearrangements are rare and the recombination landscape is conserved, suggesting that this group is well suited to assess the impact of life history on base composition.

Results

Employing data from 45 newly and 3 previously sequenced avian genomes covering a broad range of taxa, we found that lineages with large populations and short generations exhibit higher GC content. The effect extends to both coding and non-coding sites, indicating that it is not due to selection on codon usage. Consistent with recombination driving base composition, GC content and heterogeneity were positively correlated with the rate of recombination. Moreover, we observed ongoing increases in GC in the majority of lineages.

Conclusions

Our results provide evidence that gBGC may drive patterns of nucleotide composition in avian genomes and are consistent with more effective gBGC in large populations and a greater number of meioses per unit time; that is, a shorter generation time. Thus, in accord with theoretical predictions, base composition evolution is substantially modulated by species life history.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-014-0549-1) contains supplementary material, which is available to authorized users.  相似文献   

16.
Analysis of synonymous codon usage bias in Chlamydia   总被引:9,自引:0,他引:9  
Chlamydiae are obligate intracellular bacterial pathogens that cause ocular and sexuallytransmitted diseases,and are associated with cardiovascular diseases.The analysis of codon usage mayimprove our understanding of the evolution and pathogenesis of Chlamydia and allow reengineering of targetgenes to improve their expression for gene therapy.Here,we analyzed the codon usage of C.muridarum,C.trachomatis(here indicating biovar trachoma and LGV),C.pneumoniae,and C.psittaci using the codonusage database and the CUSP(Create a codon usage table)program of EMBOSS(The European MolecularBiology Open Software Suite).The results show that the four genomes have similar codon usage patterns,with a strong bias towards the codons with A and T at the third codon position.Compared with Homosapiens,the four chlamydial species show discordant seven or eight preferred codons.The ENC(effectivenumber of codons used in a gene)-plot reveals that the genetic heterogeneity in Chlamydia is constrained bythe G+C content,while translational selection and gene length exert relatively weaker influences.Moreover,mutational pressure appears to be the major determinant of the codon usage variation among the chlamydialgenes.In addition,we compared the codon preferences of C.trachomatis with those of E.coli,yeast,adenovirus and Homo sapiens.There are 23 codons showing distinct usage differences between C.trachomatisand E.coli,24 between C.trachomatis and adenovirus,21 between C.trachomatis and Homo sapiens,butonly six codons between C.trachomatis and yeast.Therefore,the yeast system may be more suitable for theexpression of chlamydial genes.Finally,we compared the codon preferences of C.trachomatis with those ofsix eukaryotes,eight prokaryotes and 23 viruses.There is a strong positive correlation between the differ-ences in coding GC content and the variations in codon bias(r=0.905,P<0,001).We conclude that thevariation of codon bias between C.trachomatis and other organisms is much less influenced by phylogeneticlineage and primarily determined by the extent of disparities in GC content.  相似文献   

17.
Evolution of codon usage and base contents in kinetoplastid protozoans   总被引:2,自引:0,他引:2  
In this study we analyze and compare the trends in codon usage in five representative species of kinetoplastid protozoans (Crithidia fasciculata, Leishmania donovani, L. major, Trypanosoma cruzi and T. brucei), with the purpose of investigating the processes underlying these trends. A principal component analysis shows that the G+C content at the third codon position represents the main source of codon-usage variation, both within species (among genes) and among species. The non- Trypanosoma species exhibit narrow distributions in codon usage, while both Trypanosoma species present large within-species heterogeneity. The three non-Trypanosoma species have very similar codon-usage preferences. These codon preferences are also shared by the highly expressed genes of T. cruzi and to a lesser degree by those of T. brucei. This leads to the conclusion that the codon preferences shared by these species are the ancestral ones in the kinetoplastids. On the other hand, the study of noncoding sequences shows that Trypanosoma species exhibit mutational biases toward A + T richness, while the non- Trypanosoma species present mutational pressure in the opposite direction. These data taken together allow us to infer the origin of the different codon-usage distributions observed in the five species studied. In C. fasciculata and Leishmania, both mutational biases and (translational) selection pull toward G + C richness, resulting in a narrow distribution. In Trypanosoma species the mutational pressure toward A + T richness produced a shift in their genomes that differentially affected coding and noncoding sequences. The effect of these pressures on the third codon position of genes seems to have been inversely proportional to the level of gene expression.   相似文献   

18.
Among a sample of 39 Geodia cydonium (Demospongiae, Porifera) genes, with an average G + C content of 51.2%, extensive structural heterogeneity and considerable variations in synonymous codon usage were found. The G + C content of coding sequences and G + C content at silent codon positions (GC3S) varied from 42.4 to 59.2% and from 35.6 to 76.5%, respectively. Correspondence analysis of 39 genes revealed that putative highly expressed genes preferentially use a limited subset of codons, which were therefore defined as preferred codons in G. cydonium . A total of 22 preferred codons for 18 amino acids with synonyms in codons were identified and they all (with one exception) end with C or G. Among these codons there are also C- and G-ending codons which were previously identified as codons optimal for translation in a variety of eukaryotes, including metazoans and plants. The bias in synonymous codon usage in putative highly expressed G. cydonium genes is moderate, indicating that these genes are not shaped under strong natural selection. We postulate that the preference for C- and G-ending codons was already established in the ancestor of all Metazoa, including also sponges. This ancestor most probably also had a G + C rich genome. The selection toward C- and G-ending codons has been largely conserved throughout eukaryote evolution; exceptions are, for example, mammals for which strong mutational biases caused switches from that rule.  相似文献   

19.
The usage of alternative synonymous codons in the apicomplexan Cryptosporidium parvum has been investigated. A data set of 54 genes was analysed. Overall, A- and U-ending codons predominate, as expected in an A+T-rich genome. Two trends of codon usage variation among genes were identified using correspondence analysis. The primary trend is in the extent of usage of a subset of presumably translationally optimal codons, that are used at significantly higher frequencies in genes expected to be expressed at high levels. Fifteen of the 18 codons identified as optimal are more G+C-rich than the otherwise common codons, so that codon selection associated with translation opposes the general mutation bias. Among 40 genes with lower frequencies of these optimal codons, a secondary trend in G+C content was identified. In these genes, G+C content at synonymously variable third positions of codons is correlated with that in 5' and 3' flanking sequences, indicative of regional variation in G+C content, perhaps reflecting regional variation in mutational biases.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号