首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 109 毫秒
1.
ABSTRACT: BACKGROUND: The evolution and genomic stop codon frequencies have not been rigorously studied with the exception of coding of non-canonical amino acids. Here we study the rate of evolution and frequency distribution of stop codons in bacterial genomes. RESULTS: We show that in bacteria stop codons evolve slower than synonymous sites, suggesting the action of weak negative selection. However, the frequency of stop codons relative to genomic nucleotide content indicated that this selection regime is not straightforward. The frequency of TAA and TGA stop codons is GC-content dependent, with TAA decreasing and TGA increasing with GC-content, while TAG frequency is independent of GC-content. Applying a formal, analytical model to these data we found that the relationship between stop codon frequencies and nucleotide content cannot be explained by mutational biases or selection on nucleotide content. However, with weak nucleotide content-dependent selection on TAG, -0.5 < Nes < 1.5, the model fits all of the data and recapitulates the relationship between TAG and nucleotide content. For biologically plausible rates of mutations we show that, in bacteria, TAG stop codon is universally associated with lower fitness, with TAA being the optimal for G-content < 16% while for G-content > 16% TGA has a higher fitness than TAG. CONCLUSIONS: Our data indicate that TAG codon is universally suboptimal in the bacterial lineage, such that TAA is likely to be the preferred stop codon for low GC content while the TGA is the preferred stop codon for high GC content. The optimization of stop codon usage may therefore be useful in genome engineering or gene expression optimization applications.ReviewersThis article was reviewed by Michail Gelfand, Arcady Mushegian and Shamil Sunyaev. For the full reviews, please go to the Reviewers' Comments section.  相似文献   

2.
In bacteria, synonymous codon usage can be considerably affected by base composition at neighboring sites. Such context-dependent biases may be caused by either selection against specific nucleotide motifs or context-dependent mutation biases. Here we consider the evolutionary conservation of context-dependent codon bias across 11 completely sequenced bacterial genomes. In particular, we focus on two contextual biases previously identified in Escherichia coli; the avoidance of out-of-frame stop codons and AGG motifs. By identifying homologues of E. coli genes, we also investigate the effect of gene expression level in Haemophilus influenzae and Mycoplasma genitalium. We find that while context-dependent codon biases are widespread in bacteria, few are conserved across all species considered. Avoidance of out-of-frame stop codons does not apply to all stop codons or amino acids in E. coli, does not hold for different species, does not increase with gene expression level, and is not relaxed in Mycoplasma spp., in which the canonical stop codon, TGA, is recognized as tryptophan. Avoidance of AGG motifs shows some evolutionary conservation and increases with gene expression level in E. coli, suggestive of the action of selection, but the cause of the bias differs between species. These results demonstrate that strong context-dependent forces, both selective and mutational, operate on synonymous codon usage but that these differ considerably between genomes. Received: 6 May 1999 / Accepted: 29 October 1999  相似文献   

3.
Role of premature stop codons in bacterial evolution   总被引:1,自引:0,他引:1  
When the stop codons TGA, TAA, and TAG are found in the second and third reading frames of a protein-encoding gene, they are considered premature stop codons (PSC). Deinococcus radiodurans disproportionately favored TGA more than the other two triplets as a PSC. The TGA triplet was also found more often in noncoding regions and as a stop codon, though the bias was less pronounced. We investigated this phenomenon in 72 bacterial species with widely differing chromosomal GC contents. Although TGA and TAG were compositionally similar, we found a great variation in use of TGA but a very limited range of use of TAG. The frequency of use of TGA in the gene sequences generally increased with the GC content of the chromosome, while the frequency of use of TAG, like that of TAA, was inversely proportional to the GC content of the chromosome. The patterns of use of TAA, TGA and TAG as real stop codons were less biased and less influenced by the GC content of the chromosome. Bacteria with higher chromosomal GC contents often contained fewer PSC trimers in their genes. Phylogenetically related bacteria often exhibited similar PSC ratios. In addition, metabolically versatile bacteria have significantly fewer PSC trimers in their genes. The bias toward TGA but against TAG as a PSC could not be explained either by the preferential usage of specific codons or by the GC contents of individual chromosomes. We proposed that the quantity and the quality of the PSC in the genome might be important in bacterial evolution.  相似文献   

4.
It is shown that synonymous codon usage is less biased in favor of those codons preferred by highly expressed genes at the end ofEscherichia coli genes than in the middle. This appears to be due to the close proximity of manyE. coli genes. It is shown that a substantial number of genes overlap either the Shine-Dalgarno sequence or the coding sequence of the next gene on the chromosome and that the codons that overlap have lower synonymous codon bias than those which do not. It is also shown that there is an increase in the frequency of A-ending codons, and a decrease in the frequency of G-ending codons at the end ofE. coli genes that lie close to another gene. It is suggested that these trends in composition could be associated with selection against the formation of mRNA secondary structure near the start of the next gene on the chromosome. Stop codon use is also affected by the close proximity of genes; many genes are forced to use TGA and TAG stop codons because they terminate either within the Shine-Dalgarno or coding sequence of the next gene on the chromosome. The implications these results have for the evolution of synonymous codon use are discussed.  相似文献   

5.
The assumption that conservation of sequence implies the action of purifying selection is central to diverse methodologies to infer functional importance. GC-biased gene conversion (gBGC), a meiotic mismatch repair bias strongly favouring GC over AT, can in principle mimic the action of selection, this being thought to be especially important in mammals. As mutation is GC→AT biased, to demonstrate that gBGC does indeed cause false signals requires evidence that an AT-rich residue is selectively optimal compared to its more GC-rich allele, while showing also that the GC-rich alternative is conserved. We propose that mammalian stop codon evolution provides a robust test case. Although in most taxa TAA is the optimal stop codon, TGA is both abundant and conserved in mammalian genomes. We show that this mammalian exceptionalism is well explained by gBGC mimicking purifying selection and that TAA is the selectively optimal codon. Supportive of gBGC, we observe (i) TGA usage trends are consistent at the focal stop codon and elsewhere (in UTR sequences); (ii) that higher TGA usage and higher TAA→TGA substitution rates are predicted by a high recombination rate; and (iii) across species the difference in TAA <-> TGA substitution rates between GC-rich and GC-poor genes is largest in genomes that possess higher between-gene GC variation. TAA optimality is supported both by enrichment in highly expressed genes and trends associated with effective population size. High TGA usage and high TAA→TGA rates in mammals are thus consistent with gBGC’s predicted ability to “drive” deleterious mutations and supports the hypothesis that sequence conservation need not be indicative of purifying selection. A general trend for GC-rich trinucleotides to reside at frequencies far above their mutational equilibrium in high recombining domains supports the generality of these results.

Is sequence conservation a sign of purifying selection and hence functional importance? This analysis of why mammals use and conserve the most error-prone stop codon suggests not, consistent with GC-biased gene conversion’s predicted ability to “drive” deleterious mutations and supporting the hypothesis that sequence conservation need not be indicative of purifying selection.  相似文献   

6.
alpha-L-Iduronidase is a glycosyl hydrolase involved in the sequential degradation of the glycosaminoglycans heparan sulphate and dermatan sulphate. A deficiency in alpha-L-iduronidase results in the lysosomal accumulation and urinary secretion of partially degraded glycosaminoglycans and is the cause of the lysosomal storage disorder mucopolysaccharidosis type I (MPS I; Hurler and Scheie syndromes; McKusick 25280). The premature stop codons Q70X and W402X are two of the most common alpha-l-iduronidase gene (IDUA) mutations accounting for up to 70% of MPS I disease alleles in some populations. Here, we have reported a new mutation, making a total of 15 different mutations that can cause premature IDUA stop codons and have investigated the biochemistry of these mutations. Natural stop codon read-through was dependent on the fidelity of the codon when evaluated at Q70X and W402X in CHO-K1 cells, but the three possible stop codons TAA, TAG and TGA, had different effects on mRNA stability and this effect was context dependent. In CHO-K1 cells expressing the Q70X and W402X mutations, the level of gentamicin-enhanced stop codon read-through was slightly less than the increment in activity caused by a lower fidelity stop codon. In this system, gentamicin had more effect on read-through for the TAA and TGA stop codons when compared to the TAG stop codon. In an MPS I patient study, premature TGA stop codons were associated with a slightly attenuated clinical phenotype, when compared to classical Hurler syndrome (e.g. W402X/W402X and Q70X/Q70X genotypes with TAG stop codons). Natural read-through of premature stop codons is a potential explanation for variable clinical phenotype in MPS I patients. Enhanced stop codon read-through is a potential treatment strategy for a large sub-group of MPS I patients.  相似文献   

7.
钟智  李宏 《生物物理学报》2008,24(5):379-392
以细菌和古菌基因组5′ UTR序列作为研究对象,分析在5′ UTR 的3个不同阅读框架中三联体AUG的分布,发现无论是细菌还是古菌基因组都在阅读框1中有非常明显的AUG缺失(depletion)。AUG的缺失表明在起始密码子上游的AUG很可能会对基因的翻译起始产生影响。分析得知:绝大部分的AUG都是以uORF(upstream open reading frame)的形式出现的,uAUG(upstream AUG)的数量很少,特别是在阅读框1中,而且在细菌基因组的阅读框1中uAUG较多地出现在了含有SD序列的基因上游。比较发现,uAUG引导的序列在同义密码子使用上的偏好性较真正的编码序列差,这可能表明细菌和古菌在同义密码子使用上的偏好性也是决定基因准确地翻译起始的重要因素之一。  相似文献   

8.
Evolution and differentiation of MSHR gene in different species   总被引:1,自引:0,他引:1  
Coat color offers some prospects for evolutionary studies due to its large amount of presumably adaptive coat color variation and conserved genetic mechanisms of generating different coat colors in different species. Melanocyte-stimulating hormone receptor (MSHR) gene is responsible for intraspecific and interspecific color variation in mammals and birds. A total number of 206 MSHR gene sequences belonging to 84 species, 58 genera, and 20 families were analyzed to investigate its evolution and differentiation in different species. Most of the species have 954 bp and stop codon TGA. Species in Callithrix and Callimico have a stop codon mutation from TGA to TGG and elongate 81 bp with TAG as stop codon. Species in Phasianidae, Fringillidae, and Lemuridae also use TAG as stop codon. The Sus scrofa had an insertion of AACCAGACC encoding Asn-Gln-Thr from 85 to 93 bp. In Bovidae, a brown strain of cow with 966 bp due to the 12-bp duplication of GGCATTGCCCGG from 670 to 681 bp encoding for Gly-Ile-Ala-Arg was found. Teiidae has the smallest number of total mutations (6), silent mutations (3), nonsynonymous mutations (3), average number of nucleotide differences (1.519), synonymous nucleotide diversity (pi(s) = 0.0030), and nonsynonymous nucleotide diversity (pi(a) = 0.0029), and Hominidae, Lemuridae, Canidae, and Teiidae have higher ratio of pi(a)/pi(s) (0.537-0.973). The reconstructed phylogenetic tree of MSHR gene of families is basically consistent with the taxonomy of National Center for Biotechnology Information.  相似文献   

9.
It is well known that stop codons play a critical role in the process of protein synthesis. However, little effort has been made to investigate whether stop codon usage exhibits biases, such as widely seen for synonymous codon usage. Here we systematically investigate stop codon usage bias in various eukaryotes as well as its relationships with its context, GC3 content, gene expression level, and secondary structure. The results show that there is a strong bias for stop codon usage in different eukaryotes, i.e., UAA is overrepresented in the lower eukaryotes, UGA is overrepresented in the higher eukaryotes, and UAG is least used in all eukaryotes. Different conserved patterns for each stop codon in different eukaryotic classes are found based on information content and logo analysis. GC3 contents increase with increasing complexity of organisms. Secondary structure prediction revealed that UAA is generally associated with loop structures, whereas UGA is more uniformly present in loop and stem structures, i.e., UGA is less biased toward having a particular structure. The stop codon usage bias, however, shows no significant relationship with GC3 content and gene expression level in individual eukaryotes. The results indicate that genomic complexity and GC3 content might contribute to stop codon usage bias in different eukaryotes. Our results indicate that stop codons, like synonymous codons, exhibit biases in usage. Additional work will be needed to understand the causes of these biases and their relationship to the mechanism of protein termination. [Reviewing Editor: Dr. Manyuan Long]  相似文献   

10.
Bacterial release factor RF2 promotes termination of protein synthesis, specifically recognizing stop codons UAA or UGA. The crystal structure of Escherichia coli RF2 has been determined to a resolution of 1.8 A. RF2 is structurally distinct from its eukaryotic counterpart eRF1. The tripeptide SPF motif, thought to confer RF2 stop codon specificity, and the universally conserved GGQ motif, proposed to be involved with the peptidyl transferase center, are exposed in loops only 23 A apart, and the structure suggests that stop signal recognition is more complex than generally believed.  相似文献   

11.
Suzuki H  Saito R  Tomita M 《FEBS letters》2005,579(28):6499-6504
Multivariate analyses are often used to identify major trends of variation in synonymous codon usage among genes. These analyses need to be performed on properly normalized codon usage data to avoid biases masking this synonymous variation, i.e., gene length, amino acid usage, and codon degeneracy; however, previous studies have failed to do so. In this paper, we demonstrate that the use of alternative normalized data (called 'relative adaptiveness' in the literature) can avoid all these biases and furthermore, can identify more trends of variation among genes, including GC-ending codon usage, GT-ending codon usage, and gene expression level.  相似文献   

12.
During the evolution of living organisms, a natural selection event occurs toward the optimization of their genomes regarding the usage of codons. During this process which is known as codon bias, a set of preferred codons is naturally defined in the genome of a given organism, since there are 61 possible codons (plus 3 stop codons) to 20 amino acids. Such event leads to optimization of metabolic cellular processes such as translational efficiency, RNA stability and energy saving. Although we know why, we do not know how exactly a set of preferred codons for each amino acid is defined for a given genome considering that the usage frequency of each synonymous codons is peculiar to each organism. In order to help answering this question, we analyzed the usage frequency of codons which are similar to stop codons, since a minor mutation on these codons may lead to a stop codon into the open reading frame compromising the protein expression as a result. We found a reduced use of those codons in Xanthomomas axonopodis pv. citri which presents an optimized genome regarding codon usage. On the other hand, such codons are more often used in Xylella fastidiosa, which does not seem to have established codon preferences as previously shown. Our results support that a set of preferred codons is not randomly selected and propose new ideas to the field warranting further experiments in this regard.  相似文献   

13.
Buchnera, the bacterial endosymbionts of aphids, undergo severe population bottlenecks during maternal transmission through their hosts. Previous studies suggest an increased effect of drift within these strictly asexual, small populations, resulting in an increased fixation of slightly deleterious mutations. This study further explores sequence evolution in Buchnera using three approaches. First, patterns of codon usage were compared across several homologous Escherichia coli and Buchnera loci, in order to test the prediction that selection for the use of optimal codons is less effective in small populations. A chi 2-based measure of codon bias was developed to adjust for the overall A + T richness of silent positions in the endosymbionts. In contrast to E. coli homologues, adaptive codon bias across Buchnera loci is markedly low, and patterns of codon usage lack a strong relationship with gene expression level. These data suggest that codon usage in Buchnera has been shaped largely by mutational pressure and drift rather than by selection for translational efficiency. One exception to the overall lack of bias is groEL, which is known to be constitutively overexpressed in Buchnera and other endosymbionts. Second, relative-rate tests show elevated rates of sequence evolution of numerous protein-coding loci across Buchnera, compared to E. coli. Finally, consistently higher ratios of nonsynonymous to synonymous substitutions in Buchnera loci relative to the enteric bacteria strongly suggest the accumulation of nonsynonymous substitutions in endosymbiont lineages. Combined, these results suggest a decreased effectiveness of purifying selection in purging endosymbiont populations of slightly deleterious mutations, particularly those affecting codon usage and amino acid identity.  相似文献   

14.
In this study, we analysed synonymous codon usage in Shigella flexneri 2a strain 301 (Sf301) and performed a comparative analysis of synonymous codon usage patterns in Sf301 and other strains of Shigella and Escherichia coli. Although there was a significant variety in codon usage bias among different Sf301 genes, there was a slight but observable codon usage bias that could primarily be attributable to mutational pressure and translational selection. In addition, the relative abundance of dinucleotides in Sf301 was observed to be independent of the overall base composition but was still caused by differential mutational pressure; this also shaped codon usage. By comparing the relative synonymous codon usage values across different Shigella and E. coli strains, we suggested that the synonymous codon usage pattern in the Shigella genomes was strain specific. This study represents a comprehensive analysis of Shigella codon usage patterns and provides a basic understanding of the mechanisms underlying codon usage bias.  相似文献   

15.
Many organisms exhibit biased codon usage in their genome, including the fungal model organism Neurospora crassa. The preferential use of subset of synonymous codons (optimal codons) at the macroevolutionary level is believed to result from a history of selection to promote translational efficiency. At present, few data are available about selection on optimal codons at the microevolutionary scale, that is, at the population level. Herein, we conducted a large-scale assessment of codon mutations at biallelic sites, spanning more than 5,100 genes, in 2 distinct populations of N. crassa: the Caribbean and Louisiana populations. Based on analysis of the frequency spectra of synonymous codon mutations at biallelic sites, we found that derived (nonancestral) optimal codon mutations segregate at a higher frequency than derived nonoptimal codon mutations in each population; this is consistent with natural selection favoring optimal codons. We also report that optimal codon variants were less frequent in longer genes and that the fixation of optimal codons was reduced in rapidly evolving long genes/proteins, trends suggestive of genetic hitchhiking (Hill-Robertson) altering codon usage variation. Notably, nonsynonymous codon mutations segregated at a lower frequency than synonymous nonoptimal codon mutations (which impair translational efficiency) in each N. crassa population, suggesting that changes in protein composition are more detrimental to fitness than mutations altering translation. Overall, the present data demonstrate that selection, and partly genetic interference, shapes codon variation across the genome in N. crassa populations.  相似文献   

16.
The two codon-specific eubacterial release factors (RF1: UAA/UAG and RF2: UAA/UGA) have specific tripeptide motifs (PXT/SPF) within an exposed recognition loop shown in recent structures to interact with stop codons during protein synthesis termination. The motifs have been inferred to be critical for codon specificity, but this study shows that they are insufficient to determine specificity alone. Swapping the motifs or the entire loop between factors resulted in a loss of codon recognition rather than a switch of codon specificity. From a study of chimeric eubacterial RF1/RF2 recognition loops and an atypical shorter variant in Caenorhabditis elegans mitochondrial RF1 that lacks the classical tripeptide motif PXT, key determinants throughout the whole loop have been defined. It reveals that more than one configuration of the recognition loop based on specific sequence and size can achieve the same desired codon specificity. This study has provided unexpected insight into why a combination of the two factors is necessary in eubacteria to exclude recognition of UGG as stop.  相似文献   

17.
Translation termination is catalyzed by release factors that recognize stop codons. However, previous works have shown that in some bacteria, the termination process also involves bases around stop codons. Recently, Ito et al. analyzed release factors and identified the amino acids therein that recognize stop codons. However, the amino acids that recognize bases around stop codons remain unclear. To identify the candidate amino acids that recognize the bases around stop codons, we aligned the protein sequences of the release factors of various bacteria and searched for amino acids that were conserved specifically in the sequence of bacteria that seemed to regulate translation termination by bases around stop codons. As a result, species having several highly conserved residues in RF1 and RF2 showed positive correlations between their codon usage bias and conservation of the bases around the stop codons. In addition, some of the residues were located very close to the SPF motif, which deciphers stop codons. These results suggest that these conserved amino acids enable the release factors to recognize the bases around the stop codons. Present address (Y. Ozawa): Tokyo Research Laboratory, IBM Japan, Ltd., 1623-14 Shimotsuruma, Yamato-shi, Kanagawa 242-8502, Japan  相似文献   

18.
Unequal use of synonymous codons has been found in several prokaryotic and eukaryotic genomes. This bias has been associated with translational efficiency. The prevalence of this bias across lineages is currently unknown. Here, a new method (GCB) to measure codon usage bias is presented. It uses an iterative approach for the determination of codon scores and allows the computation of an index of codon bias suitable for interspecies comparison. A server to calculate GCB-values of individual genes as well as a list of compiled results are available at . The method was applied to complete bacterial genomes. The relation of codon usage bias with amino acid composition and the choice of stop codons were determined and discussed.  相似文献   

19.
The 'effective number of codons' used in a gene   总被引:64,自引:0,他引:64  
F Wright 《Gene》1990,87(1):23-29
A simple measure is presented that quantifies how far the codon usage of a gene departs from equal usage of synonymous codons. This measure of synonymous codon usage bias, the 'effective number of codons used in a gene', Nc, can be easily calculated from codon usage data alone, and is independent of gene length and amino acid (aa) composition. Nc can take values from 20, in the case of extreme bias where one codon is exclusively used for each aa, to 61 when the use of alternative synonymous codons is equally likely. Nc thus provides an intuitively meaningful measure of the extent of codon preference in a gene. Codon usage patterns across genes can be investigated by the Nc-plot: a plot of Nc vs. G + C content at synonymous sites. Nc-plots are produced for Homo sapiens, Saccharomyces cerevisiae, Escherichia coli, Bacillus subtilis, Dictyostelium discoideum, and Drosophila melanogaster. A FORTRAN77 program written to calculate Nc is available on request.  相似文献   

20.
The Selective Advantage of Synonymous Codon Usage Bias in Salmonella   总被引:1,自引:0,他引:1  
The genetic code in mRNA is redundant, with 61 sense codons translated into 20 different amino acids. Individual amino acids are encoded by up to six different codons but within codon families some are used more frequently than others. This phenomenon is referred to as synonymous codon usage bias. The genomes of free-living unicellular organisms such as bacteria have an extreme codon usage bias and the degree of bias differs between genes within the same genome. The strong positive correlation between codon usage bias and gene expression levels in many microorganisms is attributed to selection for translational efficiency. However, this putative selective advantage has never been measured in bacteria and theoretical estimates vary widely. By systematically exchanging optimal codons for synonymous codons in the tuf genes we quantified the selective advantage of biased codon usage in highly expressed genes to be in the range 0.2–4.2 x 10−4 per codon per generation. These data quantify for the first time the potential for selection on synonymous codon choice to drive genome-wide sequence evolution in bacteria, and in particular to optimize the sequences of highly expressed genes. This quantification may have predictive applications in the design of synthetic genes and for heterologous gene expression in biotechnology.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号