首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 39 毫秒
1.
To study the evolution of mutation biased synonymous codon usage, we examined nucleotide co-occurrence patterns in the Deinococcus radiodurans, D. geothermalis, and Thermus thermophilus genomes for nucleotide replacement dependent on the surrounding nucleotide context. Nucleotides on the third codon site were found to be strongly correlated with nucleotide sites at most six nucleotides away in all three species, where abundance patterns were dependent on whether two nucleotides share the same purine(R)/pyrimidine(Y) status. In the class Deinococci adjacent third site nucleotides were strongly correlated, where NNR|NNR and NNY|NNY codon pairs were overabundant while NNR|NNY and NNY|NNR codon pairs were underabundant. By far the largest deviations in all three species occur for NN(YR)|(YR)NN codon pairs. In the Thermus species, the NNY|YNN and NNR|RNN codon pairs were overabundant versus the underabundant NNY|RNN and NNR|YNN codon pairs, whereas in the Deinococcus species the opposite over-/underabundance relationship held for adjacent (GC) bases. We also observed a weaker overabundance of NNR|NRN and NNY|NYN codon pairs versus the underabundant NNR|NYN and NNY|NRN codon pairs. The perfect purine/pyrimidine symmetry of each of these cases, plus the lack of significant deviations for nucleotide pairs on other length scales up to 20 codons apart demonstrates that a pervasive pattern of nucleotide replacement dependent on local nucleotide context, and not codon bias, has occurred in these species. This nucleotide replacement has led to modified synonymous codon usage within the class Deinococci that affects which codons are positioned at particular codon sites dependent on the local nucleotide context.  相似文献   

2.
Biased codon usage is common in eukaryotic and prokaryotic genes. Evidence from Escherichia, Saccharomyces, and Drosophila indicates that it favors translational efficiency and accuracy. However, to date no functional advantages have been identified in the codon–anticodon interactions involving the most frequently used (preferred) codons. Here we present evidence that forces not related to the individual codon–anticodon interaction may be involved in determining which synonymous codons are preferred or avoided. We show that the ``off-frame' trinucleotide motif preferences inferrable from Drosophila coding regions are often in the same direction as Drosophila's ``in-frame' codon preferences, i.e., its codon usage. The off-frame preferences were inferred from the nonrandomness of the location of confamilial synonymous codons along coding regions—a pattern often described as a context dependence of nucleotide choice at synonymous positions or as codon-pair bias. We relied on randomizations of the location of confamilial codons that do not alter, and cannot be influenced by, the encoded amino acid sequences, codon usage, or base composition of the genes examined. The statistically significant congruency of in-frame and off-frame trinucleotide preferences suggests that the same kind of reading-frame-independent force(s) may also influence synonymous codon choice. These forces may have produced biases in codon usage that then led to the evolution of the translational advantages of these motifs as preferred codons. Under this scenario, tRNA pool size differences between preferred and nonpreferred codons initially were evolved to track the default overrepresentation of codons with preferred motifs. The motif preference hypothesis can explain the structuring of codon preferences and the similarities in the codon usages of distantly related organisms. Received: 10 November 1998 / Accepted: 23 February 1999  相似文献   

3.
To understand the synonymous codon usage pattern in mitochondrial genome of Antheraea assamensis, we analyzed the 13 mitochondrial protein‐coding genes of this species using a bioinformatic approach as no work was reported yet. The nucleotide composition analysis suggested that the percentages of A, T, G,and C were 33.73, 46.39, 9.7 and 10.17, respectively and the overall GC content was 19.86, that is, lower than 50% and the genes were AT rich. The mean effective number of codons of mitochondrial protein‐coding genes was 36.30 and it indicated low codon usage bias (CUB). Relative synonymous codon usage analysis suggested overrepresented and underrepresented codons in each gene and the pattern of codon usage was different among genes. Neutrality plot analysis revealed a narrow range of distribution for GC content at the third codon position and some points were diagonally distributed, suggesting both mutation pressure and natural selection influenced the CUB.  相似文献   

4.
On the rate of DNA sequence evolution inDrosophila   总被引:30,自引:0,他引:30  
Summary Analysis of the rate of nucleotide substitution at silent sites inDrosophila genes reveals three main points. First, the silent rate varies (by a factor of two) among nuclear genes; it is inversely related to the degree of codon usage bias, and so selection among synonymous codons appears to constrain the rate of silent substitution in some genes. Second, mitochondrial genes may have evolved only as fast as nuclear genes with weak codon usage bias (and two times faster than nuclear genes with high codon usage bias); this is quite different from the situation in mammals where mitochondrial genes evolve approximately 5–10 times faster than nuclear genes. Third, the absolute rate of substitution at silent sites in nuclear genes inDrosophila is about three times hihger than the average silent rate in mammals.  相似文献   

5.
Selection on Codon Usage for Error Minimization at the Protein Level   总被引:1,自引:0,他引:1  
Given the structure of the genetic code, synonymous codons differ in their capacity to minimize the effects of errors due to mutation or mistranslation. I suggest that this may lead, in protein-coding genes, to a preference for codons that minimize the impact of errors at the protein level. I develop a theoretical measure of error minimization for each codon, based on amino acid similarity. This measure is used to calculate the degree of error minimization for 82 genes of Drosophila melanogaster and 432 rodent genes and to study its relationship with CG content, the degree of codon usage bias, and the rate of nucleotide substitution. I show that (i) Drosophila and rodent genes tend to prefer codons that minimize errors; (ii) this cannot be merely the effect of mutation bias; (iii) the degree of error minimization is correlated with the degree of codon usage bias; (iv) the amino acids that contribute more to codon usage bias are the ones for which synonymous codons differ more in the capacity to minimize errors; and (v) the degree of error minimization is correlated with the rate of nonsynonymous substitution. These results suggest that natural selection for error minimization at the protein level plays a role in the evolution of coding sequences in Drosophila and rodents.Reviewing Editor: Dr. Massimo Di Giulio  相似文献   

6.
为分析栽培大豆和野生大豆线粒体基因组的密码子使用特征差异,该文以其线粒体基因组编码序列为研究对象,比较其密码子偏性形成的影响因素和演化过程。结果表明:(1)栽培大豆和野生大豆线粒体基因组编码区的GC含量分别为44.56%和44.58%,说明栽培大豆和野生大豆线粒体编码基因均富含A/T碱基。(2)栽培大豆和野生大豆线粒体基因组密码子第1位、第2位GC含量平均值与第3位GC含量的相关性均呈极显著水平,说明突变在其密码子偏性形成中的作用不可忽略; PR2-plot分析显示,在同义密码子第3位碱基的使用频率上,嘌呤低于嘧啶; Nc-plot分析中Nc比值位于-0.1~0.2区间的基因数占总基因数的95%以上;突变和选择等多重因素共同作用影响了大豆线粒体基因组编码序列密码子使用偏性的形成。(3)有20、21个密码子分别被确定为栽培大豆和野生大豆线粒体基因组编码序列的最优密码子,其中除丝氨酸TCC密码子外均以A或T结尾。综上结果认为,栽培大豆线粒体密码子偏性的形成受选择的影响要高于野生大豆,这可能是栽培大豆由野生大豆经长期人工栽培驯化的结果。  相似文献   

7.
Complimentary DNA sequence data of Φ × 174, fd, f1, G4, Ml3, MS2, λ and T7 phages ofEscherichia coli are analysed at mono-, di-, tri- and tetranucleotide levels. Our analysis shows that, (i) mononucleotides have certain preferences to occur at specific positions X1, X2, X3 of codon, (ii) These nucleotides interact nonlinearly to form dinucleotide and this dinucleotide also interacts nonlinearely with a third nucleotide to form codon, (iii) However, nonlinear interactions are negligible at tetranucleotide level suggesting that, coding regions of complimentary DNA are Markov chains of order two. Trinucleotide potential values in three frames have suggested that, at least thirteen different trinucleotides can be used as a marker to locate coding regions in DNA of prokaryotes. (iv) Parallel paired codons are expressed in such a way that one of the codons in the pair expresses with high frequency while the other with low frequency. On the other hand the complimentary codon pairs express with small frequency difference, (v) In the synonymous codon groups, codon ending with T are found to express with more frequency  相似文献   

8.
A novel bias in codon third-letter usage was found in Escherichia coli genes with low fractions of "optimal codons", by comparing intact sequences with control random sequences. Third-letter usage has been found to be biased according to preference in codon usage and to doublet preference from the following first letter. The present study examines third-letter usage in the context of the nucleotide sequence when these preferences are considered. In order to exclude any influence by these factors, the random sequences were generated such that the amino acid sequence, codon usage, and the doublet frequency in each gene were all preserved. Comparison of intact sequences with these randomly generated sequences reveals that third letters of codons show a strong preference for the purine/pyrimidine pattern of the next codons: purine (R) is preferred to pyrimidine (Y) at the third site when followed by an R-Y-R codon, and pyrimidine is preferred when followed by an R-R-Y, an R-Y-Y or a Y-R-Y codon. This bias is probably related to interactions of tRNA molecules in the ribosome.  相似文献   

9.
Enterogenic Escherichia coli (ETEC) F18 strains are the main pathogenic bacteria causing severe diarrhea in humans and domestic animals. However, the information about synonymous codon usage pattern of ETEC F18 genome remains unclear. We conducted a genome-wide analysis of synonymous codon usage patterns in the ETEC F18 strain SRA: SAMN02471895. After filtering of the complete genome sequence, 4327 coding sequences were analyzed using multivariate statistical methods to calculate synonymous codon usage patterns and to evaluate the influence of various factors in shaping the codon usage. The mean GC content was 51.38%, with a slight preference for G/C-ending codons. Twenty-two codons were determined as ‘‘optimal codons”. ENC plots showed some of the genes were on or close to the expected curve, while only points with low-ENC values were below the curve. PR2 analysis showed that GC and AT were not used proportionally, suggesting major roles for mutational pressure and natural selection in shaping usage. Neutrality plots showed a significant correlation between GC12 and GC3, suggesting that mutational pressure is responsible for nucleotide composition in shaping the strength of codon usage. Translational selection was the main factor shaping the codon usage pattern of ETEC F18 genome, while other factors such as protein length, GRAVY and ARO values also influenced codon usage to some extent. We analyzed the codon usage pattern systematically and identified the factors shaping codon usage bias in the ETEC F18 genome. Such information further elucidates the mechanisms of synonymous codon usage bias and provides the basis of molecular genetic engineering and evolutionary studies.  相似文献   

10.

Background  

The exchange of nucleotides at synonymous sites in a gene encoding a protein is believed to have little impact on the fitness of a host organism. This should be especially true for synonymous transitions, where a pyrimidine nucleotide is replaced by another pyrimidine, or a purine is replaced by another purine. This suggests that transition redundant exchange (TREx) processes at the third position of conserved two-fold codon systems might offer the best approximation for a neutral molecular clock, serving to examine, within coding regions, theories that require neutrality, determine whether transition rate constants differ within genes in a single lineage, and correlate dates of events recorded in genomes with dates in the geological and paleontological records. To date, TREx analysis of the yeast genome has recognized correlated duplications that established a new metabolic strategies in fungi, and supported analyses of functional change in aromatases in pigs. TREx dating has limitations, however. Multiple transitions at synonymous sites may cause equilibration and loss of information. Further, to be useful to correlate events in the genomic record, different genes within a genome must suffer transitions at similar rates.  相似文献   

11.
It is shown that synonymous codon usage is less biased in favor of those codons preferred by highly expressed genes at the end ofEscherichia coli genes than in the middle. This appears to be due to the close proximity of manyE. coli genes. It is shown that a substantial number of genes overlap either the Shine-Dalgarno sequence or the coding sequence of the next gene on the chromosome and that the codons that overlap have lower synonymous codon bias than those which do not. It is also shown that there is an increase in the frequency of A-ending codons, and a decrease in the frequency of G-ending codons at the end ofE. coli genes that lie close to another gene. It is suggested that these trends in composition could be associated with selection against the formation of mRNA secondary structure near the start of the next gene on the chromosome. Stop codon use is also affected by the close proximity of genes; many genes are forced to use TGA and TAG stop codons because they terminate either within the Shine-Dalgarno or coding sequence of the next gene on the chromosome. The implications these results have for the evolution of synonymous codon use are discussed.  相似文献   

12.
王艳  赵懿琛  赵德刚 《广西植物》2021,41(2):274-282
为了解杜仲基因密码子使用模式,该文以杜仲基因组密码子为研究对象,运用CodonW软件对杜仲的320个蛋白编码基因进行同义密码子相对使用频率(RSCU)分析、ENC-GC3s关联分析编码基因的密码子ENC值、PR2-plot偏倚分析编码基因的密码子碱基使用频率,并运用CUSP软件与Codon Usage Database软件对杜仲基因密码子的GC含量、使用频率与代表性物种烟草、拟南芥、大肠杆菌和酿酒酵母的密码子GC含量和使用频率进行比较。结果表明:杜仲基因密码子的RSCU>1的密码子有30个,其中18个以G/C结尾、12个以A/U结尾,说明杜仲基因密码子偏好以G/C结尾,且偏好性较强;有效密码子数(ENC)范围为30~60,该范围内的密码子距离标准曲线较远,其ENC值小,偏好性较强;PR2-plot偏倚分析碱基使用频率显示,G>C、U>A;杜仲与代表性物种的GC含量分析显示,杜仲的GC12、GC3以及平均GC含量均高于代表性物种;杜仲与代表性物种的密码子使用频率分析显示,杜仲与烟草、酿酒酵母的密码子偏好较为接近,杜仲与拟南芥、大肠杆菌的密码子偏好差距较大。杜仲是我国特有的珍贵中药材,对其进行密码子使用模式分析,并研究其密码子偏好规律,为杜仲植物基因工程中外源基因的改良及表达提供了理论基础。  相似文献   

13.
 Codon bias and base composition in major histocompatibility complex (MHC) sequences have been studied for both class I and II loci in Homo sapiens and Pan troglodytes. There is low to moderate codon bias for the MHC of humans and chimpanzees. In the class I loci, the same level of moderate codon bias is seen for HLA-B, HLA-C, Patr-A, Patr-B, and Patr-C, while at HLA-A the level of codon bias is lower. There is a correlation between codon usage bias and G+C content in the A and B loci in humans and chimps, but not at the C locus. To examine the effect of diversifying selection on codon bias, we subdivided class I alleles into antigen recognition site (ARS) and non-ARS codons. ARS codons had lower bias than non-ARS codons. This may indicate that the constraint of codon bias on nucleotide substitution may be selected against in ARS codons. At the class II loci, there are distinct differences between alpha and beta chain genes with respect to codon usage, with the beta chain genes being much more biased. Species-specific differences in base composition were seen in exon 2 at the DRB1 locus, with lower GC content in chimpanzees. Considering the complex evolutionary history of MHC genes, the study of codon usage patterns provides us with a better understanding of both the evolutionary history of these genes and the evolution of synonymous codon usage in genes under natural selection. Received: 2 April 1998 / Revised: 2 September 1998  相似文献   

14.
Summary We examined the codon usages in wellconserved and less-well-conserved regions of vertebrate protein genes and found them to be similar. Despite this similarity, there is a statistically significant decrease in codon bias in the less-well-conserved regions. Our analysis suggests that although those codon changes initially fixed under amino acid replacements tend to follow the overall codon usage pattern, they also reduce the bias in codon usage. This decrease in codon bias leads one to predict that the rate of change of synonymous codons should be greater in those regions that are less well conserved at the amino acid level than in the better-conserved regions. Our analysis supports this prediction. Furthermore, we demonstrate a significantly elevated rate of change of synonymous codons among the adjacent codons 5 to amino acid replacement positions. This provides further support for the idea that there are contextual constraints on the choice of synonymous codons in eukaryotes.  相似文献   

15.
Sorimachi K  Okayasu T 《Amino acids》2008,34(4):661-668
When nucleotide (G, C, T and A) contents were plotted against each nucleotide, their relationships were clearly expressed by a linear formula, y = αx + β in the coding and non-coding regions. This linear relationship was obtained from the complete single-stranded DNA. Similarly, nucleotide contents at all three codon positions were expressed by linear regression lines based on the content of each nucleotide. In addition, 64 codon usages were also expressed by linear formulas against nucleotide content. Thus, the nucleotide content not only in coding sequence but also in non-coding sequence can be expressed by a linear formula, y = αx + β, in 145 organisms (112 bacteria, 15 archaea and 18 eukaryotes). Based on these results, the ratio of C/T, G/T, C/A or G/A one can essentially estimate all four nucleotide contents in the complete single-stranded DNA, and the determination of any ratio of two kinds of nucleotides can essentially estimate four nucleotide contents, nucleotide contents at the three different codon positions and codon distributions at 64 codons in the coding region. The maximum and minimum values of G content were ∼0.35 and ∼0.15, respectively, among various organisms examined. Codon evolution occurs according to linear formulas between these two values. Electronic supplementary material The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

16.
Selection pressures on proteins are usually measured by comparing homologous nucleotide sequences (Zuckerkandl and Pauling 1965). Recently we introduced a novel method, termed volatility, to estimate selection pressures on proteins on the basis of their synonymous codon usage (Plotkin and Dushoff 2003; Plotkin et al. 2004). Here we provide a theoretical foundation for this approach. Under the Fisher-Wright model, we derive the expected frequencies of synonymous codons as a function of the strength of selection on amino acids, the mutation rate, and the effective population size. We analyze the conditions under which we can expect to draw inferences from biased codon usage, and we estimate the time scales required to establish and maintain such a signal. We find that synonymous codon usage can reliably distinguish between negative selection and neutrality only for organisms, such as some microbes, that experience large effective population sizes or periods of elevated mutation rates. The power of volatility to detect positive selection is also modest—requiring approximately 100 selected sites—but it depends less strongly on population size. We show that phenomena such as transient hyper-mutators can improve the power of volatility to detect selection, even when the neutral site heterozygosity is low. We also discuss several confounding factors, neglected by the Fisher-Wright model, that may limit the applicability of volatility in practice. Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users. [Reviewing Editor: Dr. Lauren Meyers]  相似文献   

17.
Evidence from a variety of sources indicates that selection has influenced synonymous codon usage in Drosophila. It has generally been difficult, however, to distinguish selection that acted in the distant past from ongoing selection. However, under a neutral model, polymorphisms usually reflect more recent mutations than fixed differences between species and may, therefore, be useful for inferring recent selection. If the ancestral state is preferred, selection should shift the frequency distribution of derived states/site toward lower values; if the ancestral is unpreferred, selection should increase the number of derived states/site. Polymorphisms were classified as ancestrally preferred or unpreferred for several genes of D. simulans and D. melanogaster. A computer simulation of coalescence was employed to derive the expected frequency distributions of derived states/site under various modifications of the Wright–Fisher neutral model, and distributions of test statistics (t and Mann–Whitney U) were derived by appropriate sampling. One-tailed tests were applied to transformed frequency data to assess whether the two frequency distributions deviated from neutral expectations in the direction predicted by selection on codon usage. Several genes from D. simulans appear to be subject to recent selection on synonymous codons, including one gene with low codon bias, esterase-6. Selection may also be acting in D. melanogaster. Received: 15 April 1998 / Accepted: 13 May 1999  相似文献   

18.
In this study, major factors shaping codon and amino acid usage variation Lactobacillus sakei 23K were investigated. It included 13 other Lactobacillus species for a comparative analysis. The correspondence analysis (COA) showed that in 13 species the major trend of synonymous codon usage was highly correlated with gene expression level as assessed by the “Codon Adaptation Index” (CAI) values. In addition, Nc (effective number of codons) plot, SCUO (synonymous codon usage order) plot and correlation analyses showed that the base composition and mutational bias have dominant role in the codon usage variation. However, the translational selection for genes at higher expression level, where more frequent synonymous codons correspond to more abundant cognate transfer RNAs (tRNAs), was not found to be similar in all species. The study also showed that the amino acid usage in these species was significantly (P < 0.01) influenced by hydrophobicity and aromaticity of proteins. Furthermore, 24 codons that were found to be optimally used by L. sakei and its comparative study with 13 Lactobacillus species might provide some useful information in their further study of molecular evolution and genetic engineering.  相似文献   

19.
Wan J  Bai X  Liu W  Xu J  Xu M  Gao H 《Molecular biology reports》2009,36(6):1299-1303
Prion diseases are fatal neurodegenerative disorders of humans and certain other mammals. Prion protein gene (Prnp) is associated with susceptibility and species barrier to prion diseases. No natural and experimental prion diseases have been documented to date in Arctic fox. In the present study, coding region of Prnp from 135 Arctic foxes were cloned and screened for polymorphisms. Our results indicated that the Arctic fox Prnp open reading frame (ORF) contains 771 nucleotides encoding 257 amino acids. Four single nucleotide polymorphisms (SNPs) (G312C, A337G, C541T, and A723G) were identified. SNPs G312C and A723G produced silent mutations, but SNPs A337G and C541T resulted in a M–V change at codon 113 and R–C at codon 181, respectively. The Arctic fox Prnp amino acid sequence was similar to that of the dog (XM 542906). In short, this study provides preliminary information about genotypes of Prnp in Arctic fox.  相似文献   

20.
Mycoplasma bovis is a major pathogen causing arthritis, respiratory disease and mastitis in cattle. A better understanding of its genetic features and evolution might represent evidences of surviving host environments. In this study, multiple factors influencing synonymous codon usage patterns in M. bovis (three strains’ genomes) were analyzed. The overall nucleotide content of genes in the M. bovis genome is AT-rich. Although the G and C contents at the third codon position of genes in the leading strand differ from those in the lagging strand (p<0.05), the 59 synonymous codon usage patterns of genes in the leading strand are highly similar to those in the lagging strand. The over-represented codons and the under-represented codons were identified. A comparison of the synonymous codon usage pattern of M. bovis and cattle (susceptible host) indicated the independent formation of synonymous codon usage of M. bovis. Principal component analysis revealed that (i) strand-specific mutational bias fails to affect the synonymous codon usage pattern in the leading and lagging strands, (ii) mutation pressure from nucleotide content plays a role in shaping the overall codon usage, and (iii) the major trend of synonymous codon usage has a significant correlation with the gene expression level that is estimated by the codon adaptation index. The plot of the effective number of codons against the G+C content at the third codon position also reveals that mutation pressure undoubtedly contributes to the synonymous codon usage pattern of M. bovis. Additionally, the formation of the overall codon usage is determined by certain evolutionary selections for gene function classification (30S protein, 50S protein, transposase, membrane protein, and lipoprotein) and translation elongation region of genes in M. bovis. The information could be helpful in further investigations of evolutionary mechanisms of the Mycoplasma family and heterologous expression of its functionally important proteins.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号