首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
A computer program (PCBI) was developed to quickly calculate codon bias index (CBI). PCBI can analyze a gene containing introns. The 22 preferred codons defined fromSaccharomyces cerevisiae were used in PCBI as the standard to measure the CBI values. However, users can modify the preferred codons to suit each organism. The data PCBI provides include DNA sequence of open reading frame without introns, amino acid sequence of gene product, a table of amino acid composition, a table of codon usage and (G+C) content, parameters for calculating CBI, and the value of CBI. PCBI runs on DOS or Windows environment, but results can be saved in ASCII text format.  相似文献   

3.
New and simple numerical criteria based on a codon adaptation index are applied to the complete genomic sequences of 80 Eubacteria and 16 Archaea, to infer weak and strong genome tendencies toward content bias, translational bias, and strand bias. These criteria can be applied to all microbial genomes, even those for which little biological information is known, and a codon bias signature, that is the collection of strong biases displayed by a genome, can be automatically derived. A codon bias space, where genomes are identified by their preferred codons, is proposed as a novel formal framework to interpret genomic relationships. Principal component analysis confirms that although GC content has a dominant effect on codon bias space, thermophilic and mesophilic species can be identified and separated by codon preferences. Two more examples concerning lifestyle are studied with linear discriminant analysis: suitable separating functions characterized by sets of preferred codons are provided to discriminate: translationally biased (hyper)thermophiles from mesophiles, and organisms with different respiratory characteristics, aerobic, anaerobic, facultative aerobic and facultative anaerobic. These results suggest that codon bias space might reflect the geometry of a prokaryotic "physiology space." Evolutionary perspectives are noted, numerical criteria and distances among organisms are validated on known cases, and various results and predictions are discussed both on methodological and biological grounds.  相似文献   

4.
 The mutational response index (MRI) and measurements of codon bias were determined in three characterised genes, and two open reading frames of unknown function, from the Frankia nif operon, which encodes genes for nitrogen fixation. The merits of the different systems of measuring codon usage are discussed in the light of the results, as are the applicability of these techniques to the assessment of the translational function of putative open reading frames. Received: 8 August 1997 / Accepted: 19 September 1997  相似文献   

5.
It has recently been demonstrated that human natural codon usage bias is optimized towards a higher buffering capacity to mutations (measured as the tendency of single point mutations in a DNA sequence to yield the same or similar amino acids) compared to random sequences. In this work, we investigate this phenomenon further by analyzing the natural DNA of four different species (human, mouse, zebrafish and fruit fly) to determine whether such a tolerance to mutations is correlated with the life span and age of sexual maturation for the corresponding organisms. We also propose a new measure to quantify the buffering capacity of a DNA sequence to mutations that takes into account the observed mutation rates within every genome and the effect of the corresponding mutation.Our results suggest there is a propensity for tolerance to mutations that is positively correlated with the life expectancy of the considered organisms. Moreover, random sequences that are constrained to produce the same protein as the naturally occurring sequences are found to be more buffered than completely random sequences while being less buffered than the natural sequences. These results suggest that optimization toward protective mechanisms tolerant to mutations is correlated with both life expectancy and age to sexual maturity at both the levels of codon usage bias and the bias of the natural sequence of codons itself.  相似文献   

6.
A simple, effective measure of synonymous codon usage bias, the Codon Adaptation Index, is detailed. The index uses a reference set of highly expressed genes from a species to assess the relative merits of each codon, and a score for a gene is calculated from the frequency of use of all codons in that gene. The index assesses the extent to which selection has been effective in moulding the pattern of codon usage. In that respect it is useful for predicting the level of expression of a gene, for assessing the adaptation of viral genes to their hosts, and for making comparisons of codon usage in different organisms. The index may also give an approximate indication of the likely success of heterologous gene expression.  相似文献   

7.
The codon bias in Escherichia coli for all two-fold degenerate amino acids was studied as dependent on the context from the six bases in the nearest surrounding codons. By comparing the results in genes at different expression levels, effects that are due to differences in mutation rates can be distinguished from those that are due to selection. Selective effects on the codon bias is found mostly from the first neighbouring base in the 3'direction, while neighbouring bases further away influence mostly the mutational bias. In some cases it is also possible to identify specific molecular processes, repair or avoidance of frame shift, that lead to the context dependence of the bias.  相似文献   

8.
Codon optimizer: a freeware tool for codon optimization   总被引:2,自引:0,他引:2  
Selection plays a major role in the determination of codon usage in all organisms studied so far. In highly expressed genes, a narrow set of codons is used and these codons correspond to the more abundant tRNA species. This minimizes the risk of tRNA depletion during translation. In fact, the codons in a gene may be true bottlenecks, especially in cases where foreign genes are expressed in a host in which the usage of codons in highly expressed genes does not resemble the usage of codons in the species from which the foreign gene originates. In such cases, it has been shown that substitution of rare codons in the introduced gene may increase the yield dramatically. In addition, replacement of rare codons might decrease the chance of misincorporation and protect the protein from premature turnover. Here, a piece of software is announced that calculates a codon-optimized sequence of any gene based on knowledge of highly expressed genes of a host. In addition, it calculates the codon adaptation index of the gene and identifies internal type II restriction sites of the optimized sequence. The program runs under Windows and is available as freeware for use in academia.  相似文献   

9.
Multiple synonymous codons code for the same amino acid, resulting in the degeneracy of the genetic code and in the preferred used of some codons called codon bias usage (CBU). We performed a large-scale analysis of codon usage bias analysing the distribution of the codon adaptation index (CAI) and the codon relative adaptiveness index (RA) in 4868 bacterial genomes. We found that CAI values differ significantly between protein functional domains and part of the protein outside domains and show how CAI, GC content and preferred usage of polymerase III alpha subunits are related. Additionally, we give evidence of the association between CAI and bacterial phenotypes.  相似文献   

10.
Stenström CM  Jin H  Major LL  Tate WP  Isaksson LA 《Gene》2001,263(1-2):273-284
The codon that follows the AUG initiation triplet (+2 codon) affects gene expression in Escherichia coli. We have extended this analysis using two model genes lacking any apparent Shine-Dalgarno sequence. Depending on the identity of the +2 codon a difference in gene expression up to 20-fold could be obtained. The effects did not correlate with the levels of intracellular pools of cognate tRNA for the +2 codon, with putative secondary mRNA structures, or with mRNA stability. However, most +2 iso-codons that were decoded by the same species of tRNA gave pairwise similar effects, suggesting that the effect on gene expression was associated with the decoding tRNA. High adenine content of the +2 codon was associated with high gene expression. Of the fourteen +2 codons that mediated the highest efficiency, all except two had an adenine as the first base of the codon. Analysis of the 3540 E. coli genes from the TransTerm database revealed that codons associated with high gene expression in the two expression systems are over-represented at the +2 position in natural genes. Codons that are associated with low gene expression are under-represented. The data suggest that evolution has favored codons at the +2 position that give high translation initiation.  相似文献   

11.
Lavner Y  Kotlar D 《Gene》2005,345(1):127-138
We study the interrelations between tRNA gene copy numbers, gene expression levels and measures of codon bias in the human genome. First, we show that isoaccepting tRNA gene copy numbers correlate positively with expression-weighted frequencies of amino acids and codons. Using expression data of more than 14,000 human genes, we show a weak positive correlation between gene expression level and frequency of optimal codons (codons with highest tRNA gene copy number). Interestingly, contrary to non-mammalian eukaryotes, codon bias tends to be high in both highly expressed genes and lowly expressed genes. We suggest that selection may act on codon bias, not only to increase elongation rate by favoring optimal codons in highly expressed genes, but also to reduce elongation rate by favoring non-optimal codons in lowly expressed genes. We also show that the frequency of optimal codons is in positive correlation with estimates of protein biosynthetic cost, and suggest another possible action of selection on codon bias: preference of optimal codons as production cost rises, to reduce the rate of amino acid misincorporation. In the analyses of this work, we introduce a new measure of frequency of optimal codons (FOP'), which is unaffected by amino acid composition and is corrected for background nucleotide content; we also introduce a new method for computing expected codon frequencies, based on the dinucleotide composition of the introns and the non-coding regions surrounding a gene.  相似文献   

12.
Objectives: The paper was aimed to establish the influence of some general and local factors on adaptation process to removable prostheses (RPs). The adaptation process is a complex issue, which is often associated with painful reactions. Those complaints force patients to visit a dentist who makes alterations to reduce the patient's discomfort. Material and methods: The study involved analysis of 300 dental records of patients who visited our Department for RPs. The authors analysed the influence of gender, age, condition of general health, maintenance of the prosthetic base tissues and the kind of prostheses on the process of adaptation. It was measured by means of a number of follow‐up visits of the patients to our polyclinic. The findings were analysed statistically by means of chi‐squared test. The level of significance was assumed to be p < 0.05. Results and conclusions: Adaptation to RPs without any correction was revealed by about one‐fifth of patients. Men adapted to RPs better than women. The biggest problems with adaptation to RPs were observed in patients using a complete and partial prosthesis simultaneously. The number of follow‐up visits by patients who were treated with RPs for the first time or had been treated before was almost the same. Adaptation of RPs on an atrophic muco‐osseous ridge was associated with more multiple visits than in the case of a well‐preserved ridge. Healthy patients adapt to RPs better than patients with systemic disorders. Taking into account the limitations of the study, the number of follow‐up visits may be used as a helpful indicator of the adaptation process.  相似文献   

13.
PICDI is a very simple program designed to calculate the Intrinsic Codon Deviation Index (ICDI). The program is available in Macintosh as well a PC format. Requirements for correct input of the sequences have been kept to a minimum and the analysis of sequences up to 2000 codons is very quick. The ICDI is very useful for estimation of codon bias of genes from species in which optimal codons are not known. The availability of a computer program for its calculation will increase its usefulness in the fields of Molecular Biology and Biotechnology.  相似文献   

14.
Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~ 60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~ 40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~ 65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences.  相似文献   

15.
Highly expressed genes in many bacteria and small eukaryotes often have a strong compositional bias, in terms of codon usage. Two widely used numerical indices, the codon adaptation index (CAI) and the codon usage, use this bias to predict the expression level of genes. When these indices were first introduced, they were based on fairly simple assumptions about which genes are most highly expressed: the CAI was originally based on the codon composition of a set of only 24 highly expressed genes, and the codon usage on assumptions about which functional classes of genes are highly expressed in fast-growing bacteria. Given the recent advent of genome-wide expression data, we should be able to improve on these assumptions. Here, we measure, in yeast, the degree to which consideration of the current genome-wide expression data sets improves the performance of both numerical indices. Indeed, we find that by changing the parameterization of each model its correlation with actual expression levels can be somewhat improved, although both indices are fairly insensitive to the exact way they are parameterized. This insensitivity indicates a consistent codon bias amongst highly expressed genes. We also attempt direct linear regression of codon composition against genome-wide expression levels (and protein abundance data). This has some similarity with the CAI formalism and yields an alternative model for the prediction of expression levels based on the coding sequences of genes. More information is available at http://bioinfo.mbb.yale.edu/expression/codons.  相似文献   

16.
Palidwor GA  Perkins TJ  Xia X 《PloS one》2010,5(10):e13431

Background

In spite of extensive research on the effect of mutation and selection on codon usage, a general model of codon usage bias due to mutational bias has been lacking. Because most amino acids allow synonymous GC content changing substitutions in the third codon position, the overall GC bias of a genome or genomic region is highly correlated with GC3, a measure of third position GC content. For individual amino acids as well, G/C ending codons usage generally increases with increasing GC bias and decreases with increasing AT bias. Arginine and leucine, amino acids that allow GC-changing synonymous substitutions in the first and third codon positions, have codons which may be expected to show different usage patterns.

Principal Findings

In analyzing codon usage bias in hundreds of prokaryotic and plant genomes and in human genes, we find that two G-ending codons, AGG (arginine) and TTG (leucine), unlike all other G/C-ending codons, show overall usage that decreases with increasing GC bias, contrary to the usual expectation that G/C-ending codon usage should increase with increasing genomic GC bias. Moreover, the usage of some codons appears nonlinear, even nonmonotone, as a function of GC bias. To explain these observations, we propose a continuous-time Markov chain model of GC-biased synonymous substitution. This model correctly predicts the qualitative usage patterns of all codons, including nonlinear codon usage in isoleucine, arginine and leucine. The model accounts for 72%, 64% and 52% of the observed variability of codon usage in prokaryotes, plants and human respectively. When codons are grouped based on common GC content, 87%, 80% and 68% of the variation in usage is explained for prokaryotes, plants and human respectively.

Conclusions

The model clarifies the sometimes-counterintuitive effects that GC mutational bias can have on codon usage, quantifies the influence of GC mutational bias and provides a natural null model relative to which other influences on codon bias may be measured.  相似文献   

17.
18.
To better conceptualize the mechanism underlying the evolution of synonymous codons, we have analysed intragenic codon usage in chosen "regions" of some mouse and human genes. We divided a given gene into two regions: one consisting of a trinucleotide repeat (TNR) and the other consisting of the "rest of the coding region" (RCR). Usually, a TNR is composed of a repetitive single codon, which may reflect its frequency in a gene. In contrast, a non-random frequency of a codon in the RCR versus TNR (or vice versa) of a gene should indicate a bias for that codon within the TNR. We examined this scenario by comparing codon frequency between the RCR and the cognate TNR(s) for a set of human and mouse genes. A TNR length of six amino acids or more was used to identify genes from the Genbank database. Twenty nine human and twenty one mouse genes containing TNRs coding for nine different amino acid runs were identified. The ratio of codon frequency in a TNR versus the corresponding RCR was expressed as "fold change" which was also regarded as a measure of codon bias (defined as preferential use either in TNR or in RCR). Chi-square values were then determined from the distribution of codon frequency in a TNR vs. the cognate RCR. At p<0.001, 22% and 27%, respectively, of human and mouse TNRs showed codon bias. Greater than 40% of the TNRs (29 out of 69 in human, and 18 of 42 in mouse) showed codon bias at p<0.05. In addition, we identify eight single-codon TNRs in mouse and ten in human genes. Thus, our results show intragenic codon bias in both mouse and human genes expressed in diverse tissue types. Since our results are independent of the Codon Adaptation Index (CAI) and starvation CAI, and since the tRNA repertoire in a cell or in a tissue is constant, our data suggest that other constraints besides tRNA abundance played a role in creating intragenic codon bias in these genes.  相似文献   

19.
Summary This paper is concerned with the divergence of synonymous codon usage and its bias in three homologous genes within vertebrate species. Genetic distances among species are described in terms of synonymous codon usage divergence and the correlation is found between the genetic distances and taxonomic distances among species under study. A codon usage clock is reported in alphaglobin and beta-globin. A method is developed to define the synonymous codon preference bias and it is observed that the bias changes considerably among species.  相似文献   

20.
Analysis of synonymous codon usage bias in Chlamydia   总被引:9,自引:0,他引:9  
Chlamydiae are obligate intracellular bacterial pathogens that cause ocular and sexuallytransmitted diseases,and are associated with cardiovascular diseases.The analysis of codon usage mayimprove our understanding of the evolution and pathogenesis of Chlamydia and allow reengineering of targetgenes to improve their expression for gene therapy.Here,we analyzed the codon usage of C.muridarum,C.trachomatis(here indicating biovar trachoma and LGV),C.pneumoniae,and C.psittaci using the codonusage database and the CUSP(Create a codon usage table)program of EMBOSS(The European MolecularBiology Open Software Suite).The results show that the four genomes have similar codon usage patterns,with a strong bias towards the codons with A and T at the third codon position.Compared with Homosapiens,the four chlamydial species show discordant seven or eight preferred codons.The ENC(effectivenumber of codons used in a gene)-plot reveals that the genetic heterogeneity in Chlamydia is constrained bythe G+C content,while translational selection and gene length exert relatively weaker influences.Moreover,mutational pressure appears to be the major determinant of the codon usage variation among the chlamydialgenes.In addition,we compared the codon preferences of C.trachomatis with those of E.coli,yeast,adenovirus and Homo sapiens.There are 23 codons showing distinct usage differences between C.trachomatisand E.coli,24 between C.trachomatis and adenovirus,21 between C.trachomatis and Homo sapiens,butonly six codons between C.trachomatis and yeast.Therefore,the yeast system may be more suitable for theexpression of chlamydial genes.Finally,we compared the codon preferences of C.trachomatis with those ofsix eukaryotes,eight prokaryotes and 23 viruses.There is a strong positive correlation between the differ-ences in coding GC content and the variations in codon bias(r=0.905,P<0,001).We conclude that thevariation of codon bias between C.trachomatis and other organisms is much less influenced by phylogeneticlineage and primarily determined by the extent of disparities in GC content.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号