首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The relationship between the codon usage bias and the sequence context surrounding the AUG translation initiation codon was examined in 1100 Drosophila melanogaster mRNA sequences. The codon usage bias measured by the "codon adaptation index" (CAI), and the effectiveness of the AUG context for translation initiation assessed by the "AUG context adaptation index" (AUGCAI), showed a significant positive relationship (correlation coefficient: r = 0.34, p <0.0001), indicating that these two factors are evolutionally under a similar natural selection constraint at the translational level. The importance of each position of the AUG context in relation to codon usage bias was examined, and the preference for the nucleotide at the -13, -12, -11, -10, -7, -6, -5, -4, -3, -2, and -1 positions showed a significant positive correlation to the codon usage bias, suggesting the action of natural selection on these very specific positions of the Drosophila genome. The relationship between AUGCAI value and gene length was also examined, and a significant negative relationship was found (r = -0.15, p <0.0001), suggesting a general tendency of higher expressivity of shorter genes, and of lower expressivity of longer genes in D. melanogaster.  相似文献   

2.
In the present study, we examined GC nucleotide composition, relative synonymous codon usage (RSCU), effective number of codons (ENC), codon adaptation index (CAI) and gene length for 308 prokaryotic mechanosensitive ion channel (MSC) genes from six evolutionary groups: Euryarchaeota, Actinobacteria, Alphaproteobacteria, Betaproteobacteria, Firmicutes, and Gammaproteobacteria. Results showed that: (1) a wide variation of overrepresentation of nucleotides exists in the MSC genes; (2) codon usage bias varies considerably among the MSC genes; (3) both nucleotide constraint and gene length play an important role in shaping codon usage of the bacterial MSC genes; and (4) synonymous codon usage of prokaryotic MSC genes is phylogenetically conserved. Knowledge of codon usage in prokaryotic MSC genes may benefit from the study of the MSC genes in eukaryotes in which few MSC genes have been identified and functionally analysed.  相似文献   

3.
It is important and meaningful to understand the codon usage pattern and the factors that shape codon usage of maize. In this study, trends in synonymous codon usage in maize have been firstly examined through the multivariate statistical analysis on 7402 cDNA sequences. The results showed that the genes positions on the primary axis were strongly negatively correlated with GC3s, GC content of individual gene and gene expression level assessed by the codon adaptation index (CAI) values, which indicated that nucleotide composition and gene expression level were the main factors in shaping the codon usage of maize, and the variation in codon usage among genes may be due to mutational bias at the DNA level and natural selection acting at the level of mRNA translation. At the same time, CDS length and the hydrophobicity of each protein were, respectively, significantly correlated with the genes locations on the primary axis, GC3s and CAI values. We infer that genes length and the hydrophobicity of the encoded protein may play minor role in shaping codon usage bias. Additional 28 codons ending with a G or C base have been defined as “optimal codons”, which may provide useful information for maize gene-transformation and gene prediction.  相似文献   

4.
5.
The compositional non-randomness was studied in genes of Saccharomyces cerevisiae and Schizosaccharomyces pombe. In both species, codon usage is well correlated with expressivity (measured as the codon adaptation index). Both species generally display higher nucleotide non-randomness in the group of highly expressed genes than in the lowly expressed genes. The highly expressed genes in both species are furthermore characterized by marked peaks in non-randomness at N=3 upstream of start codons, N=2 downstream of start codons and at N=1 and N=7 downstream of stop codons, indicating that these nucleotides may be key elements in translational regulation. Intragenic variation in codon usage was also observed to be linked to expressivity. It is suggested that the firm link between expressivity and codon usage calls for codon optimization. Based on bioinformatic calculations, examples of proteins are given for which codon optimizations might be relevant.  相似文献   

6.
Mitogen activated protein kinase (MAPK) genes provide resistance to various biotic and abiotic stresses. Codon usage profiling of the genes reveals the characteristic features of the genes like nucleotide composition, gene expressivity, optimal codons etc. The present study is a comparative analysis of codon usage patterns for different MAPK genes in three organisms, viz. Arabidopsis thaliana, Glycine max (soybean) and Oryza sativa (rice). The study has revealed a high AT content in MAPK genes of Arabidopsis and soybean whereas in rice a balanced AT-GC content at the third synonymous position of codon. The genes show a low bias in codon usage profile as reflected in the higher values (50.83 to 56.55) of effective number of codons (Nc). The prediction of gene expression profile in the MAPK genes revealed that these genes might be under the selective pressure of translational optimization as reflected in the low codon adaptation index (CAI) values ranging from 0.147 to 0.208.  相似文献   

7.
The variation in base composition at the three codon sites in relation to gene expressivity, the latter estimated by the Codon Adaptation Index, has been studied in a sample of 1371 Escherichia coli genes. Correlation and regression analyses show that increasing expression levels are accompanied by higher frequencies of base G at first, of base A at second and of base C at third codon positions. However, correlation between expressivity and base compositional biases at each codon site was only significant and positive at first codon position. The preference for G-starting codons as gene expression level increases is discussed in terms of translational optimization.  相似文献   

8.
葡萄基因组密码子使用偏好模式研究   总被引:2,自引:0,他引:2  
根据完整基因组序列,运用多元统计分析和对应分析的方法,探讨了葡萄全基因组序列密码子的使用模式和影响密码子使用的各种可能因素。结果显示:葡萄密码子偏好性主要受到碱基差异(r=0.925)和自然选择(r=0.193)共同作用的影响,突变压力占了主导因素,自然选择的作用较小。同时基因长度和蛋白质疏水性也对密码子的偏好性有所影响。确定了葡萄的20个最优密码子。  相似文献   

9.
10.
An ab initio model for gene prediction in prokaryotic genomes is proposed based on physicochemical characteristics of codons calculated from molecular dynamics (MD) simulations. The model requires a specification of three calculated quantities for each codon: the double-helical trinucleotide base pairing energy, the base pair stacking energy, and an index of the propensity of a codon for protein-nucleic acid interactions. The base pairing and stacking energies for each codon are obtained from recently reported MD simulations on all unique tetranucleotide steps, and the third parameter is assigned based on the conjugate rule previously proposed to account for the wobble hypothesis with respect to degeneracies in the genetic code. The third interaction propensity parameter values correlate well with ab initio MD calculated solvation energies and flexibility of codon sequences as well as codon usage in genes and amino acid composition frequencies in ∼175,000 protein sequences in the Swissprot database. Assignment of these three parameters for each codon enables the calculation of the magnitude and orientation of a cumulative three-dimensional vector for a DNA sequence of any length in each of the six genomic reading frames. Analysis of 372 genomes comprising ∼350,000 genes shows that the orientations of the gene and nongene vectors are well differentiated and make a clear distinction feasible between genic and nongenic sequences at a level equivalent to or better than currently available knowledge-based models trained on the basis of empirical data, presenting a strong support for the possibility of a unique and useful physicochemical characterization of DNA sequences from codons to genomes.  相似文献   

11.
Positive correlation between gene expression and synonymous codon usage bias is well documented in the literature. However, in the present study of Vibrio cholerae genome, we have identified a group of genes having unusually high codon usage bias despite being low potential expressivity. Our results suggest that codon usage in lowly expressed genes might also be selected on to preferably use non-optimal codons to maintain a low cellular concentration of the proteins that they encode. This would predict that lowly expressed genes are also biased in codon usage, but in a way that is opposite to the bias of highly expressed genes.  相似文献   

12.
Liu Q  Feng Y  Xue Q 《Mitochondrion》2004,4(4):313-320
In this paper, the main factors shaping codon usage in the mitochondrion genome of rice were reported. Correspondence analysis, a commonly used multivariate statistical approach, was carried out to analyze synonymous codon usage bias. The results showed that the main trend was strongly correlated with the gene expression level assessed by the 'Codon Adaptation Index' value, a result that was confirmed by the distribution of genes along the first axis. From the results that there were two significant correlations between axis 1 coordinates and the GC, GC3s content at silent sites of each sequence, and clearly significant correlations between the 'Effective Number of Codons' values and GC, GC3s content, we inferred that codon usage bias was affected by gene nucleotide composition also. In addition, the hydrophobicity of each protein also played some roles in shaping codon usage in this organelle, which could be confirmed by the significant correlation between the positions of genes placed on the first axis and the hydrophobicity value of each protein. In summary, natural selection played a crucial role, nucleotide mutational bias and amino acid composition only in a minor way, in shaping codon usage in the mitochondrion genome of rice. Notably, 21 codons defined firstly as 'optimal codons' might provide some more useful information for gene engineering and/or evolution studying.  相似文献   

13.
Synonymous codon usage is a commonly used means for estimating gene expression levels of Escherichia coli genes and has also been used for predicting highly expressed genes for a number of prokaryotic genomes. By comparison of expression level-dependent features in codon usage with protein abundance data from two proteome studies of exponentially growing E. coli and Bacillus subtilis cells, we try to evaluate whether the implicit assumption of this approach can be confirmed with experimental data. Log-odds ratio scores are used to model differences in codon usage between highly expressed genes and genomic average. Using these, the strength and significance of expression level-dependent features in codon usage were determined for the genes of the Escherichia coli, Bacillus subtilis and Haemophilus influenzae genomes. The comparison of codon usage features with protein abundance data confirmed a relationship between these to be present, although exceptions to this, possibly related to functional context, were found. For species with expression level-dependent features in their codon usage, the applied methodology could be used to improve in silico simulations of the outcome of two-dimensional gel electrophoretic experiments.  相似文献   

14.
Predicted highly expressed genes of diverse prokaryotic genomes   总被引:13,自引:0,他引:13       下载免费PDF全文
  相似文献   

15.
Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome—between 5% and 33%, depending on genome size—while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl–tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an “adaptome” by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea.  相似文献   

16.
17.
Gu W  Zhou T  Ma J  Sun X  Lu Z 《Bio Systems》2004,73(2):89-97
The role of silent position in the codon on the protein structure is an interesting and yet unclear problem. In this paper, 563 Homo sapiens genes and 417 Escherichia coli genes coding for proteins with four different folding types have been analyzed using variance analysis, a multivariate analysis method newly used in codon usage analysis, to find the correlation between amino acid composition, synonymous codon, and protein structure in different organisms. It has been found that in E. coli, both amino acid compositions in differently folded proteins and synonymous codon usage in different gene classes coding for differently folded proteins are significantly different. It was also found that only amino acid composition is different in different protein classes in H. sapiens. There is no universal correlation between synonymous codon usage and protein structure in these two different organisms. Further analysis has shown that GC content on the second codon position can distinguish coding genes for different folded proteins in both organisms.  相似文献   

18.
Genes involved in the symbiotic interactions between the nitrogen-fixing endosymbiont Bradyrhizobium japonicum, and its leguminous host are mostly clustered in a symbiotic island (SI), acquired by the bacterium through a process of horizontal transfer. A comparative analysis of the codon and amino acid usage in core and SI genes/proteins of B. japonicum has been carried out in the present study. The mutational bias, translational selection, and gene length are found to be the major sources of variation in synonymous codon usage in the core genome as well as in SI, the strength of translational selection being higher in core genes than in SI. In core proteins, hydrophobicity is the main source of variation in amino acid usage, expressivity and aromaticity being the second and third important sources. But in SI proteins, aromaticity is the chief source of variation, followed by expressivity and hydrophobicity. In SI proteins, both the mean molecular weight and mean aromaticity of individual proteins exhibit significant positive correlation with gene expressivity, which violate the cost-minimization hypothesis. Investigation of nucleotide substitution patterns in B. japonicum and Mesorhizobium loti orthologous genes reveals that both synonymous and non-synonymous sites of highly expressed genes are more conserved than their lowly expressed counterparts and this conservation is more pronounced in the genes present in core genome than in SI.  相似文献   

19.
20.
Chanda I  Pan A  Saha SK  Dutta C 《FEBS letters》2007,581(30):5751-5758
Comparative analyses of codon/amino acid usage in Leishmania major, Trypanosoma brucei and Trypanosoma cruzi reveal that gene expressivity and GC-bias play key roles in shaping the gene composition of all three parasites, and protein composition of L. major only. In T. brucei and T. cruzi, the major contributors to the variation in protein composition are hydropathy and/or aromaticity. Principle of Cost Minimization is followed by T. brucei, disregarded by T. cruzi and opposed by L. major. Slowly evolving highly expressed gene-products of L. major bear signatures of relatively AT-rich ancestor, while faster evolution under GC-bias has characterized the lowly expressed genes of the species by higher GC12-content.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号