Codon evolution is governed by linear formulas |
| |
Authors: | K Sorimachi T Okayasu |
| |
Institution: | (1) Educational Support Center, Dokkyo Medical University, Mibu, Tochigi 321-0293, Japan;(2) Center of Medical Informatics, Dokkyo Medical University, Mibu, Tochigi 321-0293, Japan |
| |
Abstract: | When nucleotide (G, C, T and A) contents were plotted against each nucleotide, their relationships were clearly expressed
by a linear formula, y = αx + β in the coding and non-coding regions. This linear relationship was obtained from the complete single-stranded DNA. Similarly,
nucleotide contents at all three codon positions were expressed by linear regression lines based on the content of each nucleotide.
In addition, 64 codon usages were also expressed by linear formulas against nucleotide content. Thus, the nucleotide content
not only in coding sequence but also in non-coding sequence can be expressed by a linear formula, y = αx + β, in 145 organisms (112 bacteria, 15 archaea and 18 eukaryotes). Based on these results, the ratio of C/T, G/T, C/A or G/A
one can essentially estimate all four nucleotide contents in the complete single-stranded DNA, and the determination of any
ratio of two kinds of nucleotides can essentially estimate four nucleotide contents, nucleotide contents at the three different
codon positions and codon distributions at 64 codons in the coding region. The maximum and minimum values of G content were
∼0.35 and ∼0.15, respectively, among various organisms examined. Codon evolution occurs according to linear formulas between
these two values.
Electronic supplementary material The online version of this article (doi:) contains supplementary material, which is available to authorized users. |
| |
Keywords: | Codon evolution Nucleotide content Linear empirical formula Codon usage Amino acid composition Genome |
本文献已被 PubMed SpringerLink 等数据库收录! |
|