Genome inhomogeneity is determined mainly by WW and SS dinucleotides |
| |
Authors: | Kozhukhin Costya G; Pevzner Pavel A |
| |
Institution: | Institute of Control Sciences Moscow 117342
1Laboratory of Mathematical Methods, Institute of Genetics of Microorganisms Moscow 113545, USSR |
| |
Abstract: | According to the hypothesis of the modular structure of DNA,genomes consist of modules of various nature which may differin statistical characteristics. Statistical analysis helps inrevealing the differences in statistical characteristics andpredicting the modular structure. In this connection the questionabout the contribution of each word of length l (l-tuple) tothe inhomogeneity of genetic text arises. The notion of stationary(i.e. relatively evenly distributed over a genome) versus non-stationaryl-tuples has been introduced previously. In this paper, thedinucleotide distributions for all long sequences from GenBankwere analyzed and it was shown that non-stationary dinucleotidesare closely associated with polyW and polyS tracts (W denotesweak nucleotides A or T, while S stands for thestrong nucleotides G or C). Thus, genome inhomogeneityis shown to be determined mainly by AA, TT, GG, CC, AT, TA,GC and CG dinucleotides. It has been demonstrated that neithercodon usage nor the isochore modelcan account for this phenomenon. |
| |
Keywords: | |
本文献已被 Oxford 等数据库收录! |
|