The Distribution Patterns of Bases of Protein-coding Genes,Non-coding ORFs,and Intergenic Sequences in Pseudomonas aeruginosa PA01 Genome and its Implications |
| |
Authors: | Feng-Biao Guo |
| |
Affiliation: | School of Life Science and Technology University of Electronic Science and Technology of China , Chengdu , 610054 , China |
| |
Abstract: | Abstract The distribution patterns of bases of DNA fragments in different regions in P. aeruginosa genome are analyzed in this paper. It's shown that 5565 protein-coding genes, 17315 non- coding ORFs, and 1104 intergenic sequences are located into seven clusters based on their base frequencies. Almost all the protein-coding genes are contained in one of the seven clusters. The significant difference of base frequencies among three codon positions in high GC genome, which arouse the division between the distribution patterns of bases of six reading frames of protein-coding genes, is responsible for the appearance of the clustering phenomenon. In the light of the clustering phenomenon, the author supposes that the anitisense strand ORFs, particularly those corresponding to Frame 2′ and Frame 3′, may not code for proteins in P. aeruginosa genome. |
| |
Keywords: | Pseudomonas aeruginosa PA01 genome Base frequencies Distribution pattern Clustering phenomenon Antisense strand ORF |
|
|