The relationship between base composition and codon usage in bacterial genes and its use for the simple and reliable identification of protein-coding sequences |
| |
Authors: | M.J. Bibb P.R. Findlay M.W. Johnson |
| |
Affiliation: | John Innes Institute, Colney Lane, Norwich, NR4 7UH, U.K. Tel. 0603 52571 |
| |
Abstract: | Bacterial genes that code for proteins appear to possess a codon usage characteristic of their overall base composition. This results in different but predictable non-random distributions of nucleotides within codons, permitting the recognition of protein-coding sequences in a wide range of bacterial species. The nature of this distribution depends on the base composition of the coding sequence. The position-specific differences are especially conspicuous in genes of extreme G + C content, allowing the particularly reliable prediction of the reading frame and coding strand of experimentally determined DNA sequences. This fmding has been exploited to identify the coding sequence of the viomycin phosphotransferase (vph) gene of Streptomyces vinaceus. An easily applied computer program (“Frame”) has been written to carry out and display such analyses. |
| |
Keywords: | Computer analysis of DNA sequences program “Frame” Streptomyces viomycin phosphotransferase gene reading frame triplets |
本文献已被 ScienceDirect 等数据库收录! |