首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Detection of Short Protein Coding Regions within the Cyanobacterium Genome: Application of the Hidden Markov Model
Authors:Yada  Tetsushi; Hirosawa  Makoto
Institution:1Japan Science and Technology Corporation (JST) 5-3 Yonbancho, Chiyoda-ku, Tokyo 102, Japan
2Kazusa DNA Research Institute (KDRI) 1532-3 Yana-uchino, Kisarazu, Chiba 292, Japan
Abstract:The gene-finding programs developed so far have not paid muchattention to the detection of short protein coding regions (CDSs).However, the detection of short CDSs is important for the studyof photosynthesis. We utilized GeneHacker, a gene-finding programbased on the hidden Markov model (HMM), to detect short CDSs(from 90 to 300 bases) in a 1.0 mega contiguous sequence ofcyanobacterium Synechocystis sp. strain PCC6803 which carriesa complete set of genes for oxygenic photosynthesis. GeneHackerdiffers from other gene-finding programs based on the HMM inthat it utilizes di-codon statistics as well. GeneHacker successfullydetected seven out of the eight short CDSs annotated in thissequence and was clearly superior to GeneMark in this rangeof length. GeneHacker detected 94 potentially new CDSs, 9 ofwhich have counterparts in the genetic databases. Four of thenine CDSs were less than 150 bases and were photosynthesis-relatedgenes. The results show the effectiveness of GeneHacker in detectingvery short CDSs corresponding to genes.
Keywords:Cyanobacterium  gene finding  hidden Markov model  short protein coding region  oxygenic photosynthesis genes
本文献已被 Oxford 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号