首页 | 本学科首页   官方微博 | 高级检索  
   检索      


First and second moment of counts of words in random texts generated by Markov chains
Authors:Kleffe  J; Borodovsky  M
Institution:Institute of Molecular Biology and Biochemistry, Department of Molecular Biology and Informatics, Free University of Berlin Arnimallee 22, D-1000, Berlin 33, Germany
1School of Biology, Georgia Institute of Technology Atlanta, GA 30332, USA and Institute of Molecular Genetics 123182 Moscow
Abstract:An exact expression for the variance of random frequency thata given word has in text generated by a Markov chain is presented.The result is applied to periodic Markov chains, which describethe protein-coding DNA sequences better than simple Markov chains.A new solution to the problem of word overlap is proposed. Itwas found that the expected frequency and overlapping propertiesdetermine most of the variance. The expectation and varianceof counts for triplets are compared with experimental countsin Escherichia coli coding sequences.
Keywords:
本文献已被 Oxford 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号