首页 | 本学科首页   官方微博 | 高级检索  
     


An expectation maximization algorithm for training hidden substitution models
Authors:Holmes I  Rubin G M
Affiliation:Howard Hughes Medical Institute, University of California, Berkeley, CA 94720, USA. ihh@fruitfly.org
Abstract:We derive an expectation maximization algorithm for maximum-likelihood training of substitution rate matrices from multiple sequence alignments. The algorithm can be used to train hidden substitution models, where the structural context of a residue is treated as a hidden variable that can evolve over time. We used the algorithm to train hidden substitution matrices on protein alignments in the Pfam database. Measuring the accuracy of multiple alignment algorithms with reference to BAliBASE (a database of structural reference alignments) our substitution matrices consistently outperform the PAM series, with the improvement steadily increasing as up to four hidden site classes are added. We discuss several applications of this algorithm in bioinformatics.
Keywords:molecular evolution   bioinformatics   amino acid substitution rates   Markov models
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号