首页 | 本学科首页   官方微博 | 高级检索  
     


Basing population genetic inferences and models of molecular evolution upon desired stationary distributions of DNA or protein sequences
Authors:Choi Sang Chul  Redelings Benjamin D  Thorne Jeffrey L
Affiliation:Bioinformatics Research Center, North Carolina State University, Box 7566, Raleigh, NC 27695-7566, USA.
Abstract:Models of molecular evolution tend to be overly simplistic caricatures of biology that are prone to assigning high probabilities to biologically implausible DNA or protein sequences. Here, we explore how to construct time-reversible evolutionary models that yield stationary distributions of sequences that match given target distributions. By adopting comparatively realistic target distributions,evolutionary models can be improved. Instead of focusing on estimating parameters, we concentrate on the population genetic implications of these models. Specifically, we obtain estimates of the product of effective population size and relative fitness difference of alleles. The approach is illustrated with two applications to protein-coding DNA. In the first, a codon-based evolutionary model yields a stationary distribution of sequences, which, when the sequences are translated,matches a variable-length Markov model trained on human proteins. In the second, we introduce an insertion-deletion model that describes selectively neutral evolutionary changes to DNA. We then show how to modify the neutral model so that its stationary distribution at the amino acid level can match a profile hidden Markov model, such as the one associated with the Pfam database.
Keywords:variable-length Markov model   profile hidden Markov model   insertion–deletion model   scaled selection coefficient   fitness   Pfam
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号