首页 | 本学科首页   官方微博 | 高级检索  
     


Correcting the Bias of Empirical Frequency Parameter Estimators in Codon Models
Authors:Sergei Kosakovsky Pond  Wayne Delport  Spencer V. Muse  Konrad Scheffler
Affiliation:1. Department of Medicine, University of California San Diego, San Diego, California, United States of America.; 2. Department of Pathology, University of California San Diego, San Diego, California, United States of America.; 3. Department of Statistics, North Carolina State University, Raleigh, North Carolina, United States of America.; 4. Computer Science Division, Department of Mathematical Sciences, Stellenbosch University, Stellenbosch, South Africa.;Aarhus University, Denmark
Abstract:Markov models of codon substitution are powerful inferential tools for studying biological processes such as natural selection and preferences in amino acid substitution. The equilibrium character distributions of these models are almost always estimated using nucleotide frequencies observed in a sequence alignment, primarily as a matter of historical convention. In this note, we demonstrate that a popular class of such estimators are biased, and that this bias has an adverse effect on goodness of fit and estimates of substitution rates. We propose a “corrected” empirical estimator that begins with observed nucleotide counts, but accounts for the nucleotide composition of stop codons. We show via simulation that the corrected estimates outperform the de facto standard estimates not just by providing better estimates of the frequencies themselves, but also by leading to improved estimation of other parameters in the evolutionary models. On a curated collection of sequence alignments, our estimators show a significant improvement in goodness of fit compared to the approach. Maximum likelihood estimation of the frequency parameters appears to be warranted in many cases, albeit at a greater computational cost. Our results demonstrate that there is little justification, either statistical or computational, for continued use of the -style estimators.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号