首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Accounting for missing data in the estimation of contemporary genetic effective population size (Ne)
Authors:D Peel  R S Waples  G M Macbeth  C Do  J R Ovenden
Institution:1. CSIRO Mathematics, Informatics and Statistics, Castray Esplanade, , Hobart, TAS, 7001 Australia;2. Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, , Seattle, 98112 WA, USA;3. Conservation Biology Division, Northwest Fisheries Science Center, , Seattle, WA, USA
Abstract:Theoretical models are often applied to population genetic data sets without fully considering the effect of missing data. Researchers can deal with missing data by removing individuals that have failed to yield genotypes and/or by removing loci that have failed to yield allelic determinations, but despite their best efforts, most data sets still contain some missing data. As a consequence, realized sample size differs among loci, and this poses a problem for unbiased methods that must explicitly account for random sampling error. One commonly used solution for the calculation of contemporary effective population size (Ne) is to calculate the effective sample size as an unweighted mean or harmonic mean across loci. This is not ideal because it fails to account for the fact that loci with different numbers of alleles have different information content. Here we consider this problem for genetic estimators of contemporary effective population size (Ne). To evaluate bias and precision of several statistical approaches for dealing with missing data, we simulated populations with known Ne and various degrees of missing data. Across all scenarios, one method of correcting for missing data (fixed‐inverse variance‐weighted harmonic mean) consistently performed the best for both single‐sample and two‐sample (temporal) methods of estimating Ne and outperformed some methods currently in widespread use. The approach adopted here may be a starting point to adjust other population genetics methods that include per‐locus sample size components.
Keywords:Effective population size  Linkage Disequilibrium  missing data  Ne  temporal method
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号