首页 | 本学科首页   官方微博 | 高级检索  
     


Purging putative siblings from population genetic data sets: a cautionary view
Authors:Robin S. Waples  Eric C. Anderson
Affiliation:1. NOAA Fisheries, Northwest Fisheries Science Center, Seattle, WA, USA;2. NOAA Fisheries, Southwest Fisheries Science Center, Santa Cruz, CA, USA
Abstract:Interest has surged recently in removing siblings from population genetic data sets before conducting downstream analyses. However, even if the pedigree is inferred correctly, this has the potential to do more harm than good. We used computer simulations and empirical samples of coho salmon to evaluate strategies for adjusting samples to account for family structure. We compared performance in full samples and sibling‐reduced samples of estimators of allele frequency (urn:x-wiley:09621083:media:mec14022:mec14022-math-0003), population differentiation (urn:x-wiley:09621083:media:mec14022:mec14022-math-0004) and effective population size (urn:x-wiley:09621083:media:mec14022:mec14022-math-0005). Results: (i) unless simulated samples included large family groups together with a component of unrelated individuals, removing siblings generally reduced precision of urn:x-wiley:09621083:media:mec14022:mec14022-math-0006 and urn:x-wiley:09621083:media:mec14022:mec14022-math-0007; (ii) urn:x-wiley:09621083:media:mec14022:mec14022-math-0008 based on the linkage disequilibrium method was largely unbiased using full random samples but became increasingly upwardly biased under aggressive purging of siblings. Under nonrandom sampling (some families over‐represented), urn:x-wiley:09621083:media:mec14022:mec14022-math-0009 using full samples was downwardly biased; removing just the right ‘Goldilocks’ fraction of siblings could produce an unbiased estimate, but this sweet spot varied widely among scenarios; (iii) weighting individuals based on the inferred pedigree (to produce a best linear unbiased estimator, BLUE) maximized precision of urn:x-wiley:09621083:media:mec14022:mec14022-math-0010 when the inferred pedigree was correct but performed poorly when the pedigree was wrong; (iv) a variant of sibling removal that leaves intact small sibling groups appears to be more robust to errors in inferences about family structure. Our results illustrate the complex challenges posed by presence of family structure, suggest that no single optimal solution exists and argue for caution in adjusting population genetic data sets for the presence of putative siblings without fully understanding the consequences.
Keywords:allele frequency  effective population size  family structure  genetic differentiation  precision
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号