首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Sampling Issues Affecting Accuracy of Likelihood-based Classification Using Genetical Data
Authors:B Guinand  KT Scribner  A Topchy  KS Page  W Punch  MK Burnham-Curtis
Institution:1. Department of Fisheries and Wildlife, Michigan State University, East Lansing, MI, 48824, U.S.A.
2. Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, 48824, U.S.A.
3. U.S.G.S/BRD, Great Lakes Science Center, 1451 Green Rd., Ann Arbor, MI, 48105, U.S.A.
Abstract:We demonstrate the effectiveness of a genetic algorithm for discovering multi-locus combinations that provide accurate individual assignment decisions and estimates of mixture composition based on likelihood classification. Using simulated data representing different levels of inter-population differentiation (Fst~ 0.01 and 0.10), genetic diversities (four or eight alleles per locus), and population sizes (20, 40, 100 individuals in baseline populations), we show that subsets of loci can be identified that provide comparable levels of accuracy in classification decisions relative to entire multi-locus data sets, where 5, 10, or 20 loci were considered. Microsatellite data sets from hatchery strains of lake trout, Salvelinus namaycush, representing a comparable range of inter-population levels of differentiation in allele frequencies confirmed simulation results. For both simulated and empirical data sets, assignment accuracy was achieved using fewer loci (e.g., three or four loci out of eight for empirical lake trout studies). Simulation results were used to investigate properties of the ‘leave-one-out’ (L1O) method for estimating assignment error rates. Accuracy of population assignments based on L1O methods should be viewed with caution under certain conditions, particularly when baseline population sample sizes are low (<50).
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号