首页 | 本学科首页   官方微博 | 高级检索  
   检索      


A simulation study of sample size for DNA barcoding
Authors:Arong Luo  Haiqiang Lan  Cheng Ling  Aibing Zhang  Lei Shi  Simon Y W Ho  Chaodong Zhu
Institution:1. Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China;2. School of Statistics and Mathematics, Yunnan University of Finance and Economics, Kunming, China;3. Department of Computer Science and Technology, College of Information Science and Technology, Beijing University of Chemical Technology, Beijing, China;4. College of Life Sciences, Capital Normal University, Beijing, China;5. School of Biological Sciences, University of Sydney, Sydney, New South Wales, Australia;6. College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
Abstract:For some groups of organisms, DNA barcoding can provide a useful tool in taxonomy, evolutionary biology, and biodiversity assessment. However, the efficacy of DNA barcoding depends on the degree of sampling per species, because a large enough sample size is needed to provide a reliable estimate of genetic polymorphism and for delimiting species. We used a simulation approach to examine the effects of sample size on four estimators of genetic polymorphism related to DNA barcoding: mismatch distribution, nucleotide diversity, the number of haplotypes, and maximum pairwise distance. Our results showed that mismatch distributions derived from subsamples of ≥20 individuals usually bore a close resemblance to that of the full dataset. Estimates of nucleotide diversity from subsamples of ≥20 individuals tended to be bell‐shaped around that of the full dataset, whereas estimates from smaller subsamples were not. As expected, greater sampling generally led to an increase in the number of haplotypes. We also found that subsamples of ≥20 individuals allowed a good estimate of the maximum pairwise distance of the full dataset, while smaller ones were associated with a high probability of underestimation. Overall, our study confirms the expectation that larger samples are beneficial for the efficacy of DNA barcoding and suggests that a minimum sample size of 20 individuals is needed in practice for each population.
Keywords:Coalescence  haplotype  maximum pairwise distance  mismatch distribution  nucleotide diversity
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号