首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Extracting abundance information from DNA-based data
Authors:Mingjie Luo  Yinqiu Ji  David Warton  Douglas W Yu
Institution:1. State Key Laboratory of Genetic Resources and Evolution and Yunnan Key Laboratory of Biodiversity and Ecological Security of Gaoligong Mountain, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China

Kunming College of Life Sciences, University of Chinese Academy of Sciences, Kunming, Yunnan, China;2. State Key Laboratory of Genetic Resources and Evolution and Yunnan Key Laboratory of Biodiversity and Ecological Security of Gaoligong Mountain, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China;3. School of Mathematics and Statistics, UNSW Sydney, Sydney, New South Wales, Australia

Evolution and Ecology Research Centre, UNSW Sydney, Sydney, New South Wales, Australia

Abstract:The accurate extraction of species-abundance information from DNA-based data (metabarcoding, metagenomics) could contribute usefully to diet analysis and food-web reconstruction, the inference of species interactions, the modelling of population dynamics and species distributions, the biomonitoring of environmental state and change, and the inference of false positives and negatives. However, multiple sources of bias and noise in sampling and processing combine to inject error into DNA-based data sets. To understand how to extract abundance information, it is useful to distinguish two concepts. (i) Within-sample across-species quantification describes relative species abundances in one sample. (ii) Across-sample within-species quantification describes how the abundance of each individual species varies from sample to sample, such as over a time series, an environmental gradient or different experimental treatments. First, we review the literature on methods to recover across-species abundance information (by removing what we call “species pipeline biases”) and within-species abundance information (by removing what we call “pipeline noise”). We argue that many ecological questions can be answered with just within-species quantification, and we therefore demonstrate how to use a “DNA spike-in” to correct for pipeline noise and recover within-species abundance information. We also introduce a model-based estimator that can be used on data sets without a physical spike-in to approximate and correct for pipeline noise.
Keywords:Arthropoda  biomonitoring  community composition  DNA barcoding  environmental DNA  Insecta  internal standard  polymerase chain reaction  taxonomic bias
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号