首页 | 本学科首页   官方微博 | 高级检索  
   检索      

论二次抽样以矫正多样性统计偏差的经典、发展与应用
引用本文:黄浩,文蝶,黄汉晖.论二次抽样以矫正多样性统计偏差的经典、发展与应用[J].古生物学报,2023,62(3):424-435.
作者姓名:黄浩  文蝶  黄汉晖
作者单位:中国科学院地质与地球物理研究所, 北京 100029;南京大学, 地球科学与工程学院, 南京 210023
基金项目:国家自然科学基金面上项目(41872036)资助
摘    要:稀疏标准化是定量古生物工作中矫正多样性统计偏差的常用方法。相比基于样本大小的传统稀疏化,基于采样充分度的改进能更忠实反映多样性信息。然而一些案例对于稀疏化的适用性不够重视,尤其是改进的方法鲜有国内文献介绍。本文阐述了稀疏化的原理,强调了应用的注意事项和改进方法的优势。稀疏化的原理是从大小不同的样本中二次抽样出彼此“公平”的子样本,以比较其分类单元丰富度。传统方法据样本大小衡量公平,改进的方法据采样充分度评估公平,要求子样本在群落中代表的个体频率总和相等。两种思路均可通过计算机模拟多次重复二次抽样或公式推导来计算,已有PAST和iNext等软件可以实现。采样是否充分代表了古生物群落是有效应用该方法的首要前提。

关 键 词:稀疏标准化  多样性  丰富度  样本大小  采样充分度  二次抽样
收稿时间:2023/3/3 0:00:00
修稿时间:2023/4/4 0:00:00

Classic formula, updated algorithm and application of rarefaction: bias correction in fossil diversity through subsampling
HUANG Hao,WEN Die,HUANG Han-hui.Classic formula, updated algorithm and application of rarefaction: bias correction in fossil diversity through subsampling[J].Acta Palaeontologica Sinica,2023,62(3):424-435.
Authors:HUANG Hao  WEN Die  HUANG Han-hui
Institution:Institute of Geology and Geophysics, Chinese Academy of Sciences, Beijing 100029 , China;School of Earth Sciences and Engineering, Nanjing University, Nanjing 210023 , China
Abstract:Taxonomic diversity of paleocommunities is a key metric for tracing the evolution of life and underlying geological events. However, the taxonomic richness of fossil collections or compiled data is easily biased by differences in sampling size. Rarefaction is a routine statistical method to mitigate such biases by reducing larger collections to a consistent sample size with the smaller ones. Traditional individual-based rarefaction has been increasingly superseded in the literature by coverage-based rarefaction (or SQS, shareholder quorum subsampling as named by some paleontologists). However, some case studies still show certain misunderstanding of this longstanding method, and coverage-based rarefaction has rarely been clarified in the Chinese literature. In order to better apply this method, this paper introduces the principle, details of calculation and suggestions for application of the rarefaction techniques. The core idea of rarefaction is to randomly resample from the original samples until the subsamples reach a consistent sample level, then the mathematical expectation of the taxonomic richness of these subsamples is calculated for comparison. Traditional rarefaction method evaluates such consistency by the same sample size, such as the number of specimens or fossil occurrences in literature. One major drawback of this traditional method is that the information of larger samples is often severely compressed. To address this problem, an updated method, i.e., coverage-based rarefaction, requires resampling until the equal sample coverage is achieved. The degree of coverage is measured by the sum of the individual frequencies in the community covered by the taxa in the subsamples. It has been well demonstrated that the updated method could more faithfully reflect the true ratio of taxonomic richness among communities. Both the traditional and updated rarefaction methods can be implemented by algorithmic simulation or analytical derivation, and software such as PAST or iNext is convenient for implementation. The primary requirement for applying rarefaction is that the samples at hand are as representative of the paleocommunity as possible. We also suggest several potential directions to further develop the rarefaction techniques in the field of quantitative paleontology.
Keywords:
点击此处可从《古生物学报》浏览原始摘要信息
点击此处可从《古生物学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号