首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Without quality presence–absence data,discrimination metrics such as TSS can be misleading measures of model performance
Authors:Boris Leroy  Robin Delsol  Bernard Hugueny  Christine N Meynard  Chéïma Barhoumi  Morgane Barbet‐Massin  Céline Bellard
Institution:1. Unité Biologie des Organismes et Ecosystèmes Aquatiques (BOREA UMR 7208), Muséum National d'Histoire Naturelle, Sorbonne Universités, Université de Caen Normandie, Université des Antilles, CNRS, IRD, Paris, France;2. Ecologie, Systématique & Evolution, UMR CNRS 8079, Univ. Paris‐Sud, Orsay Cedex, France;3. Laboratoire évolution & Diversité Biologique (EDB UMR 5174), Université de Toulouse Midi‐Pyrénées, CNRS, IRD, UPS, Toulouse Cedex 9, France;4. CBGP, INRA, CIRAD, IRD, Montpellier SupAgro, Univ Montpellier, Montpellier, France;5. Institut des Sciences de l'Evolution de Montpellier, UMR CNRS 5554, Univ. De Montpellier, Montpellier Cedex, France;6. Department of Genetics, Evolution and Environment, Center for Biodiversity and Environment Research, University College of London, London, UK
Abstract:The discriminating capacity (i.e. ability to correctly classify presences and absences) of species distribution models (SDMs) is commonly evaluated with metrics such as the area under the receiving operating characteristic curve (AUC), the Kappa statistic and the true skill statistic (TSS). AUC and Kappa have been repeatedly criticized, but TSS has fared relatively well since its introduction, mainly because it has been considered as independent of prevalence. In addition, discrimination metrics have been contested because they should be calculated on presence–absence data, but are often used on presence‐only or presence‐background data. Here, we investigate TSS and an alternative set of metrics—similarity indices, also known as F‐measures. We first show that even in ideal conditions (i.e. perfectly random presence–absence sampling), TSS can be misleading because of its dependence on prevalence, whereas similarity/F‐measures provide adequate estimations of model discrimination capacity. Second, we show that in real‐world situations where sample prevalence is different from true species prevalence (i.e. biased sampling or presence‐pseudoabsence), no discrimination capacity metric provides adequate estimation of model discrimination capacity, including metrics specifically designed for modelling with presence‐pseudoabsence data. Our conclusions are twofold. First, they unequivocally impel SDM users to understand the potential shortcomings of discrimination metrics when quality presence–absence data are lacking, and we recommend obtaining such data. Second, in the specific case of virtual species, which are increasingly used to develop and test SDM methodologies, we strongly recommend the use of similarity/F‐measures, which were not biased by prevalence, contrary to TSS.
Keywords:AUC  ecological niche models  model evaluation  prevalence  species distribution models
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号