首页 | 本学科首页   官方微博 | 高级检索  
     


An applicability index for reliable and applicable decision trees in water quality modelling
Affiliation:1. State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China;2. State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China;3. Collaborative Innovation Center of Geospatial Technology, Wuhan 430079, China;4. College of Urban and Environment Sciences, Hubei Normal University, Huangshi 435002, China;5. Dalian Academy of Reconnaissance and Mapping Co. Ltd, Dalian 116061, China;1. Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa 904-0495, Japan;2. The University of Hong Kong, School of Biological Sciences, Kadoorie Biological Sciences Building, Pok Fu Lam Road, Hong Kong, SAR, China;1. Department of Environmental Geochemical Cycle Research, Japan Agency for Marine-Earth Science and Technology (JAMSTEC), Kanazawa-ku, Yokohama 236-0001, Japan;2. Faculty of Agriculture and Agricultural Science Programme, Kochi University, Nankoku-shi, Kochi 783-8502, Japan;3. United Graduate School of Agricultural Sciences, Ehime University, Matsuyama 790-8566, Japan;4. Research Institute for Humanity and Nature, Motoyama, Kamigamo, Kita-ku, Kyoto 603-8047, Japan;5. Graduate School of Human and Environmental Studies, Kyoto University, Yoshida Nihonmatsu-cho, Sakyo-ku, Kyoto 606-8501, Japan
Abstract:Data-driven environmental models are mainly assessed on the basis of their model fit and only limited attention is given to their applicability for end-users. In this paper, we present the applicability index (API) that scores decision trees in terms of their interpretability and applicability for end-users. The API integrates two criteria, viz. the simplicity of the model and its ability to predict the classes of the response variable. We developed 10,000 decision trees with different parameterizations and assessed the use of API for model selection with two different datasets. The API reduced the number of decision trees that were retained only based on statistical criteria from 2,806 to 173 and from 1,117 to 784, respectively. The models that were retained were more easily interpretable, equally statistically reliable but less complex. Conventional statistical criteria such as Cohen’s kappa and the number of correctly classified instances were only moderately correlated with the API (r = 0.26 and r = 0.49, respectively). This indicates that the API is a useful complement to the existing statistical criteria available for model selection. The API was tested for two datasets consisting of water quality data in lowland rivers in Belgium and the Netherlands, hence its validity needs to be tested for other types of data and modelling domains.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号