首页 | 本学科首页   官方微博 | 高级检索  
   检索      

基于不同特征挖掘方法结合广义提升回归模型估测安徽省土壤pH
引用本文:王世航,卢宏亮,赵明松,周玲美.基于不同特征挖掘方法结合广义提升回归模型估测安徽省土壤pH[J].应用生态学报,2020,31(10):3509-3517.
作者姓名:王世航  卢宏亮  赵明松  周玲美
作者单位:1.安徽理工大学空间信息与测绘工程学院, 安徽淮南 232001;2.中国科学院南京土壤研究所, 土壤与农业可持续发展国家重点实验室, 南京 210008
基金项目:国家自然科学基金项目(31700369,41501226)资助
摘    要:为探讨不同特征挖掘方法与广义提升回归模型相结合在数字土壤制图中的应用,本研究首先使用递归特征消除和过滤式两种特征筛选方法对环境协变量进行筛选,再分别使用原始环境协变量、筛选后的最优变量组合作为自变量,建立基于广义提升回归模型和随机森林模型的安徽省土壤pH预测模型并进行制图。结果表明: 引入两种特征挖掘方法均可有效提高广义提升回归模型和随机森林模型预测土壤pH的精度,并且可以起到降维的作用;相较于随机森林模型,广义提升回归模型的验证集预测精度略低,在训练集中,广义提升回归模型的精度却远高于随机森林模型,模型解释度高,整体效果较好;随机森林模型的主要参数ntree和mtry对于模型的影响程度较低,而不同参数对于广义提升回归模型的预测精度影响较大,不同参数组合模型精度不同,建模前需要进行调参。空间制图结果表明,安徽省土壤pH呈“南酸北碱”趋势。

关 键 词:土壤pH  特征挖掘  广义提升回归模型  随机森林  机器学习  安徽省  
收稿时间:2020-05-06

Assessing soil pH in Anhui Province based on different features mining methods combined with generalized boosted regression models
WANG Shi-hang,LU Hong-liang,ZHAO Ming-song,ZHOU Ling-mei.Assessing soil pH in Anhui Province based on different features mining methods combined with generalized boosted regression models[J].Chinese Journal of Applied Ecology,2020,31(10):3509-3517.
Authors:WANG Shi-hang  LU Hong-liang  ZHAO Ming-song  ZHOU Ling-mei
Institution:1.School of Geomatics, Anhui University of Science and Technology, Huainan 232001, Anhui, China;2.State Key Laboratory of Soil and Sustainable Agriculture, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008, China
Abstract:We explored the application of different feature mining methods combined with genera-lized boosted regression models in digital soil mapping. Environmental covariates were selected by two feature selection methods i.e., recursive feature elimination and selection by filtering. Using the original environmental covariates and the selected optimal variable combination as independent varia-bles, soil pH prediction model of Anhui Province was established and mapped based on the genera-lized boosted regression model and random forest model. The results showed that both kinds of feature mining methods could effectively improve the accuracy of soil pH prediction by generalized boosted regression models and random forest model, and could reduce dimensionality. Compared with the random forest model, the prediction accuracy of the validation set of the generalized boosted regression model was slightly lower. In the training set, the accuracy of the generalized boosted regression models was much higher than that of the random forest model, with higher interpretation and better overall effect. The main parameters of the random forest model, ntree and mtry, had limi-ted effect on the model. Different parameters and their combination could affect the prediction accuracy of the generalized boosted regression models, and thus should be tuned before modeling. The results of spatial mapping showed that soil pH in Anhui Province showed a pattern of “south acid and north alkali”.
Keywords:soil pH  feature mining  generalized boosted regression models  random forest  machine learning  Anhui Province  
本文献已被 CNKI 等数据库收录!
点击此处可从《应用生态学报》浏览原始摘要信息
点击此处可从《应用生态学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号