Comparison of boosted regression tree and random forest models for mapping topsoil organic carbon concentration in an alpine ecosystem |
| |
Affiliation: | 1. Department of Soil Science, College of Agriculture, Shahid Bahonar University of Kerman, Kerman, Iran;2. Department of Soil Science, College of Agriculture, Isfahan University of Technology, 84156-83111 4 Isfahan, Iran;3. Department of Geology and Soil Science, Ghent University, B9000 Ghent, Belgium;1. Faculty of Agriculture and Natural Resources, Ardakan University, Ardakan, Iran;2. Faculty of Agriculture, University of Kurdistan, Sanandaj, Iran;3. Department of Geography, Brigham Young University, Provo, UT, USA;1. State Key Laboratory of Soil and Sustainable Agriculture, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008, China;2. Alterra, Wageningen University and Research Centre, PO Box 32, 6700 AA Wageningen, The Netherlands |
| |
Abstract: | Soil organic carbon (SOC) plays an important role in soil fertility and carbon sequestration, and a better understanding of the spatial patterns of SOC is essential for soil resource management. In this study, we used boosted regression tree (BRT) and random forest (RF) models to map the distribution of topsoil organic carbon content at the northeastern edge of the Tibetan Plateau in China. A set of 105 soil samples and 12 environmental variables (including topography, climate and vegetation) were analyzed. The performance of the models was evaluated using a 10-fold cross-validation procedure. Maps of the mean values and standard deviations of SOC were generated to illustrate model variability and uncertainty. The results indicate that the BRT and RF models exhibited very similar performance and yielded similar predicted distributions of SOC. The two models explained approximately 70% of the total SOC variability. The BRT and RF models robustly predicted the SOC at low observed SOC values, whereas they underestimated high observed SOC values. This underestimation may have been caused by biased distributions of soil samples in the SOC space. Vegetation-related variables were assigned the highest importance in both models, followed by climate and topography. Both models produced spatial distribution maps of SOC that were closely related to vegetation cover. The SOC content predicted by the BRT model was clearly higher than that of the RF model in areas with greater vegetation cover because the contributions of vegetation-related variables in the two models (65% and 43%, respectively) differed significantly. The predicted SOC content increased from the northwestern to the southeastern part of the study area, average values produced by the BRT and RF models were 27.3 g kg−1 and 26.6 g kg−1, respectively. We conclude that the BRT and RF methods should be calibrated and compared to obtain the best prediction of SOC spatial distribution in similar regions. In addition, vegetation variables, including those obtained from remote sensing imagery, should be taken as the main environmental indicators and explicitly included when generating SOC maps in Alpine environments. |
| |
Keywords: | Digital soil mapping Soil organic carbon Boosted regression tree Random forest Environmental variables Tibetan Plateau |
本文献已被 ScienceDirect 等数据库收录! |
|