首页 | 本学科首页   官方微博 | 高级检索  
     


Statistical techniques for modeling of Corylus, Alnus, and Betula pollen concentration in the air
Authors:Jakub Nowosad  Alfred Stach  Idalia Kasprzyk  Kazimiera Chłopek  Katarzyna Dąbrowska-Zapart  Łukasz Grewling  Małgorzata Latałowa  Anna Pędziszewska  Barbara Majkowska-Wojciechowska  Dorota Myszkowska  Krystyna Piotrowska-Weryszko  Elżbieta Weryszko-Chmielewska  Małgorzata Puc  Piotr Rapiejko  Tomasz Stosik
Affiliation:1.Space Informatics Lab,University of Cincinnati,Cincinnati,USA;2.Institute of Geoecology and Geoinformation,Adam Mickiewicz University,Poznań,Poland;3.Department of Ecology and Environmental Biology,University of Rzeszów,Rzeszów,Poland;4.Faculty of Earth Sciences,University of Silesia,Sosnowiec,Poland;5.Laboratory of Aeropalynology, Faculty of Biology,Adam Mickiewicz University,Poznań,Poland;6.Department of Plant Ecology,University of Gdańsk,Gdańsk,Poland;7.Department of Immunology, Rheumatology and Allergy, Faculty of Medicine,Medical University,?ód?,Poland;8.Department of Clinical and Environmental Allergology,Jagiellonian University Medical College,Kraków,Poland;9.Department of Botany,University of Life Sciences in Lublin,Lublin,Poland;10.Department of Botany and Nature Conservation, Faculty of Biology,University of Szczecin,Szczecin,Poland;11.Allergen Research Center,Warsaw,Poland;12.Department of Botany and Ecology,University of Science and Technology,Bydgoszcz,Poland
Abstract:Prediction of allergic pollen concentration is one of the most important goals of aerobiology. Past studies have used a broad range of modeling techniques; however, the results cannot be directly compared owing to the use of different datasets, validation methods, and evaluation metrics. The main aim of this study was to compare nine statistical modeling techniques using the same dataset. An additional goal was to assess the importance of predictors for the best model. Aerobiological data for Corylus, Alnus, and Betula pollen counts were obtained from nine cities in Poland and covered between five and 16 years of measurements. Meteorological data from the AGRI4CAST project were used as a predictor variables. The results of 243 final models (3 taxa (times)  9 cities (times) 9 techniques) were validated using a repeated k-fold cross-validation and compared using relative and absolute performance statistics. Afterward, the variable importance of predictors in the best models was calculated and compared. Simple models performed poorly. On the other hand, regression trees and rule-based models proved to be the most accurate for all of the taxa. Cumulative growing degree days proved to be the single most important predictor variable in the random forest models of Corylus, Alnus, and Betula. Finally, the study suggested potential improvements in aerobiological modeling, such as the application of robust cross-validation techniques and the use of gridded variables.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号