Statistical techniques for modeling of Corylus, Alnus, and Betula pollen concentration in the air

Jakub Nowosad , Alfred Stach , Idalia Kasprzyk , Kazimiera Chłopek , Katarzyna Dąbrowska-Zapart , Łukasz Grewling , Małgorzata Latałowa , Anna Pędziszewska , Barbara Majkowska-Wojciechowska , Dorota Myszkowska , Krystyna Piotrowska-Weryszko , Elżbieta Weryszko-Chmielewska , Małgorzata Puc , Piotr Rapiejko , Tomasz Stosik


Prediction of allergic pollen concentration is one of the most important goals of aerobiology. Past studies have used a broad range of modeling techniques; however, the results cannot be directly compared owing to the use of different datasets, validation methods, and evaluation metrics. The main aim of this study was to compare nine statistical modeling techniques using the same dataset. An additional goal was to assess the importance of predictors for the best model. Aerobiological data for Corylus, Alnus, and Betula pollen counts were obtained from nine cities in Poland and covered between five and 16 years of measurements. Meteorological data from the AGRI4- CAST project were used as a predictor variables. The results of 243 final models (3 taxa x 9 cities x 9 techniques) were validated using a repeated k-fold cross-validation and compared using relative and absolute performance statistics. Afterward, the variable importance of predictors in the best models was calculated and compared. Simple models performed poorly. On the other hand, regression trees and rulebased models proved to be the most accurate for all of the taxa. Cumulative growing degree days proved to be the single most important predictor variable in the random forest models of Corylus, Alnus, and Betula. Finally, the study suggested potential improvements in aerobiological modeling, such as the application of robust cross-validation techniques and the use of gridded variables.
Author Jakub Nowosad - [University of Cincinnati]
Jakub Nowosad,,
, Alfred Stach - [Uniwersytet im. Adama Mickiewicza w Poznaniu]
Alfred Stach,,
- Uniwersytet im. Adama Mickiewicza w Poznaniu
, Idalia Kasprzyk - [University of Rzeszów]
Idalia Kasprzyk,,
, Kazimiera Chłopek - [University of Silesia in Katowice]
Kazimiera Chłopek,,
, Katarzyna Dąbrowska-Zapart - [University of Silesia in Katowice]
Katarzyna Dąbrowska-Zapart,,
, Łukasz Grewling - [Uniwersytet im. Adama Mickiewicza w Poznaniu]
Łukasz Grewling,,
- Uniwersytet im. Adama Mickiewicza w Poznaniu
, Małgorzata Latałowa (FB/DPE)
Małgorzata Latałowa,,
- Department of Plant Ecolog
, Anna Pędziszewska (FB/DPE)
Anna Pędziszewska,,
- Department of Plant Ecolog
, Barbara Majkowska-Wojciechowska - [Medical University of Lodz]
Barbara Majkowska-Wojciechowska,,
, Dorota Myszkowska - [Uniwersytet Jagielloński Collegium Medicum]
Dorota Myszkowska,,
et al.`
Journal seriesAerobiologia, ISSN 0393-5965, (A 25 pkt)
Issue year2018
Publication size in sheets0.60
Keywords in EnglishAllergenic pollen, pollen concentration in the air, Betulaceae, regression models, predictive modeling, machine learning
ASJC Classification1110 Plant Science; 2403 Immunology; 2723 Immunology and Allergy
Languageen angielski
Score (nominal)25
Score sourcejournalList
ScoreMinisterial score = 25.0, 03-07-2020, ArticleFromJournal
Publication indicators Scopus Citations = 5.000; WoS Citations = 3.000; Scopus SNIP (Source Normalised Impact per Paper): 2018 = 0.998; WoS Impact Factor: 2018 = 1.931 (2) - 2018=2.155 (5)
Citation count*4 (2020-07-10)
Share Share

Get link to the record

* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.
Are you sure?