Implementing gene expression programming in the parallel environment for big datasets’ classification
Joanna Jędrzejowicz , Piotr Jędrzejowicz , Izabela Wierzbowska
AbstractThe paper investigates a Gene Expression Programming (GEP)-based ensemble classifier constructed using the stacked generalization concept. The classifier has been implemented with a view to enable parallel processing with the use of Spark and SWIM - an open source genetic programming library. The classifier has been validated in computational experiments carried out on benchmark data sets. Also, it has been inbvestigated how the results are influenced by some settings. The paper is an extension of a previous paper of the authors.
|Journal series||Vietnam Journal of Computer Science, ISSN 2196-8888, e-ISSN 2196-8896, (0 pkt)|
|Publication size in sheets||0.6|
|Keywords in English||gene expression, classification, big data|
|License||Journal (articles only); published final; ; with publication|
|Score||= 5.0, 24-07-2019, ArticleFromJournal|
* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.