Implementing gene expression programming in the parallel environment for big datasets’ classification
Joanna Jędrzejowicz , Piotr Jędrzejowicz , Izabela Wierzbowska
Abstract
The paper investigates a Gene Expression Programming (GEP)-based ensemble classifier constructed using the stacked generalization concept. The classifier has been implemented with a view to enable parallel processing with the use of Spark and SWIM - an open source genetic programming library. The classifier has been validated in computational experiments carried out on benchmark data sets. Also, it has been inbvestigated how the results are influenced by some settings. The paper is an extension of a previous paper of the authors.Author | |
Journal series | Vietnam Journal of Computer Science, ISSN 2196-8888, e-ISSN 2196-8896, (0 pkt) |
Issue year | 2019 |
Vol | 6 |
No | 2 |
Pages | 163-175 |
Publication size in sheets | 0.6 |
Keywords in English | gene expression, classification, big data |
DOI | DOI:10.1142/S2196888819500118 |
URL | https://www.worldscientific.com/doi/pdf/10.1142/S2196888819500118 |
Language | en angielski |
License | Journal (articles only); published final; ; with publication |
Score (nominal) | 5 |
Score source | journalList |
Score | = 5.0, 17-11-2019, ArticleFromJournal |
Citation count* | 1 (2019-12-07) |
* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.
Back