Implementing gene expression programming in the parallel environment for big datasets’ classification

Joanna Jędrzejowicz , Piotr Jędrzejowicz , Izabela Wierzbowska

Abstract

The paper investigates a Gene Expression Programming (GEP)-based ensemble classifier constructed using the stacked generalization concept. The classifier has been implemented with a view to enable parallel processing with the use of Spark and SWIM - an open source genetic programming library. The classifier has been validated in computational experiments carried out on benchmark data sets. Also, it has been inbvestigated how the results are influenced by some settings. The paper is an extension of a previous paper of the authors.
Author Joanna Jędrzejowicz (FMPI / II)
Joanna Jędrzejowicz,,
- Institute of Informatics
, Piotr Jędrzejowicz
Piotr Jędrzejowicz,,
-
, Izabela Wierzbowska
Izabela Wierzbowska,,
-
Journal seriesVietnam Journal of Computer Science, ISSN 2196-8888, e-ISSN 2196-8896, (0 pkt)
Issue year2019
Vol6
No2
Pages163-175
Publication size in sheets0.6
Keywords in Englishgene expression, classification, big data
DOIDOI:10.1142/S2196888819500118
URL https://www.worldscientific.com/doi/pdf/10.1142/S2196888819500118
Languageen angielski
LicenseJournal (articles only); published final; Uznanie Autorstwa (CC-BY); with publication
Score (nominal)5
ScoreMinisterial score = 5.0, 24-07-2019, ArticleFromJournal
Citation count*
Cite
Share Share

Get link to the record


* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.
Back