162
Views
8
CrossRef citations to date
0
Altmetric
Original Articles

A comparison of fitness-case sampling methods for genetic programming

, , ORCID Icon, &
Pages 1203-1224 | Received 06 May 2015, Accepted 30 Apr 2017, Published online: 27 May 2017
 

Abstract

Genetic programming (GP) is an evolutionary computation paradigm for automatic program induction. GP has produced impressive results but it still needs to overcome some practical limitations, particularly its high computational cost, overfitting and excessive code growth. Recently, many researchers have proposed fitness-case sampling methods to overcome some of these problems, with mixed results in several limited tests. This paper presents an extensive comparative study of four fitness-case sampling methods, namely: Interleaved Sampling, Random Interleaved Sampling, Lexicase Selection and Keep-Worst Interleaved Sampling. The algorithms are compared on 11 symbolic regression problems and 11 supervised classification problems, using 10 synthetic benchmarks and 12 real-world data-sets. They are evaluated based on test performance, overfitting and average program size, comparing them with a standard GP search. Comparisons are carried out using non-parametric multigroup tests and post hoc pairwise statistical tests. The experimental results suggest that fitness-case sampling methods are particularly useful for difficult real-world symbolic regression problems, improving performance, reducing overfitting and limiting code growth. On the other hand, it seems that fitness-case sampling cannot improve upon GP performance when considering supervised binary classification.

Notes

No potential conflict of interest was reported by the authors.

1 Fitness-case sampling refers to the selection of a subset of fitness-cases, which is mostly the case for the methods presented in this study. Though, there is the possibility to consider the entire training set.

Additional information

Funding

First, second and fifth authors were supported by CONACYT (México) scholarships, respectively, [grant number 226981], [grant number 232288], [grant number 573397]. Funding for this work was provided by CONACYT Basic Science Research Project number [178323], DGEST (México) Research Project [5414.14-P], FP7-PEOPLE-2013-IRSES project ACOBSEC financed by the European Commission with contract number [612689] and CONACYT Project [FC-2015-2/944] “Aprendizaje evolutivo a gran escala".

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 373.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.