Browse
We’re here to help

Find guidance on Author Services

Search
Browse
We’re here to help

Find guidance on Author Services

Home
All Journals
Journal of Experimental & Theoretical Artificial Intelligence
List of Issues
Volume 29, Issue 6
A comparison of fitness-case sampling me ....

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Search in:

Advanced search

Journal of Experimental & Theoretical Artificial Intelligence Volume 29, 2017 - Issue 6

Submit an article Journal homepage

162

Views

CrossRef citations to date

Altmetric

Original Articles

A comparison of fitness-case sampling methods for genetic programming

Yuliana MartínezTree-Lab, Posgrado en Ciencias de la Ingeniería, Departamento de Ingeniería Eléctrica y Electrónica, Instituto Tecnológico de Tijuana, Calz. del Tecnológico S/N, Tomás Aquino, Tijuana, Mexico.View further author information

Enrique NaredoTree-Lab, Posgrado en Ciencias de la Ingeniería, Departamento de Ingeniería Eléctrica y Electrónica, Instituto Tecnológico de Tijuana, Calz. del Tecnológico S/N, Tomás Aquino, Tijuana, Mexico.;Laboratorio Nacional de GeoInteligencia, Centro de Investigación en Geografía y Geomática, Ing. Jorge L. Tamayo A.C. (Centro GEO), Aguascalientes, Aguascalientes, Mexico.View further author information

Leonardo TrujilloTree-Lab, Posgrado en Ciencias de la Ingeniería, Departamento de Ingeniería Eléctrica y Electrónica, Instituto Tecnológico de Tijuana, Calz. del Tecnológico S/N, Tomás Aquino, Tijuana, Mexico.Correspondence[email protected]

http://orcid.org/0000-0003-1812-5736 View further author information

Pierrick LegrandIMB, UMR CNRS 5251, 351, cours de la Libration, Talence, France.;Inria Bordeaux Sud-Ouest, Talence, France.;University of Bordeaux, Bordeaux, France.View further author information

Uriel LópezTree-Lab, Posgrado en Ciencias de la Ingeniería, Departamento de Ingeniería Eléctrica y Electrónica, Instituto Tecnológico de Tijuana, Calz. del Tecnológico S/N, Tomás Aquino, Tijuana, Mexico.View further author information

Pages 1203-1224 | Received 06 May 2015, Accepted 30 Apr 2017, Published online: 27 May 2017

Cite this article
https://doi.org/10.1080/0952813X.2017.1328461
CrossMark

Sample our Behavioral Sciences journals, sign in here to start your access, latest two full volumes FREE to you for 14 days

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/0952813X.2017.1328461?needAccess=true

Abstract

Genetic programming (GP) is an evolutionary computation paradigm for automatic program induction. GP has produced impressive results but it still needs to overcome some practical limitations, particularly its high computational cost, overfitting and excessive code growth. Recently, many researchers have proposed fitness-case sampling methods to overcome some of these problems, with mixed results in several limited tests. This paper presents an extensive comparative study of four fitness-case sampling methods, namely: Interleaved Sampling, Random Interleaved Sampling, Lexicase Selection and Keep-Worst Interleaved Sampling. The algorithms are compared on 11 symbolic regression problems and 11 supervised classification problems, using 10 synthetic benchmarks and 12 real-world data-sets. They are evaluated based on test performance, overfitting and average program size, comparing them with a standard GP search. Comparisons are carried out using non-parametric multigroup tests and post hoc pairwise statistical tests. The experimental results suggest that fitness-case sampling methods are particularly useful for difficult real-world symbolic regression problems, improving performance, reducing overfitting and limiting code growth. On the other hand, it seems that fitness-case sampling cannot improve upon GP performance when considering supervised binary classification.

Keywords:

Genetic programming
fitness-case sampling
performance evaluation

Notes

No potential conflict of interest was reported by the authors.

1 Fitness-case sampling refers to the selection of a subset of fitness-cases, which is mostly the case for the methods presented in this study. Though, there is the possibility to consider the entire training set.

Additional information

Funding

First, second and fifth authors were supported by CONACYT (México) scholarships, respectively, [grant number 226981], [grant number 232288], [grant number 573397]. Funding for this work was provided by CONACYT Basic Science Research Project number [178323], DGEST (México) Research Project [5414.14-P], FP7-PEOPLE-2013-IRSES project ACOBSEC financed by the European Commission with contract number [612689] and CONACYT Project [FC-2015-2/944] “Aprendizaje evolutivo a gran escala".

Log in via your institution

Access through your institution

Log in to Taylor & Francis Online

Shibboleth

Log in to Taylor & Francis Online

Username Password

Forgot password?

Keep me logged in (not suitable for shared devices).

You will otherwise be logged out automatically, after a limited period, and will need to log in again.

Restore content access

Restore content access for purchases made as guest

Purchase options * Save for later Item saved, go to cart

PDF download + Online access

48 hours access to article PDF & online version
Article PDF can be downloaded
Article PDF can be printed

USD 61.00 Add to cart

PDF download + Online access - Online Checkout

Issue Purchase

30 days online access to complete issue
Article PDFs can be downloaded
Article PDFs can be printed

USD 373.00 Add to cart

Issue Purchase - Online Checkout

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references