Title
A comparison of fitness-case sampling methods for genetic programming.
Abstract
Genetic programming (GP) is an evolutionary computation paradigm for automatic program induction. GP has produced impressive results but it still needs to overcome some practical limitations, particularly its high computational cost, overfitting and excessive code growth. Recently, many researchers have proposed fitness-case sampling methods to overcome some of these problems, with mixed results in several limited tests. This paper presents an extensive comparative study of four fitness-case sampling methods, namely: Interleaved Sampling, Random Interleaved Sampling, Lexicase Selection and Keep-Worst Interleaved Sampling. The algorithms are compared on 11 symbolic regression problems and 11 supervised classification problems, using 10 synthetic benchmarks and 12 real-world data-sets. They are evaluated based on test performance, overfitting and average program size, comparing them with a standard GP search. Comparisons are carried out using non-parametric multigroup tests and post hoc pairwise statistical tests. The experimental results suggest that fitness-case sampling methods are particularly useful for difficult real-world symbolic regression problems, improving performance, reducing overfitting and limiting code growth. On the other hand, it seems that fitness-case sampling cannot improve upon GP performance when considering supervised binary classification.
Year
DOI
Venue
2017
10.1080/0952813X.2017.1328461
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE
Keywords
Field
DocType
Genetic programming,fitness-case sampling,performance evaluation
Pairwise comparison,Computer science,Evolutionary computation,Genetic programming,Sampling (statistics),Artificial intelligence,Overfitting,Symbolic regression,Machine learning,Code growth
Journal
Volume
Issue
ISSN
29.0
6
0952-813X
Citations 
PageRank 
References 
2
0.39
10
Authors
5
Name
Order
Citations
PageRank
Yuliana Martínez1425.70
Enrique Naredo2495.55
Leonardo Trujillo344438.12
Pierrick Legrand49016.20
Uriel López531.46