Title
Benchmarks that matter for genetic programming
Abstract
There have been several papers published relating to the practice of benchmarking in machine learning and Genetic Programming (GP) in particular. In addition, GP has been accused of targeting over-simplified 'toy' problems that do not reflect the complexity of real-world applications that GP is ultimately intended. There are also theoretical results that relate the performance of an algorithm with a probability distribution over problem instances, and so the current debate concerning benchmarks spans from the theoretical to the empirical. The aim of this article is to consolidate an emerging theme arising from these papers and suggest that benchmarks should not be arbitrarily selected but should instead be drawn from an underlying probability distribution that reflects the problem instances which the algorithm is likely to be applied to in the real-world. These probability distributions are effectively dictated by the application domains themselves (essentially data-driven) and should thus re-engage the owners of the originating data. A consequence of properly-founded benchmarking leads to the suggestion of meta-learning as a methodology for automatically designing algorithms rather than manually designing algorithms. A secondary motive is to reduce the number of research papers that propose new algorithms but do not state in advance what their purpose is (i.e. in what context should they be applied). To put the current practice of GP benchmarking in a particular harsh light, one might ask what the performance of an algorithm on Koza's lawnmower problem (a favourite toy-problem of the GP community) has to say about its performance on a very real-world cancer data set: the two are completely unrelated.
Year
DOI
Venue
2014
10.1145/2598394.2609875
GECCO (Companion)
Keywords
Field
DocType
function optimization,evolutionary computation,genetic programming,program synthesis,no free lunch theorems,hyper-heuristics,meta-learning,machine learning
Mathematical optimization,Ask price,Computer science,Evolutionary computation,Genetic programming,Motif (music),Function optimization,Probability distribution,Artificial intelligence,Machine learning,Benchmarking
Conference
Citations 
PageRank 
References 
2
0.52
24
Authors
3
Name
Order
Citations
PageRank
John R. Woodward127417.48
Simon Martin220.52
Jerry Swan319619.47