Benchmarks that matter for genetic programming - Citegraph

Paper Info

Title
Benchmarks that matter for genetic programming

Abstract
There have been several papers published relating to the practice of benchmarking in machine learning and Genetic Programming (GP) in particular. In addition, GP has been accused of targeting over-simplified 'toy' problems that do not reflect the complexity of real-world applications that GP is ultimately intended. There are also theoretical results that relate the performance of an algorithm with a probability distribution over problem instances, and so the current debate concerning benchmarks spans from the theoretical to the empirical. The aim of this article is to consolidate an emerging theme arising from these papers and suggest that benchmarks should not be arbitrarily selected but should instead be drawn from an underlying probability distribution that reflects the problem instances which the algorithm is likely to be applied to in the real-world. These probability distributions are effectively dictated by the application domains themselves (essentially data-driven) and should thus re-engage the owners of the originating data. A consequence of properly-founded benchmarking leads to the suggestion of meta-learning as a methodology for automatically designing algorithms rather than manually designing algorithms. A secondary motive is to reduce the number of research papers that propose new algorithms but do not state in advance what their purpose is (i.e. in what context should they be applied). To put the current practice of GP benchmarking in a particular harsh light, one might ask what the performance of an algorithm on Koza's lawnmower problem (a favourite toy-problem of the GP community) has to say about its performance on a very real-world cancer data set: the two are completely unrelated.

Year	DOI	Venue
2014	10.1145/2598394.2609875	GECCO (Companion)
Keywords	Field	DocType
function optimization,evolutionary computation,genetic programming,program synthesis,no free lunch theorems,hyper-heuristics,meta-learning,machine learning	Mathematical optimization,Ask price,Computer science,Evolutionary computation,Genetic programming,Motif (music),Function optimization,Probability distribution,Artificial intelligence,Machine learning,Benchmarking	Conference
Citations	PageRank	References
2	0.52	24
Authors
3

Authors (3 rows)

Cited by (2 rows)

References (24 rows)

Name	Order	Citations	PageRank
John R. Woodward	1	274	17.48
Simon Martin	2	2	0.52
Jerry Swan	3	196	19.47

1