Title
Ensemble Bayesian Model Averaging in Genetic Programming
Abstract
This paper considers the general problem of function estimation via Genetic Programming (GP). Data analysts typically select a model from a population of models, and then proceed as if the selected model had generated the data. This approach ignores the uncertainty in model selection, leading to over-confident inferences and lack of generalisation. We adopt a coherent method for accounting for this uncertainty through a weighted averaging of all models competing in a population of GP. It is a principled statistical method for postprocessing a population of programs into an ensemble, which is based on Bayesian Model Averaging (BMA). Under two different formulations of BMA, the predictive probability density function (PDF) of a response variable is a weighted average of PDFs centered around the individual predictions of component models that take the form of either standalone programs or ensembles of programs. The weights are equal to the posterior probabilities of the models generating the predictions, and reflect the models' skill on the training dataset. The method was applied to a number of synthetic symbolic regression problems, and results demonstrate that it generalises better than standard methods for model selection, as well as methods for ensemble construction in GP.
Year
DOI
Venue
2014
10.1109/CEC.2014.6900567
IEEE Congress on Evolutionary Computation
Keywords
Field
DocType
function estimation problem,posterior probabilities,model selection,bayes methods,inference mechanisms,response variable,genetic programming,regression analysis,lack-of-generalisation,pdf,ensemble bayesian model averaging,predictive probability density function,genetic algorithms,over-confident inferences,statistical method,synthetic symbolic regression problems,bma,sociology,training data,probability density function,data models,statistics,predictive models
Population,Bayesian inference,Computer science,Bayesian linear regression,Model selection,Posterior probability,Genetic programming,Artificial intelligence,Bayesian statistics,Symbolic regression,Machine learning
Conference
Citations 
PageRank 
References 
1
0.35
16
Authors
3
Name
Order
Citations
PageRank
Alexandros Agapitos121122.88
Michael O'Neill287669.58
Anthony Brabazon391898.60