Title
Prediction of expected performance for a genetic programming classifier.
Abstract
The estimation of problem difficulty is an open issue in genetic programming (GP). The goal of this work is to generate models that predict the expected performance of a GP-based classifier when it is applied to an unseen task. Classification problems are described using domain-specific features, some of which are proposed in this work, and these features are given as input to the predictive models. These models are referred to as predictors of expected performance. We extend this approach by using an ensemble of specialized predictors (SPEP), dividing classification problems into groups and choosing the corresponding SPEP. The proposed predictors are trained using 2D synthetic classification problems with balanced datasets. The models are then used to predict the performance of the GP classifier on unseen real-world datasets that are multidimensional and imbalanced. This work is the first to provide a performance prediction of a GP system on test data, while previous works focused on predicting training performance. Accurate predictive models are generated by posing a symbolic regression task and solving it with GP. These results are achieved by using highly descriptive features and including a dimensionality reduction stage that simplifies the learning and testing process. The proposed approach could be extended to other classification algorithms and used as the basis of an expert system for algorithm selection.
Year
DOI
Venue
2016
10.1007/s10710-016-9265-9
Genetic Programming and Evolvable Machines
Keywords
Field
DocType
Problem difficulty,Prediction of expected performance,Genetic programming,Supervised learning
Data mining,Dimensionality reduction,Computer science,Expert system,Genetic programming,Supervised learning,Artificial intelligence,Test data,Statistical classification,Classifier (linguistics),Symbolic regression,Machine learning
Journal
Volume
Issue
ISSN
17
4
1389-2576
Citations 
PageRank 
References 
1
0.35
48
Authors
4
Name
Order
Citations
PageRank
Yuliana Martínez1425.70
Leonardo Trujillo244438.12
Pierrick Legrand39016.20
Edgar Galván López4818.87