Title
Natural Encoding for Evolutionary Supervised Learning
Abstract
Some of the most influential factors in the quality of the solutions found by an evolutionary algorithm (EA) are a correct coding of the search space and an appropriate evaluation function of the potential solutions. EAs are often used to learn decision rules from datasets, which are encoded as individuals in the genetic population. In this paper, the coding of the search space for the obtaining of those decision rules is approached, i.e., the representation of the individuals of the genetic population and also the design of specific genetic operators. Our approach, called "natural coding," uses one gene per feature in the dataset (continuous or discrete). The examples from the datasets are also encoded into the search space, where the genetic population evolves, and therefore the evaluation process is improved substantially. Genetic operators for the natural coding are formally defined as algebraic expressions. Experiments with several datasets from the University of California at Irvine (UCI) machine learning repository show that as the genetic operators are better guided through the search space, the number of rules decreases considerably while maintaining the accuracy, similar to that of hybrid coding, which joins the well-known binary and real representations to encode discrete and continuous attributes, respectively. The computational cost associated with the natural coding is also reduced with regard to the hybrid representation. Our algorithm, HlDER*, has been statistically tested against C4.5 and C4.5 Rules, and performed well. The knowledge models obtained are simpler, with very few decision rules, and therefore easier to understand, which is an advantage in many domains. The experiments with high-dimensional datasets showed the same good behavior, maintaining the quality of the knowledge model with respect to prediction accuracy.
Year
DOI
Venue
2007
10.1109/TEVC.2006.883466
Evolutionary Computation, IEEE Transactions
Keywords
Field
DocType
genetic algorithms,knowledge representation,learning (artificial intelligence),search problems,HlDER* algorithm,algebraic expressions,continuous attributes,decision rule learning,discrete attributes,evolutionary algorithm,evolutionary supervised learning,genetic operators,genetic population,machine learning,natural coding,natural encoding,search space coding,Decision rules,evolutionary encoding,supervised learning
Decision rule,Knowledge representation and reasoning,Mathematical optimization,Evolutionary algorithm,Evaluation function,Coding (social sciences),Supervised learning,Artificial intelligence,Genetic representation,Genetic algorithm,Mathematics,Machine learning
Journal
Volume
Issue
ISSN
11
4
1089-778X
Citations 
PageRank 
References 
26
1.54
26
Authors
3
Name
Order
Citations
PageRank
Aguilar-Ruiz, J.S.1323.04
Raúl Giráldez210510.26
J. C. Riquelme323914.01