Title
The Effect of Missing Data on Learning Classifier System Learning Rate and Classification Performance
Abstract
Missing data pose a potential threat to learning and classification in that they may compromise the ability of a system to develop robust, generalized models of the environment in which they operate. This investigation reports on the effects of the various types of missing data, present in varying densities in a group of simulated datasets, on learning classifier system performance. It was found that missing data have an adverse effect on learning classifier system (LCS) learning and classification performance, the latter of which is not seen in See5, a robust decision tree inducer. Specific adverse effects include decreased learning rate, decreased accuracy of classification of novel data on testing, increased proportions of testing cases that cannot be classified, and increased variability in these metrics. In addition, the effects are correlated with the density of missing values in a dataset, as well as the type of missing data, whether it is random and ignorable, or systematically missing and therefore non-ignorable.
Year
DOI
Venue
2002
10.1007/978-3-540-40029-5_4
Lecture Notes in Artificial Intelligence
Keywords
Field
DocType
decision tree,learning classifier system,missing data,missing values,adverse effect
Margin (machine learning),Semi-supervised learning,Instance-based learning,Stability (learning theory),Pattern recognition,Computer science,Unsupervised learning,Artificial intelligence,Missing data,Machine learning,Learning classifier system,Quadratic classifier
Conference
Volume
ISSN
Citations 
2661
0302-9743
6
PageRank 
References 
Authors
0.53
13
2
Name
Order
Citations
PageRank
John H. Holmes1787.55
Warren B. Bilker2182.51