Title | ||
---|---|---|
The Effect of Missing Data on Learning Classifier System Learning Rate and Classification Performance |
Abstract | ||
---|---|---|
Missing data pose a potential threat to learning and classification in that they may compromise the ability of a system to develop robust, generalized models of the environment in which they operate. This investigation reports on the effects of the various types of missing data, present in varying densities in a group of simulated datasets, on learning classifier system performance. It was found that missing data have an adverse effect on learning classifier system (LCS) learning and classification performance, the latter of which is not seen in See5, a robust decision tree inducer. Specific adverse effects include decreased learning rate, decreased accuracy of classification of novel data on testing, increased proportions of testing cases that cannot be classified, and increased variability in these metrics. In addition, the effects are correlated with the density of missing values in a dataset, as well as the type of missing data, whether it is random and ignorable, or systematically missing and therefore non-ignorable. |
Year | DOI | Venue |
---|---|---|
2002 | 10.1007/978-3-540-40029-5_4 | Lecture Notes in Artificial Intelligence |
Keywords | Field | DocType |
decision tree,learning classifier system,missing data,missing values,adverse effect | Margin (machine learning),Semi-supervised learning,Instance-based learning,Stability (learning theory),Pattern recognition,Computer science,Unsupervised learning,Artificial intelligence,Missing data,Machine learning,Learning classifier system,Quadratic classifier | Conference |
Volume | ISSN | Citations |
2661 | 0302-9743 | 6 |
PageRank | References | Authors |
0.53 | 13 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
John H. Holmes | 1 | 78 | 7.55 |
Warren B. Bilker | 2 | 18 | 2.51 |