Title
Combining feature reduction and case selection in building CBR classifiers
Abstract
CBR systems that are built for the classification problems are called CBR classifiers. This paper presents a novel and fast approach to building efficient and competent CBR classifiers that combines both feature reduction (FR) and case selection (CS). It has three central contributions: 1) it develops a fast rough-set method based on relative attribute dependency among features to compute the approximate reduct, 2) it constructs and compares different case selection methods based on the similarity measure and the concepts of case coverage and case reachability, and 3) CBR classifiers built using a combination of the FR and CS processes can reduce the training burden as well as the need to acquire domain knowledge. The overall experimental results demonstrating on four real-life data sets show that the combined FR and CS method can preserve, and may also improve, the solution accuracy while at the same time substantially reducing the storage space. The case retrieval time is also greatly reduced because the use of CBR classifier contains a smaller amount of cases with fewer features. The developed FR and CS combination method is also compared with the kernel PCA and SVMs techniques. Their storage requirement, classification accuracy, and classification speed are presented and discussed.
Year
DOI
Venue
2006
10.1109/TKDE.2006.40
IEEE Trans. Knowl. Data Eng.
Keywords
Field
DocType
cs method,rough set theory,case selection,different case selection method,cbr system,case-based reasoning,case retrieval time,cs combination method,case reachability,pattern classification,rough sets.,feature reduction,k-nn principle,competent cbr classifier,cbr classifier,combining feature reduction,case coverage,case selection methods,cbr classifiers,building cbr classifiers,storage requirement,rough-set method,kernel pca,case based reasoning,data structures,rough set,case base reasoning,kernel,computer aided software engineering,knowledge based systems,feature extraction,principal component analysis,domain knowledge,rough sets
Data mining,Reduct,Similarity measure,Computer science,Kernel principal component analysis,Artificial intelligence,Classifier (linguistics),Case-based reasoning,Pattern recognition,Domain knowledge,Support vector machine,Rough set,Machine learning
Journal
Volume
Issue
ISSN
18
3
1041-4347
Citations 
PageRank 
References 
43
1.70
31
Authors
3
Name
Order
Citations
PageRank
Yan Li110111.46
Simon Chi Keung Shiu266735.42
Sankar K. Pal36410627.31