Abstract | ||
---|---|---|
CBR systems that are built for the classification problems are called CBR classifiers. This paper presents a novel and fast approach to building efficient and competent CBR classifiers that combines both feature reduction (FR) and case selection (CS). It has three central contributions: 1) it develops a fast rough-set method based on relative attribute dependency among features to compute the approximate reduct, 2) it constructs and compares different case selection methods based on the similarity measure and the concepts of case coverage and case reachability, and 3) CBR classifiers built using a combination of the FR and CS processes can reduce the training burden as well as the need to acquire domain knowledge. The overall experimental results demonstrating on four real-life data sets show that the combined FR and CS method can preserve, and may also improve, the solution accuracy while at the same time substantially reducing the storage space. The case retrieval time is also greatly reduced because the use of CBR classifier contains a smaller amount of cases with fewer features. The developed FR and CS combination method is also compared with the kernel PCA and SVMs techniques. Their storage requirement, classification accuracy, and classification speed are presented and discussed. |
Year | DOI | Venue |
---|---|---|
2006 | 10.1109/TKDE.2006.40 | IEEE Trans. Knowl. Data Eng. |
Keywords | Field | DocType |
cs method,rough set theory,case selection,different case selection method,cbr system,case-based reasoning,case retrieval time,cs combination method,case reachability,pattern classification,rough sets.,feature reduction,k-nn principle,competent cbr classifier,cbr classifier,combining feature reduction,case coverage,case selection methods,cbr classifiers,building cbr classifiers,storage requirement,rough-set method,kernel pca,case based reasoning,data structures,rough set,case base reasoning,kernel,computer aided software engineering,knowledge based systems,feature extraction,principal component analysis,domain knowledge,rough sets | Data mining,Reduct,Similarity measure,Computer science,Kernel principal component analysis,Artificial intelligence,Classifier (linguistics),Case-based reasoning,Pattern recognition,Domain knowledge,Support vector machine,Rough set,Machine learning | Journal |
Volume | Issue | ISSN |
18 | 3 | 1041-4347 |
Citations | PageRank | References |
43 | 1.70 | 31 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yan Li | 1 | 101 | 11.46 |
Simon Chi Keung Shiu | 2 | 667 | 35.42 |
Sankar K. Pal | 3 | 6410 | 627.31 |