Abstract | ||
---|---|---|
In this paper we propose a new algorithm called SPIDER3 for selective preprocessing of multi-class imbalanced data sets. While it borrows selected ideas (i.e., combination of relabeling and local resampling) from its predecessor - SPIDER2, it introduces several important extensions. Unlike SPIDER2, it is able to handle directly multi-class problems. Moreover, it considers the relevance of specific decision classes to control the order of their processing. Finally, it uses information about relations between specific classes (modeled with misclassification costs) to better control the extent of changes introduced locally to preprocessed data. We performed a computational experiment on artificial 3-class data sets to evaluate and compare SPIDER3 to SPIDER2 with temporarily aggregated classes and the results confirmed advantages of the new algorithm. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1007/978-3-319-59162-9_25 | PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2017 |
Field | DocType | Volume |
Data set,Pattern recognition,Computer science,Algorithm,Preprocessor,Artificial intelligence,Resampling | Conference | 578 |
ISSN | Citations | PageRank |
2194-5357 | 0 | 0.34 |
References | Authors | |
7 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Szymon Wojciechowski | 1 | 2 | 1.06 |
Szymon Wilk | 2 | 461 | 40.94 |
Jerzy Stefanowski | 3 | 1653 | 139.25 |