Title
Random Space Division Sampling for Label-Noisy Classification or Imbalanced Classification
Abstract
This article presents a simple sampling method, which is very easy to be implemented, for classification by introducing the idea of random space division, called “random space division sampling” (RSDS). It can extract the boundary points as the sampled result by efficiently distinguishing the label noise points, inner points, and boundary points. This makes it the first general sampling method for classification that not only can reduce the data size but also enhance the classification accuracy of a classifier, especially in the label-noisy classification. The “general” means that it is not restricted to any specific classifiers or datasets (regardless of whether a dataset is linear or not). Furthermore, the RSDS can online accelerate most classifiers because of its lower time complexity than most classifiers. Moreover, the RSDS can be used as an undersampling method for imbalanced classification. The experimental results on benchmark datasets demonstrate its effectiveness and efficiency. The code of the RSDS and comparison algorithms is available at: <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/syxiaa/RSDS</uri> .
Year
DOI
Venue
2022
10.1109/TCYB.2021.3070005
IEEE Transactions on Cybernetics
Keywords
DocType
Volume
Algorithms
Journal
52
Issue
ISSN
Citations 
10
2168-2267
0
PageRank 
References 
Authors
0.34
26
6
Name
Order
Citations
PageRank
Shuyin Xia183.52
Yong Zheng200.34
Guoyin Wang32144202.16
Ping He400.68
Heng Li512138.18
Zizhong Chen692469.93