Title
Analysing the Footprint of Classifiers in Overlapped and Imbalanced Contexts.
Abstract
It is recognised that the imbalanced data problem is aggravated by other difficulty factors, such as class overlap. Over the years, several research works have focused on this problematic, although presenting two major hitches: the limitation of test domains and the lack of a formulation of the overlap degree, which makes results hard to generalise. This work studies the performance degradation of classifiers with distinct learning biases in overlap and imbalanced contexts, focusing on the characteristics of the test domains (shape, dimensionality and imbalance ratio) and on to what extent our proposed overlapping measure (degOver) is aligned with the performance results observed. Our results show that MLP and CART classifiers are the most robust to high levels of class overlap, even for complex domains, and that KNN and linear SVM are the most aligned with degOver. Furthermore, we found that the dimensionality of data also plays an important role in explaining performance results.
Year
Venue
Field
2018
IDA
Pattern recognition,Computer science,Cart,Curse of dimensionality,Footprint,Artificial intelligence,Machine learning,Linear svm
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
9
6
Name
Order
Citations
PageRank
Marta Mercier100.34
Miriam Seoane Santos2315.28
Pedro Abreu39219.58
Carlos Soares49518.18
Jastin Pompeu Soares5101.85
João Santos6204.80