Learning from imbalanced data in presence of noisy and borderline examples - Citegraph

Paper Info

Title
Learning from imbalanced data in presence of noisy and borderline examples

Abstract
In this paper we studied re-sampling methods for learning classifiers from imbalanced data. We carried out a series of experiments on artificial data sets to explore the impact of noisy and borderline examples from the minority class on the classifier performance. Results showed that if data was sufficiently disturbed by these factors, then the focused re-sampling methods - NCR and our SPIDER2 - strongly outperformed the oversampling methods. They were also better for real-life data, where PCA visualizations suggested possible existence of noisy examples and large overlapping ares between classes.

Year	DOI	Venue
2010	10.1007/978-3-642-13529-3_18	RSCTC
Keywords	Field	DocType
oversampling method,minority class,real-life data,pca visualization,noisy example,artificial data,large overlapping are,imbalanced data,classifier performance,borderline example,sampling methods	Data set,Oversampling,Pattern recognition,Computer science,Artificial intelligence,Classifier (linguistics),Machine learning	Conference
Volume	ISSN	ISBN
6086	0302-9743	3-642-13528-5
Citations	PageRank	References
59	1.60	6
Authors
3

Authors (3 rows)

Cited by (59 rows)

References (6 rows)

Name	Order	Citations	PageRank
Krystyna Napierała	1	71	3.24
Jerzy Stefanowski	2	1653	139.25
Szymon Wilk	3	461	40.94

1