High dimensional classifiers in the imbalanced case. - Citegraph

Paper Info

Title
High dimensional classifiers in the imbalanced case.

Abstract
A binary classification problem is imbalanced when the number of samples from the two groups differs. For the high dimensional case, where the number of variables is much larger than the number of samples, imbalance leads to a bias in the classification. The independence classifier is studied theoretically and based on the analysis two new classifiers are suggested that can handle any imbalance ratio. The analytical results are supplemented by a simulation study, where the suggested classifiers in some aspects outperform multiple undersampling. For correlated data the ROAD classifier is considered and a suggestion is given for how to modify the classifier to handle the bias from imbalanced group sizes.

Year	DOI	Venue
2016	10.1016/j.csda.2015.12.009	Computational Statistics & Data Analysis
Keywords	Field	DocType
High dimension,Imbalance,Classification	Binary classification,Pattern recognition,Undersampling,Artificial intelligence,Classifier (linguistics),Machine learning,Mathematics	Journal
Volume	Issue	ISSN
98	C	0167-9473
Citations	PageRank	References
1	0.35	5
Authors
3

Authors (3 rows)

Cited by (1 rows)

References (5 rows)

Name	Order	Citations	PageRank
Britta Anker Bak	1	1	0.35
Jens Ledet Jensen	2	65	5.43
BakBritta Anker	3	1	0.35

1