Title
A Cost-Sensitive Learning Strategy for Feature Extraction from Imbalanced Data.
Abstract
In this paper, novel cost-sensitive principal component analysis (CSPCA) and cost-sensitive non-negative matrix factorization (CSNMF) methods are proposed for handling the problem of feature extraction from imbalanced data. The presence of highly imbalanced data misleads existing feature extraction techniques to produce biased features, which results in poor classification performance especially for the minor class problem. To solve this problem, we propose a cost-sensitive learning strategy for feature extraction techniques that uses the imbalance ratio of classes to discount the majority samples. This strategy is adapted to the popular feature extraction methods such as PCA and NMF. The main advantage of the proposed methods is that they are able to lessen the inherent bias of the extracted features to the majority class in existing PCA and NMF algorithms. Experiments on twelve public datasets with different levels of imbalance ratios show that the proposed methods outperformed the state-of-the-art methods on multiple classifiers.
Year
DOI
Venue
2016
10.1007/978-3-319-46675-0_9
Lecture Notes in Computer Science
Field
DocType
Volume
Pattern recognition,Computer science,Matrix decomposition,Feature extraction,Non-negative matrix factorization,Artificial intelligence,Principal component analysis,Machine learning
Conference
9949
ISSN
Citations 
PageRank 
0302-9743
2
0.39
References 
Authors
8
3
Name
Order
Citations
PageRank
Ali Braytee1115.25
Wei Liu246837.36
paul j kennedy3254.74