Title
Overfitting in linear feature extraction for classification of high-dimensional image data.
Abstract
Overfitting has been widely studied in the context of classification and regression. In this paper, we study the overfitting in the context of dimensionality reduction. We show that the conventional wisdom of improving classification performance by maximising inter-class discrimination is not valid for high-dimensional datasets, and can lead to severe overfitting. In particular, we prove the theoretical existence of perfectly discriminative subspace projections, and show that for datasets with very high input dimensionality, inter-class discrimination should be reduced rather than maximised. This naturally leads to a simple dimensionality reduction technique, which we call Soft Discriminant Maps, which we use to show a direct relationship between the classification performance and the level of inter-class discrimination of feature extractors. Moreover, Soft Discriminant Maps consistently exhibit better classification performance than other comparable techniques.
Year
DOI
Venue
2016
10.1016/j.patcog.2015.11.015
Pattern Recognition
Keywords
Field
DocType
Dimensionality reduction,Feature extraction,Classification,High-dimensional datasets,Overfitting
Dimensionality reduction,Pattern recognition,Subspace topology,Computer science,Multiple discriminant analysis,Feature extraction,Curse of dimensionality,Artificial intelligence,Mutual information,Overfitting,Discriminative model,Machine learning
Journal
Volume
Issue
ISSN
53
C
0031-3203
Citations 
PageRank 
References 
5
0.39
29
Authors
2
Name
Order
Citations
PageRank
Raymond Liu1977.04
Duncan Fyfe Gillies29717.86