Title
Mutual Information Based Output Dimensionality Reduction
Abstract
Given a large dimensional input and output space, even simple regression is prohibitively costly. Dimensionality reduction in the output space is important for efficient learning and prediction as modern paradigms, e.g. Topic modelling, image classification, etc., have extremely large output spaces. In contrast to input dimensionality reduction, dimension reduction in output side is complicated. We propose, mutual information based output dimensionality reduction, that takes into account the relationship between the input and the output which is essential for regression and classification problems. Our method selects those labels to form the compressed label space that typically have the maximum mutual information with the input. Selecting the best subset is computationally hard, but we provide a polynomial time algorithm with provable approximation guarantee. We conduct experiments on seven multi-label classification datasets. Results show our method performs better than existing methods on some datasets.
Year
DOI
Venue
2014
10.1109/ICDM.2014.110
Data Mining
Keywords
Field
DocType
approximation theory,computational complexity,learning (artificial intelligence),pattern classification,regression analysis,approximation guarantee,classification problems,compressed label space,dimensional input space,dimensional output space,learning,multilabel classification datasets,mutual information based output dimensionality reduction,polynomial time algorithm,prediction,regression problems,subset selection,dimension reduction,multi-label,mutual information,output dimension reduction,submodular function
Data mining,Dimensionality reduction,Computer science,Input/output,Artificial intelligence,Time complexity,Contextual image classification,Compressed sensing,Pattern recognition,Submodular set function,Greedy algorithm,Mutual information,Machine learning
Conference
ISSN
Citations 
PageRank 
1550-4786
0
0.34
References 
Authors
14
2
Name
Order
Citations
PageRank
Shishir Pandey100.34
Rahul Vaze246345.64