Abstract | ||
---|---|---|
In multi-label learning, feature selection is a non-ignorable preprocessing step which can alleviate the negative effect of high-dimensionality. To address this problem, a number of effective information theory based feature selection algorithms for multi-label learning are proposed. However, these existing algorithms assume that the label space of multi-label training data is complete. In practice, the standpoint does not always hold true, due to the ambiguity among class labels or the cost effort to fully annotate instances. In this paper, we first define the new concepts of multi-label information entropy and multi-label mutual information. Then, feature redundancy, feature independence, and feature interaction are defined, respectively. In which, feature interaction is used to select more valuable features which may be ignored due to the incomplete label space. Moreover, a multi-label feature selection method with missing labels is proposed. Finally, extensive experiments conducted on eight publicly available data sets verify the effectiveness of the proposed algorithm via comparing it with state-of-the-art methods. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1007/s10489-019-01431-6 | Applied Intelligence |
Keywords | Field | DocType |
Feature selection, Neighborhood mutual information, Feature interaction, Missing labels, Multi-label learning | Information theory,Data set,Pattern recognition,Feature selection,Computer science,Preprocessor,Redundancy (engineering),Mutual information,Artificial intelligence,Entropy (information theory),Ambiguity,Machine learning | Journal |
Volume | Issue | ISSN |
49 | 8 | 0924-669X |
Citations | PageRank | References |
1 | 0.34 | 29 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Chenxi Wang | 1 | 30 | 2.29 |
Yaojin Lin | 2 | 470 | 23.01 |
Jinghua Liu | 3 | 29 | 2.94 |