Title
Multi-Label Bioinformatics Data Classification With Ensemble Embedded Feature Selection.
Abstract
In bioinformatics, the vast of multi-label type of datasets, including clinical text, gene, and protein data, need to be categorized. Specifically, due to the redundant or irrelevant features in bioinformatics data, the performance of multi-label classifiers will be limited, and therefore, selecting effective features from the feature space is necessary. However, most of the proposed methods, which aimed at dealing with multi-label feature selection problem in the past few years, only adopt a simple and direct strategy that transforms the multi-label feature selection problem into more single-label ones and ignore correlations among different labels. In this paper, a novel algorithm named ensemble embedded feature selection (EEFS) is proposed to handle multi-label bioinformatics data learning problem in a more effective and efficient way. The EEFS does not only explicitly find out the correlations among labels, but it can also adequately utilize the label correlations by multi-label classifiers and evaluation measures. Furthermore, it can reduce the accumulated errors of data itself by employing an ensemble method. The experimental results on five multi-label bioinformatics datasets show that our algorithm achieves significant superiority over the other state-of-the-art algorithms.
Year
DOI
Venue
2019
10.1109/ACCESS.2019.2931035
IEEE ACCESS
Keywords
DocType
Volume
Bioinformatics,multi-label learning,embedded feature selection
Journal
7
ISSN
Citations 
PageRank 
2169-3536
1
0.36
References 
Authors
0
4
Name
Order
Citations
PageRank
Yumeng Guo131.05
Fu-lai Chung224434.50
Guo-Zheng Li336842.62
Lei Zhang410.36