Title
A Hierarchical Ensemble of ECOC for cancer classification based on multi-class microarray data.
Abstract
The difficulty of the cancer classification using multi-class microarray datasets lies in that there are only a few samples in each class. To effectively solve such a problem, we propose a hierarchical ensemble strategy, named as Hierarchical Ensemble of Error Correcting Output Codes (HE-ECOC). In this strategy, different feature subsets extracted from a dataset are used as inputs for three data-dependent ECOC algorithms, so as to produce different ECOC coding matrices. The mutual diversity degrees among these coding matrices are then calculated based on two schemes, named as the maximizing local diversity (MLD) and the maximizing global diversity (MGD) schemes. Both schemes can choose diverse coding matrices generated by the same or different ECOC algorithm(s), and the average fusion scheme is used to fuse the outputs of base learners. In the experiments, it is found that both MLD and MGD based HE-ECOC strategies work stably, and outperform individual single ECOC algorithms. In contrast with some ensemble systems, HE-ECOC generates a more robust ensemble system, and achieves better performance in most case. In short, HE-ECOC is a promising solution for the multi-class problem. The matlab code is available upon request.
Year
DOI
Venue
2016
10.1016/j.ins.2016.02.028
Information Sciences
Keywords
Field
DocType
Error Correcting Output Codes (ECOC),Ensemble learning,Cancer classification,Feature selection,Multi-class microarray data
Cancer classification,Data mining,MATLAB,Feature selection,Computer science,Matrix (mathematics),Coding (social sciences),Ensemble systems,Artificial intelligence,Fuse (electrical),Ensemble learning,Pattern recognition,Machine learning
Journal
Volume
Issue
ISSN
349
C
0020-0255
Citations 
PageRank 
References 
10
0.51
31
Authors
3
Name
Order
Citations
PageRank
Kun-hong Liu121717.00
ZhiHao Zeng2110.87
Vincent To-Yee Ng3433.18