Title
Optimal Aggregation of Binary Classifiers for Multiclass Cancer Diagnosis Using Gene Expression Profiles
Abstract
Multiclass classification is one of the fundamental tasks in bioinformatics and typically arises in cancer diagnosis studies by gene expression profiling. There have been many studies of aggregating binary classifiers to construct a multiclass classifier based on one-versus-the-rest (1R), one-versus-one (11), or other coding strategies, as well as some comparison studies between them. However, the studies found that the best coding depends on each situation. Therefore, a new problem, which we call the “optimal coding problem,” has arisen: how can we determine which coding is the optimal one in each situation? To approach this optimal coding problem, we propose a novel framework for constructing a multiclass classifier, in which each binary classifier to be aggregated has a weight value to be optimally tuned based on the observed data. Although there is no a priori answer to the optimal coding problem, our weight tuning method can be a consistent answer to the problem. We apply this method to various classification problems including a synthesized data set and some cancer diagnosis data sets from gene expression profiling. The results demonstrate that, in most situations, our method can improve classification accuracy over simple voting heuristics and is better than or comparable to state-of-the-art multiclass predictors.
Year
DOI
Venue
2009
10.1109/TCBB.2007.70239
IEEE/ACM Trans. Comput. Biology Bioinform.
Keywords
Field
DocType
learning artificial intelligence,molecular biophysics,multiclass classification,cancer,medicine,gene expression,encoding,genetics,support vector machines,gene expression profiling,voting,bioinformatics,aggregation,error correction,genomics,supervised learning,classification algorithms
Binary classification,Computer science,Coding (social sciences),Heuristics,Artificial intelligence,Classifier (linguistics),Multiclass classification,Pattern recognition,Support vector machine,Supervised learning,Bioinformatics,Statistical classification,Machine learning
Journal
Volume
Issue
ISSN
6
2
1545-5963
Citations 
PageRank 
References 
11
0.62
13
Authors
4
Name
Order
Citations
PageRank
Naoto Yukinawa1131.04
Shigeyuki Oba229027.68
Kikuya Kato3403.49
Shin Ishii423934.39