Title
Cost-sensitive and hybrid-attribute measure multi-decision tree over imbalanced data sets.
Abstract
One of the most popular algorithms for classification is the decision tree. However, existing binary decision tree models do not handle well the minority class over imbalanced data sets. To address this difficulty, a Cost-sensitive and Hybrid attribute measure Multi-Decision Tree (CHMDT) approach is presented in this paper. It penalizes misclassification through a hybrid attribute measure, which is defined from the combination of the Gini index and information gain measure. It further builds a multi-decision tree consisting of multiple decision trees each with different root node information. The overall objective of the approach is to maximize the classification performance with the hybrid attribute measure while minimizing the total misclassification cost. Experiments are conducted over twelve KEEL imbalanced data sets to demonstrate the CHMDT approach. They show that the classification performance of the minority class is improved significantly without sacrifice of the overall classification accuracy of the majority class.
Year
DOI
Venue
2018
10.1016/j.ins.2017.09.013
Information Sciences
Keywords
Field
DocType
Multi-decision tree,Minority class,Imbalanced data set,Cost sensitivity,Hybrid attribute measure
Decision tree,Data mining,Data set,Information gain,Binary decision diagram,Artificial intelligence,Mathematics,Machine learning,Decision tree learning,Incremental decision tree
Journal
Volume
Issue
ISSN
422
C
0020-0255
Citations 
PageRank 
References 
14
0.57
22
Authors
6
Name
Order
Citations
PageRank
Fenglian Li1141.25
Xueying Zhang2142.26
Xiqian Zhang3140.57
Chunlei Du4191.73
Yue Xu553453.20
Yu-Chu Tian655059.35