Title | ||
---|---|---|
Cost-sensitive and hybrid-attribute measure multi-decision tree over imbalanced data sets. |
Abstract | ||
---|---|---|
One of the most popular algorithms for classification is the decision tree. However, existing binary decision tree models do not handle well the minority class over imbalanced data sets. To address this difficulty, a Cost-sensitive and Hybrid attribute measure Multi-Decision Tree (CHMDT) approach is presented in this paper. It penalizes misclassification through a hybrid attribute measure, which is defined from the combination of the Gini index and information gain measure. It further builds a multi-decision tree consisting of multiple decision trees each with different root node information. The overall objective of the approach is to maximize the classification performance with the hybrid attribute measure while minimizing the total misclassification cost. Experiments are conducted over twelve KEEL imbalanced data sets to demonstrate the CHMDT approach. They show that the classification performance of the minority class is improved significantly without sacrifice of the overall classification accuracy of the majority class. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1016/j.ins.2017.09.013 | Information Sciences |
Keywords | Field | DocType |
Multi-decision tree,Minority class,Imbalanced data set,Cost sensitivity,Hybrid attribute measure | Decision tree,Data mining,Data set,Information gain,Binary decision diagram,Artificial intelligence,Mathematics,Machine learning,Decision tree learning,Incremental decision tree | Journal |
Volume | Issue | ISSN |
422 | C | 0020-0255 |
Citations | PageRank | References |
14 | 0.57 | 22 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Fenglian Li | 1 | 14 | 1.25 |
Xueying Zhang | 2 | 14 | 2.26 |
Xiqian Zhang | 3 | 14 | 0.57 |
Chunlei Du | 4 | 19 | 1.73 |
Yue Xu | 5 | 534 | 53.20 |
Yu-Chu Tian | 6 | 550 | 59.35 |