Title
Online transfer learning with multiple decision trees.
Abstract
Online learning techniques have been widely used in many fields where instances come one by one. However, in early stage of a data stream, online learning models cannot exhibit good classification accuracy for it cannot collect sufficient instances to learn. For example, a well-known online learning algorithm named as very fast decision tree (VFDT) needs to wait for Hoeffding bound satisfied to split, which leads to poor classification accuracy at the beginning of data stream. Thus, VFDT may not be appropriate for some real applications which demand us a fast and accurate online detection. This situation will become more serious in the scenario of data stream classification with concept drift. This paper attempts to take transfer learning algorithm to make up this shortcoming of VFDT. To achieve this goal, a new decision tree method named as VFDT-D is first proposed to cache instances in its leaf nodes to handle numerical attributes and adapt to a framework of online transfer learning (OTL), and then a measure which considers tree path, classification accuracy and classification confidence is proposed to evaluate the local similarity between source and target domain classifiers. At last, a multiple-source online transfer learning algorithm named as DMOTL is proposed to take VFDT-D as base classifier and use the proposed measure of local similarity to select the optimal source domain classifier to help transfer learning. The extensive experiments on several synthetic and real-world datasets demonstrate the advantage of the proposed algorithm.
Year
DOI
Venue
2019
10.1007/s13042-019-00998-3
International Journal of Machine Learning and Cybernetics
Keywords
Field
DocType
Online learning, Transfer learning, Multiple sources, Local similarity, Incremental decision tree, Concept drift
Hoeffding's inequality,Decision tree,Data stream,Cache,Computer science,Transfer of learning,Concept drift,Artificial intelligence,Classifier (linguistics),Machine learning,Incremental decision tree
Journal
Volume
Issue
ISSN
10
10
1868-8071
Citations 
PageRank 
References 
1
0.38
0
Authors
5
Name
Order
Citations
PageRank
Yimin Wen111.06
Yixiu Qin210.38
Keke Qin310.38
Xiaoxia Lu410.38
Pingshan Liu543.81