Title
A new fuzzy decision tree classification method for mining high-speed data streams based on binary search trees
Abstract
Decision tree construction is a well-studied problem in data mining. Recently, there has been much interest in mining data streams. Domingos and Hulten have presented a one-pass algorithm for decision tree constructions. Their system using Hoeffding inequality to achieve a probabilistic bound on the accuracy of the tree constructed. Gama et al. have extended VFDT in two directions. Their system VFDTc can deal with continuous data and use more powerful classification techniques at tree leaves. Peng et al. present soft discretization method to solve continuous attributes in data mining. In this paper, we revisit these problems and implemented a system sVFDT for data stream mining. We make the following contributions: 1) we present a binary search trees (BST) approach for efficiently handling continuous attributes. Its processing time for values inserting is O(nlogn), while VFDT's processing time is O(n2). 2) We improve the method of getting the best split-test point of a given continuous attribute. Comparing to the method used in VFDTc, it decreases from O(nlogn) to O (n) in processing time. 3) Comparing to VFDTc, sVFDT's candidate split-test number decrease from O(n) to O(logn).4)Improve the soft discretization method to increase classification accuracy in data stream mining.
Year
DOI
Venue
2007
10.1007/978-3-540-73814-5_20
FAW
Keywords
Field
DocType
data mining,data stream mining,classification method,decision tree construction,continuous attribute,present soft discretization method,processing time,new fuzzy decision tree,continuous data,soft discretization method,high-speed data stream,mining data stream,binary search tree,decision tree
Decision tree,Data mining,Data stream mining,Tree traversal,Computer science,Optimal binary search tree,Decision tree learning,Binary search tree,Incremental decision tree,Interval tree
Conference
Volume
ISSN
ISBN
4613
0302-9743
3-540-73813-4
Citations 
PageRank 
References 
4
0.44
22
Authors
5
Name
Order
Citations
PageRank
Zhoujun Li1964115.99
Tao Wang2333.91
Ruoxue Wang340.44
Yuejin Yan4253.06
Huo-wang Chen523533.47