Title
An Incremental Fuzzy Decision Tree Classification Method for Mining Data Streams
Abstract
One of most important algorithms for mining data streams is VFDT. It uses Hoeffding inequality to achieve a probabilistic bound on the accuracy of the tree constructed. Gama et al. have extended VFDT in two directions. Their system VFDTc can deal with continuous data and use more powerful classification techniques at tree leaves. In this paper, we revisit this problem and implemented a system fVFDT on top of VFDT and VFDTc. We make the following four contributions: 1) we present a threaded binary search trees (TBST) approach for efficiently handling continuous attributes. It builds a threaded binary search tree, and its processing time for values inserting is O(nlogn), while VFDT`s processing time is O(n2). When a new example arrives, VFDTc need update O(logn)attribute tree nodes, but fVFDT just need update one necessary node.2) we improve the method of getting the best split-test point of a given continuous attribute. Comparing to the method used in VFDTc, it improves fromO(nlogn)to O (n)in processing time. 3) Comparing to VFDTc, fVFDT`s candidate split-test number decrease fromO(n)to O(logn).4)Improve the soft discretization method to be used in data streams mining, it overcomes the problem of noise data and improve the classification accuracy.
Year
DOI
Venue
2007
10.1007/978-3-540-73499-4_8
MLDM
Keywords
Field
DocType
threaded binary search tree,extended vfdt,mining data streams,continuous attribute,attribute tree node,noise data,incremental fuzzy decision tree,processing time,system vfdtc,continuous data,data streams mining,mining data stream,classification method,binary search tree,data stream mining
Data mining,Computer science,Threaded binary tree,Red–black tree,Artificial intelligence,Interval tree,Tree traversal,Pattern recognition,Optimal binary search tree,Segment tree,Machine learning,Search tree,Incremental decision tree
Conference
Volume
ISSN
Citations 
4571
0302-9743
7
PageRank 
References 
Authors
0.54
22
4
Name
Order
Citations
PageRank
Tao Wang1333.91
Zhoujun Li2964115.99
Yuejin Yan3253.06
Huo-wang Chen423533.47