Title
MHFlexDT: A Multivariate Branch Fuzzy Decision Tree Data Stream Mining Strategy Based on Hybrid Partitioning Standard.
Abstract
Because of the inability to take a multi-pass scanning algorithm for random access to fast data streams and traditional data mining algorithms can't sample all samples of the data stream, research of data stream mining algorithm based on fuzzy decision tree theory that fuzzy decision tree combines the understandability of decision tree and the ability of representation of fuzzy set to deal with the fuzziness and uncertainty information is very valuable to improve the accuracy of data mining. This paper presents a fuzzy decision tree data mining strategy based on hybrid partitioning standard for the problem that the method has a low accuracy when we deal with low-membership samples with missing values by dividing the samples into leaf nodes according to their membership. The multivariate branch fuzzy decision tree data stream mining strategy based on hybrid partitioning standard(MHFlexDT) is used to construct the multivariate branch fuzzy tree structure. The data fitting problem is solved by adding temporary branches to the uncertain data. At the same time, the decision tree depth is effectively limited by using the McDiarmid bound threshold. The experimental results show that MHFlexDT strategy compared with fuzzy decision tree data mining strategy is more effective in large-scale data stream mining to reduce system computation, control decision tree depth, and ensure a high accuracy when we deal with missing values, data over-fitting and noisy data problems.
Year
DOI
Venue
2018
10.1007/978-3-319-92537-0_36
ADVANCES IN NEURAL NETWORKS - ISNN 2018
Keywords
Field
DocType
Data streams mining,Fuzzy decision tree,Hybrid partitioning standard,Classification learning
Decision tree,Data stream mining,Data stream,Computer science,Fuzzy logic,Fuzzy set,Uncertain data,Tree structure,Artificial intelligence,Missing data,Machine learning
Conference
Volume
ISSN
Citations 
10878
0302-9743
0
PageRank 
References 
Authors
0.34
12
4
Name
Order
Citations
PageRank
Xin Song11515.82
Han Wang252.48
Huiyuan He300.34
Yakun Meng400.34