Title
MapReduce distributed highly random fuzzy forest for noisy big data.
Abstract
Nowadays the amounts of data available to us have the ever larger growth trend. On the other hand such data often contain noise. We call them noisy Big Data. There is an increasing need for learning methods that can handle such noisy Big Data for classification tasks. In this paper we propose a highly random fuzzy forest algorithm for learning an ensemble of fuzzy decision trees from a big data set contaminated with attribute noise. We also present the distributed version of the proposed learning algorithm implemented in the MapReduce framework. Experiment results have demonstrated that the proposed algorithm is faster and more accurate than the state-of-the-art approach particularly in the presence of attribute noise.
Year
Venue
Field
2017
ICNC-FSKD
Decision tree,Noise measurement,Computer science,Fuzzy logic,Fuzzy set,Artificial intelligence,Statistical classification,Big data,Fuzzy decision tree,Machine learning
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Faruk Mustafic100.34
Francisco Herrera2273911168.49
N. Xiong3564.19
Sergio Ramírez-Gallego4986.99