Title
Cost-sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data.
Abstract
Classification with big data has become one of the latest trends when talking about learning from the available information. The data growth in the last years has rocketed the interest in effectively acquiring knowledge to analyze and predict trends. The variety and veracity that are related to big data introduce a degree of uncertainty that has to be handled in addition to the volume and velocity requirements. This data usually also presents what is known as the problem of classification with imbalanced datasets, a class distribution where the most important concepts to be learned are presented by a negligible number of examples in relation to the number of examples from the other classes. In order to adequately deal with imbalanced big data we propose the Chi-FRBCS-BigDataCS algorithm, a fuzzy rule based classification system that is able to deal with the uncertainly that is introduced in large volumes of data without disregarding the learning in the underrepresented class. The method uses the MapReduce framework to distribute the computational operations of the fuzzy model while it includes cost-sensitive learning techniques in its design to address the imbalance that is present in the data. The good performance of this approach is supported by the experimental analysis that is carried out over twenty-four imbalanced big data cases of study. The results obtained show that the proposal is able to handle these problems obtaining competitive results both in the classification performance of the model and the time needed for the computation.
Year
DOI
Venue
2015
10.1016/j.fss.2014.01.015
Fuzzy Sets and Systems
Keywords
Field
DocType
Fuzzy rule based classification systems,Big data,MapReduce,Hadoop,Imbalanced datasets,Cost-sensitive learning
Data mining,Fuzzy model,Computer science,Artificial intelligence,Big data,Machine learning,Fuzzy rule,Computation
Journal
Volume
ISSN
Citations 
258
0165-0114
32
PageRank 
References 
Authors
0.85
53
4
Name
Order
Citations
PageRank
Victoria Lopez11865.50
S. del Río22438.92
José Manuel Benítez388856.02
Francisco Herrera4273911168.49