Title
Big data classification using heterogeneous ensemble classifiers in Apache Spark based on MapReduce paradigm
Abstract
•Distributed Heterogeneous Ensemble is designed for big data classification.•Classifiers are pruned from the ensemble to increase the diversity.•A Spark version of DHBoost is presented based on MapReduce programming paradigm.•DHBoost outperforms the state-of-the-art ensemble classifiers in the Spark library.
Year
DOI
Venue
2021
10.1016/j.eswa.2021.115369
Expert Systems with Applications
Keywords
DocType
Volume
Ensemble classifier,Boosting,MapReduce,Big data,Apache Spark,Apache Hadoop
Journal
183
ISSN
Citations 
PageRank 
0957-4174
1
0.35
References 
Authors
0
3
Name
Order
Citations
PageRank
Hamid Reza Kadkhodaei110.35
Amir-Masoud Eftekhari-Moghadam210.35
Mehdi Dehghan33022324.48