Title
PaMPa-HD: A Parallel MapReduce-Based Frequent Pattern Miner for High-Dimensional Data.
Abstract
Frequent closed itemset mining is among the most complex exploratory techniques in data mining, and provides the ability to discover hidden correlations in transactional datasets. The explosion of Big Data is leading to new parallel and distributed approaches. Unfortunately, most of them are designed to cope with low-dimensional datasets, whereas no distributed high-dimensional frequent closed itemset mining algorithms exists. This work introduces PaMPa-HD, a parallel MapReduce-based frequent closed itemset mining algorithm for high-dimensional datasets, based on Carpenter. The experimental results, performed on both real and synthetic datasets, show the efficiency and scalability of PaMPa-HD.
Year
DOI
Venue
2015
10.1109/ICDMW.2015.18
ICDM Workshops
Field
DocType
Citations 
Data mining,Clustering high-dimensional data,Algorithm design,Computer science,Artificial intelligence,Data mining algorithm,Big data,Machine learning,Scalability
Conference
3
PageRank 
References 
Authors
0.36
11
6
Name
Order
Citations
PageRank
Daniele Apiletti110411.69
Elena Baralis21319186.33
Tania Cerquitelli329635.94
Paolo Garza442639.13
Pietro Michiardi51512111.53
Fabio Pulvirenti6122.92