Title
Maximally informative k-itemset mining from massively distributed data streams.
Abstract
We address the problem of mining maximally informative k-itemsets (miki) in data streams based on joint entropy. We propose PentroS, a highly scalable parallel miki mining algorithm. PentroS renders the mining process of large volumes of incoming data very efficient. It is designed to take into account the continuous aspect of data streams, particularly by reducing the computations of need for updating the miki results after arrival/departure of transactions to/from the sliding window. PentroS has been extensively evaluated using massive real-world data streams. Our experimental results confirm the effectiveness of our proposal which allows excellent throughput with high itemset length.
Year
DOI
Venue
2018
10.1145/3167132.3167187
SAC 2018: Symposium on Applied Computing Pau France April, 2018
Keywords
Field
DocType
Distributed Data Streams, Miki mining, Spark Streaming
Data mining,Data stream mining,Sliding window protocol,Computer science,Joint entropy,Throughput,Data mining algorithm,Computation,Scalability
Conference
ISBN
Citations 
PageRank 
978-1-4503-5191-1
0
0.34
References 
Authors
15
4
Name
Order
Citations
PageRank
Mehdi Zitouni100.34
Reza Akbarinia225425.77
Sadok Ben Yahia3657124.02
Florent Masseglia440843.08