Title
Finding the hottest item in data streams.
Abstract
We study a problem of finding the hottest item interval in a data stream, where the hotness of an item over an interval is determined by its average frequency. Finding the hottest item interval is particularly helpful in business promotions, such as monitoring the peak sales records, finding the hottest period in an online game, digging the highest click rate of an online music, etc. Existing work focus on finding the most frequent item over a fixed length interval. However, these solutions cannot return the hottest interval since the best length (i.e., maximizing the average frequency) is unknown in advance. To discover the hottest item interval, a straightforward solution is to calculate the average frequencies of items for every possible interval length, which is too costly for stream applications. To efficiently compute the hottest item interval, we propose an algorithm that employs the arrival timestamps of items and reduce the search space by three pruning strategies. Extensive experiments show that the proposed algorithms can efficiently discover the hottest item interval on both real and synthetic datasets.
Year
DOI
Venue
2018
10.1016/j.ins.2017.11.012
Information Sciences
Keywords
Field
DocType
Online algorithm,Hottest interval,Item stream
Data stream mining,Data stream,Timestamp,Artificial intelligence,Mathematics,Machine learning
Journal
Volume
ISSN
Citations 
430
0020-0255
0
PageRank 
References 
Authors
0.34
20
6
Name
Order
Citations
PageRank
Huaizhong Lin16712.34
Shanshan Wu210616.37
Leong Hou U334833.45
ngai meng kou4385.17
Yunjun Gao586289.71
Dongming Lu616332.29