Abstract | ||
---|---|---|
We study a problem of finding the hottest item interval in a data stream, where the hotness of an item over an interval is determined by its average frequency. Finding the hottest item interval is particularly helpful in business promotions, such as monitoring the peak sales records, finding the hottest period in an online game, digging the highest click rate of an online music, etc. Existing work focus on finding the most frequent item over a fixed length interval. However, these solutions cannot return the hottest interval since the best length (i.e., maximizing the average frequency) is unknown in advance. To discover the hottest item interval, a straightforward solution is to calculate the average frequencies of items for every possible interval length, which is too costly for stream applications. To efficiently compute the hottest item interval, we propose an algorithm that employs the arrival timestamps of items and reduce the search space by three pruning strategies. Extensive experiments show that the proposed algorithms can efficiently discover the hottest item interval on both real and synthetic datasets. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1016/j.ins.2017.11.012 | Information Sciences |
Keywords | Field | DocType |
Online algorithm,Hottest interval,Item stream | Data stream mining,Data stream,Timestamp,Artificial intelligence,Mathematics,Machine learning | Journal |
Volume | ISSN | Citations |
430 | 0020-0255 | 0 |
PageRank | References | Authors |
0.34 | 20 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Huaizhong Lin | 1 | 67 | 12.34 |
Shanshan Wu | 2 | 106 | 16.37 |
Leong Hou U | 3 | 348 | 33.45 |
ngai meng kou | 4 | 38 | 5.17 |
Yunjun Gao | 5 | 862 | 89.71 |
Dongming Lu | 6 | 163 | 32.29 |