Abstract | ||
---|---|---|
Parallel computing is an effective method to solve computationally large and data-intensive problems. The traditional data mining algorithm cannot mining association rules for large amounts of streaming data in a timely and effectively. In order to improve the speed and accuracy of association rules mining, distributed and parallel algorithms have become a research focus. This paper proposes a parallel FP-growth approach using Spark Streaming, called SSPFP, which can parallel mining frequent itemsets and association rules in real-time streaming data. In this paper, the proposed SSPFP algorithm is applied to mining the association rules between temperature and salinity in marine Argo data. The experimental results indicate that SSPFP algorithm is efficient for association rules mining. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1002/cta.2935 | INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS |
Keywords | DocType | Volume |
Argo, association rules, data mining, parallel computing, Spark Streaming | Journal | 49 |
Issue | ISSN | Citations |
4 | 0098-9886 | 0 |
PageRank | References | Authors |
0.34 | 0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Longtao Liu | 1 | 0 | 0.34 |
Jiabao Wen | 2 | 16 | 4.61 |
Zexun Zheng | 3 | 0 | 0.68 |
Hansong Su | 4 | 0 | 0.68 |