A parallel FP-growth algorithm on World Ocean Atlas data with multi-core CPU

作者:Jiang, Yu; Zhao, Minghao; Hu, Chengquan; He, Lili; Bai, Hongtao*; Wang, Jin
来源:Journal of Supercomputing, 2019, 75(2): 732-745.
DOI:10.1007/s11227-018-2297-6

摘要

According to the complexity of ocean data, this paper adopts a parallel mining algorithm of association rules to explore the correlation and regularity of oxygen, temperature, phosphate, nitrate and silicate in the ocean. After the marine data is interpolated, this paper utilizes the parallel FP-growth algorithm to mine the data and then briefly analyzes the mining results of the frequent itemsets and association rules. The relationship between the parallel efficiency and the core number of CPU is analyzed through datasets with different scales. The experimental results indicate that the acceleration effect is ideal when each thread scored 200,000-300,000 data, which leads to more than 1.2 times of performance improvement.