A weighted N-list-based method for mining frequent weighted itemsets

作者:Huong Bui; Bay Vo*; Ham Nguyen; Tu Anh Nguyen Hoang; Tzung Pei Hong
来源:Expert Systems with Applications, 2018, 96: 388-405.
DOI:10.1016/j.eswa.2017.10.039

摘要

Mining frequent itemsets (FIs) is an important problem in the field of data mining, and thus there have been many different methods proposed to solve this problem. However, mining FIs usually works on binary databases and has a limitation that is only concerned with the appearance of items regardless of their importance. In practical applications, items often have different importance depending on their values or meanings, and that leads to the emergence of weighted databases. In this paper, we propose a new method for mining frequent weighted itemsets (FWIs) from a weighted database by using the weighted N-list structure (WN-list), an extension of the N-list. Some theorems are proposed to calculate the weighted supports of itemsets fast, and then an algorithm is built based on these theorems for efficiently mining FWIs. The experimental results show that the proposed method outperforms existing methods, especially when run on very large and sparse databases.