Journal of Jilin University Science Edition ›› 2021, Vol. 59 ›› Issue (3): 635-642.

Previous Articles     Next Articles

THIMFUP Algorithm Based on the Most Frequent Item Extraction and Candidate Set Pruning

YANG Yong1,2, ZHANG Lei1, QU Fuheng1, LIU Junjie1, CHEN Qiang1   

  1. 1. School of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China;
    2. Institute of Education, Changchun Normal University, Changchun 130032, China
  • Received:2020-09-15 Online:2021-05-26 Published:2021-05-23
  • Contact: 杨勇 E-mail:yy@cust.edu.cn

Abstract: Aiming at the problem that FBCM (FUP (fast update algorithm) based on matrix compression) algorithm frequently scanned the original frequent itemset library and generated a large number of candidate sets in the process of item set mining. We proposed a method to extract the most frequent items from the database to reduce the scanning times on the original frequent itemset library, and through a candidate set pruning idea, it reduced the generation of candidate sets in the whole running process of the algorithm, so as to improve the speed of frequent itemset mining. Experimental results show that the efficiency of the algorithm is 15% higher than that of FBCM algorithm, and the highest is 60% under the same experimental conditions.

Key words: association rule, incremental mining, candidate set pruning, most frequent item

CLC Number: 

  • TP301.6