J4

• 计算机科学 • 上一篇    下一篇

具有动态加权特性的关联规则算法

欧阳继红, 王仲佳, 刘大有   

  1. 吉林大学 计算机科学与技术学院, 符号计算与知识工程教育部重点实验室,
  • 收稿日期:2004-11-09 修回日期:1900-01-01 出版日期:2005-05-26 发布日期:2005-05-26
  • 通讯作者: 刘大有

An Improved Association Rule Algorithm with Dynamically Weighted Characteristic

OUYANG Ji-hong, WANG Zhong-jia, LIU Da-you   

  1. College of Computer Science and Technology, Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
  • Received:2004-11-09 Revised:1900-01-01 Online:2005-05-26 Published:2005-05-26
  • Contact: LIU Da-you

摘要: 基于FP_growth关联规则, 提出一种具有动态加权特性 的改进算法. 把事务数据库中的项目按其重要程度划分为5个等级; 运用层次分析(AHP)算法 构造判断矩阵, 计算特征向量; 将得到的向量作为权值, 与项目在事务数据库中出现的次数 综合考虑作为衡量重要程度的标准, 生成FP_tree; 最后得到频繁项目集和关联规则. 由于 权重的赋予过程可以由领域专家动态地改变, 这样不但能挖掘出更有意义的规则, 而且在算 法的运行初期就大量剔除了那些权重小的无用项目集, 从而大大提高了算法的运行效率.

关键词: 数据挖掘, 关联规则, FP_growth算法, 加权树, 层次分析方

Abstract: Based on the FP_growth association rules, an improved a lgorithm with the dynamically weighted characteristic was put forward. Accord ing to their important degree, the items within the transaction database were divided into 5 grades, AHP was used to construct the judgmental matrix, and the eigenvector was calculated. At the same time, in order to create the FP_tree and to measure its important degree, taking the vector we got as weight, a synthetic consideration was made between the weight and the number items ap peared in the transaction database. Finally the frequent item sets and associati on rules were found. As the process of getting weight can be dynamically changed by domain expert, not only can the more meaningful rules be mined, but also a lot of lower weighted item sets were got rid of during the early time of algorithm, so the efficiency of our algorithm was improved.

Key words: data mining, association rule, FP_growth, weighted tree, AHP

中图分类号: 

  • TP18