吉林大学学报(理学版) ›› 2022, Vol. 60 ›› Issue (4): 929-942.

• • 上一篇    下一篇

基于k多数值代表的混合矩阵对象数据聚类

赵健   

  1. 长治学院 计算机系, 山西 长治 046011
  • 收稿日期:2021-04-30 出版日期:2022-07-26 发布日期:2022-07-26
  • 通讯作者: 赵健 E-mail:mitang22965730@163.com

Mixed Matrix Object Data Clustering Based on k-Multiple Value Representation

ZHAO Jian   

  1. Department of Computer, Changzhi University, Changzhi 046011, Shanxi Province, China
  • Received:2021-04-30 Online:2022-07-26 Published:2022-07-26

摘要: 针对数据的稀疏性和高维问题, 提出一种基于k多数值代表的混合矩阵对象数据聚类方法, 有效反映聚类中心与聚类内矩阵对象的分布. 为充分挖掘数据之间的隐含信息, 该算法首先定义两个数值矩阵对象之间的相异度度量, 给出一种更新聚类中心的启发式方法; 然后进一步提出一种基于k多数值代表的混合矩阵对象数据聚类算法; 最后在真实数据集与合成数据集上进行仿真实验. 实验结果表明, 该算法能有效实现包含大量记录、 矩阵对象、 属性的数据聚类.

关键词: 多数值代表, 聚类, 稀疏性, 矩阵对象

Abstract: Aiming at the problems of data sparsity and high dimension, the author proposed a mixed matrix object data clustering method based on k-multiple  value  representation, which effectively  reflected the distribution of cluster centers and matrix objects in clusters. In order to fully mine the hidden information between data, firstly, the algorithm defined the dissimilarity measure between two numerical matrix objects, and gave a heuristic method to update the clustering center. Secondly,  a mixed matrix object data clustering algorithm based on k-multiple value representation was proposed. Finally, the simulation experiments were carried out on real data sets and composite data sets. The experimental results  show that the proposed algorithm can effectively realize data clustering including a large number of records, matrix objects and attributes.

Key words: multiple value representation, clustering, sparsity, matrix object

中图分类号: 

  • TP391.41