吉林大学学报(理学版) ›› 2021, Vol. 59 ›› Issue (5): 1205-1211.

• • 上一篇    下一篇

 一种增量式MinMax k-Means聚类算法

胡雅婷1, 陈营华1, 宝音巴特2, 曲福恒3, 李卓识1   

  1. 1. 吉林农业大学 信息技术学院, 长春 130118; 2. 吉林省科学技术工作者服务中心, 长春 130021;3. 长春理工大学 计算机科学技术学院, 长春 130022
  • 收稿日期:2020-08-12 出版日期:2021-09-26 发布日期:2021-09-26
  • 通讯作者: 曲福恒 E-mail:qufuheng@163.com

An Incremental MinMax k-Means Clustering Algorithm

HU Yating1, CHEN Yinghua1, BAOYIN Bate2, QU Fuheng3, LI Zhuoshi1   

  1. 1. School of Information and Technology, Jilin Agriculture University, Changchun 130118, China; 2. Jilin Science and Technology Works Service Center, Changchun 130021, China; 3. School of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China
  • Received:2020-08-12 Online:2021-09-26 Published:2021-09-26

摘要: 针对MinMax k-means算法易产生空解、 收敛速度慢和计算效率低的问题, 提出一种增量式MinMax k-means聚类算法. 该算法从给定的初始聚类个数开始, 以固定步长递增式产生新的聚类中心, 采用基于数据均衡的快速分裂方法产生增量聚类中心, 从而避免了传统增量聚类中心选择中遍历数据、k-means聚类算法运行次数过多导致的大计算量问题. 与MinMax k-means及相关算法的对比实验结果表明, 该算法在计算效率和求解精度上均优于对比算法, 有效改善了MinMax k-means聚类对初始化中心敏感和易产生空解的问题.

关键词: k均值聚类, 增量式聚类, 初始化, 聚类中心

Abstract: Aiming at the problem that MinMax k-means algorithm was easy to generate empty solutions, slow convergence speed and low computational efficiency, we proposed an incremental MinMax k-means clustering algorithm. The algorithm started from a given initial clustering number, and generated new cluster centers by increasing a fixed step length. The fast dividing method based on data balance was used to generate incremental cluster centers, so as to avoid the large amount of calculation problem caused by traversing data and too many running times of k-means clustering algorithm in traditional incremental clustering center selection. Compared with MinMax k-means and related algorithms, the experimental results show that the algorithm is superior to the comparison algorithm in calculation efficiency and accuracy, and effectively improves the sensitivity of MinMax k-means clustering to initialization center and easy to generate empty solutions.

Key words: k-means clustering, incremental clustering, initialization, cluster center

中图分类号: 

  • TP391.4