基于min-max准则与区域划分的I-k-means-+聚类算法

Journal of Jilin University Science Edition ›› 2023, Vol. 61 ›› Issue (5): 1131-1138.

Previous Articles Next Articles

I-k-means-+ Clustering Algorithm Based on min-max Criterion and Region Division

QU Fuheng¹, SONG Jianfei¹, YANG Yong^1,2, HU Yating³, PAN Yuetao¹

1. College of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China; 2. College of Education, Changchun Normal University, Changchun 130032, China； 3. College of Information Technology, Jilin Agricultural University, Changchun 130118, China

Received:2022-12-05 Online:2023-09-26 Published:2023-09-26

Abstract

Abstract: Aiming at the problem of unstable clustering results and low solving accuracy of I-k-means-+ algorithm, we proposed I-k-means-+ clustering algorithm based on min-max criterion and region division. Firstly, the min-max criterion was proposed to calculate the distance from each data point to the nearest center, and the data point with the largest distance was preferentially selected as the new clustering center to avoid multiple initial centers gathering in the same cluster. Secondly, the data points in the split cluster were divided into different regions, and a data point was selected as the candidate center in each region to increase the diversity of the candidate center. Finally, for the clusters that failed to pair, the new split cluster was re-selected by gain to pair with the original deleted cluster again, so as to improve the pairing success rate and further reduce the objective function value. The experimental results show that compared with the I-k-means-+ algorithm, the proposed algorithm improves the accuracy of the solution by 6.47% on average while maintaining similar operational efficiency, and the clustering results are more stable. Compared with k-means and k-means++ algorithms, the proposed algorithm has higher solving accuracy.

Key words: cluster analysis, k-means algorithm, I-k-means-+ algorithm, min-max criterion, region division

CLC Number:

TP391

QU Fuheng, SONG Jianfei, YANG Yong, HU Yating, PAN Yuetao. I-k-means-+ Clustering Algorithm Based on min-max Criterion and Region Division[J].Journal of Jilin University Science Edition, 2023, 61(5): 1131-1138.

[1]	LI Changming, ZHANG Hongchen, WANG Chao, LI Xiaoguang, LU Yang, QIAN Chaoyue. An Efficient Yinyang k-Means Clustering Algorithm [J]. Journal of Jilin University Science Edition, 2021, 59(6): 1455-1460.
[2]	PU Xiaochuan, HUANG Junli, QI Ning, SONG Changsong. Application of K-Means Algorithm Based on Density Information Entropy in Customer Segmentation [J]. Journal of Jilin University Science Edition, 2021, 59(5): 1245-1251.
[3]	QI Xiangming, SUN Xujiao. Chinese Text Clustering Algorithm Based on Semantic Cluster [J]. Journal of Jilin University Science Edition, 2019, 57(5): 1193-1199.
[4]	JIANG Jianhua, WU Di, HAO Dehao, WANG Limin, ZHANG Yonggang, LI Keqin. Density Peaks Clustering Algorithm Based on CDbw and ABC Optimization#br# [J]. Journal of Jilin University Science Edition, 2018, 56(6): 1469-1475.
[5]	XIA Xuefei, HAN Xiao, LAN Tianshu, WANG Lihua, WU Jianan, ZHOU You. An Algorithm for Generation of Simulated MicroarrayData Based on Grey Value Interval [J]. Journal of Jilin University Science Edition, 2016, 54(06): 1401-1404.
[6]	JIANG Jianhua, YANG Yumian, BIAN Haiyan, KANG Jiarong, WANG Limin, LIU Ying. Application of ECommerce Sites Evaluation withImproved DBSCAN Clustering Algorithm [J]. Journal of Jilin University Science Edition, 2016, 54(02): 329-336.
[7]	QU Fu-Heng, HU Ya-Ting, MA Si-Liang, YU Li-Gong, SUN Shuang-Ci. A Convergence Theorem of Kernel Based FuzzycMeans Clustering Algorithm [J]. J4, 2011, 49(06): 1079-1086.
[8]	QU Fuheng, MA Siliang, HU Yating. A Kernel Based Fuzzy Clustering Algorithm [J]. J4, 2008, 46(06): 1137-1141.

I-k-means-+ Clustering Algorithm Based on min-max Criterion and Region Division

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 8

Metrics

Comments

Recommended 0