Journal of Jilin University Science Edition ›› 2023, Vol. 61 ›› Issue (4): 899-908.
Previous Articles Next Articles
YAN Chen, YANG Youlong, LIU Yuanyuan
Received:
Online:
Published:
Abstract: Aiming at the problem that existing ensemble clustering algorithms usually used K-means algorithm as the base clustering generator, although it could ensure the diversity of clustering members, it ignored that poor base clusterings might cause terrible disturbance to the final clustering result, we proposed a two stage ensemble algorithm based on clustering quality. Considering that K-means algorithm ran efficiently, but the clustering quality was relatively rough, firstly, we proposed to use K-means algorithm to generate base clustering members in the generation stage, and then selected clustering members with both high quality and strong diversity through group aggrement measure to form candidate ensemble. Secondly, the information entropy knowledge was futher applied to construct the weighted-clustering co-association matrix in the ensemble stage. Finally, the final clustering result was obtained by using consensus function. Three indexes were used for comparative experiments on ten real datasets, and the experimantal results show that the algorithm can effectively improve the accuracy of clustering results while maintaining good robustness.
Key words: ensemble clustering, clustering quality, group aggrement, information entropy, consensus function
CLC Number:
YAN Chen, YANG Youlong, LIU Yuanyuan. Two Stage Ensemble Algorithm Based on Clustering Quality[J].Journal of Jilin University Science Edition, 2023, 61(4): 899-908.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: http://xuebao.jlu.edu.cn/lxb/EN/
http://xuebao.jlu.edu.cn/lxb/EN/Y2023/V61/I4/899
Cited