Journal of Jilin University Science Edition ›› 2021, Vol. 59 ›› Issue (6): 1455-1460.

Previous Articles     Next Articles

An Efficient Yinyang k-Means Clustering Algorithm

LI Changming1, ZHANG Hongchen1, WANG Chao2, LI Xiaoguang2, LU Yang3, QIAN Chaoyue3   

  1. 1. Engineering Technology Research and Development Center, Changchun Guanghua University, Changchun 130033, China;
    2. School of Electrical Information, Changchun Guanghua University, Changchun 130033, China;  
    3. School of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China
  • Received:2020-12-08 Online:2021-11-26 Published:2021-11-26

Abstract: Aiming at the problem that the traditional Yinyang algorithm did not use the data structure, resulting in low computational efficiency, we proposed an efficient Yinyang k-means clustering algorithm. The algorithm decomposed the original data layer by layer according to the data similarity, and established a full m-tree structure to store the data of each layer. The weighted data was established based on the data information stored in each leaf node of the tree structure, and the weighted Yinyang k-means algorithm was run to obtain the convergence center. In the original data, the convergence centers of the weighted data were taken as the initial condition to run the traditional Yinyang k-means algorithm to further optimize the objective function value. Comparative experiments with k-means, traditional Yinyang k-means and other two acceleration algorithms on five UCI data sets show that the proposed algorithm has a high acceleration ratio, and the solution accuracy is basically equivalent to Yinyang k-means clustering.

Key words: cluster analysis, Yinyang k-means algorithm, k-means , algorithm, data weighting

CLC Number: 

  • TP391