Journal of Jilin University (Information Science Edition) ›› 2026, Vol. 44 ›› Issue (3): 687-693.

Previous Articles     Next Articles

Mathematical Modeling of Similarity Clustering for Unstructured Data of Network Measurement Points

HU Junhua   

  1. Basic Medical College, Shaanxi University of Chinese Medicine, Xianyang 712046, China
  • Received:2024-06-05 Online:2026-06-02 Published:2026-06-02

Abstract: The unstructured data structure of network measurement points is not clear. In order to improve the similarity of clustering, a mathematical modeling method for clustering the similarity of unstructured data of network measurement points is studied. Using the method of unstructured data network partitioning, the unstructured data of network measurement points is transformed into semi-structured data, obtaining a semi- structured data meta path. The semi-structured data is decomposed into two non negative matrices using the non negative matrix decomposition method. The non negative matrices are multiplied and fitted, and the regularization coefficient is introduced in the process to establish a comprehensive similarity rectangle on the original path of the semi-structured data. This enables the highly similar network measurement point semi- structured data to establish a similar cluster indicator vector and construct a similarity clustering mathematical model. After the model iteration, the clustering results are more reasonable and consistent. The experimental results show that this method can effectively convert unstructured data from network measurement points into semi-structured data. The clustering density of unstructured data from network measurement points in similarity clustering is high, and the NMI(Normalized Mutual Information) value is distributed in a higher area. Its clustering performance for network measurement point unstructured data is good. 

Key words: network measurement points, unstructured data, similarity, mathematical modeling, non negative matrix factorization, similarity regularization term

CLC Number: 

  • TP311