吉林大学学报(信息科学版) ›› 2026, Vol. 44 ›› Issue (3): 687-693.

• • 上一篇    下一篇

网络测点非结构化数据相似性聚类数学建模

胡俊华    

  1. 陕西中医药大学基础医学院,陕西咸阳712046
  • 收稿日期:2024-06-05 出版日期:2026-06-02 发布日期:2026-06-02
  • 作者简介:胡俊华(1983— ), 男, 陕西彬州人, 陕西中医药大学讲师, 主要从事计算数学与模型研究, (Tel)86-18220002524 (E-mail)hujunhua88888@163. com。
  • 基金资助:
    陕西中医药大学研究生教育教学改革创新基金资助项目(JGCX016)

Mathematical Modeling of Similarity Clustering for Unstructured Data of Network Measurement Points

HU Junhua   

  1. Basic Medical College, Shaanxi University of Chinese Medicine, Xianyang 712046, China
  • Received:2024-06-05 Online:2026-06-02 Published:2026-06-02

摘要: 针对网络测点非结构化数据结构不明确的问题为提升聚类的相似度对网络测点非结构化数据相似性聚类数学建模方法进行了研究。 使用非结构化数据网络划分方式将网络测点非结构化数据转换成半结构化数据, 得到半结构化数据元路径并以其为基础运用非负矩阵分解方法将半结构化数据分解成2个非负矩阵; 对非负矩阵进行相乘与拟合处理同时引入正则项系数与半结构化数据在其原路径建立相似度矩形上的综合相似度使具有高度相似性的网络测点半结构化数据建立相似的簇指示向量构建相似性聚类数学模型经过该模型迭代使聚类结果更加合理和一致。实验结果表明,该方法可有效将网络测点非结构化数据转换成半结构化数据相似性聚类网络测点非结构化数据聚类的疏密度数值较高归一化互信息(NMI:Normalized Mutual Information)数值分布在较高区域, 其对网络测点非结构化数据相似性聚类性能较好。

关键词: 网络测点, 非结构化数据, 相似性, 数学建模, 非负矩阵分解, 相似性正则项

Abstract: The unstructured data structure of network measurement points is not clear. In order to improve the similarity of clustering, a mathematical modeling method for clustering the similarity of unstructured data of network measurement points is studied. Using the method of unstructured data network partitioning, the unstructured data of network measurement points is transformed into semi-structured data, obtaining a semi- structured data meta path. The semi-structured data is decomposed into two non negative matrices using the non negative matrix decomposition method. The non negative matrices are multiplied and fitted, and the regularization coefficient is introduced in the process to establish a comprehensive similarity rectangle on the original path of the semi-structured data. This enables the highly similar network measurement point semi- structured data to establish a similar cluster indicator vector and construct a similarity clustering mathematical model. After the model iteration, the clustering results are more reasonable and consistent. The experimental results show that this method can effectively convert unstructured data from network measurement points into semi-structured data. The clustering density of unstructured data from network measurement points in similarity clustering is high, and the NMI(Normalized Mutual Information) value is distributed in a higher area. Its clustering performance for network measurement point unstructured data is good. 

Key words: network measurement points, unstructured data, similarity, mathematical modeling, non negative matrix factorization, similarity regularization term

中图分类号: 

  • TP311