网络测点非结构化数据相似性聚类数学建模

吉林大学学报(信息科学版) ›› 2026, Vol. 44 ›› Issue (3): 687-693.

网络测点非结构化数据相似性聚类数学建模

胡俊华

陕西中医药大学基础医学院,陕西咸阳712046

收稿日期:2024-06-05 出版日期:2026-06-02 发布日期:2026-06-02
作者简介:胡俊华(1983— ), 男, 陕西彬州人, 陕西中医药大学讲师, 主要从事计算数学与模型研究, (Tel)86-18220002524 (E-mail)hujunhua88888@163. com。
基金资助:
陕西中医药大学研究生教育教学改革创新基金资助项目(JGCX016)

Mathematical Modeling of Similarity Clustering for Unstructured Data of Network Measurement Points

HU Junhua

Basic Medical College, Shaanxi University of Chinese Medicine, Xianyang 712046, China

Received:2024-06-05 Online:2026-06-02 Published:2026-06-02

摘要/Abstract

摘要： 针对网络测点非结构化数据结构不明确的问题, 为提升聚类的相似度, 对网络测点非结构化数据相似性聚类数学建模方法进行了研究。使用非结构化数据网络划分方式将网络测点非结构化数据转换成半结构化数据, 得到半结构化数据元路径, 并以其为基础, 运用非负矩阵分解方法将半结构化数据分解成2个非负矩阵; 对非负矩阵进行相乘与拟合处理, 同时引入正则项系数与半结构化数据在其原路径建立相似度矩形上的综合相似度, 使具有高度相似性的网络测点半结构化数据建立相似的簇指示向量; 构建相似性聚类数学模型, 经过该模型迭代使聚类结果更加合理和一致。实验结果表明,该方法可有效将网络测点非结构化数据转换成半结构化数据, 相似性聚类网络测点非结构化数据聚类的疏密度数值较高, 归一化互信息(NMI:Normalized Mutual Information)数值分布在较高区域, 其对网络测点非结构化数据相似性聚类性能较好。

关键词: 网络测点, 非结构化数据, 相似性, 数学建模, 非负矩阵分解, 相似性正则项

Abstract: The unstructured data structure of network measurement points is not clear. In order to improve the similarity of clustering, a mathematical modeling method for clustering the similarity of unstructured data of network measurement points is studied. Using the method of unstructured data network partitioning, the unstructured data of network measurement points is transformed into semi-structured data, obtaining a semi- structured data meta path. The semi-structured data is decomposed into two non negative matrices using the non negative matrix decomposition method. The non negative matrices are multiplied and fitted, and the regularization coefficient is introduced in the process to establish a comprehensive similarity rectangle on the original path of the semi-structured data. This enables the highly similar network measurement point semi- structured data to establish a similar cluster indicator vector and construct a similarity clustering mathematical model. After the model iteration, the clustering results are more reasonable and consistent. The experimental results show that this method can effectively convert unstructured data from network measurement points into semi-structured data. The clustering density of unstructured data from network measurement points in similarity clustering is high, and the NMI(Normalized Mutual Information) value is distributed in a higher area. Its clustering performance for network measurement point unstructured data is good.

Key words: network measurement points, unstructured data, similarity, mathematical modeling, non negative matrix factorization, similarity regularization term

中图分类号:

TP311

胡俊华 . 网络测点非结构化数据相似性聚类数学建模[J]. 吉林大学学报(信息科学版), 2026, 44(3): 687-693.

HU Junhua. Mathematical Modeling of Similarity Clustering for Unstructured Data of Network Measurement Points[J]. Journal of Jilin University (Information Science Edition), 2026, 44(3): 687-693.

[1]	任伟建, 张紫汉, 康朝海, 霍凤财, 孙勤江, 陈建玲. Mamba-SoftBBS: 改进DCP 的点云配准方法[J]. 吉林大学学报(信息科学版), 2026, 44(3): 663-669.
[2]	江明泽, 李伟, 董丹. 基于鲁棒子空间聚类算法的多来源数据集成处理方法[J]. 吉林大学学报(信息科学版), 2026, 44(3): 625-631.
[3]	越缙, 周飞. 基于双层注意力的多源全媒体交互信息相似性搜索算法 [J]. 吉林大学学报(信息科学版), 2026, 44(2): 453-459.
[4]	关峥. 融合K-均值和帧间相似性的影视视频关键帧提取算法 [J]. 吉林大学学报(信息科学版), 2025, 43(6): 1381-1387.
[5]	贾学萍, 刘永志. 直觉模糊集相似性及在边坡评价中的应用[J]. 吉林大学学报(信息科学版), 2025, 43(4): 863-869.
[6]	段锦, 郝水莲, 高美玲, 黄丹丹, 朱文博, 付为杰. 低照度彩色偏振图像增强算法[J]. 吉林大学学报(信息科学版), 2025, 43(3): 671-681.
[7]	韩云娜. 基于优先级的网络链路拥塞自动控制数学建模[J]. 吉林大学学报(信息科学版), 2025, 43(2): 296-302.
[8]	李宏, 齐涵, 刘庆强, 李富, 吴丽. 基于双度量约束的拉普拉斯特征映射[J]. 吉林大学学报(信息科学版), 2021, 39(4): 368-375.
[9]	徐世福, 蒋亚南. 机械故障稀疏特征相似性度量优化研究[J]. 吉林大学学报(信息科学版), 2020, 38(2): 154-159.
[10]	张雨烟, 陈万忠, 张涛, 李明阳. 基于非负矩阵分解的癫痫脑电自动检测[J]. 吉林大学学报(信息科学版), 2017, 35(5): 551-559.
[11]	朱波, 郑虹, 孙琳琳, 杨友星. 基于AST的程序代码相似性度量研究[J]. 吉林大学学报(信息科学版), 2015, 33(1): 99-104.
[12]	孟丽茹, 赵岩, 王世刚, 陈贺新. 基于2D视觉注意模型的全参考图像质量评价方法[J]. 吉林大学学报(信息科学版), 2014, 32(6): 563-568.
[13]	贺海涛, 郑山红, 侯丽鑫, 王国春, 王璐. 基于中文文本的疾病领域本体学习的研究[J]. 吉林大学学报(信息科学版), 2014, 32(1): 76-81.
[14]	周鹏宇, 杨欣, 周大可, 刘加. 基于非负矩阵分解的阴影检测方法[J]. 吉林大学学报(信息科学版), 2013, 31(6): 575-581.
[15]	孙兵\|刘雯\|田地\|宋桐\|富妍. 基于时间序列的数据挖掘在证券中的应用[J]. J4, 2010, 28(03): 270-.