吉林大学学报(理学版) ›› 2019, Vol. 57 ›› Issue (3): 583-590.

• 计算机科学 • 上一篇    下一篇

基于节点相似性和链接次数组合时间序列的链接预测

魏晓辉1,2, 许国威1, 王兴旺1, 徐海啸1,2   

  1. 1. 吉林大学  计算机科学与技术学院, 长春 130012; 2. 吉林大学 高性能计算中心, 长春 130012
  • 收稿日期:2017-03-08 出版日期:2019-05-26 发布日期:2019-05-20
  • 通讯作者: 徐海啸 E-mail:haixiao@jlu.edu.cn

Link Prediction Based on Time Series Combined\=from Node Similarities and Link Number

WEI Xiaohui1,2, XU Guowei1, WANG Xingwang1, XU Haixiao1,2#br#   

  1. 1. College of Computer Science and Technology, Jilin University, Changchun 130012, China;2. High Performance Computing Centre, Jilin University, Changchun 130012, China
  • Received:2017-03-08 Online:2019-05-26 Published:2019-05-20
  • Contact: XU Haixiao E-mail:haixiao@jlu.edu.cn

摘要: 针对现有方法利用网络信息相对割裂, 很难描述链接次数与相似性分数关系的问题, 提出一种动态网络中的链接预测方法, 用节点相似性分数和链接次数组合时间序列模型进行预测. 首先通过社区演化, 预测历史上各时间快照下节点间的相似性分数; 然后用二元时间序列模型将其与各时间快照下节点间的事实链接次数相结合, 判断下一个时间段内各节点对链接发生的可能性; 最后在WeiboNetTweet微博转发数据集上进行测试. 实验结果表明, 该方法至少提高了5%的预测准确度, 证明了社区演化与链接预测之间的内在联系, 验证了二元时间序列模型的有效性.

关键词: 链接预测, 社区演化, 时间序列, 节点相似性

Abstract: Aiming at the problem that the existing methods made use of the relative fragmentation of network information, and it was difficult to describe the relationship between link number and similarity score, we proposed a link prediction method in dynamic networks, which used time series combined from node similarities scores and link mumber to predict. Firstly, it predicted similarity scores of all time snapshots by community evolution. Secondly, nodes similarities were combined with real link number by binary time series model and the probability of link between all node pairs in next period was predicted. Finally, the test was carried out on a dataset forwarded by WeiboNetTweet microblog. The experimental results show that the method improves the prediction accuracy by at least 5%, proves the intrinsic relationship between community evolution and link prediction, and verifies the effectiveness of binary time series model.

Key words: link prediction, community evolution, time series, node similarity

中图分类号: 

  • TP311