吉林大学学报(工学版) ›› 2013, Vol. 43 ›› Issue (02): 444-450.

• 论文 • 上一篇    下一篇

基于频繁传播模式的影响群落发现方法

刘丽娜1, 沈继红1,2, 朱强华3, 丁兆云4   

  1. 1. 哈尔滨工程大学 自动化学院, 哈尔滨 150001;
    2. 哈尔滨工程大学 理学院, 哈尔滨 150001;
    3. 海军工程大学 电子工程学院, 武汉 430033;
    4. 国防科技大学 计算机学院, 长沙 410073
  • 收稿日期:2012-06-05 出版日期:2013-03-01 发布日期:2013-03-01
  • 作者简介:刘丽娜(1985-),女,博士研究生.研究方向:复杂系统建模,系统可靠性与优化,数据分析.E-mail:liulinazb@163.com
  • 基金资助:

    国家自然科学基金项目(61202127).

Discovering tribe-leaders based on frequent pattern of propagation

LIU Li-na1, SHEN Ji-hong1,2, ZHU Qiang-hua3, DING Zhao-yun4   

  1. 1. College of Automation, Harbin Engineering University, Harbin 150001, China;
    2. College of Science, Harbin Engineering University, Harbin 150001, China;
    3. College of Electronic Engineering, Naval University of Engineering, Wuhan 430033, China;
    4. College of Computer, National University of Defense Technology, Changsha 410073, China
  • Received:2012-06-05 Online:2013-03-01 Published:2013-03-01

摘要: 针对传统研究大多基于影响个体挖掘而忽略了影响群落的发现,本文考虑用户之间的频繁传播模式,提出了一种基于频繁传播模式的影响群落挖掘方法。针对群落内部传播模式的多样化,给出了一种信息传播树扩展方法,通过松弛信息传播树有向特性与图扩展方法,将信息传播树转换为连通无向无环图。结合支持度与影响强度,提出了一种新的频繁子图挖掘算法Tribe-FGM,减小模式增长的规模,提高频繁子图挖掘效率。实验采用新浪微博真实数据,在约90万条博文以及对应约64万左右用户的"地震"话题与约31万条博文以及对应约21万左右用户的"两会"话题的数据集上验证了算法的性能和有效性。

关键词: 计算机应用, 社会网络, 频繁模式, 影响力

Abstract: A novel scheme of mining tribe-leaders was proposed based on the frequent pattern of propagation. In this scheme, first, a method to expend the information tree is applied to overcome the problem of multi-pattern propagation, in which the information propagation tree is converted into a connected and undirected acyclic graph. Then, considering the support and influent strength, a new frequent sub-graph mining method called Tribe-FGM is proposed to improve the efficiency of the graph mining by reducing the scale of pattern growth. A real dataset from sina microblog was taken in the experiment. The dataset is about topic of "earthquake", which contains 0.9 million posts and 0.6 million users, and the topic of the "two sessions", which contains about 0.31 million posts and 0.21 users. Experiment results validate the proposed scheme.

Key words: computer application, social network, frequent pattern, influence

中图分类号: 

  • TP391
[1] Weimann G. The Influentials: People Who Influence People[M]. Albany: State University of New York Press, 1994.

[2] Kempe D, Kleinberg J, Tardos . Maximizing the spread of influence through a social network//Proc of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2003.

[3] Chen Wei, Wang Chi, Wang Ya-jun. Scalable influence maximization for prevalent viral marketing in large scale social networks//Proc of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2010.

[4] Naohiro M, Yukio O, Mitsuru I. Influence diffusion model in text-Based communication[J]. Transactions of the Japanese Society for Artificial Intelligence, 2002, 17(3): 259-267.

[5] Song Xiaodan, Chi Yun, Koji Hino, et al. Identifying opinion leaders in the blogosphere//Proc of the 16th ACM conference on Conference on Information and Knowledge Management. New York: ACM, 2007.

[6] Pal A, Counts S. Identifying topical authorities in microblogs//Proc of the 4th ACM International Conference on Web Search and Data Mining. New York: ACM, 2011.

[7] Yan X, Han J. gSpan: Graph-based substructure patterns mining//Proce of the 2nd International Conference on Data Mining. NJ: IEEE,2002.
[1] 刘富,宗宇轩,康冰,张益萌,林彩霞,赵宏伟. 基于优化纹理特征的手背静脉识别系统[J]. 吉林大学学报(工学版), 2018, 48(6): 1844-1850.
[2] 王利民,刘洋,孙铭会,李美慧. 基于Markov blanket的无约束型K阶贝叶斯集成分类模型[J]. 吉林大学学报(工学版), 2018, 48(6): 1851-1858.
[3] 金顺福,王宝帅,郝闪闪,贾晓光,霍占强. 基于备用虚拟机同步休眠的云数据中心节能策略及性能[J]. 吉林大学学报(工学版), 2018, 48(6): 1859-1866.
[4] 赵东,孙明玉,朱金龙,于繁华,刘光洁,陈慧灵. 结合粒子群和单纯形的改进飞蛾优化算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1867-1872.
[5] 刘恩泽,吴文福. 基于机器视觉的农作物表面多特征决策融合病变判断算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1873-1878.
[6] 欧阳丹彤, 范琪. 子句级别语境感知的开放信息抽取方法[J]. 吉林大学学报(工学版), 2018, 48(5): 1563-1570.
[7] 刘富, 兰旭腾, 侯涛, 康冰, 刘云, 林彩霞. 基于优化k-mer频率的宏基因组聚类方法[J]. 吉林大学学报(工学版), 2018, 48(5): 1593-1599.
[8] 桂春, 黄旺星. 基于改进的标签传播算法的网络聚类方法[J]. 吉林大学学报(工学版), 2018, 48(5): 1600-1605.
[9] 刘元宁, 刘帅, 朱晓冬, 陈一浩, 郑少阁, 沈椿壮. 基于高斯拉普拉斯算子与自适应优化伽柏滤波的虹膜识别[J]. 吉林大学学报(工学版), 2018, 48(5): 1606-1613.
[10] 车翔玖, 王利, 郭晓新. 基于多尺度特征融合的边界检测算法[J]. 吉林大学学报(工学版), 2018, 48(5): 1621-1628.
[11] 赵宏伟, 刘宇琦, 董立岩, 王玉, 刘陪. 智能交通混合动态路径优化算法[J]. 吉林大学学报(工学版), 2018, 48(4): 1214-1223.
[12] 黄辉, 冯西安, 魏燕, 许驰, 陈慧灵. 基于增强核极限学习机的专业选择智能系统[J]. 吉林大学学报(工学版), 2018, 48(4): 1224-1230.
[13] 傅文博, 张杰, 陈永乐. 物联网环境下抵抗路由欺骗攻击的网络拓扑发现算法[J]. 吉林大学学报(工学版), 2018, 48(4): 1231-1236.
[14] 曹洁, 苏哲, 李晓旭. 基于Corr-LDA模型的图像标注方法[J]. 吉林大学学报(工学版), 2018, 48(4): 1237-1243.
[15] 侯永宏, 王利伟, 邢家明. 基于HTTP的动态自适应流媒体传输算法[J]. 吉林大学学报(工学版), 2018, 48(4): 1244-1253.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!