吉林大学学报(信息科学版) ›› 2015, Vol. 33 ›› Issue (5): 564-.

• 论文 • 上一篇    下一篇

基于最小距离乘积K-eans 算法的改进

贺嘉楠1, 高云龙1, 王宏杰2, 朱琪1, 董立岩1   

  1. 1. 吉林大学计算机科学与技术学院, 长春130012; 2. 长春轨道客车股份有限公司转向架制造中心, 长春130062
  • 收稿日期:2015-06-21 出版日期:2015-09-30 发布日期:2015-12-30
  • 作者简介:贺嘉楠(1990—), 女, 陕西扶风人, 吉林大学硕士研究生, 主要从事数据挖掘研究, (Tel)86-3104443495(E-mail)hejn2013@163. com; 通讯作者: 董立岩(1966—), 男, 长春人, 吉林大学教授, 博士生导师, 主要从事数据挖掘研究,(Tel)86-8604315166(E-ail)dongly@ jlu. edu. cn。
  • 基金资助:

    国家青年自然科学基金资助项目(61300145)

Improved K-eans Algorithm Based on Min-istance Product

HE Jianan1, GAO Yunlong1, WANG Hongjie2, ZHU Qi1, DONG Liyan1   

  1. 1. College of Computer Science and Technology, Jilin University, Changchun 130012, China;
    2. Bogie Manufacturing Center, Changchun Railway Vehicles Limited Company, Changchun 130062, China
  • Received:2015-06-21 Online:2015-09-30 Published:2015-12-30

摘要:

针对传统K-eans 算法因初始聚类中心的随机性而导致聚类结果产生很大的波动性问题, 提出一种基于最小距离乘积聚类算法CAMDP(Clustering Algorithm based on Min-Distance Product), 利用数次抽样技术, 在得到的聚类中心集合上继续使用最小乘积法寻找最佳的初始聚类中心, 较大程度减少了K-eans聚类算法对初值选取的随机性。实验结果表明: 改进后的K-eans算法既考虑了网络结构的拓扑信息, 又考虑了节点的属性特征, 为社区划分提供了有力的决策支持。

关键词: 社区结构, 聚类, 社会关系, 聚类中心

Abstract:

Traditional K-eans algorithm of the initial clustering center is randomly generated, which can lead to roduce very big volatility clustering results. In order to solve this problem, We propose a algorithm named lustering algorithm based on min-istance Product. With the method of sampling, CAMDP(Clustering Algorithm ased on Min-istance Product) produces selected point which has minimum product of distances between itself nd all other initialized clustering centers, which improves the selecting of the initial value of the K-eans lgorithm, avoiding the random selected clustering centers. The results show that the topological feature is onsidered and the attributes of vertex are taken into account, which let the improved K-eans provide the strong upport to the division of community.

Key words: community structure, clustering, social relations, clustering centers

中图分类号: 

  • TP391