基于密度信息熵的K-means算法在客户细分中的应用

吉林大学学报(理学版) ›› 2021, Vol. 59 ›› Issue (5): 1245-1251.

基于密度信息熵的K-means算法在客户细分中的应用

蒲晓川^1,2, 黄俊丽^2,3, 祁宁^2,4, 宋长松²

1. 遵义师范学院信息工程学院, 贵州遵义 563006； 2. 国立釜庆大学技术管理学院, 韩国釜山 48513；
3. 遵义师范学院管理学院, 贵州遵义 563006； 4. 河西学院经济管理学院, 甘肃张掖 734000

收稿日期:2020-07-02 出版日期:2021-09-26 发布日期:2021-09-26
通讯作者: 蒲晓川 E-mail:puxiaochuan78906@yeah.net

Application of K-Means Algorithm Based on Density Information Entropy in Customer Segmentation

PU Xiaochuan^1,2, HUANG Junli^2,3, QI Ning^2,4, SONG Changsong²

1. School of Information Engineering, Zunyi Normal University, Zunyi 563006, Guizhou Province, China;
2. Graduate School of Management of Technology, Pukyong National University, Busan 48513, South Korea;
3. School of Management, Zunyi Normal University, Zunyi 563006, Guizhou Province, China;
4. School of Economics and Management, Hexi University, Zhangye 734000, Gansu Province, China

Received:2020-07-02 Online:2021-09-26 Published:2021-09-26

摘要/Abstract

摘要： 为解决企业客户价值体现问题, 提出一种TFA客户细分改进模型, 以客户发展空间T、购买频次F和平均购买额A为指标, 充分体现客户的价值和发展空间. 首先, 引入局部密度值ρ和信息熵H, 改进K-means聚类算法, 以优化传统K-means聚类方法初始聚类中心的选取问题；其次, 通过搭建机器学习框架, 对选取人工数据集及真实数据集进行聚类实验, 验证模型的有效性. 实验结果表明, 该模型能有效分类客户, 充分反映客户价值及其发展空间, 并通过改进聚类算法提升了算法效率.

关键词: 客户分类, 客户发展空间, K-means算法, 初始聚类中心, 密度信息熵

Abstract: In order to solve the problem of the reflection of corporate customer value, we proposed an improved model of TFA customer segmentation, which took customer development space T, purchase frequency F, and average purchase amount A as indicators to fully reflect the customer value and development space. Firstly, the K-means clustering algorithm was improved by introducing local density value ρ and information entropy H to optimize the traditional K-means clustering method in the initial clustering center selection problem. Secondly, by building a machine learning framework, clustering experiments were carried out on selected artificial data sets and real data sets to verify the effectiveness of the model. The experimental results show that the model can more effectively classify customers, fully reflect the customer value and its development space, and improve the efficiency of the algorithm by improving the clustering algorithm.

Key words: customer classification, customer development space, K-means algorithm, initial clustering center, density information entropy

中图分类号:

TP391

蒲晓川, 黄俊丽, 祁宁, 宋长松. 基于密度信息熵的K-means算法在客户细分中的应用[J]. 吉林大学学报(理学版), 2021, 59(5): 1245-1251.

PU Xiaochuan, HUANG Junli, QI Ning, SONG Changsong. Application of K-Means Algorithm Based on Density Information Entropy in Customer Segmentation[J]. Journal of Jilin University Science Edition, 2021, 59(5): 1245-1251.

[1]	王玉, 申铉京, 周昱洲, 林鸿斌. 一种求解交通网络中最短路径问题的人工蜂群算法[J]. 吉林大学学报(理学版), 2021, 59(5): 1144-1150.
[2]	朱新丽, 才华, 寇婷婷, 杜冬晖, 孙俊喜. 行人多目标跟踪算法[J]. 吉林大学学报(理学版), 2021, 59(5): 1161-1170.
[3]	张震, 张照崎, 朱留存, 刘济尘, 魏金占, 蔡旭航, 赵成龙. 一种基于Shi-Tomasi和改进LBP的特征匹配及目标定位快速算法[J]. 吉林大学学报(理学版), 2021, 59(5): 1171-1178.
[4]	孙启隆, 于萍, 司振惠, 郭鑫, 王岩. 基于暗通道的沙尘图像增强算法[J]. 吉林大学学报(理学版), 2021, 59(5): 1179-1187.
[5]	刘高天, 段锦, 范祺, 吴杰, 赵言. 基于改进RFBNet算法的遥感图像目标检测[J]. 吉林大学学报(理学版), 2021, 59(5): 1188-1198.
[6]	胡雅婷, 陈营华, 宝音巴特, 曲福恒, 李卓识. 一种增量式MinMax k-Means聚类算法[J]. 吉林大学学报(理学版), 2021, 59(5): 1205-1211.
[7]	聂逯松, 常方圆, 常学智, 刘畅, 金有为, 刘国晟, 付加胜, 韩霄松. 一种新型的自适应多核学习算法[J]. 吉林大学学报(理学版), 2021, 59(5): 1212-1218.
[8]	焦冲, 苏科华, 吴博文, 任术波, 辛宁. 一种基于局部平均法向变形的网格参数化方法[J]. 吉林大学学报(理学版), 2021, 59(4): 867-876.
[9]	丁通, 刘元宁, 朱晓冬, 刘帅, 张齐贤, 张阔. 面向残差网络多元特征的轻量级虹膜分类[J]. 吉林大学学报(理学版), 2021, 59(4): 877-882.
[10]	孙俊, 才华, 朱新丽, 胡浩, 李英超. 基于双重注意力机制的深度人脸表示算法[J]. 吉林大学学报(理学版), 2021, 59(4): 883-890.
[11]	傅博, 王瑞子, 王丽妍, 张湘怡. 基于深度卷积神经网络的水下偏色图像增强方法[J]. 吉林大学学报(理学版), 2021, 59(4): 891-899.
[12]	李晓峰, 李东, 王妍玮. 基于深度残差网络的医学超声图像多尺度边缘检测算法[J]. 吉林大学学报(理学版), 2021, 59(4): 900-908.
[13]	张震, 张照崎, 朱留存, 苗志滨, 王骥月, 李修明, 赵成龙, 张坤伦. 基于Harris-改进LBP的特征匹配及目标定位算法[J]. 吉林大学学报(理学版), 2021, 59(3): 568-576.
[14]	李二强, 陈凯健, 周漾. 可控多重纹理扩展合成与迁移[J]. 吉林大学学报(理学版), 2021, 59(3): 577-586.
[15]	李鹏松, 李俊达, 倪天宇, 张琦, 胡建平. 基于图像特征的卷积核初始化方法[J]. 吉林大学学报(理学版), 2021, 59(3): 587-594.