基于改进无参数 K-means 算法的刀具状态分析

吉林大学学报(信息科学版) ›› 2023, Vol. 41 ›› Issue (5): 930-937.

基于改进无参数 K-means 算法的刀具状态分析

吴晓勇, 侯秋丰, 罗勇

浙江向隆机械有限公司产品开发部, 浙江宁波 315311

收稿日期:2023-07-16 出版日期:2023-10-09 发布日期:2023-10-11
通讯作者: 罗勇(1978— ), 男, 四川资中人, 浙江向隆机械有限公司工程师, 主要从事汽车传动系统研究, (Tel)86-13819851794(E-mail)lawren. luo@ cn-sps. com。 E-mail:lawren. luo@ cn-sps. com
作者简介:吴晓勇(1986— ), 男, 福建漳州人, 浙江向隆机械有限公司工程师, 主要从事汽车传动系统研究, (Tel)86-15058457890 (E-mail)xiaoyong. wu@ cn-sps. com
基金资助:
2021 年度宁波市第二批重大科技攻关暨“揭榜挂帅冶基金资助项目(科技创新 2025 重大专项(2022Z018))

Tool State Analysis Based on Improved Nonparametric K-means Algorithm

WU Xiaoyong, HOU Qiufeng, LUO Yong

Product Development Department, Zhejiang Xianglong Machinery Company Limited, Ningbo 315311, China

Received:2023-07-16 Online:2023-10-09 Published:2023-10-11

摘要/Abstract

摘要： 针对 K-means 算法需要人为确定聚类个数和随机选取初始聚类中心导致结果陷入局部最优的问题, 结合基于密度峰值的聚类算法 CFSFDP(Clustering by Fast Search and Find of Density Peaks), 提出一种改进的无参数 K-means 算法。首先, 计算样本点的局部密度和离散度。然后, 建立决策图, 将两个参数组成向量, 计算每个点到周围 5 个点的距离, 筛选出距离大于 2 倍均方差且密度大于平均密度的点作为算法的初始聚类中心, 统计聚类中心个数 k 作为聚类个数, 将初始聚类个数 k 以及初始聚类中心作为 K-means 算法的初始参数对数据进行聚类。最后, 对 UCI(University of California, Irvine)数据集、人工建立的高斯数据集以及真实刀具振动数据集 3 种不同类型的数据集进行聚类。结果表明, 所提算法保持传统算法全局最优性, 并验证了提出算法的有效性。由于 K-means 是一种无监督聚类方法, 在获得较优刀具状态识别结果的同时, 可减少人工数据标定、有监督训练等工作量及运算成本, 这对于准确实时提取数控机床刀具运行状态具有较高的实际意义。

关键词: K-means 聚类算法, 无参数, 数控机床, 刀具磨损识别

Abstract:

For the problem that the K-means algorithm requires manual determination of the cluster numbers and random selection of initial clustering centers, which can fall into local optima, an improved parameter-free K-means algorithm is proposed by combining the density peak-based clustering algorithm CFSFDP(Clustering by Fast Search and Find of Density Peaks). First, the local density and dispersion of the sample points are calculated, then a decision diagram is established, and a vector of two parameters is composed. The distance from each point to the surrounding 5 points is calculated, and those with a distance greater than 2 times the mean square error and a density greater than the average density are filtered out. The filtered point is used as the initial clustering center of the algorithm. The number of statistical clustering centers k is used as the number of clusters, and the initial number of clusters k and the initial clustering centers are used as the initial parameters of the K-means algorithm to cluster data. The algorithm is tested on different types of data sets, including artificially created Gaussian data sets, UCI(University of California, Irvine) data sets, and real tool vibration data sets. The results show that the proposed algorithm maintains the global optimality of the traditional algorithm and validates its effectiveness. Since K-means is an unsupervised clustering method, it can reduce the workload and computational cost of manual data calibration, supervised training, etc. , while obtaining better tool state recognition results, which is of high practical significance for accurate real-time extraction of the operating state of the tool for computerized numerical control machine tools.

Key words: K-means clustering algorithm, nonparametric, numerical control machine, tool wear identification

中图分类号:

TP312

吴晓勇, 侯秋丰, 罗勇. 基于改进无参数 K-means 算法的刀具状态分析[J]. 吉林大学学报(信息科学版), 2023, 41(5): 930-937.

WU Xiaoyong, HOU Qiufeng, LUO Yong. Tool State Analysis Based on Improved Nonparametric K-means Algorithm[J]. Journal of Jilin University (Information Science Edition), 2023, 41(5): 930-937.

[1]	袁硕, 刘玉敏, 安志伟, 王硕昌, 魏海军. 基于改进 ShuffleNetV2 网络的岩石图像识别[J]. 吉林大学学报(信息科学版), 2023, 41(3): 450-458.
[2]	陈刚. 基于数据挖掘地域性强关联规则数据提取[J]. 吉林大学学报(信息科学版), 2022, 40(4): 652-656.
[3]	杨晖. 基于皮尔森相关算法的云存储层次化去冗优化[J]. 吉林大学学报(信息科学版), 2022, 40(1): 71-76.
[4]	文莉莉, 孙苗, 邬满. 基于 Faster R-CNN 的海域监管预警方法[J]. 吉林大学学报(信息科学版), 2021, 39(4): 421-429.
[5]	邬满, 张万桢, 孙苗, 林森. 基于 DBIRCH 算法的 Argo 剖面数据聚类[J]. 吉林大学学报(信息科学版), 2020, 38(5): 568-577.
[6]	李明明. 基于大数据的公共自行车运营分析及调度模型[J]. 吉林大学学报(信息科学版), 2020, 38(3): 371-378.
[7]	王策, 万福成, 于洪志, 马宁, 吴甜甜, 杨方韬. 基于Bi-LSTM 和Max Pooling 的答案句抽取技术[J]. 吉林大学学报(信息科学版), 2019, 37(4): 390-398.
[8]	刘威, 路来君, 王洪肖, 曹延波. 基于G⁴ICCS系统的数据挖掘并行算法[J]. 吉林大学学报(信息科学版), 2013, 31(3): 324-327.