吉林大学学报(理学版) ›› 2022, Vol. 60 ›› Issue (1): 143-0149.

• • 上一篇    下一篇

基于信息熵度量的局部线性嵌入算法

刘均1, 宫子栋1, 吴力2   

  1. 1. 东北石油大学 电气信息工程学院, 黑龙江 大庆 163318; 2. 大庆油田有限责任公司 天然气分公司培训中心, 黑龙江 大庆 163453
  • 收稿日期:2020-12-01 出版日期:2022-01-26 发布日期:2022-01-26
  • 通讯作者: 宫子栋 E-mail: 17853462089@163.com

Local Linear Embedding Algorithm Based on Information Entropy Measurement 

LIU Jun1, GONG Zidong1, WU Li2   

  1. 1. School of Electrical Engineering & Information, Northeast Petroleum University, Daqing 163318, Heilongjiang Province, China;  2. Natural Gas Branch Training Center, Daqing Oil Field Co., Ltd, Daqing 163453, Heilongjiang Province, China
  • Received:2020-12-01 Online:2022-01-26 Published:2022-01-26

摘要: 针对局部线性嵌入算法使用欧氏距离计算非对齐样本相似性时, 受数据位置差影响较大, 导致度量精度较低, 影响算法特征提取精度的问题, 提出一种基于信息熵度量的局部线性嵌入算法. 首先利用信息熵统计样本特征间的混乱程度, 提高划分局部邻域的准确性; 然后建立局部重构模型, 挖掘出流形的本质结构; 最后利用局部结构构建低维重构模型, 以获得样本的显著特征. 通过在轴承数据集上的实验证明了该算法在特征提取方面的有效性.

关键词: 局部线性嵌入, 特征提取, 信息熵, 数据对齐

Abstract: Aiming at the problem that when the local linear embedding algorithm used Euclidean distance to calculate the similarity of unaligned samples, it was greatly affected by data position difference, resulting in low measurement accuracy and affecting the accuracy of the algorithm feature extraction, we proposed a local linear embedding algorithm based on information entropy measurement. Firstly, the degree of confusion among sample features was counted by information entropy to improve the accuracy of dividing local neighborhoods. Secondly, a local reconstruction model was established to dig out the essential structure of the manifold. Finally, the local structure was used to build a low-dimensional reconstruction model to obtain the salientfeatures of the samples. The experiment on the bearing data set proves the effectiveness of the algorithm in feature extraction.

Key words: local linear embedding, feature extraction, information entropy, data alignment

中图分类号: 

  • TP391