吉林大学学报(信息科学版) ›› 2024, Vol. 42 ›› Issue (2): 307-311.

• • 上一篇    下一篇

油气物联网数据污染检测算法研究

郭亚茹1a , 刘 苗1b,2 , 聂中文3
  

  1. 1. 东北石油大学 a. 物理与电子工程学院, 黑龙江 大庆 163318; b. 秦皇岛校区, 河北 秦皇岛 066044; 2. 无锡学院 电子信息工程学院, 江苏 无锡 210044; 3. 上海燃气工程设计研究有限公司 智慧能源院, 上海 200120
  • 收稿日期:2023-03-31 出版日期:2024-04-10 发布日期:2024-04-12
  • 通讯作者: 刘苗(1980— ), 女, 乌鲁木齐人, 博士, 东北石油大学教授, 无锡 学院教授, 博士生导师, 主要从事物联网安全与性能优化研究, (Tel)86-18910287807(E-mail)lm_jlu@ 163. com E-mail:lm_jlu@ 163. com
  • 作者简介:郭亚茹(1998— ), 女, 山东菏泽人, 东北石油大学硕士研究生, 主要从事边缘计算、 深度学习等研究, ( Tel) 86- 13061599589(E-mail)guoyaru0@ 163. com
  • 基金资助:
    黑龙江省自然科学基金资助项目(LH2022F004) 

Research on Detection Algorithm of Oil and Gas IoT Data Contamination

GUO Yaru 1a , LIU Miao 1b,2 , NIE Zhongwen 3    

  1. 1a. College of Physics and Electronic Engineering, Northeast Petroleum University, Daqing 163318, China; 1b. Qinhuangdao Campus, Northeast Petroleum University, Qinhuangdao 066044, China; 2. School of Electronic Information Engineering, Wuxi University, Wuxi 210044, China; 3. Smart Energy Institute, Shanghai Gas Engineering Design and Research Company Limited, Shanghai 200120, China
  • Received:2023-03-31 Online:2024-04-10 Published:2024-04-12

摘要: 针对油气物联网(OGIoT: Oil and Gas Internet of Things) 连接设备的数量暴增导致边缘计算(EC: Edge Computing)系统中的边缘节点算力不足, 且难以有效识别其他边缘节点的恶意攻击而导致的服务崩溃问题, 提出针对油气物联网数据污染检测改进的高效机器学习算法(EMLDI: Efficient Machine Learning Method for Improved Data Contamination Detection of Oil and Gas IoT), 解决了因边缘节点鲁棒性不强, 数据失真或遭到轻度 质变导致边缘节点运算结果波动大且不准确问题。 通过随机选择批量样本加入高斯噪声(GN: Gaussian Noise) 扩充数据集训练网络, 使网络具有更宽泛的数据拟合能力和预测能力, 解决了数据被严重破坏时边缘节点难以 实施正确运算导致系统性崩溃问题。 实验结果表明, 该算法能更有效地识别噪声污染以及随机标签污染的 样本, 并且算法在规定的训练批次内能达到最好的效果。

关键词: 油气物联网, 高斯噪声, 数据污染, 机器学习 

Abstract: In order to address the problem that the number of connected devices in the OGIoT(Oil and Gas IoT) has increased dramatically, resulting in insufficient computing power of the edge nodes in the EC ( Edge Computing) system, and it is difficult to effectively identify the service collapse caused by malicious attacks from other edge nodes, an EMLDI(Efficient Machine Learning method for Improved Data Contamination Detection of Oil and Gas IoT algorithm) is proposed, which solves the problem of fluctuating and inaccurate results of edge nodes due to their poor robustness, data distortion or mild qualitative changes. The problem of large and inaccurate edge node results due to robustness of edge nodes and data distortion or mild qualitative changes is solved. The network is trained by adding GN(Gaussian Noise) to the expanded data set through randomly selected batch samples, which enables the network to have broader data fitting and prediction capabilities, and solves the problem of systemic collapse due to the difficulty of implementing correct operations at the edge nodes when the data is severely corrupted. The algorithm is able to identify noise contaminated and random label contaminated samples more effectively and the algorithm achieves the best results within the specified training batches.

Key words: oil and gas iot, gaussian noise, data pollution, machine learning 

中图分类号: 

  • TP393