基于深度Q学习的无线传感器网络目标覆盖问题算法

吉林大学学报(理学版) ›› 2023, Vol. 61 ›› Issue (6): 1432-1440.

基于深度Q学习的无线传感器网络目标覆盖问题算法

高思华^1,2, 顾晗¹, 贺怀清¹, 周钢³

1. 中国民航大学计算机科学与技术学院, 天津 300300； 2. 吉林大学计算机科学与技术学院，长春 130012；
3. 中国民航信息网络股份有限公司科技管理部, 北京 101300

收稿日期:2022-12-20 出版日期:2023-11-26 发布日期:2023-11-26
通讯作者: 贺怀清 E-mail:hqhe@cauc.edu.cn

Algorithm for Target Coverage Problem Based on Deep Q Learning in Wireless Sensor Networks

GAO Sihua^1,2, GU Han¹, HE Huaiqing¹, ZHOU Gang³

1. College of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, China；
2. College of Computer Science and Technology, Jilin University, Changchun 130012, China;
3. Department of Science and Technology Management, TravelSky Technology Limited, Beijing 101300, China

Received:2022-12-20 Online:2023-11-26 Published:2023-11-26

摘要/Abstract

摘要： 针对求解无线传感器网络目标覆盖问题过程中存在的节点激活策略机理不明确、可行解集存在冗余等问题, 提出一种基于深度Q学习的目标覆盖算法, 学习无线传感器网络中节点的调度策略. 首先, 算法将构建可行解集抽象成Markov决策过程, 智能体根据网络环境选择被激活的传感器节点作为离散动作; 其次, 奖励函数从激活节点的覆盖能力和自身剩余能量考虑, 评价智能体选择动作的优劣. 仿真实验结果表明, 该算法在不同规模的网络环境下均有效, 网络生命周期均优于3种贪婪算法、最大寿命覆盖率算法和自适应学习自动机算法.

关键词: 目标覆盖问题, 深度Q学习, 无线传感器网络, 强化学习

Abstract: Aiming at the uncertain mechanism of node activation strategies and redundancy of feasible solution sets in the process of solving target coverage problem in wireless sensor networks, we proposed a deep Q learning based target coverage algorithm to learn the scheduling strategies of nodes in wireless sensor networks. Firstly, the algorithm abstracted the construction of feasible solution sets into Markov decision process, and intelligently selected activated sensor nodes as discrete actions according to the network environment. Secondly, a reward function evaluated the performance of the intelligent agent in selecting actions based on the
coverage capacity and its residual energy of the active node. The simulation experiment result shows that the algorithm is effective in different network environments, and the network lifecycle is superior to the three greedy algorithms, the maximum lifetime coverage algorithm and the adaptive learning automaton algorithm.

Key words: target coverage problem, deep Q learning, wireless sensor networks, reinforcement learning

中图分类号:

TP391

高思华, 顾晗, 贺怀清, 周钢. 基于深度Q学习的无线传感器网络目标覆盖问题算法[J]. 吉林大学学报(理学版), 2023, 61(6): 1432-1440.

GAO Sihua, GU Han, HE Huaiqing, ZHOU Gang. Algorithm for Target Coverage Problem Based on Deep Q Learning in Wireless Sensor Networks[J]. Journal of Jilin University Science Edition, 2023, 61(6): 1432-1440.

参考文献

Metrics

Viewed

Full text

228

HTML			PDF

Just accepted	Online first	Issue	Just accepted	Online first	Issue
0	0	0	0	0	228

From	Others	local

Times	13	215
Rate	6%	94%

Abstract

338

Just accepted	Online first	Issue

0	0	338

From	Others	local

Times	338	1
Rate	100%	0%

Cited

Shared

[1]	潘继强, 刘杰, 达列雄, 黄现代. 基于能量迭代模型和蜂群优化的异构无线传感器网络节能分簇路由算法[J]. 吉林大学学报(理学版), 2023, 61(6): 1441-1447.
[2]	李晓峰, 任杰, 李东. 基于深度强化学习的移动机器人视觉图像分级匹配算法[J]. 吉林大学学报(理学版), 2023, 61(1): 127-135.
[3]	陶李, 郭宇欣, 韩优佳. 基于信任的WSN快速识别信任攻击模型[J]. 吉林大学学报(理学版), 2022, 60(6): 1423-1429.
[4]	方省, 罗引, 曹家, 徐楠, 蒋水宾, 郝艳妮. 基于高斯混合模型的无线传感器网络定位算法[J]. 吉林大学学报(理学版), 2022, 60(3): 713-720.
[5]	赵鹏程, 高尚, 于洪梅. 基于多智能体深度强化学习的空间众包任务分配[J]. 吉林大学学报(理学版), 2022, 60(2): 321-331.
[6]	胡黄水, 姚美琴, 王亮, 韩优佳. 基于改进的AP和遗传算法的能量感知分簇路由协议[J]. 吉林大学学报(理学版), 2021, 59(6): 1525-1531.
[7]	王出航, 王雪, 胡黄水, 赵宏伟, 韩由佳. 基于改进GA和信任感知的无线传感器网络安全分簇路由协议[J]. 吉林大学学报(理学版), 2021, 59(5): 1237-1244.
[8]	李蛟, 胡黄水, 赵宏伟, 鲁晓帆. 基于混沌遗传算法的无线传感器网络改进LEACH算法[J]. 吉林大学学报(理学版), 2021, 59(4): 950-955.
[9]	潘继强, 何立风, 达列雄, 周广彬. 基于改进短链聚合策略的无线传感器网络路由算法[J]. 吉林大学学报(理学版), 2021, 59(4): 956-960.
[10]	王宏志, 武莎莎, 鲁晓帆, 胡黄水, 王出航, 郭嫚嫚. 基于最优簇头数的环形无线传感器网络分簇算法[J]. 吉林大学学报(理学版), 2020, 58(5): 1215-1222.
[11]	袁开银, 王峰. 基于指标选择和加权融合的无线传感器网络安全风险评估[J]. 吉林大学学报(理学版), 2020, 58(4): 937-943.
[12]	程超, 李萌, 王久赫, 陈碧龙. 一种基于凸优化的WSN障碍环境下定位算法[J]. 吉林大学学报(理学版), 2018, 56(6): 1488-1494.
[13]	王出航, 沈玮娜, 胡黄水. 基于分布式模糊控制器的无线传感器网络容错非均匀分簇算法[J]. 吉林大学学报(理学版), 2018, 56(3): 631-638.
[14]	楼国红, 张剑平. 粒子群算法修正测距的无线传感器网络节点定位[J]. 吉林大学学报(理学版), 2018, 56(3): 650-656.
[15]	姜彬彬, 于寒. 综合负载均衡与能量消耗的无线传感器网络分簇算法[J]. 吉林大学学报(理学版), 2017, 55(06): 1552-1556.