基于DBSDER-QL算法的应急物资分配策略

Journal of Jilin University Science Edition ›› 2025, Vol. 63 ›› Issue (4): 1105-1116.

Previous Articles Next Articles

Emergency Resource Allocation Strategy Based on DBSDER-QL Algorithm

YANG Hao¹, ZHANG Chijun^1,2, ZHANG Xinwei³

1. School of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China;
2. International Business School, Guangdong University of Finance & Economics, Guangzhou 510320, China; 3. Student Affairs Office, Changchun University, Changchun 130022, China

Received:2025-02-24 Online:2025-07-26 Published:2025-07-26

Abstract

Abstract: Aiming at the problem of emergency resource allocation for natural disasters, we proposed a Q-learning algorithm based on dynamic Boltzmann Softmax (DBS) and dynamic exploration rate (DER) (DBSDER-QL). Firstly, the DBS strategy was used to dynamically adjust the weights of action values, promoting stable convergence of the algorithm and solving the problem of excessive of the maximum operator. Secondly, the DER strategy was used to improve convergency and stability of the algorithm, solving the problem of the fixed exploration rate Q-learning algorithm not fully converging to the optimal strategy in the later stage of training. Finally, the effectiveness of the DBS and DER strategies was verified by ablation experiments. Compared with
dynamic programming, the greedy algorithm, and traditional Q-learning algorithm, the experimental results show that DBSDER-QL algorithm is significantly better than traditional methods in terms of total cost and computational efficiency, showing higher applicability and effectiveness.

Key words: resource allocation, reinforcement learning, Q-learning algorithm, dynamic exploration rate, dynamic Boltzmann Softmax

CLC Number:

TP391

YANG Hao, ZHANG Chijun, ZHANG Xinwei. Emergency Resource Allocation Strategy Based on DBSDER-QL Algorithm[J].Journal of Jilin University Science Edition, 2025, 63(4): 1105-1116.

[1]	LIU Quan, LIU Xiaosong, WU Guangjun, LIU Yuhan. Multi-actor Deterministic Policy Gradient Algorithm Based on Progressive k-Means Clustering [J]. Journal of Jilin University Science Edition, 2025, 63(3): 885-0894.
[2]	BAI Tian, LV Luyao, LI Chu, HE Jialiang. Game Intelligent Guidance Algorithm Based on Deep Reinforcement Learning [J]. Journal of Jilin University Science Edition, 2025, 63(1): 91-0098.
[3]	HAO Jianing, YAO Yongwei, YE Yuxin. Optimization Strategy for Safety Reinforcement Learning Guided by Ontology [J]. Journal of Jilin University Science Edition, 2025, 63(1): 83-0090.
[4]	GAO Sihua, GU Han, HE Huaiqing, ZHOU Gang. Algorithm for Target Coverage Problem Based on Deep Q Learning in Wireless Sensor Networks [J]. Journal of Jilin University Science Edition, 2023, 61(6): 1432-1440.
[5]	LI Lina, LIU Shilong, MA Yubo, JIN Dezheng, LI Nianfeng. Component Implementation of Adaptive Elastic Resource Allocation Strategy Based on Storm [J]. Journal of Jilin University Science Edition, 2023, 61(2): 384-392.
[6]	LI Xiaofeng, REN Jie, LI Dong. Hierarchical Matching Algorithm of Visual Image for Mobile Robots Based on Deep Reinforcement Learning [J]. Journal of Jilin University Science Edition, 2023, 61(1): 127-135.
[7]	ZHAO Pengcheng, GAO Shang, YU Hongmei. Spatial Crowdsourcing Task Assignment Based on Multi-agent Deep Reinforcement Learning [J]. Journal of Jilin University Science Edition, 2022, 60(2): 321-331.
[8]	WANG Gang, YU Yinhui, YANG Ying. Interference Management and Resource Allocation Based on Cluster Allocation in Ultra-dense Network [J]. Journal of Jilin University Science Edition, 2021, 59(5): 1228-1236.
[9]	WEN Youdong, YANG Jun, TAN Fei. Resource Allocation Strategy of Multiuser System Based on Artificial Fish Swarm Algorithm#br# [J]. Journal of Jilin University Science Edition, 2019, 57(2): 380-386.
[10]	WANG Hongzhi, ZHU Meng, ZHOU Mingyue. Robust Power Allocation Algorithm in\=Underlay Cognitive Radio Networks [J]. Journal of Jilin University Science Edition, 2017, 55(03): 641-646.
[11]	GONG Faming, LI Shibao, LIU Jianhang. Dynamic Resource Allocation Algorithm Based on SDN [J]. Journal of Jilin University Science Edition, 2015, 53(06): 1236-1240.
[12]	SUN Ting-Ting, XU Yang, ZHOU Pu. Solving Heterogeneous Resource Allocation Problemby Multiagent Systems [J]. J4, 2012, 50(06): 1163-1168.
[13]	DING Zhaohui, WEI Xiaohui, MA Da, LUO Yuan, Wilfred Li, Peter Arzberger. A Virtual Job Model to Support Crossdomain Synchronized Resource Allocation [J]. J4, 2008, 46(02): 253-258.
[14]	DING Zhaohui, WEI Xiaohui, MA Da, LUO Yuan, Wilfred Li, Peter Arzberger. A Virtual Job Model to Support Crossdomain Synchronized Resource Allocation [J]. J4, 2008, 46(02): 253-258.
[15]	LIU Xue jie,, LIU Yan heng,, LI Lian deng, MEI Lin,,. The Application of One\\|point Random Sets Coverage Theory for Updateing Mobile Predicted Policy [J]. J4, 2006, 44(06): 167-170.

Emergency Resource Allocation Strategy Based on DBSDER-QL Algorithm

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0