基于渐近式k-means聚类的多行动者确定性策略梯度算法

Journal of Jilin University Science Edition ›› 2025, Vol. 63 ›› Issue (3): 885-0894.

Previous Articles Next Articles

Multi-actor Deterministic Policy Gradient Algorithm Based on Progressive k-Means Clustering

LIU Quan^1,2, LIU Xiaosong², WU Guangjun², LIU Yuhan³

1. School of Computer Science and Technology, Kashi University, Kashi 844000, Xinjiang Uygur Autonomous Region, China；2. School of Computer Science and Technology, Soochow University, Suzhou 215008, Jiangsu Province, China；3. Academy of Future Education, Xi’an Jiaotong-Liverpool University, Suzhou 215000, Jiangsu Province, China

Received:2024-01-25 Online:2025-05-26 Published:2025-05-26

Abstract

Abstract: Aiming at the problems of poor learning performance and high fluctuation in the deep deterministic policy gradient (DDPG) algorithm for tasks with some large state spaces, we proposed a multi-actor deep deterministic policy gradient algorithm based on progressive k-means clustering (MDDPG-PK-Means) algorithm. In the training process, when selecting actions for the state at each time step, the decision-making of the actor network was assisted based on the discrimination results of the k-means clustering algorithm. At the same time, as the training steps increased, the number of k-means cluster centers gradually increased. The MDDPG-PK-Means algorithm was applied to the MuJoCo simulation platform, the experimental results show that, compared with
DDPG and other algorithms, the MDDPG-PK-Means algorithm has better performance in most continuous tasks.

Key words: deep reinforcement learning, deterministic policy , gradient algorithm, k-means clustering, multi-actor

CLC Number:

TP18

LIU Quan, LIU Xiaosong, WU Guangjun, LIU Yuhan. Multi-actor Deterministic Policy Gradient Algorithm Based on Progressive k-Means Clustering[J].Journal of Jilin University Science Edition, 2025, 63(3): 885-0894.

[1]	BAI Tian, LV Luyao, LI Chu, HE Jialiang. Game Intelligent Guidance Algorithm Based on Deep Reinforcement Learning [J]. Journal of Jilin University Science Edition, 2025, 63(1): 91-0098.
[2]	LI Xiaofeng, REN Jie, LI Dong. Hierarchical Matching Algorithm of Visual Image for Mobile Robots Based on Deep Reinforcement Learning [J]. Journal of Jilin University Science Edition, 2023, 61(1): 127-135.
[3]	ZHAO Pengcheng, GAO Shang, YU Hongmei. Spatial Crowdsourcing Task Assignment Based on Multi-agent Deep Reinforcement Learning [J]. Journal of Jilin University Science Edition, 2022, 60(2): 321-331.
[4]	HU Yating, CHEN Yinghua, BAOYIN Bate, QU Fuheng, LI Zhuoshi. An Incremental MinMax k-Means Clustering Algorithm [J]. Journal of Jilin University Science Edition, 2021, 59(5): 1205-1211.
[5]	JIN Xiaomin, ZHANG Liping. Multilevel k-Means Clustering Algorithm Based onMinimum Spanning Tree and Its Application in Data Mining#br# [J]. Journal of Jilin University Science Edition, 2018, 56(5): 1187-1192.
[6]	YANG Jieming, WU Qilong, QU Zhaoyang, YANG Shuo, KAN Zhongfeng, GAO Ye. Distributed K-Means Clustering Algorithm Based onSampling under MapReduce Framework [J]. Journal of Jilin University Science Edition, 2017, 55(01): 109-115.
[7]	MA Shuyi, HAO Qiaohong, GUAN Qingji, QI Miao. Fast Image Dehazing Method Based on Feature Fusion [J]. Journal of Jilin University Science Edition, 2016, 54(01): 100-106.
[8]	ZHANG Jiang, WANG Chun-Xia, DIAO Jian, WU Long-Ju, LI Jing-Yong. Outlier Detecting Algorithm Based on Clusteringand Local Information [J]. J4, 2012, 50(06): 1214-1217.

Multi-actor Deterministic Policy Gradient Algorithm Based on Progressive k-Means Clustering

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 8

Metrics

Comments

Recommended 1