基于渐近式k-means聚类的多行动者确定性策略梯度算法
刘全, 刘晓松, 吴光军, 刘禹含
Multi-actor Deterministic Policy Gradient Algorithm Based on Progressive k-Means Clustering
LIU Quan, LIU Xiaosong, WU Guangjun, LIU Yuhan
吉林大学学报(理学版) . 2025, (3): 885 -0894 .