基于多智能体深度强化学习的空间众包任务分配

吉林大学学报(理学版) ›› 2022, Vol. 60 ›› Issue (2): 321-331.

基于多智能体深度强化学习的空间众包任务分配

赵鹏程, 高尚, 于洪梅

吉林大学计算机科学与技术学院, 长春 130012

收稿日期:2020-12-31 出版日期:2022-03-26 发布日期:2022-03-26
通讯作者: 于洪梅 E-mail:hmyu@jlu.edu.cn

Spatial Crowdsourcing Task Assignment Based on Multi-agent Deep Reinforcement Learning

ZHAO Pengcheng, GAO Shang, YU Hongmei

College of Computer Science and Technology, Jilin University, Changchun 130012, China

Received:2020-12-31 Online:2022-03-26 Published:2022-03-26

摘要/Abstract

摘要： 针对现有空间众包中的任务分配大多只考虑单边、短期利益和单一场景的问题, 提出一种基于多智能体深度强化学习的空间众包任务分配算法. 首先定义一种新的空间众包场景, 其中工人可以自由选择是否与他人合作；然后设计基于注意力机制和A2C(advantage actor-critic)方法的多智能体深度强化学习模型进行新场景下的任务分配；最后进行仿真实验, 并将该算法与其他最新的任务分配算法进行性能对比. 仿真实验结果表明, 该算法能同时实现最高的任务完成率和工人收益率, 证明了该算法的有效性和鲁棒性.

关键词: 多智能体深度强化学习, 空间众包, 任务分配, 注意力机制

Abstract: Aiming at the problem that most of the existing task assignment in spatial crowdsourcing only considered unilateral benefits, short-term benefits and single scenario, we proposed a spatial crowdsourcing task assignment algorithm based on multi-agent deep reinforcement learning. Firstly, a new spatial crowdsourcing scenario was defined, in which workers could freely choose whether to cooperate with others. Secondly, a multi-agent deep reinforcement learning model based on the attention mechanism and A2C (advantage actor-critic) method was designed for task assignment in the new scenario. Finally, simulation experiments were carried out, and the performance of the algorithm was compared with other latest task assignment algorithms. The experimental results show that the proposed algorithm can achieve higher task completion rate and worker profitability rate simultaneously, which proves the effectiveness and robustness of the algorithm.

Key words: multi-agent deep reinforcement learning, spatial crowdsourcing, task assignment, attention mechanism

中图分类号:

TP391

赵鹏程, 高尚, 于洪梅. 基于多智能体深度强化学习的空间众包任务分配[J]. 吉林大学学报(理学版), 2022, 60(2): 321-331.

ZHAO Pengcheng, GAO Shang, YU Hongmei. Spatial Crowdsourcing Task Assignment Based on Multi-agent Deep Reinforcement Learning[J]. Journal of Jilin University Science Edition, 2022, 60(2): 321-331.

[1]	朱海琦, 李宏, 李定文, 李富. 基于生成对抗网络的单图像超分辨率重建[J]. 吉林大学学报(理学版), 2021, 59(6): 1491-1498.
[2]	孙俊, 才华, 朱新丽, 胡浩, 李英超. 基于双重注意力机制的深度人脸表示算法[J]. 吉林大学学报(理学版), 2021, 59(4): 883-890.