基于RBFNN的强化学习在机器人导航中的应用

J4 ›› 2009, Vol. 27 ›› Issue (02): 185-.

基于RBFNN的强化学习在机器人导航中的应用

吴洪岩,刘淑华,张嵛

东北师范大学计算机学院,长春 130117

出版日期:2009-03-20 发布日期:2009-07-06
通讯作者: 吴洪岩(１９84— )，女，吉林松原人，东北师范大学硕士研究生，主要从事自主移动机器人导航研究 E-mail:wuhy836@nenu.edu.cn
作者简介:吴洪岩(１９84— )|女|吉林松原人| 东北师范大学硕士研究生|主要从事自主移动机器人导航研究|（Tel）86-13654316092 （E-mail）wuhy836@nenu.edu.cn；刘淑华(１９70— )|女|内蒙古赤峰人| 东北师范大学副教授|硕士生导师|主要从事多移动机器人、智能规划研究|（Tel）86-13174478738（E-mail）liush129@nenu.edu.cn
基金资助:
国家自然科学基金资助项目(60573067)

Application of Reinforcement Learning Based on Radial Basis Function Neural Networks in Robot Navigation

WU Hong-yan,LIU Shu-hua, ZHANG Yu

School of Computer Science, Northeast Normal University, Changchun 130117, China

Online:2009-03-20 Published:2009-07-06

摘要/Abstract

摘要：

在复杂连续环境下，强化学习系统的状态空间面临维数灾难问题，需要采取量化的方法，降低输入空间的复杂度。径向基神经网络(RBFNN:Radial Basis Function Neural Networks)具有较强的函数逼近能力及泛化能力，由此提出了基于径向基神经网络的Q学习方法,并将其应用于单机器人的自主导航。在基于径向基神经网络的强化学习系统中，用径向基神经网络逼近状态空间和Q函数，使学习系统具有良好的泛化能力。仿真结果表明,该导航方法具有较强的避碰能力，提高了机器人对环境的适应能力。

关键词: Q学习, RBF神经网络, 机器人自主导航

Abstract:

In a complex and continuous environment, Reinforcement Learning system will cause the dimensional disaster and generalization is often adopted to reduce the complexity of input space. Radial Basis Function Neural Networks (RBFNN:Radial Basis Function Neural Networks) has the function of strong approximation and generalization. Reinforcement Learning based on RBFNN is proposed,and it is used in the single-robot navigation. In the learning system, the state space and Q function are approximated by RBFNN. Simulation results show that the proposed method improves the ability of robots collision avoidance so that the robot has better environment adaptability.

Key words: Q learning, radial basis function neural network（RBFNN), robot autonomous navigation

中图分类号:

TP242

吴洪岩,刘淑华,张嵛. 基于RBFNN的强化学习在机器人导航中的应用[J]. J4, 2009, 27(02): 185-.

WU Hong-yan,LIU Shu-hua, ZHANG Yu. Application of Reinforcement Learning Based on Radial Basis Function Neural Networks in Robot Navigation[J]. J4, 2009, 27(02): 185-.

[1]	丁硕, 常晓恒, 巫庆辉. LMBP和RBF在ECS特性曲线拟合中对比研究[J]. J4, 2013, 31(2): 203-209.
[2]	李艳辉, 赵辉, 李珊珊. 一种新的Q学习算法在机械臂轨迹规划中的应用[J]. J4, 2013, 31(1): 90-94.
[3]	刘延喜,李忠范. RBF神经网络非对称损失改进及应用[J]. J4, 2010, 28(05): 488-.