不依赖观测器的不确定性系统输出反馈鲁棒控制

doi:10.13229/j.cnki.jdxbgxb.20220538

吉林大学学报(工学版) ›› 2024, Vol. 54 ›› Issue (3): 828-835.doi: 10.13229/j.cnki.jdxbgxb.20220538

• 通信与控制工程 • 上一篇

不依赖观测器的不确定性系统输出反馈鲁棒控制

赵军¹(),赵子亮²(),朱庆林²,郭斌²

^1.山东科技大学机械电子工程学院，山东青岛 266590
^2.山东科技大学交通学院，山东青岛 266590

收稿日期:2022-05-08 出版日期:2024-03-01 发布日期:2024-04-18
通讯作者: 赵子亮 E-mail:junzhao1993@163.com;zhaoziliang1@sdust.edu.cn
作者简介:赵军（1993-），男，教授，博士.研究方向：非线性系统最优/鲁棒控制.E-mail：junzhao1993@163.com
基金资助:
国家自然科学基金项目(62203279);山东省自然科学基金项目(ZR2022QF011)

Output⁃feedback robust control of uncertain systems without observer

Jun ZHAO¹(),Zi-liang ZHAO²(),Qing-lin ZHU²,Bin GUO²

^1.College of Mechanical and Electronic Engineering，Shandong University of Science and Technology，Qingdao 266590，China
^2.College of Transportation，Shandong University of Science and Technology，Qingdao 266590，China

Received:2022-05-08 Online:2024-03-01 Published:2024-04-18
Contact: Zi-liang ZHAO E-mail:junzhao1993@163.com;zhaoziliang1@sdust.edu.cn

摘要/Abstract

摘要：

针对非匹配不确定性系统的静态输出反馈鲁棒控制问题在线求解难的难题，提出了一种基于数据驱动学习的自适应学习算法。首先，将不确定性系统的鲁棒控制问题转化为具有性能指标函数的标称系统的最优控制问题。其次，为实现输出反馈最优控制，根据状态反馈控制项构造了输出反馈黎卡提方程。再次，为实现该输出反馈黎卡提方程的在线求解，使用克罗内克积和向量化操作重构输出反馈黎卡提方程，进而设计了基于输入/输出数据的自适应学习算法，摒弃了传统观测器的使用，实现可一步求解的输出反馈鲁棒控制。最后，为实现被估参数的快速收敛，进一步放松了所要求的持续激励条件。仿真结果验证了本文控制方法和学习算法的有效性。

关键词: 控制理论与控制工程, 数据驱动学习, 鲁棒控制, 最优控制, 持续激励条件

Abstract:

A novel data-driven learning method to achieve static output-feedback robust control of unmatched dynamic systems was proposed， which uses the techniques originally developed for optimal control. The robust control was first transformed into the optimal control of an augmented system， taking unmatched dynamics into consideration. Then， to design the output-feedback optimal control， an output-feedback algebraic Riccati equation was derived by tailoring its state-feedback control counterpart. Once more， an adaptive online learning method was designed to avoid using the observer， where two operations （i.e.， vectorization and Kronecker's product） were adopted to reconstruct the output-feedback algebraic Riccati equation. Finally， the required persistent excitation condition was further relaxed to realize the rapid convergence of the estimated parameters. Simulation results show the effectiveness of the proposed control method and learning algorithm.

Key words: control theory and control engineering, data-driven learning, robust control, optimal control, persistent excitation condition

中图分类号:

TP13

赵军,赵子亮,朱庆林,郭斌. 不依赖观测器的不确定性系统输出反馈鲁棒控制[J]. 吉林大学学报(工学版), 2024, 54(3): 828-835.

Jun ZHAO,Zi-liang ZHAO,Qing-lin ZHU,Bin GUO. Output⁃feedback robust control of uncertain systems without observer[J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(3): 828-835.

图/表 3

图1

图2

图3

参考文献 15

1	Chen Q, Xie S, Sun M, et al. Adaptive nonsingular fixed-time attitude stabilization of uncertain spacecraft[J]. IEEE Transactions on Aerospace and Electronic Systems, 2018, 54(6): 2937-2950.
2	张刘, 曾庆铭, 赵寰宇, 等. 基于Lyapunov理论的卫星大挠性太阳能帆板分布式自适应振动抑制控制方法[J]. 吉林大学学报: 工学版, 2023, 53(9): 2676-2685.
	Zhang Liu, Zeng Qing-Ming, Zhao Huan-Yu, et al. Distributed adaptive vibration suppression control method of large solar panels for satellites based on Lyapunov theory[J]. Journal of Jilin University (Engineering and Technology Edition), 2023, 53(9): 2676-2685.
3	Lin F. Robust Control Design: an Optimal Control Approach[M]. Chichester: John Wiley & Sons Ltd, 2007.
4	Gahinet P, Nemirovskii A, Laub A J, et al. The LMI control toolbox[C]∥Proceedings of 1994 33rd IEEE Conference on Decision and Control, Orlando, FL, USA, 1994: 2038-2041.
5	Lewis F L, Vrabie D. Reinforcement learning and adaptive dynamic programming for feedback control[J]. IEEE Circuits and Systems Magazine, 2009, 9(3): 32-50.
6	Liu D, Wei Q, Wang D, et al. Adaptive Dynamic Programming with Applications in Optimal Control[M]. Switzerland: Springer, 2017.
7	Na J, Herrmann G. Online adaptive approximate optimal tracking control with simplified dual approximation structure for continuous-time unknown nonlinear systems[J]. IEEE/CAA Journal of Automatica Sinica, 2014, 1(4): 412-422.
8	Lewis F L, Vamvoudakis K G, Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data[J]. IEEE Transactions on System Man Cybern B, 2011, 41(1): 14-25.
9	Gadewadikar J, Abu-khalaf M, Lewis F L, Necessary and sufficient conditions for H-infinity static output-feedback control[J]. Journal of Guidance Control and Dynamics, 2012, 29(4): 915-920.
10	Zhu L M, Modares H, Gan O P, et al. Adaptive suboptimal output-feedback control for linear systems using integral reinforcement learning[J]. IEEE Transactions on Control Systems Technology, 2014, 23(1): 264-273.
11	张化光, 张欣, 罗艳红, 等. 自适应动态规划综述[J]. 自动化学报, 2013, 39(4): 303-311.
	Zhang Hua-Guang, Zhang Xin, Luo Yan-Hong, et al. An overview of research on adaptive dynamic programming[J]. Acta Automatica Sinica, 2013, 39(4): 303-311.
12	吕永峰, 田建艳, 菅垄, 等. 非线性多输入系统的近似动态规划H_∞控制[J]. 控制理论与应用, 2021, 38(10): 1662-1670.
	Lv Yong-feng, Tian Jian-yan, Jin Long, et al. Approximate-dynamic-programming H_∞ controls for multi-input nonlinear system[J]. 2021, 38(10): 1662-1670.
13	Na J, Huang Y, Liu T, et al. Reinforced adaptive parameter estimation with prescribed transient convergence performance[J]. Systems & Control Letters, 2021, 149: No. 104880.
14	Stevens B L, Frank F L. Aircraft Control and Simulation, 2nd ed[M]. New York: Wiley, 2003.
15	Na J, Zhao J, Gao G, et al. Output-feedback robust control of uncertain systems via online data-driven learning[J]. IEEE Transactions on Neural Networks and Learning Systems, 2020, 32(6): 2650-2662.

相关文章 15

[1]	王德军,张凯然,徐鹏,顾添骠,于文雅. 基于车辆执行驱动能力的复杂路况速度规划及控制[J]. 吉林大学学报(工学版), 2023, 53(3): 643-652.
[2]	郭洪艳,于文雅,刘俊,戴启坤. 复杂场景智能车辆车道与速度一体化滚动优化决策[J]. 吉林大学学报(工学版), 2023, 53(3): 693-703.
[3]	齐国元,陈浩. 基于观测器的四旋翼控制-抗扰-避障一体化[J]. 吉林大学学报(工学版), 2023, 53(3): 810-822.
[4]	何德峰,周丹,罗捷. 跟随式车辆队列高效协同弦稳定预测控制[J]. 吉林大学学报(工学版), 2023, 53(3): 726-734.
[5]	徐卓君,王耀祥,黄兴,彭程. 多无人机地面移动目标搜寻和定位[J]. 吉林大学学报(工学版), 2023, 53(3): 832-840.
[6]	齐国元,李阔,王琨. 基于补偿函数观测器的四旋翼无人机姿态受限控制[J]. 吉林大学学报(工学版), 2023, 53(3): 853-862.
[7]	陈珑茏,冯天宇,吕宗阳,吴玉虎. 共轴倾转旋翼无人机有限时间滑模姿态控制[J]. 吉林大学学报(工学版), 2023, 53(3): 883-890.
[8]	马彦,郭则宣. 基于改进BPNN⁃MPF算法的锂离子电池SoE估计[J]. 吉林大学学报(工学版), 2023, 53(1): 263-272.
[9]	孙晓东,张瑶,陈龙. 电动汽车永磁同步轮毂电机无差拍预测电压补偿控制[J]. 吉林大学学报(工学版), 2022, 52(10): 2213-2224.
[10]	冯建鑫,王强,王雅雷,胥彪. 基于改进量子遗传算法的超声电机模糊PID控制[J]. 吉林大学学报(工学版), 2021, 51(6): 1990-1996.
[11]	王宏志,王婷婷,胡黄水,鲁晓帆. 基于Q学习优化BP神经网络的BLDCM转速PID控制[J]. 吉林大学学报(工学版), 2021, 51(6): 2280-2286.
[12]	马彦,黄健飞,赵海艳. 基于车间通信的车辆编队控制方法设计[J]. 吉林大学学报(工学版), 2020, 50(2): 711-718.
[13]	刘富,安毅,董博,李元春. 基于ADP的可重构机械臂能耗保代价分散最优控制[J]. 吉林大学学报(工学版), 2020, 50(1): 342-350.
[14]	夏超英, 杜智明. 丰田PRIUS混合动力汽车能量优化管理策略仿真分析[J]. 吉林大学学报(工学版), 2017, 47(2): 373-383.
[15]	邓丽飞, 石要武, 朱兰香, 于丁力. SI发动机闭环系统故障检测[J]. 吉林大学学报(工学版), 2017, 47(2): 577-582.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

不依赖观测器的不确定性系统输出反馈鲁棒控制

Output⁃feedback robust control of uncertain systems without observer

RICH HTML

PDF (PC)

摘要/Abstract

引用本文

使用本文

图/表 3

参考文献 15

相关文章 15

Metrics

本文评价

推荐阅读 0