Iterative Learning Control Combination with Adaptive Sliding Mode Technique for a Hypersonic Vehicle

Xiaodong Lu; Hao Zhang; Jianguo Guo; Jun Zhou

doi:10.1051/jnwpu/20193761120

All issues

Volume 37 / No 6 (December 2019)

JNWPU, 37 6 (2019) 1120-1128

Full HTML

Open Access

Issue		JNWPU Volume 37, Number 6, December 2019


Page(s)		1120 - 1128
DOI		https://doi.org/10.1051/jnwpu/20193761120
Published online		11 February 2020

JNWPU 2019, 37(6): 1120-1128

Iterative Learning Control Combination with Adaptive Sliding Mode Technique for a Hypersonic Vehicle

高超声速飞行器滑模自适应迭代学习控制系统设计

Xiaodong Lu (卢晓东), Hao Zhang (张豪), Jianguo Guo (郭建国) and Jun Zhou (周军)

Institute of Precision Guidance and Control, Northwestern Polytechnical University, Xi'an 710072, China

Received: 23 October 2018

Abstract

Aiming at the complicated nonlinearities, high uncertainties and strong coupling of hypersonic vehicle, a new adaptive iterative learning control method is put forward. The proposed controller combined iterative learning control with sliding mode control. Firstly, a nonlinear design model for the attitude control is established according to the attitude motion equations of hypersonic vehicle. With regard to a class of nonlinear system, a new iterative learning control combination with sliding mode control is proposed and then applied to the nonlinear design model. Finally, Lyapunov-like function method is used to prove the boundedness of all signals of the closed-loop system and the convergence of the tracking errors to zero over iterations. Simulation results are provided to show the effectiveness and robustness of the proposed control scheme compared with traditional sliding mode control. Furthermore, it also possesses stronger robustness against uncertainties and disturbances.

摘要

针对高超声速飞行器再入过程中的强耦合和干扰所带来的非匹配不确定控制问题，提出一种新型自适应迭代学习控制系统的设计方法。研究结合采用先进控制方法与迭代学习控制策略。首先给出面向控制的高超声速飞行器姿态模型。然后针对一类非线性系统，提出一种结合滑模控制的新型迭代学习控制系统设计方法，并将其应用到所提出的面向控制的姿态模型。最后应用Lyapunov泛函来证明闭环系统跟踪误差的收敛性和变量的有界性。仿真展示所提方法能使飞行器快速稳定地跟踪指令，对比传统滑模控制说明本方法具有针对气动不确定项和干扰项的强鲁棒性。

Key words: iterative learning control / adaptive control / sliding mode control / hypersonic vehicle / nonlinear / attitude motion / controller design / simulation

关键字 : 迭代学习控制 / 自适应控制 / 滑模控制 / 高超声速飞行器

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

高超声速飞行器再入过程飞行速度快、空域广、动力学参数变化大, 不同通道间耦合严重, 因而必须针对性地设计控制器, 使其在复杂飞行条件下仍能具有满意的飞行品质与性能[1-2]。

迭代学习控制(ILC)问世于20世纪80年代, 当系统在有限时间区间上执行重复运动时, 这种方法能够不断根据前次迭代的信息调整下次迭代的控制器, 最终实现在有限时间区间上的零误差跟踪[3]。传统的迭代学习控制面临诸多限制:相同的初始条件、全局Lipschitz条件以及系统相关度应为零等。变结构迭代学习控制则结合了变结构控制与迭代学习控制的优点, 能够获得更优良的鲁棒性和迭代误差收敛性能[4]。文献[4-6]均采用迭代学习控制结合变结构控制来处理复杂对象, 文献[7-10]采用自适应迭代学习控制处理初值、不确定性等问题。

反步法(back-stepping)设计策略在变结构控制中是一个重要的议题。针对高阶系统, 反步法通过求解每一步的虚拟控制量, 在最后一步成功地得到了实际控制量[11-12]。反步法中存在的微分膨胀问题, 可以采用动态面控制策略解决[13]。

本文在研究文献[13-15]的基础上, 基于线性条件[14], 提出一种系统的自适应迭代学习控制器设计方法。这种新型的自适应迭代学习控制器由三部分组成:用于估计未知非线性函数的迭代学习率, 用于估计未知时不变参数的自适应率以及变结构控制器。

本文的创新点主要为:针对高超声速飞行器姿态模型, 提出一种新型迭代学习控制器设计方法。通过结合先进控制策略与迭代学习控制设计, 本方法具有对气动拉偏的强鲁棒性。

1 再入飞行器数学姿态模型

本节研究高超声速飞行器再入大气层阶段的姿态控制问题。高超声速飞行器姿态运动模型如下

式中, α, β, γ分别为飞行器的攻角、侧滑角和倾侧角。ω_x, ω_y, ω_z分别为飞行器的旋转角速度, m, V, θ分别为飞行器的质量、速度和弹道倾角。Y, Z分别为作用在飞行器上的升力和侧向力。M_x, M_y, M_z分别为作用在飞行器上的滚转、偏航、俯仰力矩。J_x, J_y, J_z分别为飞行器的转动惯量。以q, S, l分别表示动压、参考面积、参考长度。

气动力表达式为

气动力矩表达式为

气动系数在某特征点附近拟合为

考虑再入飞行器在[0, T]上执行任务, 将模型化为如下形式

式中，k为迭代次数。x_{1, k}=[α_k β_k γ_k]^T, x_{2, k}=[ω_{z, k} ω_{y, k} ω_{x, k}]^T为飞行器姿态状态量, u_k=[δ_{z, k} δ_{y, k} δ_{x, k}]^T控制输入。各项具体形式如下

式中，q为动压, S为飞行器参考面积, l为飞行器参考长度, C_Y0为基本升力系数, C_Z^β为基本侧向力系数, C_x^β为基本滚转力矩系数, C_y0为基本俯仰力矩系数, C_z^β为基本偏航力矩系数。C_*^δ_*为各舵引起的力和力矩系数。C_{xω_x}, C_{yω_y}和C_{zω_z}分别为舵引起的阻尼力矩系数。d_{1, k}=[d_{11, k}, d_{12, k}, d_{13, k}]^T和d_{2, k}=[d_{21, k}, d_{22, k}, d_{23, k}]^T包含未建模动态、参数摄动和干扰项等。

为方便控制器的设计, 本文假定以下条件成立。

假设1 干扰d_{ij, k}是有界量。其中i=1, 2, j=1, 2, 3。

假设2 非线性函数向量ψ_{i, k}=[ψ_{i1, k}, ψ_{i2, k}, ψ_{i3, k}]^T是光滑连续的, 满足Lipschtiz条件。这意味着在任何一次迭代, 有限时间区间[0, T]上(5)式成立

假设3 参考指令y_c是可导的并且导数有界。

假设4 系统在首次迭代的初始时刻有界。

2 控制器设计

本部分将依据反步法设计步骤, 针对系统(1)设计迭代学习控制器。每一步都能得到虚拟控制量, 在最后一步设计出真实控制量。

使用y_c代表控制参考指令, 则y_c=[α_c β_c γ_c]^T并令x_1d=y_c。则误差可写为

式中，x_{2d, k}表示虚拟控制量。

第1步针对状态x_{1, k}设计虚拟控制量x_{2d, k}。

选取滑模面

式中，Φ_{1, k}=[Φ_{11, k}, Φ_{12, k}, Φ_{13, k}]^T。

对滑模面求导得

设计虚拟控制量x_{2c, k}如下

式中，σ_{1, k}=s_{1, k}+z_{1, k}。

迭代学习率设计为

式中，α₁=diag(α₁₁, α₁₂, α₁₃)为正定对角阵

函数m_{1, k}=[m_{11, k}, m_{12, k}, m_{13, k}]^T设计如下

矩阵元素

式中，γ_m1=diag(γ_m11, γ_m12, γ_m13)为正定对角阵

r_{1, k}=[r_{11, k}, r_{12, k}, r_{13, k}]^T设计如下

式中，γ_r1=diag(γ_r11, γ_r12, γ_r13)为正定对角阵

控制器设计中的参数k_{1, 1}=diag(k_{11, 1}, k_{12, 1}, k_{13, 1}), k_{1, 2}=diag(k_{11, 2}, k_{12, 2}, k_{12, 3})为正定对角阵; 迭代学习率中的量满足初始值, 系数矩阵Θ_{1, k}=diag[Θ_{11, k}, Θ_{12, k}, Θ_{13, k}], 是对角阵M₁=diag(M₁₁, M₁₂, M₁₃)的分量, c_1j>0, j=1, 2, 3。

使x_{2d, k}作为x_{2c, k}的低通滤波器输出, 且满足

式中，μ=diag(μ₁, μ₂, μ₃)为滤波器时间常数矩阵。

第2步对状态x_{2, k}设计实际控制量u_k。

选取滑模面

式中

自适应率

对滑模面求导得

设计实际控制量u_k如下

式中，σ_{2, k}=s_{2, k}+z_{2, k}。

迭代学习率设计为

式中，α₂=diag(α₂₁, α₂₂, α₂₃)为正定对角阵。

矩阵元素β_2l(m, n)=β_{2l, mn}≥0

函数m_{2, k}=[m_{21, k}, m_{22, k}, m_{23, k}]^T设计如下

式中，γ_m2=diag(γ_m21, γ_m22, γ_m23)为正定对角阵

r_{2, k}=[r_{21, k}, r_{22, k}, r_{23, k}]^T设计如下

式中，γ_r2=diag(γ_r21, γ_r22, γ_r23)为正定对角阵

控制器设计中的参数k_{2, 1}=diag(k_{21, 1}, k_{22, 1}, k_{23, 1}), k_{2, 2}=diag(k_{21, 2}, k_{22, 2}, k_{22, 3})为正定对角阵; 迭代学习率中的量满足初始值, r_{2j, k}(0)>0, 矩阵Θ_{2, k}=diag[Θ_{21, k}, Θ_{22, k}, Θ_{23, k}], 是M₂=diag(M₂₁, M₂₂, M₂₃)分量, c_2j>0, j=1, 2, 3。

至此实际控制量u_k设计完毕。

3 稳定性和收敛性证明

引理1 系统(1)满足线性条件。线性条件指的是每次迭代初值和前次迭代终值相同。选取Lyapunov函数W_k如下

式中，λ_i=diag(λ_i1, λ_i2, λ_i3), ω_k=x_{2d, k}-x_{2c, k}

求导可得

由于

将滑模表达式代入可得

式中，。干扰项为。“‖‖”₂表示2-范数, “‖‖”₁表示1-范数。

令

选取且。

以, 则可得

式中，。将学习误差一并考虑进来

接下来首先证明V₀(t)在有限时间区间[0, T]上的有界性; 然后需要证明V_k(t)随着k的增大而单调递减的特性; 最终证明。

对V_k求导可得

将迭代学习率(10)式、(17)式代入

由于

选取参数如下

选取则可得

由于

对(30)式积分可得

由于ψ_{ij, 0}²在[0, T]上为光滑连续函数, 可知Q₀= , 这说明V₀(t), t∈[0, T]有界。再证明V_k有界。

结合迭代学习率可知

结合(27)至(29)式所采用的方法, 并选取同样参数M_ij, 代入(33)式可得

结合(23)式可知

因而

结合线性条件W_k(0)=W_k(T)

定义, …, n)

则可得

因而V_k≤W_k(0)+Q

可知V_k(T)和W_k(T)有界。由于W_k(0)=W_k-1(T)可知V_k在[0, T]上有界和其中各个分量有界, 同时说明滤波器参数矩阵存在。

由于V_k(T)正定有界可知

最终可知在有限时间区间[0, T]上

至此稳定性和收敛性证明完毕。在使用本文所设计的控制器后, 状态量跟踪误差最终会渐进收敛至零。

4 仿真

以I₃表示三维单位矩阵, 控制参数为λ₁=0.6I₃, η₁=0.5I₃, γ_m1=10^-3I₃, γ_r1=γ_r2=10^-2I₃M₁=M₂=1.2I₃, α₁=α₂=0.7I₃, k₁₁=k₁₂=2.1I₃

在仿真中选取时变等效干扰力和等效力矩如下

干扰项d_{1, k}和d_{2, k}在t=6 s后加入系统。仿真采用的指令x_1d=[8, 0, 10]°, 使用方波经过惯性环节产生。

Case 1 给予方波指令使飞行器连续运行, 并在指定的时间t=6 s加入等效干扰和干扰力矩, 用以验证控制器的有效性和对干扰的鲁棒性。

仿真结果如图 1和图 2所示。图 1a)表明跟踪响应的超调量随着迭代次数的增加而迅速下降, 在迭代10次以后已经到达一个较小值。由于根据滑动模态、跟踪误差、前次迭代结果这几种信息学习, 1~4次迭代已经使超调量下降较多。

图 1b)展示t=3.8 s时系统的稳态误差随迭代次数增长的变化, 即未加入干扰力与力矩的稳态误差, 曲线表明1~8次迭代学习效果明显, 攻角误差下降较快, 侧滑角与倾侧角误差较小。

图 1c)展示t=7.8 s时系统的稳态误差, 此时等效干扰力与力矩已经加入。曲线表明1~8次迭代学习效果明显, 10次后姿态角误差稳定到较小值。

图 2是飞行器响应关于迭代次数变化的曲线图。从图上可以看出首次迭代时跟踪响应并不理想, 但是随着迭代次数的增加, 飞行器姿态角愈来愈接近指令值。小时由于控制器具有学习能力, 干扰力与力矩对系统的影响随迭代次数增加而快速降低。

说明迭代一定次数后, 系统能够以较小差值跟踪指令。同时表明控制器对干扰具有优良的鲁棒性。

Case 2 与传统滑模控制器[15]进行对比。

仿真结果如图 3所示, 选择迭代10次后系统响应, 干扰同Case 1。曲线表明系统响应的超调量、稳态误差及对抗干扰性能等都明显优于传统滑模控制, 表明本方法不仅能够较好较快跟踪指令, 并且在抗干扰性能等方面也优于传统滑模控制。

Case 3 对气动参数进行±20%的拉偏, 考察闭环系统的性能。

仿真结果如图 4所示, 标称值为无气动参数拉偏时的系统响应。可以观察到系统响应受到气动参数拉偏影响较小。这是因为学习率根据实际系统运动来估计非线性函数, 因而当加入气动拉偏后学习率所估计的同样是加入拉偏的非线性函数并将估值反馈至控制器, 当估值足够准确就会表现出闭环系统受拉偏影响较小的特点。这说明本方法设计的控制器能够适应气动参数的不确定性变化, 并且其他方面的性能损失较小, 具有较强的鲁棒性。

误差关于迭代次数变化曲线

状态关于时间变化曲线

状态关于时间变化对比曲线

拉偏下系统状态关于时间变化曲线

5 结论

针对再入飞行器姿态模型提出了一种自适应迭代学习控制, 可以处理系统存在的不确定性和动态扰动。本方法结合了滑模自适应控制、反步法与迭代学习控制, 实现飞行器对指令的准确跟踪, 构造Lyapunov泛函证明系统稳定性和误差收敛性。

References

Guo Jianguo, Liu Yuchao, Zheng Qiang, et al. A Novel Sliding Mode Attitude Control for Hypersonic Vehicle with Mismatched Uncertainties[J]. Journal of Northwestern Polytechnical University, 2017, 35(6): 941–947 [Article] [Article] (in Chinese) [Google Scholar]
Huang Lin, Duan Zhisheng, Yang Jianying. Challenges of Control Science in Near Space Hypersonic Aircrafts[J]. Control Theory and Application, 2011, 28(10): 1496–1506 [Article] (in Chinese) [Google Scholar]
Bristow D A, Tharayil M, Alleyne A G. A Survey of Iterative Learning Control[J]. IEEE Control Systems, 2006, 26(3): 96–114 [Article] [CrossRef] [Google Scholar]
Zhu Q, Xu J X, Huang D, et al. Iterative Learning Control Design for Linear Discrete-Time Systems with Multiple High-Order Internal Models[J]. Automatica, 2015, 62: 65–76 [Article] [CrossRef] [Google Scholar]
Tayebi A. Adaptive Iterative Learning Control for Robot Manipulators[J]. Automatica, 2004, 40(7): 1195–1203 [Article] [CrossRef] [Google Scholar]
Yan Qiuzhen, Sun Mingxuan, Li He. Iterative Learning Control for Nonlinear Uncertain Systems with Arbitrary Initial State[J]. Acta Automatica Sinica, 2016, 42(4): 545–555 [Article] (in Chinese) [Google Scholar]
Chi R, Hou Z, Xu J. Adaptive ILC for a Class of Discrete-Time Systems with Iteration-Varying Trajectory and Random Initial Condition[J]. Automatica, 2008, 44(8): 2207–2213 [Article] [CrossRef] [Google Scholar]
Zhang C L, Li J M. Adaptive Iterative Learning Control of Non-Uniform Trajectory Tracking for Strict Feedback Nonlinear Time-Varying Systems with Unknown Control Direction[J]. Applied Mathematical Modelling, 2015, 39(10/11): 2942–2950 [Article] [CrossRef] [Google Scholar]
Wei J, Zhang Y, Sun M, et al. Adaptive Iterative Learning Control of a Class of Nonlinear Time-Delay Systems with Unknown Backlash-Like Hysteresis Input and Control Direction[J]. ISA transactions, 2017, 70: 79–92 [Article] [CrossRef] [Google Scholar]
Yu Qiongxia, Hou Zhongsheng. Adaptive Iterative Learning Control for a Class of Non-Parametric Uncertain Systems[J]. Journal of Systems Science and Complexity, 2016, 36(1): 1–13 [Article] (in Chinese) [Google Scholar]
Shao X L, Wang H L. Back-Stepping Active Disturbance Rejection Control Design for Integrated Missile Guidance and Control System via Reduced-Order ESO[J]. ISA Transactions, 2015, 57: 10–22 [Article] [CrossRef] [Google Scholar]
Guo Jianguo, Zhang Tianbao, Zhou Jun, et al. Compound Control System Design for RLV Based on Normalization Method[J]. Journal of Astronautics, 2016, 37(6): 639–645 [Article] (in Chinese) [Google Scholar]
Yip P P, Hedrick J K. Adaptive Dynamic Surface Control:a Simplified Algorithm for Adaptive Backstepping Control of Nonlinear Systems[J]. International Journal of Control, 1998, 71(5): 959–979 [Article] [CrossRef] [Google Scholar]
Benslimane H, Boulkroune A, Chekireb H. Iterative Learning Control for Strict-Feedback Nonlinear Systems with both Structured and Unstructured Uncertainties[J]. Arabian Journal for Science and Engineering, 2016, 41(9): 3683–3694 [Article] [CrossRef] [Google Scholar]
Liu Haidong, Bao Weimin, Li Huifeng, et al. Attitude Control Method within Finite Time Globally for Hypersonic Vehicles[J]. Journal of Beijing University of Aeronautics and Astronautics, 2016, 42(9): 1864–1873 [Article] (in Chinese) [Google Scholar]

All Figures

	图1 误差关于迭代次数变化曲线
In the text

	图2 状态关于时间变化曲线
In the text

	图3 状态关于时间变化对比曲线
In the text

	图4 拉偏下系统状态关于时间变化曲线
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Guo Jianguo, Liu Yuchao, Zheng Qiang, et al. A Novel Sliding Mode Attitude Control for Hypersonic Vehicle with Mismatched Uncertainties[J]. Journal of Northwestern Polytechnical University, 2017, 35(6): 941–947 [Article] [Article] (in Chinese) [Google Scholar]

[2] Huang Lin, Duan Zhisheng, Yang Jianying. Challenges of Control Science in Near Space Hypersonic Aircrafts[J]. Control Theory and Application, 2011, 28(10): 1496–1506 [Article] (in Chinese) [Google Scholar]

[3] Bristow D A, Tharayil M, Alleyne A G. A Survey of Iterative Learning Control[J]. IEEE Control Systems, 2006, 26(3): 96–114 [Article] [CrossRef] [Google Scholar]

[4] Zhu Q, Xu J X, Huang D, et al. Iterative Learning Control Design for Linear Discrete-Time Systems with Multiple High-Order Internal Models[J]. Automatica, 2015, 62: 65–76 [Article] [CrossRef] [Google Scholar]

[5] Tayebi A. Adaptive Iterative Learning Control for Robot Manipulators[J]. Automatica, 2004, 40(7): 1195–1203 [Article] [CrossRef] [Google Scholar]

[6] Yan Qiuzhen, Sun Mingxuan, Li He. Iterative Learning Control for Nonlinear Uncertain Systems with Arbitrary Initial State[J]. Acta Automatica Sinica, 2016, 42(4): 545–555 [Article] (in Chinese) [Google Scholar]

[7] Chi R, Hou Z, Xu J. Adaptive ILC for a Class of Discrete-Time Systems with Iteration-Varying Trajectory and Random Initial Condition[J]. Automatica, 2008, 44(8): 2207–2213 [Article] [CrossRef] [Google Scholar]

[8] Zhang C L, Li J M. Adaptive Iterative Learning Control of Non-Uniform Trajectory Tracking for Strict Feedback Nonlinear Time-Varying Systems with Unknown Control Direction[J]. Applied Mathematical Modelling, 2015, 39(10/11): 2942–2950 [Article] [CrossRef] [Google Scholar]

[9] Wei J, Zhang Y, Sun M, et al. Adaptive Iterative Learning Control of a Class of Nonlinear Time-Delay Systems with Unknown Backlash-Like Hysteresis Input and Control Direction[J]. ISA transactions, 2017, 70: 79–92 [Article] [CrossRef] [Google Scholar]

[10] Yu Qiongxia, Hou Zhongsheng. Adaptive Iterative Learning Control for a Class of Non-Parametric Uncertain Systems[J]. Journal of Systems Science and Complexity, 2016, 36(1): 1–13 [Article] (in Chinese) [Google Scholar]

[11] Shao X L, Wang H L. Back-Stepping Active Disturbance Rejection Control Design for Integrated Missile Guidance and Control System via Reduced-Order ESO[J]. ISA Transactions, 2015, 57: 10–22 [Article] [CrossRef] [Google Scholar]

[12] Guo Jianguo, Zhang Tianbao, Zhou Jun, et al. Compound Control System Design for RLV Based on Normalization Method[J]. Journal of Astronautics, 2016, 37(6): 639–645 [Article] (in Chinese) [Google Scholar]

[13] Yip P P, Hedrick J K. Adaptive Dynamic Surface Control:a Simplified Algorithm for Adaptive Backstepping Control of Nonlinear Systems[J]. International Journal of Control, 1998, 71(5): 959–979 [Article] [CrossRef] [Google Scholar]

[14] Benslimane H, Boulkroune A, Chekireb H. Iterative Learning Control for Strict-Feedback Nonlinear Systems with both Structured and Unstructured Uncertainties[J]. Arabian Journal for Science and Engineering, 2016, 41(9): 3683–3694 [Article] [CrossRef] [Google Scholar]

[15] Liu Haidong, Bao Weimin, Li Huifeng, et al. Attitude Control Method within Finite Time Globally for Hypersonic Vehicles[J]. Journal of Beijing University of Aeronautics and Astronautics, 2016, 42(9): 1864–1873 [Article] (in Chinese) [Google Scholar]