Search in:

International Journal of Control Latest Articles

Submit an article Journal homepage

Open access

385

Views

CrossRef citations to date

Altmetric

Research Article

Robustness improvement of optimal control in terms of RBFNN with empirical model reduction and transfer learning

Anni Zhaoa Department of Mechanical Engineering, School of Engineering, University of California, Merced, CA, USAView further author information

Arash Toudeshkia Department of Mechanical Engineering, School of Engineering, University of California, Merced, CA, USAView further author information

Reza Ehsania Department of Mechanical Engineering, School of Engineering, University of California, Merced, CA, USAView further author information

Joshua H. Viersb Department of Civil & Environment Engineering, School of Engineering, University of California, Merced, CA, USAView further author information

Jian-Qiao Suna Department of Mechanical Engineering, School of Engineering, University of California, Merced, CA, USACorrespondence[email protected]
View further author information

Received 06 Mar 2023, Accepted 05 Mar 2024, Published online: 21 Mar 2024

Cite this article
https://doi.org/10.1080/00207179.2024.2328687
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Figure 1. Flowchart of the RBFNN optimal control algorithm with model reduction and transfer learning for linear systems.

Figure 2. Quanser-Servo2 Inverted Pendulum system hardware setup (Quanser, Citation2022).

Table 1. Parameters of the rotary pendulum system.

Display Table

Table 2. Summary of reduced order model matrices from model-based gramians and empirical gramians.

Display Table

Table 3. Hankel singular values of the rotary pendulum.

Display Table

Figure 3. Top: Relative output errors of the reduced order model by the balanced truncation and empirical balanced truncation. Below: The input signal.

Figure 4. Tracking the square wave of the rotary army in simulations. In the legend, ‘LQR’ denotes the LQR control designed with the original model; ‘RBFNN’ denotes the RBFNN control designed with the model-based BT; ‘Empirical RBFNN’ denotes the RBFNN control designed with the empirical BT.

Figure 5. Comparisons of tracking response $θ (t)$ of the rotary army in simulations. In the legend, ‘RBFNN_Empirical’ denotes the RBFNN control designed with the empirical BT; ‘LQR_Empirical BT’ denotes the LQR control designed with the empirical BT; ‘LQR_Model-based BT’ denotes the LQR control designed with the model-based BT.

Figure 6. Comparisons of tracking response $α (t)$ of the rotary army in simulations. Legends are the same as in Figure .

Figure 7. The closed-loop tracking responses $θ (t)$ of the rotary arm of Quanser-Servo2. Legends are the same as in Figure .

Figure 8. The closed-loop responses $θ (t)$ of the rotary arm under various controls for balancing the inverted pendulum of Quanser-Servo2. Top: Responses before retraining. Bottom: Responses after retraining. Legends are the same as in Figure .

Figure 9. The closed-loop tracking response $θ (t)$ of the rotary arm of Quanser-Servo2. Legends are the same as in Figure .

Figure 10. The closed-loop response $α (t)$ of the pendulum in the rotary arm tracking control of Quanser-Servo2. Legends are the same as in Figure .

Table 4. Summary of control performance for LQR, RBFNN, empirical RBFNN, retrained RBFNN and retrained empirical RBFNN.

Display Table

Figure 11. Robustness comparisons of all the controls under consideration. Top: The closed-loop angle response $θ (t)$ of the rotary arm in balancing control of Quanser-Servo2. Bottom: Disturbance $d (t)$ . Legends are the same as in Figure .

Figure 12. Disturbances to the second order nonlinear system.

Figure 13. Performance comparison of RBFNN, Poly-NN control and LQR controls for the nonlinear system in Equation (Equation61(61) $\begin{aligned} \begin{aligned} {\dot{x}}_{1} & = x_{1} + x_{2} - x_{1} (x_{1}^{2} + x_{2}^{2}) \\ {\dot{x}}_{2} & = - x_{1} + x_{2} - x_{2} (x_{1}^{2} + x_{2}^{2}) + u \end{aligned} \end{aligned}$ (61) ). Top: The control $u (t)$ . Middle: The response $x_{1} (t)$ . Bottom: The response $x_{2} (t)$ .

Figure 13. Performance comparison of RBFNN, Poly-NN control and LQR controls for the nonlinear system in Equation (Equation61(61) x˙1=x1+x2−x1(x12+x22)x˙2=−x1+x2−x2(x12+x22)+u(61) ). Top: The control u(t). Middle: The response x1(t). Bottom: The response x2(t).

Figure 14. Comparison of spatial distribution of RBFNN and Poly-NN optimal controls u as a function of the state $x$ . Left: The control $u (x)$ plotted in the training region $X_{s 1} \in [- 1, 1] \times [- 1, 1]$ . Right: The control $u (x)$ plotted beyond the training region into the larger region $X_{s 2} \in [- 2, 2] \times [- 2, 2]$ .

Figure 15. Robustness of the RBFNN and LQR controls with respect to the model uncertainty β. The vertical dash lines mark the critical value of β, beyond which the closed-loop system becomes unstable.

Figure A1. Comparison of RBFNNs and LQR control performances for the linear 2D system. Top: Control $u (t)$ . Middle: Response $x_{1} (t)$ . Bottom: Response $x_{2} (t)$ . The initial condition of the system is $x (0) = [1, 1]^{T}$ .

Quanser (2022). QUBE-Servo2. https://www.quanser.com/products/qube-servo-2/

Google Scholar

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Robustness improvement of optimal control in terms of RBFNN with empirical model reduction and transfer learning

Table 1. Parameters of the rotary pendulum system.

Table 2. Summary of reduced order model matrices from model-based gramians and empirical gramians.

Table 3. Hankel singular values of the rotary pendulum.

Table 4. Summary of control performance for LQR, RBFNN, empirical RBFNN, retrained RBFNN and retrained empirical RBFNN.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Robustness improvement of optimal control in terms of RBFNN with empirical model reduction and transfer learning

Figures & data

Table 1. Parameters of the rotary pendulum system.

Table 2. Summary of reduced order model matrices from model-based gramians and empirical gramians.

Table 3. Hankel singular values of the rotary pendulum.

Table 4. Summary of control performance for LQR, RBFNN, empirical RBFNN, retrained RBFNN and retrained empirical RBFNN.

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date