Full article: Stabilization of the cart-inverted-pendulum system using trivial state-feedback to output-feedback control conversion

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

A new linear observer-free output-feedback controller with five adjustable parameters is proposed to stabilize the cart-inverted-pendulum system (CIP) at the unstable equilibrium point. The controller architecture is deduced from a trivial conversion of the linear state-feedback controller that is obtained using a two-step method. First, based on a set of cart change variables, a slightly modified state-feedback controller is developed. Then, the output-feedback controller is obtained through the judicious combination of the cart step reference input internal model and a convenient open-loop state estimator with the above modified state-feedback controller. The local stability of the output-based control system is conducted using the signature formulas method to get simplified conditions. A partial single parameter tuning method and optimal global single parameter tuning method are proposed for adjusting the controller gains to maximize a new efficiency-based objective function. Numerical simulations are first conducted to reveal the simplicity of output-feedback controller design using the partial tuning method, where the state-feedback gains are assumed to be known. Then, an optimal output-feedback controller is designed using the global tuning method. The proposed output-feedback controller is equivalent in terms of performance efficiency to the best five-parameter output-feedback two PID controller.

KEYWORDS:

1. Introduction

Being an under-actuated mechanical system and inherently open-loop unstable with non-minimum phase and fourth-order highly nonlinear dynamics, the cart-inverted-pendulum (CIP) system provides many challenging control problems to standard and modern control techniques [Citation1], especially in the absence of velocity measurements and in the presence of system uncertainties, measurement noises, and external disturbances. In the context of the CIP system stabilization, driving the cart from an initial position to a final destination while keeping the pendulum erected in the upright position during such movement is a well-studied problem, and many linear [Citation2–6] and nonlinear [Citation10–14,Citation20,Citation21] controllers, including state-feedback controllers (SFC), observer-based output-feedback controllers (OBC) and observer-free output-feedback controllers (OFC), have been proposed to solve it. However, achieving this task efficiently with reduced smooth control effort and simple parameter-tuning output-feedback control schemes is a subject that still needs more investigation.

Based on the linearized system dynamics using the standard pendulum small-angle approximations: $\sin x_{1} \approx x_{1}$ and $\cos x_{1} \approx 1$ , the fact that the obtained linear model is controllable, and the assumption of accessibility of the state vector, many linear SFC, including Two Proportional and Derivative (TPD) [Citation2–4] and Two Proportional–Integral–Derivative (TPID) [Citation4–6] controllers, have been proposed to stabilize the equilibrium. The under-actuating property of the CIP system and the interaction that exists between the pendulum and cart dynamics make it useful, if not necessary, to combine at least two structures in the same controller to solve the stabilization problem [Citation5,Citation6]. Such a combination leads to an increase in the control parameters and consequently complicates the tuning parameter problem. The exigent control system requirements have led to the application of different controller tuning methods, such as the pole placement design [Citation4], the Linear Quadratic Regulator (LQR) [Citation4], and the optimization methods [Citation7,Citation8]. The comparison of the full-state-feedback controllers has been conducted in [Citation9] and the main issues when using the above tuning methods are respectively: how to choose the optimal pole locations in the s-plane, the LQR criterion matrix gains, and the form of the objective function to be optimized?

Considering that the CIP system velocities are not (accurately) measured and the system, which is also subject to uncertainties, disturbances, and measurement noises, is observable with the set of position outputs, several (nonlinear) OBC and OFC have been proposed to stabilize the CIP system while addressing these difficulties to some extent. In [Citation10], an extended high-gain OBC with dynamic inversion and multi-time-scale structure was proposed to deal with uncertainties and parameter tuning difficulties. The stability analysis for the multi-time-scale structure was carried out using singular perturbation methods. The conducted numerical simulations showed that the above proposed nonlinear OBC recovers the performance of its associated nonlinear SFC when using some small enough time-scale control parameters and demonstrated a large region of attraction of the equilibrium. Practical experiments have also been conducted but with initial pendulum angles in the neighbourhood of the upright configuration to prevent the exceeding of the cart bounded tracks and the physical limitations of the motor torque. In [Citation11], a nonlinear OBC was proposed to deal with the pendulum input disturbance rejection and the CIP state velocity variable estimations. To achieve the nonlinear controller design, a simplified CIP model was considered, in which the pendulum and cart second-order dynamics are only coupled with the input control. The velocity estimation was done by the extended Kalman filter. Local asymptotic stability of the closed-loop CIP system was verified through the application of the eigenvalue analysis on the linearized CIP system. In [Citation12], the author introduced a nonlinear state observer in a static linear SFC loop to design an OBC. As such introduction may adversely affect the stability robustness of the system, i.e. the gain and phase margins, LTR (Loop Transfer Recovery) technique has been used to redesign the observer in such a way as to shape the loop gain properties to approximate, to some extent, those of LQR. The feasibility of the observer-based method was demonstrated in [Citation11] through practical experimentation and in [Citation12] through numerical simulations. Other OBC applied to the CIP system can be found in [Citation13,Citation14].

With the introduction of observers that have their tuning parameters, the (multi-structure) controller design problem becomes again harder. Of course, a reduced-order observer can be used to estimate the unavailable velocities and the set of tuning parameters will be reduced; but if the CIP system position measurements are contaminated by noise, sensitivity to measurement noise becomes an issue that cannot be ignored. To stabilize the CIP system, OFC that processes directly the accessible CIP system positions, i.e. the pendulum angle and the cart position, without the (explicit) estimation of the state variable, is a promising approach. The progress in solving (linear and nonlinear) output-feedback regulation problems, has made it possible to design such an OFC for a wide class of systems to ensure, in addition to closed-loop stability, asymptotic tracking, and disturbance rejection for a class of reference inputs and disturbances. Some of the seminal works on the subject are given in [Citation15] and [Citation16]. The book [Citation17] that addresses this general topic gives a rigorous formulation of the problem with its solvability conditions in terms of the existence of a solution to a set of algebraic equations when the system dynamics are linear and to a mixed algebraic partial differential equation when the system dynamics are nonlinear. These equations, known as the regulator equations, are in practice simple to solve for the linear dynamic situation but very difficult (if not impossible) to solve for the nonlinear dynamical situation. Methods for finding an approximate solution to the last case, such as the Taylor series expansion, were also investigated and applied to the linear and rotary CIP systems in [Citation18] and [Citation19,Citation20], respectively. Notice that the above theory can help in specifying the architecture of either the SFC or OFC. Notice also that the controller tuning parameter issue is common to all design methods; but with an OFC, a reduced set of control parameters renders this approach more attractive from the implementation point of view.

Regarding the facts that (i) there exists a linear SFC with an appropriate fourth gain vector (i.e. a TPD) that can successfully stabilize the system to its unstable equilibrium position for the considered set-point stabilization problem (due to linearized CIP system controllability, see Assumption 2.4 in section 2), and that (ii) an appropriate OFC (with fewer parameters in comparison to an OBC) can handle efficiently the classical SFC drawbacks, we propose in this paper the conversion of the above linear static SFC into a new linear dynamic OFC, having only five tuning parameters. The controller is derived using a two-step method. First, a slightly modified SFC is derived with the introduction of a set of cart change variables [Citation3]. Then, the proposed OFC is obtained through the introduction of a convenient parameter tuning-free open-loop state estimator and the cart step reference input internal model in the modified SFC loop. The classical drawbacks of the linear SFC, namely its initial-time high control effort demand, sensitivity to sensor noises, output transient response peaking phenomenon, and implementation difficulties due to the absence of (accurate) velocity measurements, can now be handled efficiently with the proposed control scheme conversion if an appropriate parameter tuning method is used. On the other hand, concerning the motivations for using the output-feedback linear controller instead of commonly used nonlinear OBC [Citation10–14] and sliding mode controllers [Citation21,Citation22], they come from: i) the stabilization task itself that can be also well conducted using robust linear controllers (see Theorem 4.4 in [Citation24] and the simulation results of section 4) when the conditions on the size of the uncertainties and the size of the required domain of attraction are not critical; ii) from the desirable smooth input control (i.e. absence of chattering effect) that helps in extending the life of the CIP system; iii) from the possibility to convert easily an already defined SFC to an OFC with the proposed method; and finally, iv) from the simplicity to conduct efficient OFC parameter tuning with the proposed partial or global single parameter tuning methods (see sections 3.3 and 4). In contrast to the existing linear control methods to stabilize the CIP system, this work has the following main distinguishing features:

From the theoretical point of view, i) our method proposes a simple new linear OFC derived from a comprehensive conversion of linear SFC in the hope to stabilize the CIP system; ii) The existence of the proposed OFC is confirmed by the explicit conditions of Theorem 3.2; Thus, there is no need to solve a set of (algebraic) equations as in the linear output-feedback regulation approach to check the solvability of the problem and/or to derive the controller architecture; Finally, iii) concerning the controlled CIP system stability, and assuming that linearized CIP system around the considered upper unstable equilibrium controllable (see Assumption 2.4 in section 2), we have only conducted a local stability analysis based on the Lyapunov indirect method by applying a new emergence method, i.e. the signature formulas method [Citation23], on the obtained closed-loop linearized CIP system. The above-adopted method is intentionally used to get easily exploitable conditions, as it is stated in Theorem 3.2.
From the practical point of view, i) the proposed OFC has only five parameters (i.e. the conversion task increases by only one parameter the four SFC gains) that can be reduced to a single independent parameter when the SFC gains are assumed to be known or when the closed-loop coincident real pole configuration is adopted; ii) The proposed global (single parameter) tuning method, which is associated with the closed-loop coincident real pole configuration, is adopted as an alternative to the proposed partial (single parameter) tuning method, which is associated with the situation of known SFC gains, if the above tuning method shows limited performance (low speed efficiency, for example); The adopted single parameter tuning method renders our design, from the difficulty point of view, no more demanding than the design of a proportional controller; In addition to that, the optimality of the obtained solution for a given criterion can easily be checked; Finally, iii) the proposed definitions for the speed and peak efficiencies allow performing not only concrete control method comparison but also help in building simple and valuable efficiency-based objective function for controller parameter tuning.

The paper is organized as follows. The problem under consideration is stated in section 2. The design of the OFC is presented in section 3. Section 4 provides simulation results, and finally, section 5 summarizes the paper's conclusions.

2. Problem formulation

Consider the underactuated CIP system depicted in Figure . This system is described by the model [Citation5]: (1) $\underset{\dot{x}}{\underset{⏟}{[\begin{matrix} {\dot{x}}_{1} \\ {\dot{x}}_{2} \\ {\dot{x}}_{3} \\ {\dot{x}}_{4} \end{matrix}]}} = \underset{f}{\underset{⏟}{[\begin{matrix} x_{2} \\ + f_{1} x_{1} \\ x_{4} \\ - f_{2} x_{1} \end{matrix}]}} + \underset{h}{\underset{⏟}{[\begin{matrix} 0 \\ - h_{1} \\ 0 \\ + h_{2} \end{matrix}]}} u$ (1) with the cart control input force $u (t)$ and the nonlinear functions $f_{1}$ , $f_{2}$ , $h_{1}$ , and $h_{2}$ defined hereafter (2) $\begin{array}{l} f_{1} = \frac{(M + m) g L^{- 1} - m x_{2}^{2} \cos x_{1}}{M + m - m \cos^{2} x_{1}} \frac{\sin x_{1}}{x_{1}}; \\ h_{1} = \frac{L^{- 1} \cos x_{1}}{M + m - m \cos^{2} x_{1}} \\ f_{2} = \frac{m g \cos x_{1} - m L x_{2}^{2}}{M + m - m \cos^{2} x_{1}} \frac{\sin x_{1}}{x_{1}}; \\ h_{2} = \frac{1}{M + m - m \cos^{2} x_{1}} \end{array}$ (2) where $g$ is the gravity acceleration, $x_{1}$ is the angular position of the pendulum with the origin at the upright position, $x_{2}$ is the pendulum angular velocity, $x_{3}$ and $x_{4}$ are the cart position and velocity. The inverted pendulum is characterized by a mass-less pole of length $L$ and a ball of mass $m$ . The cart has a mass $M$ and moves under the action of $u (t)$ left or right on a one-dimensional horizontal bounded track.

Figure 1. The inverted pendulum system.

The following assumptions are considered:

Assumption 2.1:

The nonlinear functions $f_{1}$ , $f_{2}$ , $h_{1}$ , and $h_{2}$ satisfy the bounded conditions: (3) $\begin{aligned} \begin{array}{l} 0 < f_{1} < a_{1}; 0 < f_{2} < a_{2} \\ 0 < h_{1} < b_{1}; 0 < h_{2} < b_{2} \end{array} \end{aligned}$ (3) (4) $\begin{aligned} \begin{array}{l} x_{2}^{2} + x_{4}^{2} + (a_{1} - f_{1})^{2} + (a_{2} - f_{2})^{2} < γ_{f}^{2} \\ (b_{1} - h_{1})^{2} + (b_{2} - h_{2})^{2} < γ_{h}^{2} \end{array} \end{aligned}$ (4) where $a_{1}$ , $a_{2}$ , $b_{1}$ , $b_{2}$ , $γ_{f}$ , and $γ_{h}$ are known positive numbers.

In system equations (1), we consider a space domain $D \subset R^{4}$ in which the $x_{1}$ , $x_{2}$ and $x_{4}$ variable’ domains are implicitly defined using the bounded conditions (3) and (4) while the $x_{3}$ variable’ domain is explicitly defined using $| x_{3} | < w_{3}$ , where $2 w_{3}$ is the track length. From (3), it is observed that $h_{1} > 0$ implies that $| x_{1} | < π / 2$ while $f_{2} > 0$ implies $f_{1} > 0$ and $| x_{2} | < \sqrt{g L^{- 1} \cos x_{1}}$ . With an appropriate choice of $a_{1}$ , $a_{2}$ , $b_{1}$ , $b_{2}$ , $γ_{f}$ , and $γ_{h}$ , the bounded conditions in (3) and (4) consider the pendulum in the neighbourhood of its top unstable equilibrium position and specify the considered range of nonlinear model uncertainties.

Assumption 2.2:

The CIP physical parameters $M$ , $m$ and $L$ are known from direct measurements.

In Assumption 2, we have considered that the CIP physical parameters are known to allow the problem of conversion to be solved easily without considering parameter uncertainties. Even if this assumption appears unreasonable to some extent, it is used in some recent papers, including [Citation2] and [Citation6] that are used in section 4. In addition to that experimental results conducted in [Citation2] shows to support such a hypothesis to some extent for an SFC. On the other hand, Theorem 2.1 stated below shows some potential to take the CIP parameter uncertainties into account through the bounded conditions (4) but till now it is not clear to us how to conduct the trivial conversion in such a situation, so this issue is not yet undertaken in the present paper.

Assumption 2.3:

Small-angle approximations, i.e. $\sin x_{1} \approx x_{1}$ and $\cos x_{1} \approx 1$ , and the small-angle velocity approximation, i.e. $x_{2}^{2} << g L^{- 1}$ are considered.

Assumptions 2 and 3 are used to derive the following approximate CIP linear model: (5) $\begin{aligned} \underset{\dot{x}}{\underset{⏟}{[\begin{matrix} {\dot{x}}_{1} \\ {\dot{x}}_{2} \\ {\dot{x}}_{3} \\ {\dot{x}}_{4} \end{matrix}]}} & = \underset{A}{\underset{⏟}{[\begin{matrix} 0 & 1 & 0 & 0 \\ + a_{1} & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ - a_{2} & 0 & 0 & 0 \end{matrix}]}} \\ \times \underset{x}{\underset{⏟}{[\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \\ x_{4} \end{matrix}]}} + \underset{B}{\underset{⏟}{[\begin{matrix} 0 \\ - b_{1} \\ 0 \\ + b_{2} \end{matrix}]}} u \end{aligned}$ (5) with the upper bounds of the nonlinear functions in (3), i.e. $a_{1}$ , $a_{2}$ , $b_{1}$ , and $b_{2}$ , defined hereafter: (6) $\begin{aligned} a_{1} & = (1 + \frac{m}{M}) g L^{- 1}; a_{2} = \frac{m}{M} g; \\ b_{1} & = \frac{1}{L M}; b_{2} = \frac{1}{M} \end{aligned}$ (6)

Assumption 2.4:

Suppose that $A - B K_{0}$ is asymptotically stable for a gain vector $K_{0}$ .

Assumption 2.5:

Suppose that $γ_{h} < 1$ and $α_{f} < λ_{min} (Q) / λ_{max} (P)$ where $P = P^{T} > 0$ is the solution of the Lyapunov matrix equation $(A - B K_{0})^{T} P + P (A - B K_{0}) = - 2 Q$ for some $Q = Q^{T} > 0$ .

The existence of a class of linear state-feedback controllers that can stabilize the nonlinear model (1) in a portion of the domain $D$ is crucial. Otherwise, there is no meaning that can be attributed to the conversion process from a state-feedback to an output-feedback controller. Assumption 2.4 considers the linearized model (5) controllable and it is effectively the case since $det ([\begin{matrix} B & A B & A^{2} B & A^{3} B \end{matrix}]) \neq 0$ . Therefore, there exist linear state-feedback control laws with appropriate gain vectors that can successfully stabilize the system (1) to its unstable equilibrium position. Assumption 2.5 states supplementary conditions concerning the approximate region of attraction where the considered controller can effectively achieve the stabilization task. The justification for using linear state-feedback controllers to stabilize the nonlinear system (1) is stated in the following theorem, which is a simple adaptation of Theorem 4.4 in [Citation24].

Theorem 2.1:

Under Assumptions 2.4 and 2.5, and the constraint $γ > 0$ , the linear state-feedback $u = - (K_{0} + γ B^{T} P) x$ stabilizes the nonlinear system (1) for arbitrary nonlinear functions $f_{1}$ , $f_{2}$ , $h_{1}$ , and $h_{2}$ that satisfy the norm bounds (4).

Proof:

The proof of theorem 2.1 is quite similar to that presented for Theorem 4.4 in [Citation24], and it is based on the Lyapunov method.

Assumption 2.6:

Absence of (accurate) velocity measurements.

Assumption 2.7:

The pendulum angle is contaminated by an additive high-frequency band noise $e_{1}$ while the cart position is measured without any errors (or at least the errors are so small so that they can be neglected).

The nonlinear model (1) is completed by the measured outputs: (7) $\begin{array}{l} x_{1 m} = x_{1} + e_{1} \\ x_{3 m} = x_{3} \end{array}$ (7) For the CIP system stabilization, the goal is to drive the cart as quickly as possible from an initial position $x_{3} (0)$ to a constant final destination $x_{3 d}$ without significant overshoot, undershoot, and control input effort while keeping the pendulum erected in the upright position, during such movement. To achieve this task, Theorem 2.1 states that under full-state availability, one possible solution is to use a linear SFC with an appropriate gain vector $N = (N_{1}, N_{2}, N_{3}, N_{4})$ . In this case, the control law of the SFC, which is mathematically equivalent to a TPD, takes the form: (8) $\begin{aligned} l u (t) & = - N x (t) + N_{3} x_{3 d} \\ = - N_{1} x_{1} (t) - N_{2} x_{2} (t) - N_{3} [x_{3} (t) - x_{3 d}] \\ - N_{4} x_{4} (t) \end{aligned}$ (8) where $x = (x_{1}, x_{2}, x_{3}, x_{4})^{T}$ is the state variable vector.

The SFC gains of (8) can be adjusted to ensure several interesting steady and transient response behaviour for the CIP control system around the unstable equilibrium point [Citation2–6]. However, there are several practical issues to deal with to make it more efficient. The first one is the high control input demand that appears at the beginning of the stabilization, i.e. $u (0) = - N_{3} [x_{3} (0) - x_{3 d}]$ , which is proportional to the distance between the cart's initial position and its destination. The second one emerges with the absence of (accurate) velocity measurements (Assumption 2.6). The third one results from the high sensitivity of the state-feedback control input to the pendulum angle sensor noise (see Equations (7) and (8) and Assumption 2.7). The last one is related to the tuning parameter problem that must be solved to meet the stabilization requirements. Given the importance of SFC in the CIP stabilization (see Theorem 2.1) and the importance of the observer-free OFC in overcoming their drawbacks, it appears interesting to investigate the feedback design as an output regulation problem with a class of controllers that has some relation with the SFC. To apply such an idea, we may thus formulate the feedback controller design in two successive steps as follows.

In the first step, we consider the following class of (error) output-feedback controllers: (9) $\begin{array}{l} u = [\begin{matrix} 1 & 0 \end{matrix}] z \\ \dot{z} = H_{1} z + H_{2} {[\begin{matrix} 0 - x_{1 m} & x_{3 d} - x_{3 m} \end{matrix}]}^{T} \\ z (0) = {[\begin{matrix} 0 & 0 \end{matrix}]}^{T} \end{array}$ (9) where $z$ is a state vector and $H_{1}$ with $H_{2}$ are two constant matrices. Under the above-mentioned assumptions, design a control law of the form (9), or equivalently find $z$ , $H_{1}$ and $H_{2}$ , such that the following properties are satisfied:

Property 2.1:

The control law (9) must be interpreted as a conversion of the state-feedback control law (8).

Property 2.2:

The equilibrium point of the closed-loop nonlinear CIP system must be asymptotically stable, or equivalently, the closed-loop linearized CIP system is stable.

Remark 2.1:

Property 1 allows converting any SFC to an OFC of the form (9). With such a conversion, the first three above-mentioned SFC drawbacks can be efficiently addressed. Indeed, the controller in (9) receives only the errors in the pendulum angle and cart positions that are filtered from the noise before reaching its output. In addition, the force applied at the beginning of stabilization is now null due to the zero initial conditions used in the controller state variables.

In the second step, the goal is to set the parameters $φ$ of the proposed OFC to ensure a set of possible transient response requirements like fast cart response (reduced cart settling time), good stability (reduced cart overshoot, cart undershoot, and pendulum overshoot), and minimum control effort (reduced maximum control input effort). To solve such a problem, two issues must be addressed. The first one concerns the criterion choice while the second one concerns the adopted method to optimize it. Although there are several commonly used objective functions, we have found it useful from the comparison and the optimization point of view to build our own objective function $J (φ)$ as the minimum between two new sound performance indices. These indices are the speed efficiency $S_{E} (φ)$ and the average peak efficiency $S_{E} (φ)$ , and are defined for a given standard (reference) state-feedback method (SSF) and a given control method (CM), as follows: (10) $\begin{aligned} S_{E} (φ) & = 100 \frac{P_{4, S S F}}{P_{4, C M} + P_{4, S S F}} \\ G_{E} (φ) & = \frac{100}{3} \sum_{i = 1}^{3} \frac{P_{i, S S F}}{P_{i, C M} + P_{i, S S F}} \end{aligned}$ (10) where $P_{1, C M}$ is the maximum absolute value $P_{x 1}$ of the pendulum angle response, $P_{2, C M}$ is the maximum absolute value between the overshoot $V_{x 3}$ and undershoot $D_{x 3}$ of the cart position response, $P_{3, C M}$ is the maximum absolute value $P_{u}$ of the control input signal, and $P_{4, C M}$ is the cart settling time $t_{c s}$ at 5%. The index $P_{i, S S F}$ has the same interpretation as the index $P_{i, C M}$ , with the control method CM replaced by SSF.

The adopted definitions of speed and peak efficiencies allow not only concrete control method comparison but also help in building simple and efficient criteria for parameter tuning. Indeed, using such definitions, one can directly interpret the control method performance and indicate clearly if a method outperforms another one or not. For example, method CM outperforms completely the reference method SSF if both efficiencies are greater than 50%. The method CM is bad compared to SSF if both efficiencies are lower than 50%. In the remaining two cases, it is possible to identify the advantage the drawback of the CM over the SSF in terms of speed and peak efficiencies. With such an interpretation, we may formulate the CM setting problem as a maximin optimization model as follows: (11) $\begin{aligned} φ_{o p t} & = \underset{φ}{\arg} max_{φ} J (φ) \\ J (φ) & = min (S_{E} (φ), G_{E} (φ)) \end{aligned}$ (11) It should be noted that (11) is a continuous nonlinear optimization model with a highly nonlinear (possibly discontinuous) objective function $J (φ)$ and it is difficult to know a priori whether such a function is unimodal or multimodal before starting the optimization. To avoid erroneous solutions, problem (11) has to be solved to global optimality in the considered parameter space domain. However, the direct search over all the stabilizing controllers is cumbersome, and, given the complexity of the objective function, there is no guarantee to obtain the optimal solution with standard global search techniques, especially when the number of tuning parameters is large. To reduce the challenge of this problem, we look for reducing the space search to develop a simple tuning method that gives suboptimal guarantee solutions but can be successfully applied to parameter tuning of other types of CIP system controllers, especially the TPD, the TPID, and the proposed OFC.

3. Controller design

To solve the above problems, we proceed constructively and simply. In the first step, we propose a modified version of the given SFC based on an appropriate cart change variable, which is initially proposed in [Citation3]. Then, we convert the obtained modified SFC to an OFC and determine the conditions under which the final obtained linearized closed-loop system is stabilized. The conversion needs the development of a standard controller (state-feedback, state estimator, and internal model) as an intermediate step. In the second step, we propose two efficient methods to solve the parameter tuning problem.

3.1. Modified state-feedback controller

To make the analysis and processing of the approximate linear model (5) more general, let us reveal the link that exists between the parameters $a_{1}$ and $a_{2}$ as follows: (12) $\begin{aligned} a_{2} & = α_{1} a_{1} - β_{1}; α_{1} = \frac{b_{2}}{b_{1}} = L; \\ β_{1} & = \frac{a_{1} b_{2} - a_{2} b_{1}}{b_{1}} = g \end{aligned}$ (12) where $α_{1}$ and $β_{1}$ are two well-known positive parameters. Now, let us define the following change of variables [Citation3] (13) $\begin{aligned} y_{3} & = x_{3} + α_{1} x_{1} \\ y_{4} & = x_{4} + α_{1} x_{2} \end{aligned}$ (13)

Combining (5), (12), and (13), we obtain the following simplified linear model: (14) $\begin{array}{l} {\dot{x}}_{1} = x_{2} \\ {\dot{x}}_{2} = + a_{1} x_{1} - b_{1} u \\ {\dot{y}}_{3} = y_{4} \\ {\dot{y}}_{4} = β_{1} x_{1} \end{array}$ (14)

The above model has only three parameters that are summarized for convenience as follows: (15) $\begin{aligned} a_{1} & = (1 + \frac{m}{M}) g L^{- 1}; b_{1} = \frac{1}{L M}; β_{1} = g \end{aligned}$ (15)

Applying the Laplace transform to (14) yields a modified open-loop CIP system composed of two sub-systems $H_{1} (s)$ and $H_{3} (s)$ in cascade. These sub-systems are unstable and are characterized by the transfer functions: (16) $\begin{aligned} H_{1} (s) & = \frac{X_{1} (s)}{U (s)} = \frac{- b_{1}}{s^{2} - a_{1}} \end{aligned}$ (16) (17) $\begin{aligned} H_{3} (s) & = \frac{Y_{3} (s)}{X_{1} (s)} = \frac{β_{1}}{s^{2}} \end{aligned}$ (17)

At this stage we modify slightly (8) to get the following new state-feedback control law: (18) $\begin{aligned} u (t) & = - N_{1} x_{1} (t) - N_{2} x_{2} (t) - N_{3} [y_{3} (t) - x_{3 d}] \\ - N_{4} y_{4} (t) \end{aligned}$ (18)

Combining the Laplace transform of (18) with (16) and (17) yields the following closed-loop transfer functions: (19) $\begin{aligned} F_{1 M S F} (s) & = \frac{X_{1} (s)}{X_{3 d} (s)} \\ = \frac{- b_{1} N_{3} s^{2}}{\begin{matrix} s^{4} - b_{1} N_{2} s^{3} - (b_{1} N_{1} + a_{1}) s^{2} \\ - β_{1} b_{1} N_{4} s - β_{1} b_{1} N_{3} \end{matrix}} \\ F_{3 M S F} (s) & = \frac{Y_{3} (s)}{X_{3 d} (s)} \\ = \frac{- β_{1} b_{1} N_{3}}{\begin{matrix} s^{4} - b_{1} N_{2} s^{3} - (b_{1} N_{1} + a_{1}) s^{2} \\ - β_{1} b_{1} N_{4} s - β_{1} b_{1} N_{3} \end{matrix}} \end{aligned}$ (19) The obtained closed-loop linearized CIP system is of the fourth order and has the basic configuration shown in Figure , where the real cart position $x_{3}$ is substituted by the modified position $y_{3}$ . Notice that according to the necessary condition of stability of the system (19), the SFC gains $N = (N_{1}, N_{2}, N_{3}, N_{4})$ must be real and negative.

Figure 2. Configuration of the modified state-feedback control.

Remark 3.1

Regarding the transformation (13), the modified control law (18) can be viewed as a conventional state-feedback law (8), in which the gain vector $N = (N_{1}, N_{2}, N_{3}, N_{4})$ is substituted with $N_{n e w} = (N_{1} + α_{1} N_{3}, N_{2} + α_{1} N_{4}, N_{3}, N_{4})$ . This means that the first two gains are increased by the amounts $α_{1} N_{3}$ and $α_{1} N_{4}$ , respectively. The well-known zeros of the cart transfer function (see Eq. B.1 in appendix B) associated with the original cart position variable $x_{3}$ are cancelled when using the new cart position variable $y_{3}$ , as indicated in (19). As shown in the next section, the control law modification contributes to making the conversion from the SFC to an OFC a feasible task.

3.2. Proposed output-feedback controller

Now, we assume that the pendulum and the cart velocities are not measurable and that the pendulum angle is measured with noise as indicated by (7). An OFC can be obtained by combining the modified SFC (18), with an internal model $T_{3} (s)$ and a state estimator $R$ , as shown in Figure . The parts of the proposed servo system are described in the following.

Figure 3. CIP servo system with internal model and state estimator.

To improve tracking for the step reference input $x_{3 d}$ an internal model $T_{3} (s)$ , in the form of a simple integrator, is employed in the control system. This integrator is associated with a positive gain $K$ . The new reference input $r$ resulting from such internal model incorporation satisfies: (20) $\dot{r} = K (x_{3 d} - x_{3 m})$ (20) In view of the approximate linear model (14), a reasonable proposed structure for the state estimator $R$ is given by the following open-loop state estimator: (21) $\begin{array}{l} {\dot{\hat{x}}}_{1} = {\hat{x}}_{2} \\ {\dot{\hat{x}}}_{2} = a_{1} x_{1 m} - b_{1} u \\ {\dot{\hat{y}}}_{3} = {\hat{y}}_{4} \\ {\dot{\hat{y}}}_{4} = β_{1} x_{1 m} \end{array}$ (21) where $({\hat{x}}_{1}, {\hat{x}}_{2}, {\hat{y}}_{3}, {\hat{y}}_{4})^{T}$ is the state estimate of $(x_{1}, x_{2}, y_{3}, y_{4})^{T}$ . The state estimator (21) introduces in (14) one modification, which consists in substituting on the right-hand side of (14) the unmeasured state $x_{1}$ by the noisy measured signal $x_{1 m}$ .

Now, let us introduce the state vector $z = {[\begin{matrix} z_{1} & z_{2} \end{matrix}]}^{T}$ with: (22) $\begin{aligned} \begin{array}{l} z_{1} = u \\ z_{2} = N_{1} {\hat{x}}_{2} (t) + N_{3} {\hat{y}}_{4} \end{array} \end{aligned}$ (22)

According to Figure , the control signal is given by: (23) $\begin{aligned} u & = z_{1} = - N_{1} {\hat{x}}_{1} (t) - N_{2} {\hat{x}}_{2} (t) \\ - N_{3} {\hat{y}}_{3} (t) - N_{4} {\hat{y}}_{4} (t) + N_{3} r \end{aligned}$ (23)

Taking the time-derivative of (23) and combining it with (20), (21) and (22) yields the control law (9) with: (24) $H_{1} = [\begin{matrix} - K_{1} & - 1 \\ + K_{2} & 0 \end{matrix}], H_{2} = [\begin{matrix} - K_{3} & + K_{5} \\ + K_{4} & 0 \end{matrix}]$ (24) where the parameters of the above matrices are defined by the following simple canonical conversion: (25) $\begin{array}{l} K_{1} = - b_{1} N_{2}; K_{2} = - b_{1} N_{1} \\ K_{3} = a_{1} N_{2} + β_{1} N_{4}; K_{4} = a_{1} N_{1} + β_{1} N_{3} \\ K_{5} = K N_{3} \end{array}$ (25) Notice that $K_{1}$ and $K_{2}$ are real and positive while $K_{3}$ , $K_{4}$ and $K_{5}$ are real and negative. The obtained CIP closed-loop system has the basic configuration shown in Figure , where the proposed OFC is defined by (9), (24), and (25).

Figure 4. Configuration of the proposed output-feedback-based CIP servo system.

With the transformation (25), Property 1 is satisfied and it remains to fulfil the local stability in Property 2. To this end, let us evaluate the closed-loop linearized CIP system transfer functions. In the noiseless situation, combining (16), (17) and (24) with the Laplace transform of (9) gives the following closed-loop transfer functions: (26) $\begin{aligned} \begin{aligned} F_{1 O F} (s) & = \frac{X_{1} (s)}{X_{3 d} (s)} = \frac{- b_{1} K_{5} s^{2}}{\begin{matrix} s^{5} + K_{1} s^{4} + (K_{2} - a_{1}) s^{3} \\ - (b_{1} K_{3} + a_{1} K_{1} - α_{1} b_{1} K_{5}) s^{2} \\ - (b_{1} K_{4} + a_{1} K_{2}) s - β_{1} b_{1} K_{5} \end{matrix}} \\ F_{3 O F} (s) & = \frac{Y_{3} (s)}{X_{3 d} (s)} = \frac{- β_{1} b_{1} K_{5}}{\begin{matrix} s^{5} + K_{1} s^{4} + (K_{2} - a_{1}) s^{3} \\ - (b_{1} K_{3} + a_{1} K_{1} - α_{1} b_{1} K_{5}) s^{2} \\ - (b_{1} K_{4} + a_{1} K_{2}) s - β_{1} b_{1} K_{5} \end{matrix}} \end{aligned} \end{aligned}$ (26) Assuming that the gains $N_{i}$ , $1 \leq i \leq 4$ , of the state-feedback controller are known, it is possible to evaluate directly four of the five parameters of the proposed OFC via the transformation (25). In this situation, there is only a single tuning parameter $K$ , or equivalently $K_{5}$ , that is adjusted to ensure the stability of the systems $F_{1 O F} (s)$ and $F_{3 O F} (s)$ , i.e Property 2. The stability conditions are given in the following theorem.

Theorem 3.2:

Suppose that the state-feedback controller gains $N_{i}$ , $1 \leq i \leq 4$ , are negative. Under the assumption $Δ = (K_{2} - a_{1})^{2} + 4 (b_{1} K_{4} + a_{1} K_{2}) > 0$ , the closed-loop systems defined by (26) are stable if the next condition holds (27) $\begin{array}{l} K_{min} < K < K_{max} \\ K_{min} = N_{3}^{- 1} min_{ω^{2} \in {0, ω_{2}^{2}}} R (ω^{2}) \\ K_{max} = N_{3}^{- 1} R (ω_{1}^{2}) \end{array}$ (27) with (28) $\begin{aligned} R (ω^{2}) & = \frac{K_{1} ω^{2} + a_{1} K_{1} + b_{1} K_{3}}{α_{1} b_{1} ω^{2} + β_{1} b_{1}} ω^{2} \\ ω_{1}^{2} & = 0.5 (K_{2} - a_{1}) \\ - 0.5 \sqrt{{(K_{2} - a_{1})}^{2} + 4 (b_{1} K_{4} + a_{1} K_{2})} \\ ω_{2}^{2} & = 0.5 (K_{2} - a_{1}) \\ + 0.5 \sqrt{{(K_{2} - a_{1})}^{2} + 4 (b_{1} K_{4} + a_{1} K_{2})} \end{aligned}$ (28)

Proof:

This theorem results directly from the application of the signature formulas to (26). The complete proof is given in Appendix A.

Remark 3.3:

For the stability analysis of the fifth-order closed-loop systems (26), in contrast to the signature formulas, the classical Routh criterion leads to an intricate set of highly nonlinear conditions that are difficult to exploit.

To perform well, the control system should at least be able to reject the effect of the measurement noise. Such effect on the control input can be described by using the Laplace transform of (9), which yields: (29) $\begin{aligned} U (s) & = C_{1 O F} (s) (0 - X_{1 m} (s)) \\ + C_{3 O F} (s) (X_{3 d} (s) - X_{3 m} (s)) \end{aligned}$ (29) (30) $\begin{aligned} \begin{aligned} C_{1 O F} (s) & = \frac{K_{3} s + K_{4}}{s^{2} + K_{1} s + K_{2}} \\ C_{3 O F} (s) & = \frac{K_{5} s}{s^{2} + K_{1} s + K_{2}} \end{aligned} \end{aligned}$ (30) The measurement noise $e_{1} (t)$ is injected into the control input through the filter $C_{1 O F} (s)$ . This filter exhibits a roll-off of high-frequency measurement noise with a slope of −20 dB per decade. The roll-off begins at the peak frequency: (31) $f_{P} = \frac{1}{2 π} \sqrt{\begin{matrix} - {(\frac{K_{4}}{K_{3}})}^{2} \\ + \sqrt{{({(\frac{K_{4}}{K_{3}})}^{2} + K_{2})}^{2} - K_{1}^{2} {(\frac{K_{4}}{K_{3}})}^{2}} \end{matrix}}$ (31) Usually, the measurement noise $e_{1} (t)$ is of a high frequency; and if its bandwidth is located well above $f_{P}$ , the filter will be effective in reducing this noise.

Remark 3.4:

In the development of the proposed OFC, we have considered that the CIP physical parameters are known (Assumption 2.2) to allow the problem of conversion to be solved easily without considering parameter uncertainties. When the parameter uncertainties are of major concern, the proposed method is still able to define a control scheme with five parameters (see Figure ) that can be used to tackle directly the stabilization problem. In such a situation, the major concern may be how to tune the control scheme parameters without using the exact knowledge of the physical CIP parameters. Therefore, robust or adaptive controllers may be envisaged to solve such an issue.

Remark 3.5:

In the development of the proposed OFC, we have considered the open-loop state estimator (21) instead of a closed-loop state estimator. This rise some questions about its usefulness. Indeed, the cart integral controller and this estimator are only a tool to develop the OFC architecture without the addition of excessive tuning parameters. The estimator is not intended to achieve some perfect state estimations like the high-gain observers. However, regarding the fact that all the adjusted parts (internal model and state-feedback controller) and non-adjustable parts (state estimator and CIP system) in Figure contribute together to give the control CIP system a final overall performance, the good parameter tuning of the adjusted parts will try to compensate to some extent the deficiencies of the proposed open-loop state estimator. This is addressed in the next sections.

3.3. Partial and global parameter tuning methods

To specify the gains of the proposed OFC there are two main approaches. The first one relies on the use of the transformation (25) and assumes that the gains $N$ of the SFC are given. In this case, we have a single parameter to tune $φ = K$ to maximize the objective function $J (φ)$ that appears in the optimization model (11). This method is referred to as the partial parameter tuning method.

The alternative global parameter tuning method aims in adjusting simultaneously all the five output-feedback controller parameters. In such a situation, we have $φ = (K_{1}, K_{2}, K_{3}, K_{4}, K_{5})$ or $φ = (N_{1}, N_{2}, N_{3}, N_{4}, K)$ according to (25). To reduce the complexity of the above tuning problem while guaranteeing the optimality of the obtained solution, i.e avoid the repeatability problem associated with the use of optimization search methods [Citation7], we shall only consider a coincident real negative pole configuration for all the obtained linearized control CIP systems. The coincident pole is denoted by $p$ . With such a constraint, the controller gains can be analytically expressed with one single parameter $φ = p$ , which renders the tuning parameter an easy task. Applying this pole placement design directly on the closed-loop transfer functions (26) yields the following analytical gains for the proposed OFC: (32) $\begin{aligned} K_{1} & = - 5 p \\ K_{2} & = 10 p^{2} + a_{1} \\ K_{3} & = α_{1} {β_{1}}^{- 1} b_{1}^{- 1} p^{5} + 10 b_{1}^{- 1} p^{3} + 5 b_{1}^{- 1} a_{1} p \\ K_{4} & = - 5 b_{1}^{- 1} p^{4} - 10 b_{1}^{- 1} a_{1} p^{2} - b_{1}^{- 1} a_{1}^{2} \\ K_{5} & = {β_{1}}^{- 1} b_{1}^{- 1} p^{5} \end{aligned}$ (32) Since the gains (32) of the OFC depend only on the chosen real negative pole $p$ , solving the optimization problem (11) that tries to maximize $J (φ) = J (p)$ with standard optimization technique becomes an easy task. Also, from the optimality point of view, it is clear that the global parameter tuning method (32) outperforms is in most cases the partial parameter tuning method.

4. Numerical simulations

Consider the nonlinear CIP system (1) and (2) with a set of physical parameters $M = 2.4 kg$ , $m = 0.23 kg$ , $L = 0.36 m$ , $g = 9.81 m/ s^{2}$ , and a cart track length limited between $\pm 0.5 m$ [Citation5]. Numerical simulations are conducted in two separate sections to show the advantage of the proposed OFC with the partial and global tuning methods. In both sections, we adopt the SFC of [Citation2] as a reference control method and symbolize it by SSF. The first main reason for this reference choice is linked to its best performance, as is claimed by its authors, especially in terms of speed efficiency. The second main reason is related to the fact that the gains of such a method are described analytically and there is no free parameter to tune, as is indicated hereafter for the considered CIP model: (33) $\begin{array}{l} N_{1 S S F} = - 3 M g; N_{2 S S F} = - 3 M g z^{- 1} \\ N_{3 S S F} = - 0.2 M z^{2}; N_{4 S S F} = - 1.2 M z \end{array}$ (33) where $z = \sqrt{g L^{- 1}}$ .

In the first section, we are interested in using the partial parameter tuning method to design the proposed OFC. To this end, we shall study the impact of the proposed conversion (25) on the performances of the TPD and LQR state-feedback methods, which are presented in [Citation5] for the same CIP system, and on the performances of the SSF method. Notice that the choice of the gains for the TPD and LQR methods are done in [Citation5] using a trial and error method while the analytical gains (33) of the SSF method are derived from the optimization of a convenient criterion [Citation2]. The parameters of the above-cited state-feedback controllers are listed in Table . The goal of this section is to highlight the potential applicability of our conversion method (25) with a rudimentary partial parameter tuning method and the need to modify the initial SFC if the controlled CIP system shows limited performance.

Table 1. Gains of the state-feedback controllers.

Display Table

In the second section, we are interested in comparing the OFC, SFC, and TPID that are designed using the global tuning method within the context of coincident real pole configuration. This approach allows us to obtain rapidly a guaranteed optimality result for the tuned controller and avoid the repeatability problem that may occur when using advanced optimizing tuning methods [Citation7] in conjunction with non constrained pole configuration. The structure of the TPID is chosen to allow output-feedback processing and to have exactly five tuning parameters as in the proposed OFC. The set of obtained TPID controllers that have these properties [Citation6] are the PID-P and PI-PD controllers. Since the PI-PD with the coincident pole configuration shows very limited performance, we thus retain only the PID-P controller and refer to it hereafter as the PID controller. The derivation of the analytical expressions for the gains of the SFC and the considered PID controllers are given in Appendix B.

4.1. Partial tuning method design

To compare the performance of the SSF, LQR, and TPD controllers, the reference cart position $x_{3 d}$ is set to $0 m$ , and all the initial state values are set to zeros except the initial cart position which is set to $x_{3} (0) = - 0.1 m$ . The simulation results for the pendulum angle, cart position, and control input, without considering noise, are shown in Figures and , and Table . Regarding the fact that we are concerned with a stabilization problem with null references, the cart position, the pendulum angle, and the control effort in Figures and become effectively error signals and the used performance indices ( $P_{x 1}$ , $V_{x 3}$ , $D_{x 3}$ , $P_{u}$ ) can therefore naturally be interpreted as error indices. Now, it is observed that the TPD method exhibits the best $G_{E}$ of about $88 %$ , due to its relatively low peak values, at the price of a bad $S_{E}$ of $24 %$ . The LQR method exhibits a slightly better $G_{E}$ than the SSF method and has a nice and smooth cart response at the price of an increase in the control input effort and a slight degradation in $S_{E}$ .

Figure 5. Pendulum angle and cart position responses using SSF, LQR, and TPD methods.

Figure 6. Control input responses using SSF, LQR, and TPD methods.

Table 2. Performance comparison of the control methods.

Display Table

From the above-obtained results, it is clear that the state-feedback controllers are prone to high control effort demand that appears at the beginning of the pendulum regulation (Figure ) and to cart transient response (undershoot) peaking phenomenon (Figure ); and in the absence of velocity measurements, they cannot be applied directly to stabilize the CIP system. To overcome these difficulties the proposed OFC, defined by (9) and (25), is applied to all considered state-feedback controllers to get the output-feedback controllers SOF (standard output-feedback), MPD (modified TPD), and MLQ (modified LQR) associated to the state-feedback controllers SSF, TPD, and LQR, respectively. To apply our conversion technique, we need to tune the single parameter $K$ in such a way as to ensure closed-loop system stability and acceptable performance. To this end, a numerical investigation of stability conditions (27–28) for the above methods is performed and the obtained results are summarized in Table . A common zeros lower bounds $K_{min} = 0$ and distinct uppers bounds are noticed.

Table 3. Lower and upper bounds of $K$ for SOF, MPD, and MLQ methods.

Display Table

The effect of varying the parameter $K$ between $0.01$ and $1$ on $S_{E}$ and $G_{E}$ is shown in Figure . A clear decreasing trend for the average efficiency index $G_{E}$ with the increase of $K$ is observed for all considered CMs. The SOF and MLQ methods exhibit a similar $G_{E}$ trend. They perform better than the MPD method in the range $0.12 < K < 0.6$ . Concerning the efficiency index $S_{E}$ , an interesting increasing trend is observed for the low-value side of $K$ . The value of $K$ at which this increasing trend stop is a CM dependent. The MPD method presents the shortest interval and the SOF present the largest one. The SOF and MLQ methods exhibit similar $S_{E}$ trend in the interval $0.01 \leq K \leq 0.4$ .

Figure 7. Effect of $K$ on the speed and average peak efficiencies for SOF, MLQ, and MPD methods.

As a general remark, it is noted that the introduction of our partial tuning-based conversion technique to the SSF and LQR methods leads to a substantial improvement of $G_{E}$ at the price of an eventual degradation of $S_{E}$ . Among the studied CMs, the MPD method appears to be the worst one in term of $S_{E}$ and cannot achieve more than $18 %$ (Figure ). The SOF and MLQ methods show interesting $S_{E}$ performances; and if the parameter $K$ is appropriately adjusted, these methods can achieve the highest $S_{E}$ with a value of $37 %$ at the price of a relatively slightly low value of $G_{E}$ of $76 %$ (Figure ). Compared with the state-feedback controllers, the proposed output-feedback controllers have always better $G_{E}$ performance and relatively low $S_{E}$ performance. This means that the state-feedback $S_{E}$ performance can only be recovered to some extent with the proposed partial tuning-based output-feedback conversion technique when it is applied to the considered state-feedback controllers. The improvement of the speed of response of the CIP can be obtained using the global parameter tuning as discussed below in the next section.

The simulation results with $K = 0.4$ for the pendulum angle, cart position, and control input, without considering noise, for the studied partial tuning-based output-feedback controllers are shown in Figures and and Table . It is observed that the SOF and MLQ methods exhibit similar $S_{E}$ in the vicinity of $34.5 %$ , which is relatively large in comparison to $11.63 %$ of the MPD method. The introduction of the proposed OFC on the original SFC leads on the first hand to the decrease of the pendulum angle overshoot, the cart undershoot, and the control input overshoot, and on the other hand to the increase of the cart position overshoot and the cart settling time.

Figure 8. Pendulum angle and cart position responses using SOF, MLQ, and MPD methods.

Figure 9. Control input responses using SOF, MLQ, and MPD methods.

4.2. Global tuning method design

Table shows the effect of the pole $p$ on the performance of the SFC, OFC, and PID controllers. It is observed that the increase of $p$ leads for the above controllers to a decrease in the pendulum angle overshoot, the cart position overshoot, the cart position undershoot, and in the control input effort in addition to an increase in the cart settling time. The OFC and PID controller show quite similar performance. The SFC outperforms the other controllers in the cart position overshoot and the cart settling time but shows less interesting behaviours in the remaining transient control CIP system response characteristics. The obtained optimal gains for the above three controllers are listed in Table , where it is observed that the gains associated with the proposed OFC controllers appear to be somewhat higher than the other controller gains. Figure presents the effect of varying $p$ on the peak frequency $f_{P}$ of the filter $C_{1 O F} (s)$ and on the efficiency performance index $J (p) = min (S_{E}, G_{E})$ of the SFC and OFC controllers. The results associated with the PID method are not shown in Figure since there are very close to those of the OFC method. For a given pole $p > - 3.73$ , the OCF can recover at least 92% of the SFC efficiency while for a given pole $p < - 3.73$ , the OCF becomes more efficient than the OSC method. The best efficiency record for the SFC is about 55% at $p = - 3.55$ and the best efficiency record for the OFC is about 57.4% at $p = - 4.59$ . The OFC needs a somewhat higher bandwidth to recover the performance of the SFC as is indicated by the OFC peak frequency curve, which tends to increase with the decrease of the real negative pole $p$ . The OFC peak frequency is $1.8 Hz$ at $p = - 3.55$ and $2.3 Hz$ at $p = - 4.59$ .

Figure 10. Effect of $p$ on the peak frequency and on efficiency indices for SFC and OFC methods.

Table 4. Performance comparison of the SFC, OFC, and PID methods.

Display Table

Table 5. Optimal gains for the SFC, OFC, and PID methods.

Display Table

To check the ability of the SSF controller and the optimal SFC, OFC, and PID controllers to reject noise, a uniform random noise $e_{1} (t)$ with the values restricted to the interval $| e_{1} (t) | < A$ , with $A = 10^{- 3}$ , and sampling time $0.01 sec$ is added to the pendulum angle state over a time interval of $120 \sec$ . Table summarizes the obtained maximum absolute value for the CIP input and outputs over the last $100 \sec$ for each considered CM. It is observed that the SSF and optimal SFC methods tend to perform similarly. It is also observed that the optimal OFC outperforms the optimal PID in the performance of the control input while the optimal PID outperforms to some extent the optimal OFC in the performance of the CIP outputs. All the considered controllers give acceptable stabilization performance in the considered noisy situation. Figures illustrate graphically the impact of noise on the performance of the control system input and outputs for the above-considered controllers. The carried simulation results show the effectiveness of the proposed OFC in comparison to the SSF and its competitiveness with the best five-parameter PID controllers.

Figure 11. Pendulum angle responses in a noisy situation.

Figure 12. Cart position responses in a noisy situation.

Figure 13. Control input responses in a noisy situation.

Table 6. Performance comparison in noisy situation.

Display Table

5. Conclusion

Conventional CIP state-feedback controllers are prone to several difficulties. To deal with the high control effort demand that appears at the beginning of the pendulum regulation, the cart transient response peaking phenomenon, and implementation difficulties due to the absence of (accurate) velocity measurements, the given static state-feedback controller was converted to a second-order output-feedback controller by introducing a comprehensive transformation. Local system stability analysis is conducted using the signature formulas method to get simple conditions. The control scheme parameters were tuned using partial and global parameter tuning methods and compared with several well-known state and output-feedback controllers. Future work will be devoted to conducting a more rigorous stability analysis for the controlled nonlinear system; improving the performance of the developed output-feedback controller by optimizing its gains in the context of more efficient pole configurations; and relaxing certain assumptions used in the development of the proposed method. For taking the physical CIP parameter uncertainties into account, it may be interesting to exploit directly the obtained controller structure to build robust and/or adaptive controllers. It may be also useful to examine if the above-cited structure allows defining a more convenient sliding surface to reduce the impact of chattering when using a standard sliding mode controller to stabilize the CIP system. Finally, how to adapt the proposed method directly to the nonlinear CIP or other robotic systems without using a linearization step is also an interesting axis of research.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Ministry of Higher Education and Scientific Research of Algeria (PRFU A01L08UN210120220001), by PHC Tassili project (project code: 43790QB) and by Slovenian Research Agency (ARRS) with the Research Program P2-0219.

References

Boubaker O. The inverted pendulum benchmark in nonlinear control theory: a survey. Int J Adv Rob Syst. 2013;10:1–9.
Web of Science ®Google Scholar
Chatterjee S, Das SK. An analytical formula for optimal tuning of the state feedback controller gains for the cart-inverted pendulum system. IFAC-PapersOnLine. 2018;51(1):668–672.
Google Scholar
Messikh L, Guechi EH, Benloucif ML. Critically damped stabilization of inverted-pendulum systems using continuous-time cascade linear model predictive control. J Franklin Inst. 2017;354(16):7241–7265.
Web of Science ®Google Scholar
Shehu M, Ahmad MR, Shehu A, et al. LQR, double-PID, and pole placement stabilization and tracking control of single link inverted pendulum. In 2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE) (pp. 218-223). IEEE; 2015, November.
Google Scholar
Prasad LB, Tyagi B, Gupta HO. Optimal control of nonlinear inverted pendulum system using PID controller and LQR: performance analysis without and with disturbance input. Int J Autom Comput. 2014;11(6):661–670.
Google Scholar
Kuczmann M. Comprehensive survey of PID controller design for the inverted pendulum. Acta Technica Jaurinensis. 2019;12:55–81.
Google Scholar
Önen Ü, Cakan A, Ilhan I. Performance comparison of optimization algorithms in LQR controller design for a nonlinear system. Turkish J Electr Eng Comput Sci. 2019;27(3):1938–1953.
Web of Science ®Google Scholar
Saleem O, Rizwan M, Mahmood-ul-Hasan K. Self-tuning state-feedback control of a rotary pendulum system using adjustable degree-of-stability design. Automatika. 2021;62(1):84–97.
Web of Science ®Google Scholar
Krishnan TR. On stabilization of cart-inverted pendulum system: An experimental study. Master thesis, Department of electrical engineering, National Institute of technology, India; 2012.
Google Scholar
Lee J, Mukherjee R, Khalil HK. Output feedback stabilization of inverted pendulum on a cart in the presence of uncertainties. Automatica (Oxf). 2015;54:146–157.
Web of Science ®Google Scholar
Poloni T, Kolmanovsky I, Rohal’-Ilkiv B. Simple input disturbance observer-based control: case studies. J Dyn Syst Meas Contr. 2018;140:146–157.
Web of Science ®Google Scholar
Katariya AS. (2010). Optimal state-feedback and output-feedback controllers for the wheeled inverted pendulum system. Thesis, School of electrical and computer engineering, Georgia institute of technology.
Google Scholar
Ovalle L, Ríos H, Llama M. Robust output-feedback control for the cart-pole system: a coupled super-twisting sliding-mode approach. IET Control Theory Applic. 2019;13(2):269–278.
Web of Science ®Google Scholar
Aguilar-Ibáñez C, Suarez-Castanon MS, Cruz-Cortés N. Output feedback stabilization of the inverted pendulum system: a Lyapunov approach. Nonlinear Dyn. 2012;70:767–777.
Web of Science ®Google Scholar
Isidori A, Byrnes CI. Output regulation of nonlinear systems. IEEE Trans Autom Control. 1990;35:131–140.
Web of Science ®Google Scholar
Huang J, Chen Z. A general framework for tackling the output regulation problem. IEEE Trans Autom Control. 2004;49:2203–2218.
Web of Science ®Google Scholar
Huang J. Nonlinear Output Regulation: Theory and Applications. Philadelphia: SIAM; 2004.
Google Scholar
Huang J. Asymptotic tracking of a nonminimum phase nonlinear system with nonhyperbolic zero dynamics. IEEE Trans Autom Control. 2000;45:542–546.
Web of Science ®Google Scholar
Tzyh-Jong T, Sanposh P, Daizhan C, et al. Output regulation for nonlinear systems: some recent theoretical and experimental results. control systems technology. IEEE Trans. 2005;13:605–610.
Google Scholar
Postelnik L, Liu G, Stol K, et al. Approximate Output Regulation for a Spherical Inverted Pendulum, 2011 American Control Conference on O'Farrell Street, San Francisco, CA, USA June 29 - July 01; 2011.
Google Scholar
Adhikary N, Mahanta C. Integral backstepping sliding mode control for underactuated system: swing-up and stabilization of the cart-pendulum system. ISA Trans 213. 2013;52(6):870–880.
PubMed Web of Science ®Google Scholar
Coban R. Backstepping integral sliding mode control of an electromechanical system. Automatika. 2017;58(3):266–272.
Web of Science ®Google Scholar
Díaz-Rodríguez ID, Han S, Bhattacharyya SP. Analytical design of PID controllers. Cham, Switzerland: Springer International Publishing; 2019.
Google Scholar
Zak SH. System and control. Oxford: Oxford university press; 2003.
Google Scholar

Appendices

Appendix A: Proof of Theorem 3.2

The proof of Theorem 3.2 is conducted using the signature formulas method that is described in [23, p.40-53]. To this end, let us consider the characteristic polynomial of (26), which is without zeros on the imaginary axis and has a single tuning parameter. Then, write it in the following form:

(A1)

\begin{aligned} δ (s, K_{5}) & = δ_{e v e n} (s^{2}) + s δ_{o d d} (s^{2}) \\ δ_{e v e n} (s^{2}) & = K_{1} s^{4} - (b_{1} K_{3} + a_{1} K_{1} - α_{1} b_{1} K_{5}) s^{2} - β_{1} b_{1} K_{5} \\ δ_{o d d} (s^{2}) & = s^{4} + (K_{2} - a_{1}) s^{2} - (b_{1} K_{4} + a_{1} K_{2}) \end{aligned}

(A1) so that

(A2)

\begin{aligned} δ (j ω) & = δ_{r} (ω) + j δ_{i} (ω) \\ δ_{r} (ω) & = δ_{e v e n} (- ω^{2}) = K_{1} ω^{4} + (b_{1} K_{3} + a_{1} K_{1} \\ - α_{1} b_{1} K_{5}) ω^{2} - β_{1} b_{1} K_{5} \\ δ_{i} (ω) & = ω δ_{o d d} (- ω^{2}) = ω [ω^{4} - (K_{2} - a_{1}) ω^{2} \\ - (b_{1} K_{4} + a_{1} K_{2})] \end{aligned}

(A2)

It is clear from (A1) that only the even part of $δ (s, K_{5})$ depends on $K_{5}$ . Under the following condition: (A3) $Δ = (K_{2} - a_{1})^{2} + 4 (b_{1} K_{4} + a_{1} K_{2}) > 0$ (A3)

The real nonnegative zeros of $δ_{i} (ω)$ with odd multiplicities, i.e. $ω_{1}$ and $ω_{2}$ , with $ω_{0} = 0$ are given by: (A4) $\begin{aligned} ω_{0} & = 0 \\ ω_{1}^{2} & = \frac{K_{2} - a_{1} - \sqrt{{(K_{2} - a_{1})}^{2} + 4 (b_{1} K_{4} + a_{1} K_{2})}}{2} \\ ω_{2}^{2} & = \frac{K_{2} - a_{1} + \sqrt{{(K_{2} - a_{1})}^{2} + 4 (b_{1} K_{4} + a_{1} K_{2})}}{2} \end{aligned}$ (A4)

According to Lemma 2.1 and Theorem 2.1 of [Citation23], the Hurwitz signature of the five-order polynomial $δ (s, K_{5})$ is given by: (A5) $\begin{aligned} σ (δ) & = l - r = sgn [δ_{i} (0^{+})]^{} (sgn [δ_{r} (0)] - 2 sgn [δ_{r} (ω_{1})] \\ + 2 sgn [δ_{r} (ω_{2})]) \\ δ_{i} (0^{+}) & = 0^{+} \\ δ_{r} (0) & = - β_{1} b_{1} K_{5} \\ δ_{r} (ω_{1}) & = K_{1} ω_{1}^{4} + (b_{1} K_{3} + a_{1} K_{1} - α_{1} b_{1} K_{5}) ω_{1}^{2} - β_{1} b_{1} K_{5} \\ δ_{r} (ω_{2}) & = K_{1} ω_{2}^{4} + (b_{1} K_{3} + a_{1} K_{1} - α_{1} b_{1} K_{5}) ω_{2}^{2} - β_{1} b_{1} K_{5} \end{aligned}$ (A5) where $l$ is the number of stable poles, $r$ is the number of unstable poles, and $sgn [\cdot]$ is the signum function that is defined as follows: (A6) $sgn [x] = {\begin{cases} - 1 & if x < 0 \\ 0 & if x = 0 \\ + 1 & if x > 0 \end{cases}$ (A6)

From (A5) it is obvious that for the five-order polynomial $δ (s, K_{5})$ to be Hurwitz, it is necessary and sufficient to have: (A7) $\begin{aligned} δ_{r} (0) & = - β_{1} b_{1} K_{5} > 0 \\ δ_{r} (ω_{1}) & = K_{1} ω_{1}^{4} + (b_{1} K_{3} + a_{1} K_{1} - α_{1} b_{1} K_{5}) ω_{1}^{2} \\ - β_{1} b_{1} K_{5} < 0 \\ δ_{r} (ω_{2}) & = K_{1} ω_{2}^{4} + (b_{1} K_{3} + a_{1} K_{1} - α_{1} b_{1} K_{5}) ω_{2}^{2} \\ - β_{1} b_{1} K_{5} > 0 \end{aligned}$ (A7)

After some obvious independent algebraic manipulations, the inequalities of (A7) can be further reduced to: (A8) $\begin{array}{l} K_{5} < 0 \\ K_{5} > \frac{K_{1} ω_{1}^{4} + (b_{1} K_{3} + a_{1} K_{1}) ω_{1}^{2}}{α_{1} b_{1} ω_{1}^{2} + β_{1} b_{1}} \\ K_{5} < \frac{K_{1} ω_{2}^{4} + (b_{1} K_{3} + a_{1} K_{1}) ω_{2}^{2}}{α_{1} b_{1} ω_{2}^{2} + β_{1} b_{1}} \end{array}$ (A8)

From (A8) and $K_{5} = K N_{3}$ in eq (25), we get finally: (A9) $\begin{array}{l} K_{min} < K < K_{max} \\ K_{min} = N_{3}^{- 1} min_{ω^{2} \in {0, ω_{2}^{2}}^{}} R (ω^{2}) \\ K_{max} = N_{3}^{- 1} R (ω_{1}^{2}) \end{array}$ (A9) with (A10) $R (ω^{2}) = \frac{K_{1} ω^{2} + a_{1} K_{1} + b_{1} K_{3}}{α_{1} b_{1} ω^{2} + β_{1} b_{1}} ω^{2}$ (A10)

Appendix B: SFC and PID pole coincident-based gains

Let us first begin with the tuning of the SFC controller (8). Applying the Laplace transform to (5) and (8) yields the following closed-loop transfer functions: (B1) $\begin{aligned} \begin{aligned} F_{1 S F} (s) & = \frac{X_{1} (s)}{X_{3 d} (s)} = \frac{- b_{1} N_{3} s^{2}}{\begin{matrix} s^{4} - (b_{1} N_{2} - b_{2} N_{4}) s^{3} \\ - (b_{1} N_{1} - b_{2} N_{3} + a_{1}) s^{2} - b_{1} N_{4} β_{1} s \\ - b_{1} N_{3} β_{1} \end{matrix}} \\ F_{3 S F} (s) & = \frac{X_{3} (s)}{X_{3 d} (s)} = \frac{b_{1} N_{3} α_{1} (s^{2} - α_{1}^{- 1} β_{1})}{\begin{matrix} s^{4} - (b_{1} N_{2} - b_{2} N_{4}) s^{3} \\ - (b_{1} N_{1} - b_{2} N_{3} + a_{1}) s^{2} - b_{1} N_{4} β_{1} s \\ - b_{1} N_{3} β_{1} \end{matrix}} \end{aligned} \end{aligned}$ (B1)

For these sub-systems, the transfer function $F_{1 S F} (s)$ has a fixed double zero at the origin and the transfer function $F_{3 S F} (s)$ has two fixed zeros at $\pm \sqrt{α_{1}^{- 1} β_{1}}$ . Thus, whatever the method used to specify the state-feedback controller gains, it is always interpreted as a kind of pole placement. In addition, the presence of a positive zero makes the CIP system a nonminimum phase. Now, taking into account the usefulness of driving the cart system with little or no oscillation (under and overshoot) to $x_{3 d}$ , we propose to specify the gains $(N_{1}, N_{2}, N_{3}, N_{4})$ to obtain a coincident real negative pole $p$ structure for the targeted closed-loop system $F_{3 S F} (s)$ . In this situation, we get: (B2) $\begin{aligned} \begin{aligned} N_{1} & = - α_{1} β_{1}^{- 1} b_{1}^{- 1} p^{4} - 6 b_{1}^{- 1} p^{2} - b_{1}^{- 1} a_{1} \\ N_{2} & = + 4 α_{1} β_{1}^{- 1} b_{1}^{- 1} p^{3} + 4 b_{1}^{- 1} p \\ N_{3} & = - β^{- 1} b_{1}^{- 1} p^{4} \\ N_{4} & = + 4 β^{- 1} b_{1}^{- 1} p^{3} \end{aligned} \end{aligned}$ (B2)

The structure of the output-feedback TPID controller can be defined by [Citation6]: (B3) $\begin{aligned} U (s) & = C_{1 P I D} (s) (0 - X_{1 m} (s)) + C_{3 P I D} (s) \\ \times (X_{3 d} (s) - X_{3 m} (s)) \end{aligned}$ (B3) (B4) $\begin{aligned} \begin{aligned} C_{1 P I D} (s) & = \frac{K_{13} s^{2} + K_{11} s + K_{12}}{s (1 + T_{1} s)} \\ C_{3 P I D} (s) & = \frac{K_{33} s^{2} + K_{31} s + K_{32}}{s (1 + T_{3} s)} \end{aligned} \end{aligned}$ (B4) where $K_{1 P I D} = (K_{11}, K_{12}, K_{13}, T_{1})$ are the gains associated with the pendulum sub-system and $K_{3 P I D} = (K_{31}, K_{32}, K_{33}, T_{3})$ are the gains associated with the cart sub-system. There are 16 possible combinations of the TPID controllers that can be classified into unstable, redundant, underdetermined, and feasible setups [Citation6]. The only feasible TPID controllers that have five parameters (The same number of parameters of the proposed OFC) are the PID-P and the PI-PD [Citation6]. The filters associated with the considered PID-P (called PID in section 4) controllers are given by: (B5) $\begin{aligned} \begin{aligned} C_{1 P I D} (s) & = \frac{K_{13} s^{2} + K_{11} s + K_{12}}{s (1 + T_{1} s)} \\ C_{3 P} (s) & = K_{31} \end{aligned} \end{aligned}$ (B5)

Combining (B5), (40), and the Laplace transform of (11) gives the following five-order transfer functions: (B6) $\begin{aligned} \begin{aligned} F_{1 P I D - P} (s) & = \frac{X_{1} (s)}{X_{3 d} (s)} = \frac{A_{1 P I D - P} (s)}{D_{P I D - P} (s)} \\ F_{3 P I D - P} (s) & = \frac{X_{3} (s)}{X_{3 d} (s)} == \frac{A_{3 P I D - P} (s)}{D_{P I D - P} (s)} \end{aligned} \end{aligned}$ (B6) with (B7) $\begin{aligned} \begin{aligned} A_{1 P I D - P} (s) & = - K_{31} b_{1} (s + T_{1}^{- 1}) s^{2} \\ A_{3 P I D - P} (s) & = K_{31} (b_{2} s^{3} + T_{1}^{- 1} b_{2} s^{2} - b_{1} β_{1} s - T_{1}^{- 1} b_{1} β_{1}) \\ D_{P I D - P} (s) & = s^{5} + T_{1}^{- 1} s^{4} - (a_{1} - b_{2} K_{31} + T_{3}^{- 1} b_{1} K_{13}) s^{3} \\ - T_{1}^{- 1} (a_{1} + b_{1} K_{11} - b_{2} K_{31}) s^{2} \\ - (T_{1}^{- 1} b_{1} K_{12} + b_{1} K_{31} β_{1}) s - T_{1}^{- 1} b_{1} K_{31} β_{1} \end{aligned} \end{aligned}$ (B7)

The gains associated with the controllers (B5) taking the coincident real pole configuration into account for the closed-loop linearized CIP system is deduced from the denominator in (B6) as follows: (B8) $\begin{aligned} \begin{aligned} K_{11} & = - (1 / 5) α_{1} {β_{1}}^{- 1} b_{1}^{- 1} p^{4} - 2 b_{1}^{- 1} p^{2} - a_{1} b_{1}^{- 1} \\ K_{12} & = (24 / 25) b_{1}^{- 1} p^{3} \\ K_{13} & = \frac{b_{2} p^{4} + 50 b_{1} β_{1} p^{2} + 5 a_{1} b_{1} β_{1}}{25 b_{1}^{2} β_{1} p} \\ K_{31} & = - (1 / 5) {β_{1}}^{- 1} b_{1}^{- 1} p^{4} \\ T_{1} & = - (1 / 5) p^{- 1} \end{aligned} \end{aligned}$ (B8)

Stabilization of the cart-inverted-pendulum system using trivial state-feedback to output-feedback control conversion

Abstract

1. Introduction

2. Problem formulation