Mathematical and Computer Modelling of Dynamical Systems

Methods, Tools and Applications in Engineering and Related Sciences

Volume 26, 2020 - Issue 2

814

Views

CrossRef citations to date

Altmetric

Listen

Original Articles

On the combination of kernel principal component analysis and neural networks for process indirect control

A. ErrachdiAutomation Research Laboratory, Tunis El Manar University, Tunis, TunisiaCorrespondence[email protected]

S. SlamaAutomation Research Laboratory, Tunis El Manar University, Tunis, Tunisia

M. BenrejebAutomation Research Laboratory, Tunis El Manar University, Tunis, Tunisia

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

A new adaptive kernel principal component analysis (KPCA) for non-linear discrete system control is proposed. The proposed approach can be treated as a new proposition for data pre-processing techniques. Indeed, the input vector of neural network controller is pre-processed by the KPCA method. Then, the obtained reduced neural network controller is applied in the indirect adaptive control. The influence of the input data pre-processing on the accuracy of neural network controller results is discussed by using numerical examples of the cases of time-varying parameters of single-input single-output non-linear discrete system and multi-input multi-output system. It is concluded that, using the KPCA method, a significant reduction in the control error and the identification error is obtained. The lowest mean squared error and mean absolute error are shown that the KPCA neural network with the sigmoid kernel function is the best.

KEYWORDS:

1. Introduction

We are involved in adaptive system control of the non-linear discrete system using neural network. In fact, the indirect adaptive control structure is based on two neural network blocks corresponding to the model identification of the dynamic behaviour of the system and system controller [Citation1–Citation6].

However, the size of the neural network model or the neural network controller can accelerate or slow down their training phase. This problem of reduction of the higher dimension of neural network is well discussed by different techniques [Citation7–Citation37].

The first step in reduction method is feature selection (new features are selected from the original inputs) or feature extraction (new features are transformed from the original inputs). In the modelling, all available indicators can be used, but correlated features or irrelevant features could deteriorate the generalization performance of any model [Citation7–Citation18].

Many linear techniques of reduction dimensionality are proposed. For instance, Kohonen Self Organizing Feature Maps provide a way of representing multidimensional data in much lower dimensional spaces [Citation19], curvilinear component analysis[Citation20] and curvilinear distance analysis [Citation21] are proposed to make smaller the original dimension of the face images, data and for classification in medical imaging [Citation22] and principal component analysis (PCA) has been widely used for reducing high dimension in many applications [Citation16–Citation18,Citation23–Citation25].

PCA is a well-known method for feature extraction [Citation23,Citation24]. By calculating the eigenvectors of the covariance matrix of the original inputs, PCA linearly transforms the original high-dimensional input vector into new low-dimensional one whose components are uncorrelated. The basis function orders of PCA, as a typical approach, are the lowest in the sense of model dimension reduction [Citation16–Citation18,Citation23–Citation25].

In other applications, for instance, in the study by Zhang et al. [Citation15], a hybrid modelling strategy consists of a decoupled non-linear radial basis function neural network model based on PCA and linear autoregressive exogenous model. PCA reduces the cross-validation time required to identify optimal model hyper-parameters [Citation25]. In the study by Seerapu and Srinivas [Citation26], it was combined with the linear discriminate analysis to ameliorate the reduction. Then, in the study by Peleato et al. [Citation27], the use of fluorescence data coupled with neural networks based on PCA for improved predictability of drinking water disinfection by-products was investigated. Second, in the study by Qinshu et al. [Citation14], a PCA for feature selection and a grid searching and k-fold cross validation approach for parameter optimization in the support vector machine were developed. Finally, in other dimensionality reduction, linear techniques such as multidimensional scaling and probabilistic PCA are applied for user authentication using keystroke dynamics [Citation28] and other methods [Citation29].

However, PCA is a linear time/space separation method and cannot be directly applied to non-linear systems [Citation30]. Non-linear PCA has also been developed by using different algorithms. Kernel principal component analysis (KPCA) is a non-linear PCA developed by using the kernel method. Kernel method is originally used for Support Vector Machine (SVM). Later, it has been generalized into many algorithms having the term of dot products such as PCA. Specifically, KPCA firstly maps the original inputs into a high-dimensional feature space using the kernel method and then calculates PCA in the high-dimensional feature space. The linear PCA in the high-dimensional feature space corresponds to a non-linear PCA in the original input space. Recently, another linear transformation method called independent component analysis (ICA) is also developed. Instead of transforming uncorrelated components, ICA attempts to achieve statistically independent components in the transformed vectors. ICA is originally developed for blind source separation. Later, it has been generalized for feature extraction [Citation7].

KPCA is used as an effective method for tackling the problem of non-linear data [Citation31]. Indeed, in the study by Chakour et al. [Citation32], an algorithm for adaptive KPCA is proposed for dynamic process monitoring. This algorithm combined two existing algorithms: the recursive weighted PCA and the moving window KPCA algorithms. Even better, the fault detection of the non-linear system using KPCA method for extracting the reduced number of measurements from the training data [Citation33] is studied. In the study by Xiao and He [Citation34], a neural-network-based fault diagnosis approach of analog circuits is developed, using maximal class separability-based KPCA as a preprocessor to reduce the dimensionality of candidate features so as to obtain the optimal features with maximal class separability as inputs to the neural networks. In the study by Reddy and Ravi [Citation36], differential evolution (DE)-trained kernel principal component wavelet neural network and DE-trained kernel binary quantile regression are proposed for classification. In the proposed DE-KPCWNN technique, KPCA is applied to input data to get KPC, on which WNN is employed.

In the study by Klevecka and Lelis [Citation37], a functional algorithm of preprocessing of input data taking into account the specific aspects of teletraffic and properties of neural networks is created. The practical application for forecasting telecommunication data sequences shows that the procedure of data preprocessing decreases the time of learning and increases the plausibility and accuracy of the forecasts.

In this paper, the scheme of indirect adaptive control is used based on a neural network. First, the used neural network is based on an adaptive learning rate and a reduced derivative of the activation function. Even better, the weights of the neural network model and neural network controller are updated based on the identification error and the control error and used to generate the appropriate control.

In the first hand, in various studies [Citation1,Citation2,Citation5,Citation6,Citation15,Citation38,Citation39], the authors developed many algorithms for the adaptive indirect control without any preprocessing and they did not take into account the high dimension of the neural network.

On the other hand, in the study by Errachdi and Benrejeb [Citation4], the authors developed an algorithm to accelerate the speed of training phase in the adaptive indirect control based on neural network controller using a variable learning rate and a development of Taylor of the derivative of the activation function but they did not focus on the big dimension. That is why, in this paper, we propose a new algorithm of a reduction of the input vector of the neural controller in the control system based on the KPCA. The procedure of the data preprocessing scheme decreases the time of learning and increases the accuracy of the system control.

The present paper is organized as follows. After this introduction, Section 2 reviews the proposed KPCA method for system control. In fact, the proposed neural network controller based on the KPCA method is developed. Furthermore, in Section 3, the proposed algorithm is detailed. In Section 4, an example of a non-linear system is presented to illustrate the proposed efficiency of the method. Section 5 gives the conclusion of this paper.

2. The proposed KPCA neural network controller approach

On the basis of the input and output relations of a system, the above discrete non-linear system can be expressed by a NARMA (Non-linear Autoregressive Moving Average) model [Citation4,Citation35] given by (1) $y (k + 1) = f (y (k), . . ., y (k - n_{y}), u (k), . . ., u (k - n_{u}))$ (1)

$f (.)$ is the non-linear function mapping specified by the model, $y (k)$ and $u (k)$ are the outputs and the inputs of the system, respectively, $k$ is the discrete time, $n_{y}$ and $n_{u}$ are the number of past output and input samples, respectively, required for prediction.

The aim of this paper is to find a control law $u (k)$ to the non-linear system, given by Equation (1), based on the KPCA approach in order that the system output $y (k)$ tracks, where possible, the desired value $r (k)$ .

The indirect control architecture is shown in , and the weights of the neural network model and the neural network controller are trained by different errors where $e (k)$ is the identification error, ${\hat{e}}_{c} (k)$ is the estimated tracking error and $e_{c} (k)$ is the tracking error [Citation4].

Figure 1. The architecture of indirect neural control.

The architecture shown in assumes the role of two neural blocks. Indeed, the weights of the neural model are adjusted by the identification error $e (k)$ ; however, the weights of the neural controller are trained by the tracking error $e_{c} (k)$ [Citation4].

The multi-layer perceptron is used in the neural model and in the neural controller. Each block consists of three layers. The sigmoid activation function $s (.)$ is used for all neurons [Citation4].

2.1. The neural network model

The principle of neural network model is given by the .

Figure 2. The principle of neural network model.

The $j^{t h}$ output layer of the hidden layer is described as follows: (2) $h_{j} = \sum_{i = 1}^{n_{1}} w_{j i} x_{i} j = 1, 2, . . ., n_{2}$ (2)

where $n_{1}$ is the number of nodes of the input layer, $n_{2}$ is the number of nodes of the hidden layer and $w_{j i}$ is the hidden weight.

The input vector of the neural network model is (3) $x = {[u (k), u (k - 1), u (k - 2), . . .]}^{T}$ (3)

where $u (k)$ is the neural network controller output.

The output of the neural network model is given by the following equation: (4) $y r (k + 1) = λ s (\sum_{j = 1}^{n_{2}} w_{1 j} s (h_{j}))$ (4)

where $λ$ is a scaling coefficient and $w_{1 j}$ is the output weight.

The compact form of the output is given by the following equation: (5) $y r (k + 1) = λ s (h_{1}) = λ s [w_{1}^{T} S (W x)]$ (5)

with,. $x = [x_{i}]^{T}, i = 1, \dots, n_{1}$ $W = [w_{j i}], i= 1, \dots, n_{1}, j = 1, \dots, n_{2}$ $S (W x) = [s (h_{j})]^{T}, j = 1, \dots, n_{2}$ $w_{1} = [w_{1 j}]^{T}, j = 1, \dots, n_{2}$

The identification error $e (k)$ is given by (6) $e (k) = y (k) - y r (k)$ (6)

The function cost is given by the following equation: (7) $E = \frac{1}{2} (e (k))^{2}$ (7)

where $N$ is the number of observations.

The output weights are updated by the following equation: (8) $w_{1 j} (k + 1) = w_{1 j} (k) + Δ w_{1 j} (k)$ (8)

where $Δ w_{1 j}$ , $j = 1, . . ., n_{2}$ is given by minimizing the cost function defined as follows: (9) $\begin{aligned} Δ w_{1 j} = - η (k) \frac{\partial E (k)}{\partial w_{1 j}} \\ = - η (k) \frac{\partial E (k)}{\partial e (k)} \frac{\partial e (k)}{\partial h_{1}} \frac{\partial h_{1}}{\partial w_{1 j}} \\ = λ η (k) e (k) s^{'} (h_{1}) S (W x) \end{aligned}$ (9)

$η (k)$ is the variable learning rate for the weights of the neural network model, $0 \leq η (k) \leq 1$ , given by (10) $\begin{matrix} η (k) = \frac{1}{λ^{2} {s^{'}}^{2} (h_{1}) [S^{T} (W x) S (W x) + w_{1 j}^{T} S^{'} (W x) S^{'} (W x) w_{1 j} x^{T} x]} \end{matrix}$ (10)

$s^{'} (h_{1})$ is the derivative of $s (h_{1})$ defined as follows: (11) $\begin{aligned} s^{'} (h_{1}) = s (h_{1}) (1 - s (h_{1})) \\ = \frac{e^{- h_{1}}}{{(1 + e^{- h_{1}})}^{2}} \\ \approx \frac{1}{4} + \frac{1}{2} h_{1} + O (h_{1}^{3}) \end{aligned}$ (11)

The hidden weights are updated by the following equation: (12) $w_{j i} (k + 1) = w_{j i} (k) + Δ w_{j i} (k)$ (12)

where $Δ w_{j i}$ is given by the following equation: (13) $\begin{aligned} Δ w_{j i} = - η (k) \frac{\partial E (k)}{\partial w_{j i}} \\ = - η (k) \frac{\partial E (k)}{\partial e (k)} \frac{\partial e (k)}{\partial h_{1}} \frac{\partial h_{1}}{\partial h_{j}} \frac{\partial h_{j}}{\partial w_{j i}} \\ = λ η (k) s^{'} (h_{1}) S^{'} (W x) w_{1 j} x^{T} e (k) \end{aligned}$ (13)

with $S^{'} (W x) = d i a g [s^{'} (h_{j})]^{T}, j = 1, . . ., n_{2}$

For the stability of the neural network model, the Lyapunov function is detailed. Indeed, let us define a discrete Lyapunov function as (14) $\begin{matrix} V (k) = E (k) = \frac{1}{2} {(e (k))}^{2} \end{matrix}$ (14)

where $e (k)$ is the identification error given by Equation (6). The change in the Lyapunov function is obtained by (15) $\begin{matrix} Δ V (k) = V (k + 1) - V (k) = \frac{1}{2} ((e (k + {1))}^{2} - {(e (k))}^{2}) \end{matrix}$ (15)

The identification error difference can be represented by (16) $\begin{matrix} Δ e (k) = e (k + 1) - e (k) \approx η (k) \frac{\partial y r (k)}{\partial w_{i} (k)} e (k) \end{matrix}$ (16)

where $w_{i} (k)$ is the synaptic weights of the neural network identifier ( $w_{1 j} (k)$ , $w_{_{j i}} (k)$ ). Using Equation (16), the identification error is going to be (17) $\begin{matrix} e (k + 1) = e (k) - η (k) ξ (k) e (k) \end{matrix}$ (17)

with (18) $\begin{matrix} ξ (k) = (\frac{λ}{2})^{2} {s^{'}}^{2} (h_{1}) [S^{T} (W x) S (W x) + w_{1 j}^{T} S^{'} (W x) S^{'} (W x) w_{1 j} x^{T} x] \end{matrix}$ (18)

From Equations (17) and (18), the convergence of the identification error $e (k)$ is guaranteed if ${lim}_{k \to + \infty} e (k) = 0$ or $0 < η (k) < 2 ξ^{- 1} (k)$ with $V (k) > 0$ from Equation (14).

The suitable online algorithm may be applied if the variable learning rate $η (k)$ is $ξ^{- 1} (k)$ .

2.2. The KPCA neural network controller

The PCA technique is a lower-dimensional projection method that can use with multivariate data mining [Citation25,Citation30–Citation32,Citation40]. The main idea behind the PCA is to represent multidimensional data with fewer numbers of variables retaining the main features of the data. It is inevitable that by reducing dimensionality, some features of the data will be lost. The method PCA tries to project multidimensional data into a lower-dimensional space, retaining as much as possible variability of the data [Citation4,Citation25,Citation30–Citation32,Citation40].

However, the presented PCA method is a linear technique and cannot capture the non-linear structure in a data set. For this reason, non-linear generalization has been proposed using the kernel method, introduced for computing the principal components of the data set mapped non-linearly into some high-dimensional feature space. Because sample data are implicitly mapped from an input space to a higher-dimensional feature space $ζ$ , KPCA is implemented efficiently by virtue of kernel tricks and it can be solved as an eigenvalue problem of its kernel matrix.

In this section, we propose to reduce the input vector of the neural network controller of the adaptive indirect control structure. Indeed, before the reduction of the input vector, the new architecture of the adaptive indirect KPCA neural network control is given .

Figure 3. The new architecture of indirect neural control.

We recall the input vector of the neural network controller is (19) $z = {[r (k), r (k - 1), r (k - 2), . . .]}^{T}$ (19)

where $r (k)$ is the desired value.

For the input data ${z_{k}}_{k = 1}^{l}$ , $ϕ$ represents the non-linear mapped data in $ζ$ . The covariance matrix of the projected features $C$ is $l \times l$ , defined as (20) $C = \frac{1}{l} \sum_{j = 1}^{l} ϕ (z_{j}) ϕ (z_{j})^{T}$ (20)

Its eigenvalues and eigenvectors are given by (21) $C p_{k} = λ_{k} p_{k} k = 1, . . ., l$ (21)

From Equation (20), Equation (21) may be (22) $\frac{1}{l} \sum_{j = 1}^{l} ϕ (z_{j}) (ϕ (z_{j})^{T} p_{k}) = λ_{k} p_{k}$ (22)

$p_{k}$ can be rewritten as (23) $p_{k} = \sum_{j = 1}^{l} α_{j} ϕ (z_{j})$ (23)

with $α_{j}$ , $j = 1, . . ., l$ as the expansion coefficients. Equation (21) can be rewritten as (24) $\frac{1}{l} \sum_{j = 1}^{l} ϕ (z_{j}) (ϕ (z_{j})^{T} \sum_{i = 1}^{l} α_{i} ϕ (z_{i})) = λ_{k} \sum_{i = 1}^{l} α_{i} ϕ (z_{i})$ (24)

The kernel function $k r (z_{i}, z_{j})$ is defined as (25) $k r (z_{i}, z_{j}) = ϕ (z_{i})^{T} ϕ (z_{j})$ (25)

is multiplied to the left and to the right by $ϕ (z_{d})^{T}$ , Equation (23) becomes (26) $\frac{1}{l} \sum_{j = 1}^{l} ϕ (z_{d})^{T} ϕ (z_{j}) (ϕ (z_{j})^{T} \sum_{i = 1}^{l} α_{i} ϕ (z_{i})) = λ_{k} \sum_{i = 1}^{l} α_{i} ϕ (z_{d})^{T} ϕ (z_{i})$ (26)

Equation (25) is (27) $\frac{1}{l} \sum_{i = 1}^{l} k r (z_{d}, z_{i}) \sum_{j = 1}^{l} α_{j} k r (z_{i}, z_{j}) = λ_{k} \sum_{i = 1}^{l} α_{i} k r (z_{d}, z_{i})$ (27)

with $k r (z_{d}, z_{i}) = ϕ (z_{d})^{T} ϕ (z_{i})$

The resulting kernel principal components can be calculated using (28) $x_{r} (k) = ϕ (z)^{T} p_{k} = \sum_{i = 1}^{l} α_{i} k r (z, z_{i})$ (28)

The reduced space of the signal given by Equation (28) constitutes the input vector of the neural network controller.

We propose a dimensionality reduction technique that should be employed to reduce the dimensionality of the feature vectors before they are fed as input (29) $x_{1} = {[x_{r} (k), x_{r} (k - 1), x_{r} (k - 2), . . .]}^{T}$ (29)

The primary purpose of data pre-processing is to modify the input variables so they can better match the predicted output. The main purpose of neural network data transformation is to modify the distribution of the network input parameters without losing much information.

Using the reduced input vector $x_{1}$ , the $j^{t h}$ output layer of the hidden layer is described as follows: (30) $h_{c j} = \sum_{i = 1}^{n_{3}} v_{j i} x_{1 i} j = 1, . . ., n_{4}$ (30)

where $n_{3}$ is the number of nodes of the input layer, $v_{j i}$ is the hidden weight.

Similarly, the output of the neural controller is given by the following equation: (31) $\begin{matrix} u (k) = λ_{c} s (\sum_{j = 1}^{n_{4}} v_{1 j} s (h_{c j})) \\ = λ_{c} s (\sum_{j = 1}^{n_{4}} v_{1 j} s (\sum_{i = 1}^{n_{3}} v_{j i} x_{1 i})) \end{matrix}$ (31)

where $n_{4}$ is the number of nodes of the hidden layer, $λ_{c}$ is a scaling coefficient and $v_{1 j}$ is the output weight.

The compact form of the control input to the system is given by the following equation: (32) $u (k) = λ_{c} s (h_{c 1}) = λ_{c} s [v_{1}^{T} S (V x_{1})]$ (32)

with,. $x_{1} = [x_{1 i}]^{T}, i = 1, \dots, n_{3}$ $V = [v_{j i}],i = 1, \dots, n_{3}, j = 1, \dots, n_{4}$ $S (V x_{1}) = [s (h_{j})]^{T}, j = 1, \dots, n_{4}$ $v_{1} = [v_{1 j}]^{T}, j = 1, \dots, n_{4}$

The tracking error $e_{c} (k)$ is given by the following equation: (33) $e_{c} (k) = y (k) - r (k)$ (33)

where $r (k)$ is the desired output.

The updated weights of the neural controller are obtained by minimizing the cost function defined as follows: (34) $E_{c} = \frac{1}{2} (e_{c} (k))^{2}$ (34)

where $N$ is the number of observations. The output weights are updated by (35) $v_{1 j} (k + 1) = v_{1 j} (k) + Δ v_{1 j} (k)$ (35)

with $Δ v_{1 j}$ , $j = 1.. n_{4}$ , is the incremental change of the output weights: (36) $\begin{matrix} Δ v_{1 j} = - η_{c} (k) \frac{\partial E_{c} (k)}{\partial v_{1 j}} \\ = - η_{c} (k) \frac{\partial E_{c}}{\partial e_{c} (k)} \frac{\partial e_{c} (k)}{\partial y (k)} \frac{\partial y r (k)}{\partial h_{1}} \frac{\partial h_{1}}{\partial s (h_{j})} \frac{\partial s (h_{j})}{\partial h_{j}} \frac{\partial h_{j}}{\partial u (k)} \frac{\partial u (k)}{\partial h_{c 1}} \frac{\partial h_{c 1}}{\partial v_{1 j}} \\ = η_{c} (k) λ_{c} e_{c} (k) s^{'} (h_{1}) w_{1 j} S^{'} (W x) w_{j i} s^{'} (h_{c 1}) S (V x_{1}) \end{matrix}$ (36)

where $η_{c} (k)$ is the learning rate for the weights of the neural network controller, $0 \leq η_{c} (k) \leq 1$ , given by (37) $\begin{matrix} η_{c} (k) = 1 / (λ_{c}^{2} {s^{'}}^{2} (h_{c 1}) s^{'} (h_{1}) w_{1 j} w_{j i} S^{'} (W x) \times \\ [S^{T} (V x_{1}) S (V x_{1}) + v_{1 j}^{T} S^{'} (V x_{1}) S^{'} (V x_{1}) v_{1 j} x_{1} x_{1}^{T}]) \end{matrix}$ (37)

Concerning the hidden weights, they are updated by (38) $v_{j i} (k + 1) = v_{j i} (k) + Δ v_{j i} (k)$ (38)

where $Δ v_{j i}$ is given by (39) $\begin{matrix} Δ v_{j i} = - η_{c} (k) \frac{\partial E_{c} (k)}{\partial v_{j i}} \\ = - η_{c} (k) \frac{\partial E_{c}}{\partial e_{c}} \frac{\partial e_{c}}{\partial y} \frac{\partial y r}{\partial h_{1}} \frac{\partial h_{1}}{\partial s (h_{j})} \frac{\partial s (h_{j})}{\partial h_{j}} \frac{\partial h_{j}}{\partial u} \frac{\partial u}{\partial h_{c 1}} \frac{\partial h_{c 1}}{\partial h_{c j}} \frac{\partial h_{c j}}{\partial v_{j i}} \\ = η_{c} (k) λ_{c} e_{c} (k) s^{'} (h_{1}) w_{1 j} S^{'} (W x) w_{j i} s^{'} (h_{c 1}) v_{1 j} S^{'} (V x_{1}) x_{1}^{T} \end{matrix}$ (39)

with $S^{'} (V x_{1}) = d i a g [s^{'} (h_{j})]^{T}, j = 1, . . ., n_{4}$

Let $Ψ = [ϕ (z_{1}), . . ., ϕ (z_{l})]$ , $1_{l} = (\frac{1}{l})_{l \times l}$ and $\tilde{Γ} = Ψ^{T} Ψ$ , $Γ$ is the matrix which is defined as (40) $Γ = \tilde{Γ} - 1_{l} \tilde{Γ} - \tilde{Γ} 1_{l} + 1_{l} \tilde{Γ} 1_{l}$ (40)

with ${\tilde{Γ}}_{i j} = ϕ (z_{i})^{T} 1_{l} ϕ (z_{j}) = k r (z_{i}, z_{j})$ . In this paper, different kernel functions are used and defined in .

Table 1. The usual kernel functions.

Display Table

The principal components are the $s$ first vectors associated with the highest eigenvalues and are often sufficient to describe the structure of the data. The number $s$ satisfies the Inertia Percentage Criterion (IPC) [Citation25] given by (41) $s = a r g (I P C \geq 99)$ (41)

with (42) $I P C = 100 * \frac{\sum_{i = 1}^{s} λ_{i}}{{\sum_{i = 1}^{l} λ_{i}}_{i}}$ (42)

We have developed a neural network controller based on a reduced input vector and a variable learning rate. Consequently, this approach increases the training speed.

For the stability of the neural network controller, the Lyapunov function is detailed. Indeed, let us define a discrete Lyapunov function as (43) $\begin{matrix} V_{c} (k) = E_{c} (k) = \frac{1}{2} {(e_{c} (k))}^{2} \end{matrix}$ (43)

where $e_{c} (k)$ is the control error. The change in the Lyapunov function is obtained by (44) $\begin{matrix} Δ V_{c} (k) = V_{c} (k + 1) - V_{c} (k) = \frac{1}{2} ((e_{c} (k + {1))}^{2} - {(e_{c} (k))}^{2}) \end{matrix}$ (44)

The control error difference can be represented by (45) $\begin{matrix} Δ e_{c} (k) = e_{c} (k + 1) - e_{c} (k) \approx η_{c} (k) (\frac{\partial e_{c} (k)}{\partial v_{c} (k)})^{T} \frac{\partial y (k)}{\partial u_{c} (k)} \frac{\partial u_{c} (k)}{\partial v_{c} (k)} e_{c} (k) \end{matrix}$ (45)

where $v_{c} (k)$ is the synaptic weights of the neural network controller ( $v_{1 j} (k)$ and $v_{j i} (k)$ ). Using Equation (45), the control error is going to be (46) $\begin{matrix} e_{c} (k + 1) = e_{c} (k) - η_{c} (k) ξ_{c} (k) e_{c} (k) \end{matrix}$ (46)

with (47) $\begin{matrix} ξ_{c} (k) = λ_{c}^{2} {s^{'}}^{2} (h_{c 1}) s^{'} (h_{1}) w_{1 j} w_{j i} S^{'} (W x) [S^{T} (V x_{1}) S (V x_{1}) + v_{1 j}^{T} S^{'} (V x_{1}) S^{'} (V x_{1}) v_{1 j} x_{1} x_{1}^{T}] \end{matrix}$ (47)

From Equations (46) and (47), the convergence of the control error $e_{c} (k)$ is guaranteed if ${lim}_{k \to + \infty} e_{c} (k) = 0$ or $0 < η_{c} (k) < 2 ξ_{c}^{- 1} (k)$ with $V_{c} (k) > 0$ from Equation (43).

The suitable online algorithm for real-time applications may be applied if the variable learning rate $η_{c} (k)$ is $ξ_{c}^{- 1} (k)$ .

3. The proposed algorithm

In this section, a summary of the proposed algorithm of the online kernel principal component analysis neural network controller is presented.

Offline phase

Initialization of neural network parameters ( $v_{1 j}$ , $v_{j i}$ , $w_{1 j}$ , $w_{j i}$ ) using $M$ observations, $(M ≪ N)$ ,
Determine the matrix $C$ , focus the data and decompose into eigenvalue $λ$ ,
Determine the orthogonal eigenvalues and the eigenvectors of the covariance matrix,
Order the eigenvectors on the decreasing way respect to the corresponding eigenvalues,
(5) Choose $x_{r} (k)$ that satisfy Equation (28) using the $s$ retained principal components given byEquations (41) and (42).

Online phase

At time instant $(k + 1)$ , we have a new data $(u (k + 1), y (k + 1))$ , using the obtained input vector $x_{1}$ , if the condition $e (k + 1) < ε_{1}$ , where $ε_{1} > 0$ is a given small constant, is satisfied then the neural network model, given by Equation (5), approaches sufficiently the behaviour of the system.
If the condition $e_{c} (k + 1) < ε_{2}$ , where $ε_{2} > 0$ is a given small constant, is satisfied, then the reduced neural network controller provides sufficiently the control law $u (k)$ .
If $e (k + 1) < ε_{1}$ is not satisfied, the update of the synaptic weights of the neural network model is necessary, using Equations (8) and (12),
If $e_{c} (k + 1) < ε_{2}$ is not satisfied, the update of the synaptic weights of the neural network controller is necessary, using Equation (35) and (38),
(5), End.

4. Simulation results

In this section, two non-linear discrete systems are used. Indeed, the first is a single-input single-output nonlinear time-varying system and the second is a multi-input multi-output (MIMO) system.

4.1. Example of time-varying system

The time-varying non-linear system is described by the input–output model in the following equation [Citation41]. (48) $y (k + 1) = \frac{y (k) y (k - 1) y (k - 2) u (k - 1) (y (k - 2) - 1) + u (k)}{a_{0} (k) + a_{1} (k) y^{2} (k - 1) + a_{2} (k) y^{2} (k - 2)}$ (48)

where $y (k)$ and $u (k)$ are, respectively, the output and the input of the time-varying non-linear system at instant $k$ ; $a_{0} (k)$ , $a_{1} (k)$ and $a_{2} (k)$ are given by (49) $\{\begin{matrix} a_{0} (k) = 1 \\ a_{1} (k) = 1 + 0.2 c o s (k) \\ a_{2} (k) = 1 + 0.2 s i n (k) \end{matrix}$ (49)

The trajectory of $a_{1} (k)$ and $a_{2} (k)$ are given in .

Figure 4. $a_{1} (k)$ and $a_{2} (k)$ trajectories.

In this section, in order to examine the effectiveness of the proposed algorithm of the dimensionality reduction, different performance criteria are used.

Indeed, the mean squared identification error ( $M S E_{e}$ ) and the mean absolute identification error ( $M A E_{e}$ ) are, respectively, given by (50) $M S E_{e} = \frac{1}{N} \sum_{k = 1}^{N} (y (k) - y r (k))^{2}$ (50) (51) $M A E_{e} = \frac{1}{N} \sum_{k = 1}^{N} (y (k) - y r (k))$ (51)

where $y (k)$ is the time-varying system output, $y r (k)$ is the neural network model output and the used number of observations $N$ is $100$ .

The mean squared tracking error ( $M S E_{e_{c}}$ ) and the mean absolute tracking error ( $M A E_{e_{c}}$ ) are, respectively, given by (52) $M S E_{e_{c}} = \frac{1}{N} \sum_{k = 1}^{N} (y (k) - r (k))^{2}$ (52)

where $r (k)$ is the desired value. (53) $M A E_{e_{c}} = \frac{1}{N} \sum_{k = 1}^{N} (y (k) - r (k))$ (53)

In this section, we examine the effectiveness of the proposed algorithm of the dimensionality reduction of the neural network controller input vector in the adaptive indirect control system.

Indeed, in offline phase, using a reduced number of observations $(M = 3)$ to find, either, the parameter initialization of the neural network parameters ( $w_{1 j}$ , $w_{j i}$ , $v_{1 j}$ , $v_{j i}$ ), and the KPCA parameters as the matrix $C$ , the eigenvalues, the eigenvectors, and finally the reduced input vector $x_{r} (k)$ given by Equation (28) based on the $s$ retained principal components given by the Equations (41)–(42) are obtained.

In online phase, at instant $(k + 1)$ , we use the input vector of the neural network controller $x_{1} = {[x_{r} (k), x_{r} (k - 1), x_{r} (k - 2), x_{r} (k - 3), x_{r} (k - 4)]}^{T}$ .

In this case, both neural network model and pre-processing neural network controller consist of single input, $1$ hidden layer with $8$ nodes, and a single output node, identically, and a variable learning rate of neural network model $η (k)$ and of neural network controller $η_{c} (k)$ . The used scaling coefficient is $λ = λ_{c} = 1$ and $ε_{1} = ε_{2} = 10^{- 2}$ .

To use the suitable kernel function, the simulation results present that the used sigmoid function as a kernel, compared to other kernel functions defined in , which gives the lowest value obtained with the calculation of the $M S E_{e}$ indicating which sigmoid kernel function is the most reliable.

Table 2. The comparison results of the used kernel function in the identification error.

Display Table

However, the features are directly fed to multilayer perceptron neural network as inputs without any preprocessing by KPCA. The obtained online MLP neural network model and the plant output are obtained. The used input vector of the MLP neural network is $[r (k), r (k - 1), r (k - 2), r (k - 3), r (k - 4), r (k - 5)]^{T}$ when the number of the hidden layer is $1$ with $23$ nodes and the value of the learning rate is variable. From , an excellent concordance between both plant output and the desired value is observed with a mean square error equal to ${6.926910}^{- 7}$ .

In , the output of the reduced online MLP neural network controller and the desired values are presented. In this case, the KPCA method is combined with the multilayer perceptron neural network. The KPCA technique is used as a preprocessing method to reduce the dimension features. The obtained reduced vector is fed also to the online multilayer perceptron neural network. The number of the hidden layer is 1. The learning rates are variable. A concordance between both desired values and the plant output is noticed from . To give more efficiency of this combination, several functions are tested and the result is presented in .

As defined in , we use the sigmoid function as a kernel function in the KPCA technique, and the tracking control aim of this system is to follow as possible the reference signal based on a proposed pre-processing neural network controller.

In this simulation, the desired value, $r (k)$ , is given in the following: (54) $r (k) = \{\begin{matrix} 0.45 f o r k \leq 25 \\ 0.20 f o r 26 \leq k \leq 50 \\ 0.45 f o r 51 \leq k \leq 75 \\ 0.20 f o r k > 75 \end{matrix}$ (54)

We examine the influence of the dimensionality reduction of the neural network controller input vector in the identification error in and in the control error in .

Table 3. The influence of the dimensionality reduction in the identification error.

Display Table

Table 4. The influence of the dimensionality reduction in the control error.

Display Table

From and we observe that using the KPCA as a pre-processing phase to reduce the input vector of the neural network controller, the neural network KPCA controller has the smallest performance criteria in the identification error $e (k)$ and in the control error $e_{c} (k)$ . These results are shown in , and .

Indeed, presents the pre-processing control system output and the desired values. In this case, the KPCA method is combined with a multilayer perceptron neural network controller.

The KPCA technique is used as a preprocessing method to reduce the dimension features. The obtained reduced vector is fed to the neural network controller. A concordance between the desired values and the control system output is noticed from although the parameters vary over time. However, and present, respectively, the control law and the control error.

These figures reveal that the NN controller using the KPCA as a pre-processing technique has smaller errors than the other controller without pre-processing.

Figure 5. The pre-processing control system output and the desired values.

Figure 6. The control law.

Figure 7. The control error.

Another desired value $r (k)$ , given by Equation (55), is used to examine the effectiveness of the proposed algorithm of the dimensionality reduction of the neural network controller input vector in the adaptive indirect control system for the time-varying non-linear system.

Indeed, both neural network model and neural network controller consist of single input, $1$ hidden layer with $23$ nodes, and a single output node, identically. The used scaling coefficient is $λ = λ_{c} = 1$ and $ε_{1} = ε_{2} = 10^{- 2}$ .

In this simulation, the desired value, $r (k)$ , is given in the following: (55) $r (k) = \{\begin{matrix} 0.45 f o r k \leq 25 \\ 0.20 f o r 26 \leq k \leq 30 \\ 0.40 f o r 31 \leq k \leq 35 \\ 0.30 f o r 36 \leq k \leq 80 \\ 0.20 f o r k > 80 \end{matrix}$ (55)

presents the pre-processing control system output and the desired values. In this case, the KPCA method is combined with a multilayer perceptron neural network controller. A concordance between the desired values and the control system output is noticed, although the time-varying parameters.

However, and present, respectively, the control law and the control error. These figures reveal that the NN controller using the KPCA as a pre-processing technique has smaller errors than the other controller without pre-processing.

Figure 8. The pre-processing control system output and the desired values.

Figure 9. The control law.

Figure 10. The control error.

and present the influence of the dimensionality reduction in the identification error and in the control error.

Table 5. The influence of the dimensionality reduction in the identification error.

Display Table

Table 6. The influence of the dimensionality reduction in the control error.

Display Table

From and we observe that by using the KPCA as a pre-processing phase to reduce the input vector of the neural network controller, the neural network KPCA controller has the smallest performance criteria in the identification error $e (k)$ and in the control error $e_{c} (k)$ . These results are shown in , and .

4.2. Effect of disturbances

An added noise $v (k)$ is injected to the output of the time-varying non-linear system, given by Equation (48), in order to test the effectiveness of the pre-processing neural network controller.

To measure the correspondence between the system output and the desired value, a signal noise ratio $(S N R)$ is taken from the following equation: (56) $S N R = \frac{\sum_{k = 0}^{N} (y (k) - \overset{ˉ}{y})}{\sum_{k = 0}^{N} (v (k) - \overset{ˉ}{v})}$ (56)

with $v (k)$ is a noise of the measurement of symmetric terminal $δ$ , $v (k) \in [- δ, δ]$ , $\overset{ˉ}{y}$ and $\overset{ˉ}{v}$ are an output average value and a noise average value, respectively. In this paper, the taken SNR is $5 %$ .

Using the first desired value $r (k)$ , the sensitivity of the proposed pre-processing neural network controller is examined in and respectively.

Table 7. The influence of the dimensionality reduction in the identification error.

Display Table

Table 8. The influence of the dimensionality reduction in the control error.

Display Table

From these tables, we observe that by using the KPCA as a pre-processing phase to reduce the input vector of the neural network controller, the neural network KPCA controller has the smallest performance criteria in the identification error and in the control error.

Using the second desired value, the sensitivity of the proposed pre-processing neural network controller is examined in and respectively.

Table 9. The influence of the dimensionality reduction in the identification error.

Display Table

Table 10. The influence of the dimensionality reduction in the control error.

Display Table

According to the obtained simulation results, despite the fact of the presence of disturbance in the system output and the time-varying parameters, the lowest $M S E_{e_{c}}$ , $M A E_{e_{c}}$ and $m a x (e_{c})$ are obtained using a combination between the neural network controller and the KPCA technique.

4.3. Example of multi-input multi-output system

In this section, in order to examine the effectiveness of the proposed algorithm of the dimensionality reduction, a multi-input multi-output (MIMO non-linear system, given by the following equation, is used. (57) $\{\begin{matrix} y_{1} (k + 1) = \frac{y_{1} (k)}{1 + y_{2}^{2} (k)} + u_{1} (k) \\ y_{2} (k + 1) = \frac{y_{1} (k) y_{2} (k)}{1 + y_{2}^{2} (k)} + u_{2} (k) \end{matrix}$ (57)

where $y_{i} (k)$ and $u_{i} (k)$ , $i = 1, 2$ , are, respectively, the output and the input of the MIMO non-linear system at instant $k$ ; $r_{1} (k)$ and $r_{2} (k)$ are the reference signal given by (58) $\{\begin{matrix} r_{1} (k) = s i n (\frac{2 k π}{100}) \\ r_{2} (k) = \{\begin{matrix} 0.8 f o r k \leq 50 \\ 0.4 f o r 51 \leq k \leq 100 \\ 0.8 f o r 101 \leq k \leq 150 \\ 0.4 f o r 151 \leq k \leq 200 \end{matrix} \end{matrix}$ (58)

The control system outputs, the desired values and the control errors are presented in . However, presents the control law u1 and u2 trajectories. These figures reveal that using a NN controller combined with the KPCA as a pre-processing technique gives an excellent concordance between the system outputs and the desired outputs with smaller control errors.

Figure 11. The control system output, the desired values and the control error.

Figure 12. The control law $u_{1}$ and $u_{2}$ trajectories.

In this case, both neural network model and pre-processing neural network controller consist of single input, $1$ hidden layer with $28$ nodes, and two output nodes, identically, and variable learning rates of neural network model, $η_{i} (k)$ , and of neural network controller $η_{i c} (k)$ . The used scaling coefficient is $λ_{i} = λ_{c_{i}} = 1$ and $ε_{i} = 10^{- 2}$ , $i = 1 : 2$ .

The input vector of the neural network controller is $x_{1} = {[x_{r 1} (k), x_{r 1} (k - 1), x_{r 1} (k - 2), x_{r 2} (k), x_{r 2} (k - 1), x_{r 2} (k - 2)]}^{T}$ . The influence of the dimensionality reduction in the model error and in the control error is shown in and .

Table 11. The influence of the dimensionality reduction in the model error.

Display Table

Table 12. The influence of the dimensionality reduction in the control error.

Display Table

5. Conclusion

In this paper, an online combination between the neural network controller and the KPCA method is proposed and is applied with success in indirect adaptive control. Different kernel functions are tested. For instance, the lowest $M S E_{e}$ , $M A E_{e}$ , $m a x (e)$ , $M S E_{e_{c}}$ , $M A E_{e_{c}}$ and $m a x (e_{c})$ are obtained, and it is proved that the sigmoid kernel function is the best. The effectiveness of the proposed algorithm is successfully applied, firstly, to single-input single-output system, with and without disturbances, and it proved its robustness to reject disturbances and to accelerate the speed of the learning phase of the neural model and neural controller. Second, it is applied to MIMO system and it gives good results.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

O. Mohareri, R. Dhaouadi, and A.B. Rad, Indirect adaptive tracking control of a nonholonomic mobile robot via neural networks, Neurocomputing 88 (2012), pp. 54–66. doi:10.1016/j.neucom.2011.06.035.
Web of Science ®Google Scholar
A.A. Bohari, W.M. Utomo, Z.A. Haron, N.M. Zin, S.Y. Sim, and R.M. Ariff, Speed tracking of indirect field oriented control induction motor using neural network, Procedia Technol. 11 (2013), pp. 141–146. doi:10.1016/j.protcy.2013.12.173.
Google Scholar
S. Slama, A. Errachdi, and M. Benrejeb, Adaptive PID controller based on neural networks for MIMO nonlinear systems, J. Theor. Appl. Inf. Technol. 97 2 (2019), pp. 361–371.
Google Scholar
A. Errachdi and M. Benrejeb, Performance comparison of neural network training approaches in indirect adaptive control, Int. J. Control. Autom. Syst. 16 (3) (2018), pp. 1448–1458. doi:10.1007/s12555-017-0085-3.
Web of Science ®Google Scholar
N. Ben, W. Ding, D.A. Naif, and E.A. Fuad, Adaptive neural state-feedback tracking control of stochastic nonlinear switched systems: An average dwell-time method, IEEE Trans. Neural Networks Learn. Syst. 30 (4) (2018), pp. 1076–1087. doi:10.1109/TNNLS.2018.2860944.
PubMed Web of Science ®Google Scholar
N. Ben, L. Yanjun, Z. Wanlu, L. Haitao, D. Peiyong, and L. Junqing, Multiple lyapunov functions for adaptive neural tracking control of switched nonlinear non-lower-triangular systems, IEEE Trans. Cybern. 99 (2019). doi:10.1109/TCYB.2019.2906372
Google Scholar
P.O. Hoyer and A. HyvUarinen, Independent component analysis applied to feature extraction from colour and stereo images, Network 11 (3) (2000), pp. 191–210. doi:10.1088/0954-898X_11_3_302.
PubMed Web of Science ®Google Scholar
L.J. Cao, K.S. Chua, W.K. Chong, H.P. Lee, and Q.M. Gu, A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine, Neurocomputing 55 (1–2) (2003), pp. 321–336. doi:10.1016/S0925-2312(03)00433-8.
Web of Science ®Google Scholar
I. Guyon and A. Eliseeff, An introduction to variable and feature selection, J. Mach. Learn. Res. 3 (2003), pp. 1157–1182.
Google Scholar
J. Weston, S. Mukherjee, O. Chapelle, M. Pontil, T. Poggio, and V.N. Vapnik, Feature selection for SVMs, Adv. Neural Inform. Process. Syst. 13 (2001), pp. 668–674.
Google Scholar
F.E.H. Tay and L.J. Cao, Saliency analysis of support vector machines for feature selection, Neural Network World 2 1 (2001), pp. 153–166.
Google Scholar
F.E.H. Tay and L.J. Cao, A comparative study of saliency analysis and genetic algorithm for feature selection in support vector machines, Intell. Data Anal. 5 (3) (2001), pp. 191–209. doi:10.3233/IDA-2001-5302.
Google Scholar
K. Lee and V. Estivill-Castro, Feature extraction and gating techniques for ultrasonic shaft signal classification, Appl. Soft Comput. 7 (2007), pp. 156–165. doi:10.1016/j.asoc.2005.05.003.
Web of Science ®Google Scholar
H. Qinshu, L. Xinen, and X. Shifu, Comparison of PCA and model optimization algorithms for system identification using limited data, J. Appl. Sci. 13, 11 (2013), pp. 2082–2086. doi:10.3923/jas.2013.2082.2086
Google Scholar
R. Zhang, J. Tao, R. Lu, and Q. Jin, Decoupled ARX and RBF neural network modeling using PCA and GA optimization for nonlinear distributed parameter systems, IEEE Trans. Neural Networks Learn. Syst. 29 (2) (2018), pp. 457–469. doi:10.1109/TNNLS.2016.2631481.
PubMed Web of Science ®Google Scholar
M.L. Wang, X.D. Yan, and H.B. Shi, Spatiotemporal prediction for nonlinear parabolic distributed parameter system using an artificial neural network trained by group search optimization, Neurocomputing 113 (2013), pp. 234–240. doi:10.1016/j.neucom.2013.01.037.
Web of Science ®Google Scholar
S. Yin, S.X. Ding, A.H. Abandan Sari, and H.Y. Hao, Data-driven monitoring for stochastic systems and its application on batch process, Int. J. Syst. Sci. 44 (7) (2013), pp. 1366–1376. doi:10.1080/00207721.2012.659708.
Web of Science ®Google Scholar
E. Aggelogiannaki and H. Sarimveis, Nonlinear model predictive control for distributed parameter systems using data driven artificial neural network models, Comput. Chem. Eng. 32 (6) (2008), pp. 1225–1237. doi:10.1016/j.compchemeng.2007.05.002.
Web of Science ®Google Scholar
M. Madhusmita and H.S. Behera, Kohonen self organizing map with modified K-means clustering For high dimensional data set, Int. J. Appl. Inf. Syst. (IJAIS). 2(3) (2012), pp. 34–39. ( Foundation of Computer Science FCS, New York, USA).
Google Scholar
S. Buchala, N. Davey, T.M. Gale, and R.J. Frank, Analysis of linear and nonlinear dimensionality reduction methods for gender classifcation of face images, Int. J. Syst. Sci. 14 (36) (2005), pp. 931–942. doi:10.1080/00207720500381573.
Google Scholar
M. Lennon, G. Mercier, M.C. Mouchot, and L. Hubert-Moy, Curvilinear component analysis for nonlinear dimensionality reduction of hyperspectral images, Proc. SPIE Image Signal Process Remote Sens. VII 4541 (2001), pp. 157–168.
Google Scholar
N.K. Batmanghelich, B. Taskar, and C. Davatzikos, Generative-discriminative basis learning for medical imaging, IEEE Trans. Med. Imaging 31 (2012), pp. 51–69. doi:10.1109/TMI.2011.2162961.
PubMed Web of Science ®Google Scholar
L. Van der Mateen, E. Postma, and J. Van den Herik, Dimensionality reduction: A comparative review tilburg centre for creative computing, Tilburg University, LE Tilburg, The Netherlands, 2009.
Google Scholar
K. Kuzniar and M. Zajac, Data pre-processing in the neural network identification of the modified walls natural frequencies, Proceedings of the 19th International Conference on Computer Methods in Mechanics CMM-2011, Warszawa, 9–12 May, 2011, pp. 295–296.
Google Scholar
V.M. Janakiraman, X. Nguyen, and D. Assanis, Nonlinear identification of a gasoline HCCI engine using neural networks coupled with principal component analysis, Appl. Soft Comput. 13 (2013), pp. 2375–2389. doi:10.1016/j.asoc.2013.01.006.
Web of Science ®Google Scholar
K. Seerapu and R. Srinivas, Face recognition using robust PCA and radial basis function network, Int. J. Comput. Sci. Commun. Networks 2 5 (2012), pp. 584–589.
Google Scholar
N.M. Peleato, R.L. Legge, and R.C. Andrews, Neural networks for dimensionality reduction of fluorescence spectra and prediction of drinking water disinfection by-products, Water Res. 136 (2018), pp. 84–94. doi:10.1016/j.watres.2018.02.052
PubMed Web of Science ®Google Scholar
C. Sucheta and K.V. Prema, Effect of dimensionality reduction on performance in artificial neural network for user authentication, 3rd IEEE International Advance Computing Conference (IACC), Ghaziabad, India, 2013.
Google Scholar
G.E. Hinton and R.R. Salakhutdinov, Reducing the dimensionality of data with neural networks, Science 313 (2006), pp. 504–507. doi:10.1126/science.1127647.
PubMed Web of Science ®Google Scholar
Q. Zhu and C. Li, Dimensionality reduction with input training neural network and its application in chemical process modelling, Chinese J. Chern. Eng. 14 (5) (2006), pp. 597–603. doi:10.1016/S1004-9541(06)60121-3.
Web of Science ®Google Scholar
C.-Y. Cheng, -C.-C. Hsu, and M.-C. Chen, Adaptive kernel principal component analysis (KPCA) for monitoring small, Ind. Eng. Chem. Res. 49 (2010), pp. 2254–2262. doi:10.1021/ie900521b.
Web of Science ®Google Scholar
C. Chakour, M.F. Harkat, and M. Djeghaba, New adaptive kernel principal component analysis for nonlinear dynamic process monitoring, Appl. Math. Inf. Sci. 9 4 (2015), pp. 1833–1845.
Google Scholar
R. Fezai, M. Mansouri, O. Taouali, M.F. Harkat, and N. Bouguila, Online reduced kernel principal component analysis for process monitoring, Jx 61 (2018), pp. 1–11. doi:10.1016/j.jprocont.2017.10.010.
Google Scholar
Y. Xiao and Y. He, A novel approach for analog fault diagnosis based on neural networks and improved kernel PCA, Neurocomputing 74 (2011), pp. 1102–1115. doi:10.1016/j.neucom.2010.12.003.
Web of Science ®Google Scholar
A. Errachdi and M. Benrejeb, On-line identification using radial basis function neural network coupled with KPCA, Int. J. Gen. Syst. 45 7 (2016), pp. 1–15.
Google Scholar
K.N. Reddy and V. Ravi, Differential evolution trained kernel principal component WNN and kernel binary quantile regression: Application to banking, Knowledge-Based Syst. 39 (2013), pp. 45–56. doi:10.1016/j.knosys.2012.10.003.
Web of Science ®Google Scholar
I. Klevecka and J. Lelis, Pre-processing of input data of neural networks: The case of forecasting telecommunication network traffic, Telektronikk 104 3/4 (2008), pp. 168–178.
Google Scholar
M. Shirzadeh, A. Amirkhani, A. Jalali, and M.R. Mosavi, An indirect adaptive neural control of a visual-based quadrotor robot for pursuing a moving target, ISA Trans 59 (2015), pp. 290–302. doi:10.1016/j.isatra.2015.10.011.
PubMed Web of Science ®Google Scholar
S.J. Yoo, J.B. Park, and Y.H. Choi, Indirect adaptive control of nonlinear dynamic systems using self recurrent wavelet neural networks via adaptive learning rates, Inf. Sci. 177 (2007), pp. 3074–3098. doi:10.1016/j.ins.2007.02.009.
Web of Science ®Google Scholar
B. Scholkopf and A. Smola, Learning with Kernels, MIT Press, Cambridge, 2002.
Google Scholar
K.S. Narendra and K. Parthasarthy, Identification and control of dynamical systems using neural networks, IEEE Trans. Neural Networks 1 (1) (1990), pp. 4–27. doi:10.1109/72.80202.
PubMedGoogle Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

On the combination of kernel principal component analysis and neural networks for process indirect control

ABSTRACT

1. Introduction