Abstract
This paper develops a representation of multi-model based controllers using artificial intelligence techniques. These techniques will be graph theory, neural networks, genetic algorithms, and fuzzy logic. Thus, graph theory is used to describe in a formal and concise way the switching mechanism between the various plant parameterizations of the switched system. Moreover, the interpretation of multi-model controllers in an artificial intelligence frame will allow the application of each specific technique to the design of improved multi-model based controllers. The obtained artificial intelligence-based multi-model controllers are compared with classic single model-based ones. It is shown through simulation examples that a transient response improvement can be achieved by using multi-estimation based techniques. Furthermore, a method for synthesizing multi-model-based neural network controllers from already designed single model-based ones is presented, extending the applicability of this kind of technique to a more general type of controller. Also, some applications of genetic algorithms and fuzzy logic to multi-model controller design are proposed. In particular, the mutation operation from genetic algorithms inspires a robustness test, which consists of a random modification of the estimates which is used to select the one leading to the better identification performance towards parameterizing online the adaptive controller. Such a test is useful for plants operating in a noisy environment. The proposed robustness test improves the selection of the plant model used to parameterize the adaptive controller in comparison to classic multi-model schemes where the controller parameterization choice is basically taken based on the identification accuracy of each model. Moreover, the fuzzy logic approach suggests new ideas to the design of multi-estimation structures, which can be applied to a broad variety of adaptive controllers such as robotic manipulator controller design.
Adaptive control is a well-known control method which allows to stabilize dynamical system with partially or totally unknown parameters as well as the achievement of acceptable tracking performances (VanDoren Citation2003; Aström and Wittenmark Citation1995). The designed adaptive controllers have been successfully used when the plant has unknown but constant parameters as well as when the plant is time-varying provided that the variation of its parameters is slower than the adaptation rate of the adaptive scheme. Nevertheless, sudden changes in the plant operating point, which entail a sudden change in the plant parameter values, are not surprising in many industrial applications such as in certain chemical engineering processes or electric motors (Cezayirli and Ciliz Citation2004). If a single identification model is used, then the system will have to adapt itself to the new situation before an adequate control action can be taken. Hence, single model-based adaptive controllers may lead to a poor transient response in terms of the plant output deviation from a reference one in these situations. Furthermore, an inadequate initialization of the estimated plant parameters in the estimation algorithm may result in unacceptable large transient deviations. Thus, multi-estimation-based controllers appeared as a way to improve the transient response of adaptive systems (Narendra and Balakrishnan Citation1994; Gregorcic et al. Citation2001; Chang and Davison Citation1999; Narendra and Balakrishnan Citation1997; Mosca and Agnoloni Citation2001; Hocherman-Frommer et al. Citation1998; Ibeas et al. Citation2003) by considering a set of parameter estimation algorithms running in parallel, each one being initialized by a different set of estimated plant parameter values. This mechanism allows to quickly correct any potential erroneous choice of the initial values of the plant parameters selected to start to run the estimation algorithm due to a convenient parameterization of the adaptive controller by adequate switching between estimators. Moreover, the existence of a set of estimation algorithms running in parallel allows to improve the control action when the plant changes its operation point abruptly by a convenient switching to an estimator closer to the new plant parameterization (Ibeas et al. Citation2003; Cezayirli and Ciliz Citation2004). A general multi-model-based control scheme is composed of a set of different plant models running in parallel. These models, which may be fixed (Narendra and Balakrishnan Citation1994; Gregorcic et al. Citation2001; Chang and Davison Citation1999) or adaptive (Narendra and Balakrishnan Citation1997; Mosca and Agnoloni, Citation2001; Hocherman-Frommer et al. Citation1998; Ibeas et al. Citation2003) are different from each other in what is concerned with their structure and/or their parameter values. Then, a higher level switching structure between the various plant models chooses, according to a closed-loop performance index built for each one, the one used to calculate the control law at that time instant. In Mosca and Agnoloni (Citation2001) and Hocherman-Frommer et al. (Citation1998), several performance indexes for that purpose are discussed. Despite the specific form of the performance index is different from one work to another, it is common to define it based on the error between the real plant output and each model's one, which is a natural choice since that index reflects how far a specific model is from the real plant behavior. Thus, the switching law acts as a supervisor of the system behavior deciding in real time the estimator that will parameterize the adaptive controller. Hence, this kind of control architecture allows to develop control system schemes capable of achieving a good performance in terms of speed, accuracy, and stability for increasingly complex systems (Ibeas et al. Citation2003; Sun and Ge Citation2003; Cezayirli and Ciliz Citation2004; Chen and Narendra Citation2001).
The structure and operation of the switching law have been previously studied from an artificial intelligence point of view in an expert systems context (de la Sen et al. Citation2004; de la Sen and Almansa Citation2002). In addition, this paper proposes a formal interpretation of the multi-model (multi-estimation in the more general case concerned with adaptive models) switching structure in a more general artificial intelligence frame. First, the switching mechanism between plant models is described using a graph theory-based formalism. A graph containing information about the number of estimators running in parallel as well as the possibility of switching between them is introduced. The concept of extended adjacency matrix is proposed and used to describe the way in which the switching law between estimators is implemented and defined in a practical multiestimation scheme. An adaptive, being a more general case than that concerned with fixed models, will be used to represent the formalism. Thus, the switching rule is completely described by the proposed graph while the realization of the switching rule in a concrete simulation example leads to a walk in this graph. Once a general formalism representing the switching scheme is stated, the multi-estimation scheme is represented by using artificial intelligence typical structures such as: artificial neural networks (ANN), genetic algorithms (GA), and fuzzy logic (Da Ruan 1997; Fausett Citation1998; Etxebarria Citation1994; Beyers Citation1998; Tilli Citation1992; Ibeas and de la Sen Citation2004). This interpretation will allow the use of specific characteristics of each technique to the design of improved multi-model control schemes. Thus, a method for synthesizing multi-model-based neural network controllers from previously designed single model ones is proposed. Also, a complete multi-estimation-based control scheme built using ANN technology entirely is described. Since there are a number of estimation algorithms running in parallel, only one parameterizing the adaptive controller at each time instant, we can use the selected estimation algorithm to guide the rest of them to a possibly improved estimated parameter vector. The way for implementing this issue is to modify the estimated parameter vector updating equation to include a term which monitors the updating process in the direction of the selected current best estimated vector. Moreover, some applications of genetic algorithms and fuzzy logic to multi-model control design are presented. The genetic algorithm approach will inspire the incorporation of a robustness measure of each model in the performance index used for evaluating the closed-loop behavior of the set of estimation algorithms running in parallel. The robustness test consists of a random perturbation of the estimated plant parameter vector values in order to select which one of them leads to a better identification performance under random modification of its values. Thus, the resulting scheme will be able to improve its transient behavior in comparison to the classic multi-model schemes, where this robustness measure is not included, when the plant is operating in a noisy environment as computer simulations have shown. Furthermore, a fuzzy logic approach is presented to interpret the architecture of multi-estimation-based controllers. This problem focusing way can be applied to the design of multi-model controllers, as recent works (Ibeas and de la Sen Citation2004; de la Sen and Almansa Citation2002; Ibeas et al. Citation2004) use in the control of uncertain robotic manipulators. Each representation will be compared with the traditional single model-based adaptive controller design in order to illustrate the usefulness of the proposed schemes.
PRELIMINARIES ON SWITCHED LINEAR SYSTEMS
In this section, a brief description of the switched linear control scheme used for subsequent discussion is presented. The discussion has been restricted to linear systems to avoid the existence of finite escape times. Thus, a switched linear system is described (unifying the notation for the continuous and discrete time systems) by:
Architecture of an Adaptive Multi-Estimation Scheme for Discrete Linear Systems
In this section, a case study of a multi-estimation-based control scheme for linear discrete systems is presented. Even if multi-model control schemes can be applied either to continuous or discrete systems (as Equation (Equation1) highlights), the following discrete example will be used through the work to illustrate the presented artificial intelligence formalism in a concrete control scheme. Thus, our objective is to design a model reference following pole placement-based multi-estimation-based control for the discrete (the continuous case can be treated in the same way) time invariant linear SISO plant:
![](/cms/asset/a87a489a-e62b-47f4-8280-101130ce4b7b/uaai_a_190291_o_ilm0047.gif)
Then, this estimated parameter vector
k is used for control law calculations at each sample by using the certainty equivalence principle (Aström and Wittenmark Citation1995). If this estimated vector is initialized far from the real plant parameter vector, then the transient response will have large deviations from the desired output, resulting in a bad transient tracking performance. Furthermore, if during the system operation, the plant's operating point changes abruptly at certain time instants, then the system will exhibit an unsuitable transient response. These facts motivate the consideration of a set of estimation algorithms running in parallel, each one with its own estimated plant parameter vector:
Thus, there exists N
e
estimation algorithms running in parallel (i.e., at each sampling time t
k
= kT, T being the sampling period, every algorithm gives the estimate parameter vector and the estimated plant output
, based on past plant input and output measurements). Each algorithm is different from each other in what is concerned with the estimated parameter vector initialization and/or the kind of the estimation algorithm and integrates the so-called multi-estimation scheme. Denote by c
k
the identification algorithm which parameterizes the adaptive controller at time t
k
. A switching rule based on the identification errors
, of the N
e
estimation algorithms chooses at each sampling time t
k
the individual estimation scheme (identifier) c
k
, which parameterizes the active controller at time t
k
. Thus, any potential erroneous choice of the initial estimated plant parameter vector is quickly corrected by the switching structure via looking for the “best” estimation model. Moreover, when the system changes its operation point, and this fact is detected through a monitorizing index according to a prescribed threshold, then all the estimation algorithms are reset to their original locations and uniformly distributed over the known, convex, and compact parameter space. Hence, when a change in the plant parameters happens, then the system is able to select the most convenient parameterization of the adaptive controller to achieve an adequate closed-loop transient performance (Ibeas et al. Citation2003). The proposed identification performance supervisory index for each estimation algorithm used for selecting the current one to parameterize the adaptive controller (once the minimum residence time constraint is fulfilled) is:
Algorithm 1. (Switching Between Estimators) | |||||
{k = 0;N r > 0, c 0, selected by the designer, ST = Ø(The Initial Switching Samples Sequence Is Empty)} | |||||
k lastSwitch = 0 | |||||
for k > 0 do | |||||
if (k − k lastSwitch ) ≥ N r then | |||||
| |||||
c
k
← | |||||
if c k ≠ c k−1 then | |||||
ST ← ST ∪ {k}, klastSwitch ← k end_if | |||||
else | |||||
ck= ck − 1 | |||||
end_if | |||||
end_for | |||||
{the current estimation algorithm parameterizing the adaptive controller c k is determined}, |
GRAPH THEORY-BASED REPRESENTATION OF THE SWITCHING PROCESS
In the following, a graph theory-based representation of the switching process is stated. Previously to such a representation, a brief background on graph theory is introduced.
Brief Background and Notation on Graph Theory
A brief background and notation on graph theory is now given to make the subsequent representation clear (Chartrand and Lesniak Citation1979; Diestel Citation2000). A graph (digraph or directed graph) is a pair G = (V, E)(G = (V, A) for digraphs) consisting in a finite nonempty set V of objects called vertices together with a (possible empty) set of unordered pairs (ordered pairs for the case of digraphs) of distinct vertices of V called edges (arcs or directed edges). The edge e = {u, v} ∈ E is said to join the vertices u ∈ V and v ∈ V. If e = {u, v} is an edge of the graph G, then u and v are adjacent vertices. Moreover, a loop graph is defined as a nonempty set V of vertices together with a (possible empty) set E of edges consisting of one or two element subsets of V, each one-element subset is referred to as a loop. A loop graph that admits multiple edges (including multiple loops) is called a pseudograph. Furthermore, a pseudograph is extended to a pseudo-digraph by considering arcs instead of edges to join vertices. The number of vertices is called the cardinal of V (or the degree or power of the graph G). A walk in G is defined by:
The complete graph of degree n, denoted by K n , is defined as the graph in which every two of its vertices are adjacent (the complete symmetric digraph or order n, denoted by K n *, has both arcs (u, v) and (u, v) for every two distinct vertices u, v ∈ V). The complete pseudograph K pn and the complete symmetric pseudo-digraph K pn * are obtained by adding a loop (being an edge or an arc respectively) to all vertices in V. The adjacency matrix of a digraph G = (V, A) with n vertices is the n × n matrix [a ij ] defined by a ij = 1 if (v i , v j ) ∈ E(A) and a ij = 0 otherwise. A weighted graph (weighted digraph) is a graph (digraph) in which a positive real number, called the weight of e (a) and denoted by w(e) (w(a)) is assigned to each edge (arc) e ∈ E(a ∈ A). The adjacency matrix concept can be extended to weighted graphs by considering the weighted adjacency matrix [a ij ] defined by a ij = w(e(a)), if e(a) = (v i , v j ) ∈ E(A) and a ij = 0 otherwise.
Graph Theory-Based Representation of the Switching Law
A graph theory-based description of the switching subsystem described by Equations (Equation1) is now given which is then used to describe the general adaptive multiestimation scheme described previously. Basically, the vertices of the graph represent estimation algorithms (or equivalently parameterizations of the dynamical system (1)) while the arcs represent switches between different estimators. Auto loops are used to highlight the fact that a minimum residence time at each active estimation algorithm (or system parameterization) has to be respected in order to make the switching law well posed. Now, the switching law of the multi-estimation scheme displayed in Figure , composed of N e estimation algorithms running in parallel, can be represented by a complete symmetric pseudo-digraph, called a switching graph, constructed using the following rules:
Rule 1. Consider the complete symmetric pseudo-digraph of order N
e
, | |||||
Rule 2. Select one vertex as the starting vertex marking it with an arrow. This rule may be interpreted as the selection of the initial estimation algorithm chosen to start to run the switched linear system. | |||||
Rule 3. There is a vector weight associated with each arc between vertices whose first component is the (non-negative) supervisory index associated with the switching between both estimators, while its second component is the (positive) time associated with the residence time interval within which the system has been parameterized by the previous estimation scheme. | |||||
Rule 4. There exists a vector weight associated with each auto loop composed of two components: The first one is the supervisory index of the current parameterization of the system if it does not switch to another parameterization once the residence time constraint has been fulfilled. The second one is the minimum residence time the switching mechanism is bound to maintain the current system parameterization active in order to guarantee the switching rule is well posed. |
Remarks
1. | The proposed switching graph represents the operation mechanism of the switching process and, hence, it is a complete description of the switching rule. Thus, the graph represents a switching rule where the system is able to switch from one system parameterization to any other once the residence time constraint is fulfilled. | ||||
2. | A walk in this switching graph can be interpreted as a concrete realization of the switching rule applied to a particular case study. | ||||
3. | Note that the first superscript in the supervisory index (6) indicates the estimator which has been parameterizing the adaptive controller during the last residence time interval, while the second one labels the next estimator which potentially can parameterize the adaptive controller at the following time instant. |
In order to represent the graph in a more formal way by using concepts and descriptions from the graph theory, the following definition of extended adjacency matrix is introduced:
Definition 1
The extended adjacency matrix of a pseudo-digraph G = (V, A) with card(V) = n is the square n × n matrix [a
ij
] whose entries are defined by the vector if (v
i
) ∈ A,
if (v
i
, v
j
) ∈ A and a
ij
= [0, 0] = 01×2. Otherwise, where N
rmini
is the minimum residence time (minimum residence number of samples), a selected estimation algorithm must remain parameterizing the adaptive controller to guarantee the switching law is well posed, while N
ri
is the real residence time (real residence number of samples) the selected estimation algorithm has been parameterizing the adaptive controller before switching to another estimation algorithm. Hence, the switching rule can be formally determined by defining the switching graph together with the extended adjacency matrix. For example, the extended adjacency matrix for a switching graph composed of five identifiers running in parallel is:
Remarks
1. | The extended adjacency matrix is not symmetric despite the digraph is topologically symmetric and complete. The main reason for this stands on that the supervisory index | ||||
2. | Note that since the vector weights of the graph (in particular, the supervisory index |
Example of the Graph Representation of the Switching Law
An example of the switching graph representation for the discrete SISO linear control scheme defined previously is developed. Consider the following system described by the (unknown for control purposes) plant:
![](/cms/asset/a87a489a-e62b-47f4-8280-101130ce4b7b/uaai_a_190291_o_ilm0047.gif)
Figure shows simulations of the multi-estimation control scheme.
Moreover, the realization of the switching law over this concrete plant and multi-estimation scheme until time t = 50T has led the system to describe the path displayed in Figure .
ARTIFICIAL NEURAL NETWORKS
In this section, an artificial neural network (ANN) (Fausett Citation1998; Wang and Qing Citation2004; Etxebarria and de la Sen Citation1996; Melin and Castillo Citation2003) representation for the multi-estimation-based control scheme described previously is presented. First, we will introduce a possible representation of the parallel estimation scheme into a unique estimation neural network. Then, the architecture of the complete multi-estimation scheme will be formally described by using a set of different interconnected neural networks. Furthermore, the selected estimation algorithm chosen to parameterize the adaptive controller in real time is used for monitoring the evolution of the estimated parameter vectors of the remaining estimation algorithms running in parallel to the directions of this one which achieves a better closed-loop response.
ANN for Representing Single Estimation-Based Schemes
The following two-layered ANN is presented as a model for a discrete time single model-based adaptive control in Etxebarria (Citation1994). The difference equation (Equation2–Equation3) is implemented by the ANN displayed in Figure , where the activation functions are linear for all neurons.
The ANN output (whose role is played by the estimated plant output) can be written as:
![](/cms/asset/a87a489a-e62b-47f4-8280-101130ce4b7b/uaai_a_190291_o_ilm0047.gif)
ANN for Representing Multi-Estimation Schemes
Now, the multi-estimation scheme introduced previously can be represented using an ANN by increasing the number of neurons in the output layer of the given ANN to a number of neurons equal to the number of different estimators used in the multi-estimation scheme. Since the output layer has one single neuron in the previous network, a multiestimation scheme with N e estimators running in parallel will have N e neurons in its output layer, as Figure displays for the case of N e = 2.
Hence, the number of connections between neurons and the number of weights are increased. Thus, the proposed ANN is a unique structure containing itself the N e estimated parameter vectors (which are represented by the corresponding weights vectors). Furthermore, the ANN plant estimated outputs are obtained as:
From the weight values w
ij, k
, two estimated plant parameter vectors may be defined by . The target vector (with which the ANN is trained) is defined in this case by repeating the original target value as many times as the number of estimators used. Since the original target value was the real measured plant output, y
k
, in the general case with N
e
estimators, the new target vector will be:
Then the switching logic compares the performance indexes (6) associated to each output of the ANN and chooses the set of weights (estimated parameter vector) associated with the best estimated output, according to the controller selection algorithm described previously in order to calculate the control signal. The training rule is the generalization of the Widrow-Hoff single output training rule (8) to the multiple output case:
Rule 1. The new ANN has the same number of layers as the original single model ANN. | |||||
Rule 2. Increase the number of neurons in the output layer to a quantity of N o ′ = N e N o . | |||||
Rule 3. Build a new target vector by repeating the original one as many times as the original target vector used for training the network. Thus, the new training vector is given by: | |||||
Rule 4. Extend the original training algorithm to the new neural network by considering the new output set and the original input one. |
Note that the number of network outputs is increased N e times from the original neural network, leading to the possibility of selecting which ones of them are used later. Hence, these rules define a multi-model neural network which includes all the sets of estimation algorithms, while the switching logic acts as an intelligent supervisor deciding the set of weights and/or the set of outputs that will be used for control purposes at each time instant. In such an easy way, the multi-model structure can be integrated together with conventional neural network-based controllers in order to obtain more general ANN-based multi-model structures. The general multi-model neural network scheme is displayed in Figure .
Simulation Example for the ANN Representation
In this section, a simulation example containing two estimation algorithms and training rule (9) for the ANN displayed in Figure is presented. The switching logic is assumed to respect a minimum residence number of samples given by N r = 2 between successive switchings between estimation algorithms in order to guarantee closed-loop stability (Ibeas et al. Citation2003). The switching law and the controller design (based on a model following pole placement algorithm) are described with detail in Ibeas et al. (Citation2003). The discrete plant and reference model are described, respectively, by the parameter vectors:
It is taken ε = 0.001 and α = 1. The input signal is a unity square wave with a 20 sample period. The performance index to decide switches is given by Equation (Equation6) with λ = 0.95, β2 = 1, and β3 = 0. This choice for the parameters β indicates that the switching process only takes into account the identification error in order to choose the estimator which will parameterize the adaptive controller. The single adaptive control scheme is initialized with the first estimator. The following simulations comparing both the single estimation ANN and the multi-estimation one are obtained.
It can be concluded that a transient response improvement might be achieved by a using a multi-estimation scheme (see Figures and ). The transient overshot is less in the multi-estimation scheme than in the single-based one, while the real plant output tends to the desired output more smoothly in the multi-estimation case than in the single-based one as well. Also, note that the system improves its behavior by using the best weight set at each time (respecting the residence time constraint). The switching map c k illustrating the switching process between both sets of weights (estimated parameter vectors) is shown in Figure . The proposed ANN integrates in a unique structure the complete set of estimation models running in parallel and the updating equations for the weights, whose role is represented by the estimated parameter vectors of a classic multi-estimation scheme. Thus, a multi-model ANN has been designed from a previously designed single model-based one in an easy way and applied to a concrete problem.
ANN Description of the Architecture of the Multi-estimation Control Scheme
In this section, an ANN description of the architecture of the complete multi-estimation control scheme is stated. The key idea that develops the description is that ANN can be used to approximate (generate) any continuous nonlinear function (Fausett Citation1998). In particular, ANN will be able to generate the control signals and the supervision indexes that evaluate the identification performance of each estimation algorithm. Hence, we can describe the complete scheme by using a set of interconnected neural networks as displayed in Figure .
The scheme is composed of the following networks:
Network 1 (identification neural network). The aim of this network is to identify the parameters of the plant using the measurements of the output and the input to the plant y k , u k , respectively (obtained online for all time instants), in order to then use the estimated parameter values to calculate the control law by using the certainty equivalence principle. This set of estimation algorithms may be implemented either by N e independent identification neural networks running in parallel (which means that there is no information crossing between them despite all of them use the same input/output measurements to update their corresponding vectors), or by the complete ANN proposed in the previous section, where the evolution of some parameters modify the evolution of the remaining ones since all of them are trained together in a unique network. The entire network is trained by comparing the estimated output with the real plant output. Furthermore, it is not necessary to define the weights of the neural network as being the plant parameter vectors. The proposed architecture is applicable to any adaptive neural network controller, where the information that transmits the identification network to the next one is the network weights and/or outputs. | |||||
Network 2 (estimated control signals network). Once the identification neural network has supplied the set of estimated parameter vectors and estimated outputs (in general its weights and outputs), a new ANN is used to calculate the potential control input corresponding to each estimated parameter vector. This step is necessary since the performance index may include a term weighting the change in the control signal due to switching. If the performance index were chosen without this term, this network would be removed from the presented scheme. This network may also be organized as composed by a number of individual ANNs working in parallel or by a unique one calculating all potential control signals in parallel. Since this network is disposed to calculate the potential pole-placement based control signals, it is not necessary to train it since it is basically used for calculation purposes. | |||||
Network 3 (performance index calculation). Then the performance index can be calculated by using the estimated plant outputs and the estimated potential control inputs. Note that the performance index (6) includes a sum of these values over a window of previous samples. Hence, it would be necessary to include a battery of delays in the ANN first layer to hold the previous values of the estimated plant outputs and estimated plant parameter vectors. The length of this battery is the size of the supervision window H. Also, the performance index (6) involves a set of weights which act to modify the relative contribution of each addend to the global index. These weights could be supervised by alternative supervision algorithms as well (see, for instance, Ibeas and de la Sen [2004]). | |||||
Network 4 (network for supervising the weights of the performance index). The workings of the supervision algorithm are based on heuristic rules and dynamically modify the value of the weighting factors according the evolution of the system through time. If the values of the weighting factors were constant, then this network implementing the supervision could be removed from the scheme leading to a simpler one. | |||||
Network 5 (switching rule implementation). Finally, the switching rule selects one of the estimators to parameterize the adaptive controller using the values provided by the performance index, or in other words, it selects the estimated control signal that will be effectively applied to the plant. |
Thus, note that the system architecture can be implemented by using an ANN technology. The system can be simplified by removing the corresponding networks associated with different supervision levels in the overall scheme. Moreover, it has been considered that the identification neural network can be divided into N e independent neural networks, each one implementing one identification algorithm, while it being possible to possess a number of estimation algorithms running in parallel with mutual cross information. If we detect that an estimation algorithm is working better than the other ones, then we can aim the rest of them in the direction of the estimated parameter vector that is working in a better way. The following coupled (with mutual cross information between the estimation algorithms which compose the parallel scheme) multi-estimation algorithm is proposed.
Coupled Multi-estimation Scheme
The key idea that allows to define a coupled estimation algorithm lies on the fact that when the system detects that a concrete estimation algorithm leads to a better behavior than the others, then the scheme makes the remaining estimators evolve in the direction of the former. Thus, the following algorithm is proposed:
Algorithm 2. (Coupled identification algorithms)
for k > 0 do | |||||
if (k - lastSwitchingSample) < (residenceTime) then // the residence time constraint is not fulfilled | |||||
for i = 1, 2,…, N e do // Update estimated parameter vectors according to the standard algorithm | |||||
end_for | |||||
else | |||||
for i = 1, 2,…, N e do | |||||
if
| |||||
else | |||||
end_if | |||||
end_for | |||||
end_if | |||||
end_for, |
In the case when , the upper bound
, defined in the same way, goes to infinity and no condition over the system gain
value is stated since δ is finite. In practice, this condition means that the contribution of the new term to the dynamics of the estimation algorithm is small enough to maintain the asymptotic and stability properties of the original one. Note that the modification rule is only applied after the minimum residence time constraint is fulfilled. Therefore, the best estimation algorithm is known when the residence time constraint is respected. While in the other situation, we do not know which one is the best and thus the standard least squares algorithm has to be used. The following example illustrates the workings of this algorithm in a case study.
Case Study
Consider the following (unknown for control purposes) plant:
There are thirty estimation algorithms running in parallel initialized by:
This index is a measure of the accumulated deviation of the real closed-loop output from the reference one, which gives an idea of the transient behavior of the scheme. The single model-based adaptive controller is initialized by the first estimator, while the parameters of the standard multi-estimation scheme are λ = 0.95 and β1 = 0.95;β2 = 0.05 in the supervision index (6). The coupled estimation algorithm works with a constant value of for all i and all k. The reference input is a sine wave of amplitude ±1 and angular frequency of ω = 2πrad/s.
In can be concluded that the coupled estimation algorithm may improve the closed-loop response of the control scheme, reducing the mean deviation of the real output from the reference one as the performance index reveals in Figure .
GENETIC ALGORITHMS
Genetic algorithms are used usually as optimization tools in complex problems (Abdennour Citation2004; Wetter and Wright Citation2004; Jamshidi et al. Citation2003). However, they have been applied recently to the intelligent control of dynamic processes (Mwembeshi et al. Citation2004). In this section, a genetic algorithm representation for multi-estimation-based control schemes is given. The key idea is to use the natural selection and the genetics to obtain, at each generation, more accurate solutions to an original problem (Beyers Citation1998). The multi-estimation-based adaptive control architecture has a great similarity with the structure and operation process of a genetic algorithm as Figure illustrates.
Thus, typical operators in the genetic algorithm context, namely, selection, cross-over, and mutation, have an adaptive counterpart in this context. The mutation operation is used in the sequel to select the estimated plant parameter vectors, which have an improved robustness property in comparison to the remaining estimators. The role of the cross over operator is represented by the estimation algorithm, while the selection operator is represented by the estimation algorithm selection policy (switching rule). Note that since a set of N
e
estimation algorithms integrate the parallel multi-estimation scheme, it is obvious the usefulness of selecting the subset of them that better fits the identification performance to run the multi-estimation scheme with them while removing the rest, leading to worst behavior from the scheme. Hence, the mutation operation, typical in GA processes, can be applied to our problem in the following concrete way. At each time instant, mutate randomly the set of estimated parameter vectors to obtain a set of perturbed estimated parameter vectors
. This new set is obtained by summing a Gaussian random perturbation with zero mean, variance unity, and maximum amplitude A to each component of the estimated plant parameter vector. The amplitude of the mutation is expressed as a % of the values of the original estimates vector:
Thus, the perturbed estimated plant output can be compared with the unperturbed estimated plant one over a certain time interval to evaluate the robustness quality of each estimator. The proposed performance index is given by:
Genetic Robustness Refinement
Moreover, the number of estimation algorithms can be decreased through time in order to reduce the computational cost of the algorithm. Thus, the number of estimation algorithms running in parallel can be pruned according to the following algorithm.
Algorithm 3. (Prune of estimation algorithms from the multi-estimation parallel scheme)
{N e0, ΔN e > 0, N eThreshold , N test ≠ N r are selected by the designer} | |||||
N e (0) ← N e0, k lastTest ← 0 | |||||
or k > 0 do | |||||
if k = k lastTest + N test then | |||||
if N e (k) > N eThreshold then | |||||
Ne (k) ← Ne (k − 1) −)Δ Ne | |||||
klastTest ← k | |||||
else | |||||
Ne (k) ← Ne (k − 1) | |||||
end_if | |||||
else | |||||
Ne (k)← Ne (k − 1) | |||||
end_if | |||||
end_for , |
Simulation Example
Suppose that the plant input and output were affected by a Gaussian random perturbation with zero mean, unity variance, and a maximum amplitude of 4% of the original plant input and output signals. To generate the perturbed plant vectors , a Gaussian random perturbation is added to the estimated vectors as well. This test over the robustness quality of each estimator is used in the verification step, when the plant is working under the effect of a noise which will be simulated as a Gaussian random perturbation independent from the one used to define the perturbed plant outputs. In order to make a statistically meaningful interpretation of the proposed schemes, each scheme has been simulated 40 times affected by the random noise, and then the results have been averaged. The performance index (10) is used to compare the performance of all the schemes while the value of this index is averaged 40 times. The example is concerned with the same plant as in previous sections and the same reference model. There are 30 estimators running in parallel, while the single model-based adaptive control is initialized by the first one. The estimators are initialized by:
FIGURE 14 Comparison between the single model-based adaptive control and the multi-estimation schemes.
![FIGURE 14 Comparison between the single model-based adaptive control and the multi-estimation schemes.](/cms/asset/31d3c211-cbe8-4a97-aca3-557027acf877/uaai_a_190291_o_f0014g.gif)
It can be concluded that multi-estimation based-techniques may improve the transient response of the adaptive system via a convenient parameterization of the adaptive controller through time. Thus, the performance index (10) is smaller for the multi-estimation schemes than for the single model-based one. Furthermore, the robustness term of the performance index (11) allows to improve the behavior of the standard multi-estimation scheme in the presence of noise as Figure reveals for the performance index (12) for the robust multi-estimation scheme with a constant number of estimation algorithms. Also, if a prune over the multi-estimation scheme is performed, then the scheme improves its behavior in comparison to the single model-based one, but it gets worse in comparison to the original genetic algorithm one, which incorporates robustness issues. However, in this case, some computations are pruned, which reduces the memory storage requirements with the drawback that a worse behavior than the original complete genetic algorithm scheme, which maintains the number of estimators constant through time, is achieved.
FUZZY LOGIC APPROACH
In this section, a fuzzy logic approach for multi-estimation-based control schemes is given. As it is well known, fuzzy set theory is a generalization of the classical set theory (Tilli Citation1992; Alonso-Quesada et al. Citation2004; Ibeas and de la Sen Citation2004; de la Sen and Almansa Citation2002, Ibeas et al. Citation2004; de la Sen et al. Citation2003; Liutkevicius Citation2003; Wang and Qing Citation2004; Mwembeshi et al. 2003; Liutkevicius Citation2003; Wang and Qing Citation2004; Mwembeshi et al. 2003; Wetter and Wright Citation2004; Feng Citation2004). It allows a class of objects with a continuum grade of membership to a set. Such a set is characterized by a membership (characteristic) function which assigns to each object its grade of membership to the set ranging from one to zero. The classic set theory operations are extended to the fuzzy case as well. Inference relations over fuzzy set objects define the so-called fuzzy logic. Fuzzy logic, together with artificial neural networks and genetic algorithms, completes the set of techniques used in intelligent control (Liutkevicius Citation2003; Wang and Qing Citation2004; Mwembeshi et al. Citation2004; Wetter and Wright Citation2004; Feng Citation2004; Etxebarria and de la Sen Citation1996; Melin and Castillo Citation2003). Thus, in the multi-estimation scheme presented previously, an estimated parameter vector is chosen from a set of parameter estimated vectors
to parameterize the adaptive controller at each sampling time. However, instead of choosing a single estimated vector, it is also possible to define a combined estimated vector as:
In the standard cases (considered here and in Ibeas et al. [2003]), only one coefficient α
i, k
is different to zero and equal to unity while the rest of the parameters take a value of zero. However, it is also possible to let each coefficient α
i, k
take a value running from zero to unity. Then, we can interpret each one as a membership function of the combined estimated vector
k
to the corresponding estimation algorithm with vector
. According to a recent work (Alonso Quesada et al. 2004), the following membership function is proposed in order to clarify the interpretation:
This approach to multi-estimation schemes has been used in robotics (Ibeas and de la Sen Citation2004) to define a combined estimated plant parameter vector. In addition, combining different estimation algorithms to obtain a new one to parameterize the adaptive controller has been broadly used in the multi-model adaptive control literature (see, for instance, Cezaylirli and Ciliz [2004]), where the coefficients of the linear combination were updated by using a metric-based approach. Nevertheless, it has never been pointed out the relationship between this and the fuzzy logic/fuzzy set theory. Therefore, it may be possible to define fuzzy rules for determining online the values of the coefficients of the linear combination weights in order to achieve a predefined closed-loop behavior, opening a wide variety of applications of the fuzzy logic to multi-model control.
Simulation Example
The following simulations comparing the single model-based and the fuzzy logic-based multi-estimation schemes show the usefulness of the proposed scheme. The plant, the input signal, and the performance index used in (15) are the same as in the ANN example. The estimation algorithm is of standard least squares type for all the estimation algorithms. The residence time is two samples. There are five estimators running in parallel and initialized by:
It is apparent from Figures that the fuzzy logic-based combined multi-estimation scheme improves the transient response of the adaptive system. The transient over peak is reduced due to switching to a more convenient parameterization of the adaptive controller through the convex linear combination of the estimators of the multi-estimation scheme, according to the updating rule (Equation15). The transient response improvement is achieved once the residence number of samples constraint is fulfilled since the updating rule is not allowed to modify the weight values until then.
CONCLUSIONS
In this paper, an artificial intelligence representation of multi-estimation-based controllers has been developed. A graph theory-based representation of the switching process between the different estimation schemes is introduced. This representation allows to describe in a formal way the structure and workings of the switching process, providing a concise description of the switching logic. A neural network interpretation of multi-model based-controllers has been given, while a method for generating multi-model-based artificial neural network controllers from previously designed single model ones has been proposed. The ANN description of the overall system allows to implement the complete scheme using ANN technology. Also, a coupled estimation algorithm has been proposed in order to take opportunity of the information contained in the set of estimators running in parallel. The estimator selected to parameterize the adaptive controller guides the evolution of the remaining ones to its location. Moreover, the genetic algorithm and fuzzy logic-based approaches have been given to represent multi-estimation-based schemes. These artificial intelligence techniques have suggested new ideas to be incorporated into the classic multi-model controllers. The genetic algorithm approach has inspired a robustness test over the set of estimation algorithms which allows the improvement of the workings of the multi-estimation scheme in noisy environments. Also, the fuzzy logic approach suggests a way to combine the estimation algorithms in order to build an estimated parameter vector which parameterizes the adaptive controller. Some simulation examples show the usefulness of the proposed multi-estimation structures for improving the transient response of adaptive systems.
Acknowledgments
The authors are grateful to MEC and to the UPV-EHU by its partial support of this work through projects DPI 2003-00164, UPV 9/UPV 00I06.I06-15263/2003. Also, the authors are grateful to reviewers who have helped to improve the final version of this manuscript and to Professor J. M. Tarela whose comments about genetic algorithms helped to improve the paper. A. I. is very grateful to MECD by its support of this work through the FPU grant P2002-2727.
REFERENCES
- Abdennour , A. 2004 . Multi-objetive optimal tuning of power plant controls using genetic algorithms . Control and Intelligent Systems 32 ( 2 ): 80 – 88 . [CSA]
- Alonso-Quesada , S. , M. De la Sen , A. Bilbao-Guillerna , and A. Ibeas . 2004. A semiempirical identification method by using a multiestimation technique via reduced-order nominal models. In: Proceedings of the 2004 American Control Conference, pages 4372–4377.
- Aström , K. J. and B. Wittenmark , 1995 . Adaptive Control, Addison-Wesley Publishing Company. , 2nd Edition .
- Beyers , H. G. 1998 . The Theory of Evolution Strategies . Berlin: Springer-Verlag .
- Cezayirli , A. and K. Ciliz . Multiple model based adaptive control of a DC motor under load changes , Proc. Of the International Conference on Mechatronics 2004 , Istanbul, 1–3, June, 2004 .
- Chang , M. H. and E. J. Davison . 1999 . Adaptive switching control of LTI MIMO systems using a family of controllers approach . Automatica 35 : 453 – 465 . [CSA] [CROSSREF]
- Chartrand , G. and L. Lesniak 1979 . Graphs and Digraphs, , Second Edition . Pacific Grove , CA : The Wadsworth and Brooks/Cole .
- Chen , L. and K. S. Narendra . 2001 . Nonlinear adaptive control using neural networks and multiple models . Automatica 37 ( 8 ): 1245 – 1255 . [CSA] [CROSSREF]
- Ruan , D. , ed. 1997 . Intelligent Hybrid Systems, Fuzzy Logic, Neural Networks and Genetic Algorithms . Boston : Kluwer .
- de la Sen , M. and A. Almansa . 2002 . Adaptive stable control of manipulators with improved adaptation transients by using on-line supervision of the free parameters of the adaptation algorithm and sampling rate . Informatica Lithuanian Academy of Sciences 13 ( 3 ): 345 – 368 . [CSA]
- de la Sen , M. , S. Alonso-Quesada , A. Bilbao-Guillerna , and A. J. Garrido . 2003 . A supervised multiestimation scheme for discrete adaptive control which guarantees robust closed-loop stability . Int. Math. J. 3 ( 9 ): 907 – 937 . [CSA]
- de la Sen , M. , J. J. Miñambres , A. J. Garrido , A. Almansa , and J. C. Soto . 2004 . Basic theoretical results for expert systems. Application to the supervision of adaptation transients in planar robots . Artificial Intelligence 152 ( 2 ): 173 – 211 . [CSA] [CROSSREF]
- Diestel , R. 2000 . Graph Theory. , Second Edition . New York : Springer-Verlag .
- Etxebarria , V. 1994 . Adaptive control of discrete systems using neural networks . IEE Proc. Contr. Theory and Appl. Part D. 141 ( 4 ): 209 – 215 . [CSA] [CROSSREF]
- Etxebarria , V. and M. De La Sen . 1996 . An approach to adaptive neural control of robot manipulators . International Journal of System Science 27 ( 11 ): 1143 – 1152 . [CSA]
- Fausett , L. V. 1998 . Fundamentals of Neural Networks . Englewood Cliffs , NJ : Prentice Hall .
- Feng , G. 2004 . Stability Analysis of discrete-time fuzzy dynamic systems based on piecewise Lyapunov functions . IEEE Transactions on Fuzzy Systems 12 ( 1 ): 22 – 28 . [CSA] [CROSSREF]
- Gregorcic , G. , A. Mullane , and G. Lightbody . 2001 . Simulink implementation of adaptive control and multiple model network control . In: Proceedings of the Nordic Matlab Conference , pages 167 – 173 , October .
- Hocherman-Frommer , J. , S. R. Kulkarni , and P. J. Ramadge . 1998 . Controller switching based on output prediction errors . IEEE Transactions on Automatic Control 43 ( 5 ): 596 – 607 . [CSA] [CROSSREF]
- Ibeas , A. and M. de la Sen . 2004 . Intelligent control of robotic manipulators based on a multiestimation scheme . In Proceedings of the IASTED International Conference Artificial Intelligence and Applications , pages 694 – 699 .
- Ibeas , A. , M. de la Sen , and S. Alonso-Quesada . 2003 . Adaptive stabilization of discrete linear systems via a multiestimation scheme . Int. Math. J. 3 ( 4 ): 381 – 407 . [CSA]
- Ibeas , A. , M. de la Sen , and S. Alonso-Quesada . 2004 . Intelligent control of discrete linear systems based on a supervised adaptive multiestimation scheme . Journal of Intelligent and Robotic Systems 40 ( 4 ): 359 – 411 . [CSA] [CROSSREF]
- Jamshidi , M. , L. dos Santos Coelho , R. A. Krohling , and P. J. Flemming . 2003 . Robust Control Systems with Genetic Algorithms . Boca Raton , FL : CRC Press .
- Liutkevicius , R. 2003 . Coupled adaptive fuzzy control of nonlinear, time-varying plant . Informatica Lithuanian Academy of Sciences 14 ( 3 ): 323 – 336 . [CSA]
- Melin , P. and O. Castillo . 2003 . Adaptive intelligent control of aircraft systems with a hybrid approach combining neural networks, fuzzy logic and fractal theory . Applied Soft Computing 3 ( 4 ): 353 – 362 . [CSA] [CROSSREF]
- Mosca , E. and T. Agnoloni . 2001 . Inference of candidate loop perfomance and data filtering for switching supervisory control . Automatica 37 : 527 – 534 . [CSA] [CROSSREF]
- Mwembeshi , M. M. , C. A. Kent , and S. Salhi . 2004 . A genetic algorithm based approach to intelligent modelling and control of pH in reactors . Computers & Chemical Engineering 28 ( 9 ): 1743 – 1757 . [INFOTRIEVE] [CSA] [CROSSREF]
- Narendra , K. S. and J. Balakrishnan . 1994. Improving transient response of adaptive control systems using multiple models and switching. IEEE Transactions on Automatic Control 39(9):1861–1866. [CSA] [CROSSREF]
- Narendra , K. S. and J. Balakrishnan . 1997 . Adaptive control using multiple models . IEEE Transactions on Automatic Control 42 ( 2 ): 171 – 187 . [CSA] [CROSSREF]
- Sun , Z. 2004 . A robust stabilizing law for switched linear systems . Int. J. Control 77 ( 4 ): 389 – 398 . [CSA] [CROSSREF]
- Sun , Z. and S. S. Ge . 2003 . Dynamic output feedback stabilization of a class of switched linear systems . IEEE Trans. On Circuits and Systems 50 ( 8 ): 1111 – 1115 . [CSA] [CROSSREF]
- Tilli , T. 1992 . Fuzzy Logic . Munich : Franzis-Verlag .
- VanDoren V. , ed. 2003 . Techniques for Adaptive Control . Amsterdam : Elsevier Science .
- Wang , J. and M. Qing . 2004 . An output tracking control strategy for unknown nonlinear systems . Applied Artificial Intelligence 18 ( 1 ): 1 – 16 . [CSA]
- Wetter , M. and J. Wright . 2004 . A comparison of deterministic and probabilistic optimization algorithms for nonsmooth simulation-based optimization . Building and Environment 39 ( 8 ): 989 – 999 . [CSA] [CROSSREF]