Full article: Enhancing feedforward controller tuning via instrumental variables: with application to nanopositioning

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

Feedforward control enables high performance of a motion system. Recently, algorithms have been proposed that eliminate bias errors in tuning the parameters of a feedforward controller. The aim of this paper is to develop a new algorithm that combines unbiased parameter estimates with optimal accuracy in terms of variance. A simulation study is presented to illustrate the poor accuracy properties of pre-existing algorithms compared to the proposed approach. Experimental results obtained on an industrial nanopositioning system confirm the practical relevance of the proposed method.

KEYWORDS:

1. Introduction

Challenging requirements on positioning accuracy often necessitate the use of feedforward control for motion systems, since feedforward can effectively compensate for the error induced by known, repeating disturbances. Examples include atomic force microscopes (Butterworth, Pao, & Abramovitch, Citation2012; Clayton, Tien, Leang, Zou, & Devasia, Citation2009; Kara-Mohamed, Heath, & Lanzon, Citation2015), robotics (Khalil & Dombre, Citation2002, Chapter 14) and wafer scanners (Mishra, Coaplen, & Tomizuka, Citation2007; Oomen et al., Citation2014; van der Meulen, Tousain, & Bosgra, Citation2008). Traditional approaches that can potentially achieve these requirements on positioning accuracy include iterative learning control (ILC) (Bristow, Tharayil, & Alleyne, Citation2006; Gorinevsky, Citation2002) and model-based feedforward (Butterworth et al., Citation2012; Zhong, Pao, & de Callafon, Citation2012).

ILC algorithms update the feedforward signal by learning from previous tasks under the assumption that the task is repetitive. ILC consequently enables superior performance with respect to model-based feedforward for a specific task by compensating for all repetitive disturbances. However, changes in the reference signal typically result in significant performance deterioration (see, e.g. Hoelzle, Johnson, & Alleyne, Citation2014). Motion systems are typically confronted with similar yet slightly different reference signals (Lambrechts, Boerlage, & Steinbuch, Citation2005; Oomen et al., Citation2014). In contrast to ILC, model-based feedforward results in moderate performance for a class of reference signals instead of only one specific reference (Butterworth et al., Citation2012). Note that the performance for model-based feedforward is highly dependent on the model quality of the parametric model of the system and the accuracy of model-inversion (Devasia, Citation2002).

By introducing basis functions in ILC, the advantages of model-based feedforward and ILC are combined in van de Wijdeven and Bosgra (Citation2010). This approach is further improved in van der Meulen et al. (Citation2008), where the need for an approximate model of the system, as is common in ILC, is eliminated by exploiting results from iterative feedback tuning (Bazanella, Campestrini, & Eckhard, Citation2012). The iterative feedforward control approach in van der Meulen et al. (Citation2008) is extended to multivariable systems and input shaping in Heertjes, Hennekens, and Steinbuch (Citation2010) and Boeren, Bruijnen, van Dijk, and Oomen (Citation2014), respectively. However, in Boeren, Oomen, and Steinbuch (Citation2015), it is shown that the least-squares algorithm used in van der Meulen et al. (Citation2008) can lead to a bias error in the estimated parameters. A new algorithm is proposed in Boeren, Oomen, et al. (Citation2015) based on instrumental variable (IV) techniques that results in unbiased parameter estimates. However, accuracy in terms of variance of the estimate has not yet been investigated.

Although iterative feedforward control based on instrumental variables is promising for motion control, existing approaches suffer from poor accuracy properties in terms of variance. This severely limits the practical applicability of existing approaches in case of noisy signals. This paper aims to reveal non-optimal accuracy for the approaches in Boeren, Oomen, et al. (Citation2015) and van der Meulen et al. (Citation2008), and develop an algorithm that leads to optimal accuracy in the presence of noise.

The contributions of this paper are fourfold. First, an analysis is provided to show that the approaches in Boeren, Oomen, et al. (Citation2015) and van der Meulen et al. (Citation2008) lead to poor accuracy in terms of variance. Therefore, variance results developed in open-loop identification (Söderström & Stoica, Citation1983, Chapters 5 and 6) and closed-loop identification (Forssell, Citation1999; Gilson & Van den Hof, Citation2005) are extended towards iterative feedforward control. As a second contribution, this insight in the accuracy aspects is exploited to develop an algorithm that achieves optimal accuracy. The proposed algorithm (1) exploits an iterative refined instrumental variable (RIV) method that is similar to the approaches presented in Young (Citation1976, Citation2015), Young and Jakeman (Citation1979), Jakeman and Young (Citation1979), and Young and Jakeman (Citation1980), and (2) is closely connected to the estimation of inverse systems (see, e.g. Jung & Enqvist, Citation2013). Third, a simulation study is presented to (1) illustrate that the proposed algorithm leads to enhanced accuracy properties compared to pre-existing approaches and (2) confirm the significance of the accuracy of parameter estimates on the achievable performance. As a final contribution, experimental results confirm the practical relevance of the proposed algorithm. This paper significantly extends earlier results reported in Boeren, Bruijnen, and Oomen (Citation2014) and Boeren, Oomen, and Steinbuch (Citation2014) by thorough proofs and extended experimental results. Related data-driven tuning algorithms are available in Formentin, van Heusden, and Karimi (Citation2013b), Karimi, Butcher, and Longchamp (Citation2008), and Kim and Zou (Citation2013). Furthermore, instrumental variable approaches are often used for estimating the parameters of industrial robots (see, e.g. Janot, Vandanjon, & Gautier, Citation2014a, Citation2014b; Puthenpura & Sinha, Citation1986; Yoshida, Ikeda, & Mayeda, Citation1992).

This paper is organised as follows. In Section 2, the problem formulation is outlined. In Section 3, asymptotic expressions for optimal accuracy are developed for feedforward control. To provide a concise presentation of the contributions of this paper, the second contribution is presented before the first contribution is explained. That is, a new tuning algorithm for iterative feedforward control is proposed in Section 4. Then, in Section 5, the accuracy properties of existing approaches are analysed. In Section 6, a simulation study of a motion system is presented to compare the proposed and pre-existing approaches. In Section 7, the theoretical results are confirmed by experiments on an industrial nanopositioning system. Finally, conclusions are drawn in Section 8.

Notation:

The variable q denotes the forward shift operator qu(t) = u(t + 1). For a vector x, ||x||²_W = x^TWx. A positive-definite matrix A is denoted as A ≻ 0. Also, A − B ≻ 0 is denoted as A ≻ B. A positive-semidefinite matrix A is denoted as A ⪰ 0. Let $R [q]$ denotes the real polynomials in q. Also, $E (x) = \int_{- \infty}^{\infty} x f (x) d x,$ with probability density function f (x), $\overline{E} (x) = {lim}_{N \to \infty} \frac{1}{N} \sum_{t = 1}^{N} E (x),$ where N is the number of samples.

2. Problem definition

2.1 Problem setup

Consider the two degree-of-freedom control configuration as depicted in . The true unknown system P(q) is assumed to be discrete-time, single-input single-output, and linear time-invariant, with rational representation $\begin{matrix} P (q) = \frac{B_{0} (q)}{A_{0} (q)}, \end{matrix}$ where $B_{0} (q), A_{0} (q) \in R [q]$ . The control configuration consists of a given stabilising feedback controller C_fb(q), and a feedforward controller C^j_ff(q). The index j denotes the jth task in a sequence of finite time tasks of length N samples, where j = 0, 1, ..., M. Furthermore, T_s denotes the sampling time.

Figure 1. Two degree-of-freedom control configuration.

Let r denote the reference signal. Typically, r is designed as a known nth-order multi-segment polynomial trajectory with constraints on the first n derivatives, as in, e.g. Biagiotti and Melchiorri (Citation2012) and Lambrechts et al. (Citation2005). Also, w^j(t) = H(q)ε^j(t) denotes an unknown disturbance, where H(q) is a monic, asymptotically stable, proper system, and {ε^j(t)} is normally distributed white noise with zero mean and variance λ²_ϵ. Hence, w^j and r are uncorrelated. The feedforward signal is denoted by u^j_ff, while the measured signals e^j_m and y^j_m in the jth task are given by (1) $\begin{matrix} \begin{matrix} e_{m}^{j} (t) = e_{r}^{j} (t) - e_{w}^{j} (t), \\ y_{m}^{j} (t) = y_{r}^{j} (t) + y_{w}^{j} (t), \end{matrix} \end{matrix}$ (1) with (2) $\begin{matrix} \begin{matrix} e_{r}^{j} (t) = S (q) (1 - P (q) C_{f f}^{j} (q)) r (t), \\ e_{w}^{j} = S (q) w^{j} (t), \\ y_{r}^{j} (t) = S (q) P (q) (C_{f b} (q) + C_{f f}^{j} (q)) r (t), \\ y_{w}^{j} = S (q) w^{j} (t), \end{matrix} \end{matrix}$ (2) and sensitivity function S(q) = (1 + P(q)C_fb(q))⁻¹. Since P(q) is assumed to be unknown and w^j is an unknown disturbance, it is not possible to determine e^j_r and e^j_w (resp. y^j_r and y^j_w) based on the measured signal e^j_m (resp. y^j_m).

2.2 Iterative feedforward: batch-wise tuning

In iterative feedforward control, the measured signals e^j_m(t) and y^j_m(t), for t = 1, ..., N, are stored. Hence, the data set that is used for the estimation of the feedforward controller is given by $\begin{matrix} e_{m}^{j} = [e_{m}^{j} (1), e_{m}^{j} (2), \dots, e_{m}^{j} (N)], \\ y_{m}^{j} = [y_{m}^{j} (1), y_{m}^{j} (2), \dots, y_{m}^{j} (N)] . \end{matrix}$ After the jth task is finished, this batch of measured data is used to perform an offline update of the existing feedforward controller C^j_ff(q), i.e. $\begin{matrix} C_{f f}^{j + 1} (q) = C_{f f}^{j} (q) + C_{f f}^{Δ} (q), \end{matrix}$ before initiating the (j + 1)th task.

To establish the main ideas in this paper and provide a fair comparison, the feedforward controller C^{j + 1}_ff(q) is parametrised similar to Lambrechts et al. (Citation2005), van der Meulen et al. (Citation2008), Heertjes et al. (Citation2010), and Boeren, Bruijnen, van Dijk, et al. (Citation2014) as (3) $\begin{matrix} C_{f f}^{j + 1} (q, θ^{j + 1}) & = C_{f f}^{j} (q, θ^{j}) + C_{f f}^{Δ} (q, θ^{Δ}) \\ = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) (θ_{i}^{j} + θ_{i}^{Δ}), \end{matrix}$ (3) where $θ_{i}^{j + 1} = θ_{i}^{j} + θ_{i}^{Δ}$ , and ψ_i(q⁻¹) are basis functions. The update C^Δ_ff(q, θ^Δ) is given by (4) $\begin{matrix} C_{f f}^{Δ} (q, θ^{Δ}) & = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) θ_{i}^{Δ} = Ψ (q) θ^{Δ}, \end{matrix}$ (4) with parameters $θ^{Δ} = {[θ_{1}^{Δ} θ_{2}^{Δ} \dots θ_{n_{θ}}^{Δ}]}^{T} \in R^{n_{θ}},$ and polynomial basis functions $\begin{matrix} Ψ (q) = [ψ_{1} (q^{- 1}) ψ_{2} (q^{- 1}) \dots ψ_{n_{θ}} (q^{- 1})] . \end{matrix}$ To illustrate a typical selection of the parameter vector $θ^{Δ}$ and basis functions Ψ(q), consider the following example that is aimed at feedforward control for motion systems.

Example 2.1:

Let C^j_ff(q) = 0, i.e. only a feedback controller C_fb(q) is used in task j, and let r(t) be a fourth-order reference trajectory. A typical parametrisation of $C_{f f}^{Δ} (q, θ^{Δ})$ is given by (5) $\begin{matrix} C_{f f}^{Δ} (q, θ^{Δ}) = ψ_{a} (q^{- 1}) θ_{a}^{j} + ψ_{s} (q^{- 1}) θ_{s}^{j}, \end{matrix}$ (5) with basis functions $\begin{matrix} ψ_{a} (q^{- 1}) = {(\frac{1 - q^{- 1}}{T_{s}})}^{2}, ψ_{s} (q^{- 1}) = {(\frac{1 - q^{- 1}}{T_{s}})}^{4}, \end{matrix}$ parameters θ^j = [θ^j_a, θ_s^j]^T, and sampling time T_s. The parametrisation in (Equation5(5) $\begin{matrix} C_{f f}^{Δ} (q, θ^{Δ}) = ψ_{a} (q^{- 1}) θ_{a}^{j} + ψ_{s} (q^{- 1}) θ_{s}^{j}, \end{matrix}$ (5) ) consists of acceleration feedforward with acceleration a(t) = ψ_a(q⁻¹)r(t), i.e., the second derivative of r(t), and snap feedforward with snap s(t) = ψ_s(q⁻¹)r(t), i.e. the fourth derivative of r(t). Furthermore, θ_a denotes the mass of the system, while θ_s denotes the snap parameter. As such, C^Δ_ff(q, θ^Δ) in (Equation5(5) $\begin{matrix} C_{f f}^{Δ} (q, θ^{Δ}) = ψ_{a} (q^{- 1}) θ_{a}^{j} + ψ_{s} (q^{- 1}) θ_{s}^{j}, \end{matrix}$ (5) ) can compensate for the dominant component of the reference-induced error (see, e.g. Lambrechts et al., Citation2005). Note that double (and fourth) differentiation is possible since r(t) is a deterministic and known signal. The corresponding feedforward signal u_ff(t) is given by $\begin{matrix} u_{f f}^{j + 1} (t) = C_{f f}^{Δ} (q, θ^{Δ}) r (t) = θ_{a} a (t) + θ_{s} s (t) . \end{matrix}$ The approach proposed in this paper aims to estimate the parameters θ_a and θ_s based on measured data.

The use of feedforward is a standard approach to obtain a small error signal e^j_m(t) when tracking a reference trajectory r(t) (see, e.g. Steinbuch & Norg, Citation1998). By subdividing e^j_m(t) into e^j_r(t) and e^j_w(t), as defined in (Equation2(2) $\begin{matrix} \begin{matrix} e_{r}^{j} (t) = S (q) (1 - P (q) C_{f f}^{j} (q)) r (t), \\ e_{w}^{j} = S (q) w^{j} (t), \\ y_{r}^{j} (t) = S (q) P (q) (C_{f b} (q) + C_{f f}^{j} (q)) r (t), \\ y_{w}^{j} = S (q) w^{j} (t), \end{matrix} \end{matrix}$ (2) ), it is immediately clear that C^{j + 1}_ff(q, θ^{j + 1}) has no influence on e^j_w(t). Indeed, the goal of feedforward control is to determine a C^{j + 1}_ff(q, θ^{j + 1}) that minimises the reference-induced error e^{j + 1}_r(t, θ^{j + 1}) for t = 1, ..., N in a suitable sense. Given the definition of C^{j + 1}_ff(q, θ^{j + 1}) in (Equation3(3) $\begin{matrix} C_{f f}^{j + 1} (q, θ^{j + 1}) & = C_{f f}^{j} (q, θ^{j}) + C_{f f}^{Δ} (q, θ^{Δ}) \\ = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) (θ_{i}^{j} + θ_{i}^{Δ}), \end{matrix}$ (3) ), the aim of this paper is to determine θ^{j + 1} such that e^{j + 1}_r(t, θ^{j + 1}) is as small as possible. It directly follows from (Equation2(2) $\begin{matrix} \begin{matrix} e_{r}^{j} (t) = S (q) (1 - P (q) C_{f f}^{j} (q)) r (t), \\ e_{w}^{j} = S (q) w^{j} (t), \\ y_{r}^{j} (t) = S (q) P (q) (C_{f b} (q) + C_{f f}^{j} (q)) r (t), \\ y_{w}^{j} = S (q) w^{j} (t), \end{matrix} \end{matrix}$ (2) ) that (6) $\begin{matrix} e_{r}^{j + 1} (t, θ^{j + 1}) = S (q) (1 - P (q) C_{f f}^{j + 1} (q, θ^{j + 1})) r (t), \end{matrix}$ (6) and e^{j + 1}_r(t, θ^{j + 1}) = 0 for all t if C^{j + 1}_ff(q, θ^{j + 1}) = P^{− 1}(q). However, since P(q) is assumed to be unknown, it is not possible to determine either P⁻¹(q), or determine e^{j + 1}_r(t, θ^{j + 1}) before initiating the (j + 1)th task. Instead, the measured signal e^j_m(t) in the jth task, contaminated by w^j(t), is used in an optimisation problem to determine C^{j + 1}_ff(q, θ^{j + 1}). That is, the aim in iterative feedforward control is to determine the parameters ${\hat{θ}}^{Δ}$ in (Equation4(4) $\begin{matrix} C_{f f}^{Δ} (q, θ^{Δ}) & = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) θ_{i}^{Δ} = Ψ (q) θ^{Δ}, \end{matrix}$ (4) ), before starting the (j + 1)th task, from the optimisation problem (7) $\begin{matrix} {\hat{θ}}^{Δ} = arg min_{θ^{Δ}} V (θ^{Δ}), \end{matrix}$ (7) where the criterion V(θ^Δ) is based on the stored signals e^j_m(t) and y^j_m(t) for t = 1, ..., N, as measured in the jth task. Then, C^{j + 1}_ff(q, θ^{j + 1}) is determined according to (Equation3(3) $\begin{matrix} C_{f f}^{j + 1} (q, θ^{j + 1}) & = C_{f f}^{j} (q, θ^{j}) + C_{f f}^{Δ} (q, θ^{Δ}) \\ = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) (θ_{i}^{j} + θ_{i}^{Δ}), \end{matrix}$ (3) ). Next, two assumptions are introduced.

Assumption 2.1:

C_fb(q) is designed such that S(q)H(q) = 1, where the noise model is parametrised as $\begin{matrix} H (q) = \frac{D (q^{- 1})}{C (q^{- 1})} = \frac{1 + d_{1} q^{- 1} + \dots + d_{m} q^{- m}}{1 + c_{1} q^{- 1} + \dots + c_{m} q^{- m}} . \end{matrix}$

Assumption 2.2:

The true unknown system is given by $\begin{matrix} P_{0} = \frac{1}{A_{0} (q^{- 1})} = \frac{1}{1 + a_{1} q^{- 1} + \dots + a_{n} q^{- n}} . \end{matrix}$

Concerning Assumption 2.1, recall from (Equation6(6) $\begin{matrix} e_{r}^{j + 1} (t, θ^{j + 1}) = S (q) (1 - P (q) C_{f f}^{j + 1} (q, θ^{j + 1})) r (t), \end{matrix}$ (6) ) that C^j_ff(q, θ^j) aims to minimise e^j_r(t), i.e. the error induced by the known r(t). In the optimal case, it holds that e^j_r(t) = 0 for all t. In contrast, the main goal of the feedback controller C_fb(q) is to compensate for w^j(t) in view of minimising e^j_w(t). These disturbances are assumed to be stochastic with a certain spectrum. Typically, the dominant disturbances are in the low-frequency range, e.g. due to amplifier noise (Fleming, Citation2014), cable slab, commutation errors, or immersion water-flow in lithographic applications. Then, C_fb(q) is designed to compensate for these disturbances, in which case the optimal result is e^j_w(t) = ϵ^j(t), i.e. the error signal being white noise. Clearly, this corresponds to S(q)H(q) = 1. Similar approximations are used in the identification of robotics (see, e.g. Janot, Gautier, Jubien, & Vandanjon (Citation2014)). Assumption 2.1 can be achieved by, e.g. using traditional PID tuning, possibly with error-based retuning (van de Wal, van Baars, & Sperling, Citation2000), and common LQG control designs (Åström, Citation1970, Section 6.2). In a typical approach to design C_fb(q) such that S(q)H(q) = 1, e^j_m(t) is measured in an experiment where r(t) = 0 for all t. For this case, e^j_r(t) = 0 for all t, and consequently e^j_m(t) = e_w^j(t). Then, C_fb(q) is tuned until e^j_m(t) = ϵ^j(t), i.e. e^j_m(t) being white noise.

Concerning Assumption 2.2, the assumption that P(q) = 1/A₀(q) implies that there is a θ^{j + 1} such that C^{j + 1}_ff(q, θ^{j + 1}) = P^{− 1}(q). This result immediately follows by observing that P⁻¹(q) = A₀(q) and C^{j + 1}_ff(q, θ^{j + 1}) is restricted to a polynomial parametrisation as in (Equation3(3) $\begin{matrix} C_{f f}^{j + 1} (q, θ^{j + 1}) & = C_{f f}^{j} (q, θ^{j}) + C_{f f}^{Δ} (q, θ^{Δ}) \\ = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) (θ_{i}^{j} + θ_{i}^{Δ}), \end{matrix}$ (3) ). This assumption may appear as a stringent requirement on P(q). However, for a general class of motion systems as described in Lambrechts et al. (Citation2005), the reference signal r(t) has a dominant low-frequency signal content, and P⁻¹(q) can be accurately described by an acceleration-dependent and snap-dependent term (Boerlage, Tousain, & Steinbuch, Citation2004). That is, high-frequency dynamic aspects are concealed by the specific input design of r(t) (see also Hjalmarsson (Citation2009) for a further explanation of this aspect). These results will be corroborated by the simulation example in Section 6 and the experimental results in Section 7, where it is shown that the reference-induced contribution to the error signal can be almost completely compensated for by using C^{j + 1}_ff(q, θ^{j + 1}) as in (Equation3(3) $\begin{matrix} C_{f f}^{j + 1} (q, θ^{j + 1}) & = C_{f f}^{j} (q, θ^{j}) + C_{f f}^{Δ} (q, θ^{Δ}) \\ = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) (θ_{i}^{j} + θ_{i}^{Δ}), \end{matrix}$ (3) ) of low degree. The proposed approach can be extended by allowing a more general parametrisation for C^{j + 1}_ff(q, θ^{j + 1}), as in, e.g. Boeren, Blanken, Bruijnen, and Oomen (Citation2015) and Boeren, Bruijnen, van Dijk, et al. (Citation2014) if a reference is needed with a significant high-frequency signal content.

Throughout, measured data from a single task is used to determine ${\hat{θ}}^{Δ}$ according to (Equation7(7) $\begin{matrix} {\hat{θ}}^{Δ} = arg min_{θ^{Δ}} V (θ^{Δ}), \end{matrix}$ (7) ). This approach is pursued since it can effectively handle slow variations that are typically present in a motion system, for example, wear, by means of continuous adaptation of the feedforward parameters. Note that the presented approach can be directly extended to exploit data from multiple tasks (e.g. as in Gunnarsson and Norrlöf (Citation2001, Citation2006) and Kushner and Yin (Citation2003)).

3. Optimal feedforward based on instrumental variables

3.1 Iterative feedforward control

Based on the known r(t) and measured e^j_m(t) and y^j_m(t) in task j, the predicted error ${\hat{ϵ}}^{j + 1} (t, θ^{Δ})$ in task j + 1 can be determined as (see ): (8) $\begin{matrix} {\hat{ϵ}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - S (q) P (q) C_{f f}^{Δ} (q, θ^{Δ}) r (t) . \end{matrix}$ (8) In the proposed iterative feedforward control approach, the parameters θ^Δ should be estimated directly from measured data as is done in Hjalmarsson, Gevers, Gunnarsson, and Lequin (Citation1998), i.e. without estimating a model of P(q). As such, (Equation8(8) $\begin{matrix} {\hat{ϵ}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - S (q) P (q) C_{f f}^{Δ} (q, θ^{Δ}) r (t) . \end{matrix}$ (8) ) cannot be directly used to determine θ^Δ. Instead, the following estimate of ${\hat{ϵ}}^{j + 1} (t, θ^{Δ})$ is used (9) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - C_{f f}^{Δ} (q, θ^{Δ}) C^{- 1} y_{m}^{j} (t), \end{matrix}$ (9) where C = (C_fb(q) + C^j_ff(q)). To show that (Equation9(9) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - C_{f f}^{Δ} (q, θ^{Δ}) C^{- 1} y_{m}^{j} (t), \end{matrix}$ (9) ) is a suitable estimate of (Equation8(8) $\begin{matrix} {\hat{ϵ}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - S (q) P (q) C_{f f}^{Δ} (q, θ^{Δ}) r (t) . \end{matrix}$ (8) ), note that the commutative property of SISO systems enables rewriting y^j_m(t) in (Equation1(1) $\begin{matrix} \begin{matrix} e_{m}^{j} (t) = e_{r}^{j} (t) - e_{w}^{j} (t), \\ y_{m}^{j} (t) = y_{r}^{j} (t) + y_{w}^{j} (t), \end{matrix} \end{matrix}$ (1) ) as y^j_m(t) = (C_fb(q) + C^j_ff(q))S(q)P(q)r(t) + S(q)w^j(t). Rearranging terms leads to (10) $\begin{matrix} C^{- 1} (q) y_{m}^{j} (t) = S (q) P (q) r (t) + C^{- 1} (q) S (q) w^{j} (t) . \end{matrix}$ (10) Clearly, ${\hat{e}}^{j + 1} (t, θ^{Δ}) = {\hat{ϵ}}^{j + 1} (t, θ^{Δ})$ if w^j(t) is equal to zero. Moreover, by taking the expectation of (Equation10(10) $\begin{matrix} C^{- 1} (q) y_{m}^{j} (t) = S (q) P (q) r (t) + C^{- 1} (q) S (q) w^{j} (t) . \end{matrix}$ (10) ), it follows that (11) $\begin{matrix} E \{C^{- 1} (q) y_{m}^{j} (t)\} = E & {S (q) P (q) r (t) + C^{- 1} (q) S (q) w^{j} (t)} . \end{matrix}$ (11) By noting that r(t) is deterministic, it follows that $E S (q) P (q) r (t) = S (q) P (q) r (t)$ . Furthermore, for w^j(t) as defined in Section 2.1, it is immediately clear that $E C^{- 1} (q) S (q) w^{j} (t) = 0$ (see, e.g. Söderström, Citation2002, Lemma 4.1). By combining these results, (Equation11(11) $\begin{matrix} E \{C^{- 1} (q) y_{m}^{j} (t)\} = E & {S (q) P (q) r (t) + C^{- 1} (q) S (q) w^{j} (t)} . \end{matrix}$ (11) ) implies that (Equation10(10) $\begin{matrix} C^{- 1} (q) y_{m}^{j} (t) = S (q) P (q) r (t) + C^{- 1} (q) S (q) w^{j} (t) . \end{matrix}$ (10) ) is an unbiased estimator of S(q)P(q)r(t).

Figure 2. The update $C_{f f}^{Δ} (q, θ^{Δ})$ is determined based on the known r(t), and measured e^j_m(t) and y^j_m(t) in task j.

In the remainder of this paper, (Equation9(9) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - C_{f f}^{Δ} (q, θ^{Δ}) C^{- 1} y_{m}^{j} (t), \end{matrix}$ (9) ) is used as the predicted error in task j + 1. Substituting the parametrisation defined in (Equation4(4) $\begin{matrix} C_{f f}^{Δ} (q, θ^{Δ}) & = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) θ_{i}^{Δ} = Ψ (q) θ^{Δ}, \end{matrix}$ (4) ) into (Equation9(9) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - C_{f f}^{Δ} (q, θ^{Δ}) C^{- 1} y_{m}^{j} (t), \end{matrix}$ (9) ) and rearranging terms leads to the following estimation equation: (12) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}, \end{matrix}$ (12) where (13) $\begin{matrix} ϕ^{j} (t) = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{m}^{j} (t) \in R^{n_{θ}} . \end{matrix}$ (13) Note that (Equation12(12) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}, \end{matrix}$ (12) ) is linear in the parameters $θ^{Δ}$ .

Finally, the optimal parameters, denoted as $θ_{0}^{Δ}$ , are defined. By subdividing e^j_m(t) in (Equation12(12) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}, \end{matrix}$ (12) ) into e^j_r(t) and e^j_w(t), as defined in (Equation2(2) $\begin{matrix} \begin{matrix} e_{r}^{j} (t) = S (q) (1 - P (q) C_{f f}^{j} (q)) r (t), \\ e_{w}^{j} = S (q) w^{j} (t), \\ y_{r}^{j} (t) = S (q) P (q) (C_{f b} (q) + C_{f f}^{j} (q)) r (t), \\ y_{w}^{j} = S (q) w^{j} (t), \end{matrix} \end{matrix}$ (2) ), and rearranging terms, ${\hat{e}}^{j + 1} (t, θ^{Δ})$ in (Equation12(12) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}, \end{matrix}$ (12) ) is given by $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = [e_{r}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}] - e_{w}^{j} (t) . \end{matrix}$ Recall from Section 2.2 that the goal of feedforward control is to minimise the reference-induced error e^{j + 1}_r(t, θ^{j + 1}). Given the definition of e^j_r(t) in (Equation2(2) $\begin{matrix} \begin{matrix} e_{r}^{j} (t) = S (q) (1 - P (q) C_{f f}^{j} (q)) r (t), \\ e_{w}^{j} = S (q) w^{j} (t), \\ y_{r}^{j} (t) = S (q) P (q) (C_{f b} (q) + C_{f f}^{j} (q)) r (t), \\ y_{w}^{j} = S (q) w^{j} (t), \end{matrix} \end{matrix}$ (2) ) together with the parametrisation for C^{j + 1}_ff(q, θ^{j + 1}) in (Equation3(3) $\begin{matrix} C_{f f}^{j + 1} (q, θ^{j + 1}) & = C_{f f}^{j} (q, θ^{j}) + C_{f f}^{Δ} (q, θ^{Δ}) \\ = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) (θ_{i}^{j} + θ_{i}^{Δ}), \end{matrix}$ (3) ), it directly follows that the reference-induced error contribution in (Equation12(12) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}, \end{matrix}$ (12) ) can be expressed as (14) $\begin{matrix} {\hat{e}}_{r}^{j + 1} (t, θ^{Δ}) = e_{r}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}, \end{matrix}$ (14) Then, the optimal parameters $θ_{0}^{Δ}$ are defined such that (Equation14(14) $\begin{matrix} {\hat{e}}_{r}^{j + 1} (t, θ^{Δ}) = e_{r}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}, \end{matrix}$ (14) ) is equal to zero for all t, i.e. ${\hat{e}}_{r}^{j + 1} (t, θ_{0}^{Δ}) = 0,$ which directly implies that e^j_r(t) = (ϕ^j(t))^Tθ₀^Δ, and consequently that (15) $\begin{matrix} e_{m}^{j} (t) = {(ϕ^{j} (t))}^{T} θ_{0}^{Δ} - e_{w}^{j} (t) . \end{matrix}$ (15)

Note that this definition of θ^Δ₀ implies that C^{j + 1}_ff(q, θ₀) = P^{− 1}(q), where θ₀ = θ^j + θ^Δ₀. This is in accordance with Assumption 2.2.

3.2 An instrumental variable approach to iterative feedforward control

A general framework is proposed in Boeren, Oomen, et al. (Citation2015) for iterative feedforward control based on instrumental variables (IV). The rationale is that unbiased estimates of ${\hat{θ}}^{Δ}$ are obtained without the need for a correct model of w^j, which is in contrast to least-squares estimation, including van der Meulen et al. (Citation2008). In IV-based approaches, V(θ^Δ) is typically selected as (16) $\begin{matrix} V (θ^{Δ}) = {||\frac{1}{N} \sum_{t = 1}^{N} z (t) L (q) {\hat{e}}^{j + 1} (t, θ^{Δ})||}_{W}^{2}, \end{matrix}$ (16) where $z (t) \in R^{n_{z}}$ are instrumental variables that are uncorrelated with w^j, W is a positive-definite weighting matrix, n_z ≥ n_θ, L(q) is a prefilter and ${\hat{e}}^{j + 1} (t, θ^{Δ})$ in (Equation12(12) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}, \end{matrix}$ (12) ). Since r(t) is uncorrelated with w^j, the instrumental variables z(t) are in the remainder of this paper selected as a function of (derivatives of) r(t).

Since $V (θ^{Δ})$ is quadratic in θ^Δ, the minimiser ${\hat{θ}}^{Δ}$ of $V (θ^{Δ})$ follows from the necessary condition for optimality $\frac{\partial V (θ^{Δ})}{\partial θ^{Δ}} = 0$ since W is a positive-definite matrix. By substituting (Equation12(12) $\begin{matrix} {\hat{e}}^{j + 1} (t, θ^{Δ}) = e_{m}^{j} (t) - {(ϕ^{j} (t))}^{T} θ^{Δ}, \end{matrix}$ (12) ) in (Equation16(16) $\begin{matrix} V (θ^{Δ}) = {||\frac{1}{N} \sum_{t = 1}^{N} z (t) L (q) {\hat{e}}^{j + 1} (t, θ^{Δ})||}_{W}^{2}, \end{matrix}$ (16) ), it is straightforward to show that the minimiser ${\hat{θ}}^{Δ}$ of V(θ^Δ) in (Equation16(16) $\begin{matrix} V (θ^{Δ}) = {||\frac{1}{N} \sum_{t = 1}^{N} z (t) L (q) {\hat{e}}^{j + 1} (t, θ^{Δ})||}_{W}^{2}, \end{matrix}$ (16) ) is given by (17) $\begin{matrix} {\hat{θ}}^{Δ} = {({\hat{R}}_{z ϕ^{j}}^{T} W {\hat{R}}_{z ϕ^{j}})}^{- 1} {\hat{R}}_{z ϕ^{j}}^{T} W {\hat{R}}_{z e_{m}^{j}}, \end{matrix}$ (17) where ${\hat{R}}_{z ϕ^{j}} = \frac{1}{N} \sum_{t = 1}^{N} z (t) L (q) {(ϕ^{j} (t))}^{T}$ is nonsingular, and ${\hat{R}}_{z e_{m}^{j}} = \frac{1}{N} \sum_{t = 1}^{N} z (t) L (q) e_{m}^{j} (t)$ .

The key idea behind (Equation16(16) $\begin{matrix} V (θ^{Δ}) = {||\frac{1}{N} \sum_{t = 1}^{N} z (t) L (q) {\hat{e}}^{j + 1} (t, θ^{Δ})||}_{W}^{2}, \end{matrix}$ (16) ) is that high performance is obtained if z(t), consisting of (derivatives of) r(t), and ${\hat{e}}^{j + 1} (t, θ^{Δ})$ are uncorrelated. This concept is, in fact, very well known and finds its roots in traditional feedforward tuning techniques in control engineering, as illustrated in the following example.

Example 3.1:

Let C^j_ff(q) = 0, i.e. only a feedback controller is used in task j. Furthermore, $C_{f f}^{Δ} (q, θ^{Δ})$ is given by $\begin{matrix} C_{f f}^{Δ} (q, θ^{Δ}) = ψ_{a} (q^{- 1}) θ_{a}, \end{matrix}$ with basis function $\begin{matrix} ψ_{a} (q^{- 1}) = {(\frac{1 - q^{- 1}}{T_{s}})}^{2}, \end{matrix}$ and parameter θ_a. This parametrisation corresponds to acceleration feedforward with acceleration a(t) = ψ_a(q⁻¹)r(t), i.e. the second derivative of r(t), while θ_a denotes the mass of the system. Note that double differentiation is possible since r(t) is a known, noise-free signal. The instruments are selected as z(t) = a(t), while W = I, n_z = n_θ, and L(q) = I. For this specific case, (Equation16(16) $\begin{matrix} V (θ^{Δ}) = {||\frac{1}{N} \sum_{t = 1}^{N} z (t) L (q) {\hat{e}}^{j + 1} (t, θ^{Δ})||}_{W}^{2}, \end{matrix}$ (16) ) becomes (18) $\begin{matrix} V (θ_{a}) = {||\frac{1}{N} \sum_{t = 1}^{N} a (t) {\hat{e}}^{j + 1} (t, θ_{a})||}^{2}, \end{matrix}$ (18) where ${\hat{e}}^{j + 1} (t, θ_{a}) = e_{m}^{j} (t) - ψ_{a} (q^{- 1}) C^{- 1} y_{m}^{j} (t) θ_{a}$ . The aim of the IV approach is to determine θ_a such that a(t) and ${\hat{e}}^{j + 1} (t, θ_{a})$ are uncorrelated.

For the considered experimental setup in Section 7, the (normalised) a(t) of a typical r(t) is depicted in , together with ${\hat{e}}^{j + 1} (t, θ_{a})$ based on (1) θ_a = 0 (red), and (2) the minimiser ${\hat{θ}}_{a}$ of V(θ_a) in (Equation18(18) $\begin{matrix} V (θ_{a}) = {||\frac{1}{N} \sum_{t = 1}^{N} a (t) {\hat{e}}^{j + 1} (t, θ_{a})||}^{2}, \end{matrix}$ (18) ) (green). Clearly, high performance is obtained with ${\hat{θ}}_{a}$ , i.e. when a(t) and ${\hat{e}}^{j + 1} (t, θ_{a})$ are uncorrelated. In contrast, low performance is obtained for θ_a = 0, which corresponds to significant correlation between a(t) and ${\hat{e}}^{j + 1} (t, θ_{a})$ . This shows that the correlation between a(t) and ${\hat{e}}^{j + 1} (t, θ_{a})$ is a measure for the performance of a motion system.

Figure 3. Tuning of a feedforward controller for a motion system. Low performance in an instrumental variable framework: significant correlation between the acceleration a(t) (dashed black) and predicted error ${\hat{e}}^{j + 1} (t, θ_{a})$ (red) for θ_a = 0 (left). High performance in an instrumental variable framework: a(t) (dashed black) and ${\hat{e}}^{j + 1} (t, θ_{a})$ (green) are uncorrelated for the minimiser ${\hat{θ}}_{a}$ of the criterion V(θ_a) in (Equation18(18) $\begin{matrix} V (θ_{a}) = {||\frac{1}{N} \sum_{t = 1}^{N} a (t) {\hat{e}}^{j + 1} (t, θ_{a})||}^{2}, \end{matrix}$ (18) ) (right). (To view this figure in colour, please see the online version of the journal).

Next, the asymptotic covariance matrix P_IV is derived that is related to ${\hat{θ}}^{Δ}$ . Consider the asymptotic distribution of ${\hat{θ}}^{Δ}$ given by (19) $\begin{matrix} \sqrt{N} ({\hat{θ}}^{Δ} - θ_{0}^{Δ}) \overset{dist}{\to} N (0, P_{I V}), \end{matrix}$ (19) where θ^Δ₀ is the asymptotic parameter estimate as defined in (Equation15(15) $\begin{matrix} e_{m}^{j} (t) = {(ϕ^{j} (t))}^{T} θ_{0}^{Δ} - e_{w}^{j} (t) . \end{matrix}$ (15) ). It can be shown that ${\hat{θ}}^{Δ}$ is a consistent estimator, i.e. ${\hat{θ}}^{Δ}$ converges to θ^Δ₀ for N to infinity, along similar lines as in, e.g. Söderström, Stoica, and Trulsson (Citation1987). Then, the asymptotic covariance matrix P_IV in (Equation19(19) $\begin{matrix} \sqrt{N} ({\hat{θ}}^{Δ} - θ_{0}^{Δ}) \overset{dist}{\to} N (0, P_{I V}), \end{matrix}$ (19) ) is given by (20) $\begin{matrix} P_{I V} = {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- 1} R_{z ϕ^{j}}^{T} W J W^{T} R_{z ϕ^{j}} {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- T}, \end{matrix}$ (20) with (21) $\begin{matrix} \begin{matrix} R_{z ϕ^{j}} & = \overline{E} z (t) L (q) {(ϕ_{r}^{j} (t))}^{T}, \\ J & = λ_{ε}^{2} \overline{E} [L (q) z (t)] {[L (q) z (t)]}^{T}, \end{matrix} \end{matrix}$ (21) and reference-induced part ϕ^j_r(t) of ϕ^j(t) in (Equation13(13) $\begin{matrix} ϕ^{j} (t) = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{m}^{j} (t) \in R^{n_{θ}} . \end{matrix}$ (13) ) given by (22) $\begin{matrix} ϕ_{r}^{j} (t) = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{r}^{j} (t), \end{matrix}$ (22) with y^j_r(t) as in (Equation2(2) $\begin{matrix} \begin{matrix} e_{r}^{j} (t) = S (q) (1 - P (q) C_{f f}^{j} (q)) r (t), \\ e_{w}^{j} = S (q) w^{j} (t), \\ y_{r}^{j} (t) = S (q) P (q) (C_{f b} (q) + C_{f f}^{j} (q)) r (t), \\ y_{w}^{j} = S (q) w^{j} (t), \end{matrix} \end{matrix}$ (2) ). A derivation of (Equation20(20) $\begin{matrix} P_{I V} = {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- 1} R_{z ϕ^{j}}^{T} W J W^{T} R_{z ϕ^{j}} {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- T}, \end{matrix}$ (20) ) for iterative feedforward control follows the proof derived in Söderström and Stoica (Citation1989, App. A8.1) for open-loop identification, and Söderström et al. (Citation1987, Section 3) for closed-loop identification.

Note that P_IV in (Equation20(20) $\begin{matrix} P_{I V} = {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- 1} R_{z ϕ^{j}}^{T} W J W^{T} R_{z ϕ^{j}} {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- T}, \end{matrix}$ (20) ) depends on the design of z(t) and L(q). Next, a lower bound is derived for P_IV as a function of z(t) and L(q), which corresponds to the minimum variance achievable with an IV method for feedforward control.

3.3 Optimal design procedure for z(t) and L(q)

The covariance matrix P_IV in (Equation20(20) $\begin{matrix} P_{I V} = {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- 1} R_{z ϕ^{j}}^{T} W J W^{T} R_{z ϕ^{j}} {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- T}, \end{matrix}$ (20) ) depends on the design of z(t) and L(q). Optimal accuracy in terms of variance is obtained if z(t) and L(q) are designed such that P_IV is equal to an optimal covariance matrix $P_{I V}^{opt}$ , where for any z(t) and L(q), it holds that $P_{I V} ⪰ P_{I V}^{opt} .$ The optimal covariance matrix $P_{I V}^{opt}$ for iterative feedforward control based on instrumental variables is given by (23) $\begin{matrix} P_{I V}^{opt} = λ_{ε}^{2} {\{\overline{E} ϕ_{r}^{j} (t) {(ϕ_{r}^{j} (t))}^{T}\}}^{- 1}, \end{matrix}$ (23) where $P_{I V}^{opt} ≻ 0$ . A derivation of $P_{I V}^{opt}$ follows along similar lines as the derivation for open-loop identification in Söderström and Stoica (Citation1989, Chapter 8).

Equivalence between P_IV in (Equation20(20) $\begin{matrix} P_{I V} = {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- 1} R_{z ϕ^{j}}^{T} W J W^{T} R_{z ϕ^{j}} {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- T}, \end{matrix}$ (20) ) and $P_{I V}^{opt}$ in (Equation23(23) $\begin{matrix} P_{I V}^{opt} = λ_{ε}^{2} {\{\overline{E} ϕ_{r}^{j} (t) {(ϕ_{r}^{j} (t))}^{T}\}}^{- 1}, \end{matrix}$ (23) ) holds if z(t), L(q) and W are designed as (24) $\begin{matrix} z_{opt} (t) = ϕ_{r}^{j} (t), \\ L_{opt} (q) = 1, \\ W_{opt} = I, and n_{z} = n_{θ}, \end{matrix}$ (24) and ϕ^j_r(t) in (Equation22(22) $\begin{matrix} ϕ_{r}^{j} (t) = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{r}^{j} (t), \end{matrix}$ (22) ). This result follows by substituting z_opt(t), L_opt(q), and W_opt in (Equation20(20) $\begin{matrix} P_{I V} = {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- 1} R_{z ϕ^{j}}^{T} W J W^{T} R_{z ϕ^{j}} {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- T}, \end{matrix}$ (20) ). Note that this design is not unique, and closely related to the proposed instrumental variables in the RIV algorithms.

The following two observations are made based on z_opt(t), L_opt(q), and W_opt. First, the optimal design reveals that minimum variance is obtained when the number of instruments n_z is equal to the number of parameters n_θ, and uniform weighting (W = I) is applied for t = 1, ..., N. Based on this observation, W and n_z are furthermore selected as W = I and n_z = n_θ. Then, (Equation20(20) $\begin{matrix} P_{I V} = {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- 1} R_{z ϕ^{j}}^{T} W J W^{T} R_{z ϕ^{j}} {(R_{z ϕ^{j}}^{T} W R_{z ϕ^{j}})}^{- T}, \end{matrix}$ (20) ) becomes (25) $\begin{matrix} P_{I V} = R_{z ϕ^{j}}^{- 1} J R_{z ϕ^{j}}^{- T}, \end{matrix}$ (25) with $R_{z ϕ^{j}}$ and J as in (Equation21(21) $\begin{matrix} \begin{matrix} R_{z ϕ^{j}} & = \overline{E} z (t) L (q) {(ϕ_{r}^{j} (t))}^{T}, \\ J & = λ_{ε}^{2} \overline{E} [L (q) z (t)] {[L (q) z (t)]}^{T}, \end{matrix} \end{matrix}$ (21) ). Furthermore, (Equation17(17) $\begin{matrix} {\hat{θ}}^{Δ} = {({\hat{R}}_{z ϕ^{j}}^{T} W {\hat{R}}_{z ϕ^{j}})}^{- 1} {\hat{R}}_{z ϕ^{j}}^{T} W {\hat{R}}_{z e_{m}^{j}}, \end{matrix}$ (17) ) reduces to a basic IV method, i.e. (26) $\begin{matrix} {\hat{θ}}^{Δ} = {\hat{R}}_{z ϕ^{j}}^{- 1} {\hat{R}}_{z e_{m}^{j}} . \end{matrix}$ (26)

Second, the optimal instruments z_opt(t) cannot be determined since P(q) is assumed to be unknown in the developed framework. To see this, note that ϕ^j_r(t) in (Equation22(22) $\begin{matrix} ϕ_{r}^{j} (t) = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{r}^{j} (t), \end{matrix}$ (22) ), and in particular y^j_r(t), cannot be determined based on the known r(t), and measured e^j_m(t) and y^j_m(t) when P(q) is unknown. In the next section, a novel algorithm is proposed that alternates between iteratively updating the estimate of the noise-free z_opt(t), and determining ${\hat{θ}}^{Δ}$ , similar to the algorithms described in Young (Citation2015).

4. Proposed approach achieving optimal accuracy

4.1 Proposed RIV algorithm

The algorithm proposed in this section is an iterative method to achieve optimal accuracy by jointly updating the estimate of the optimal instruments z_opt(t) in (Equation24(24) $\begin{matrix} z_{opt} (t) = ϕ_{r}^{j} (t), \\ L_{opt} (q) = 1, \\ W_{opt} = I, and n_{z} = n_{θ}, \end{matrix}$ (24) ) and solving for ${\hat{θ}}^{Δ}$ as in (Equation26(26) $\begin{matrix} {\hat{θ}}^{Δ} = {\hat{R}}_{z ϕ^{j}}^{- 1} {\hat{R}}_{z e_{m}^{j}} . \end{matrix}$ (26) ). Note that measured data from a single task is sufficient. Related iterative RIV methods are developed in system identification (see Jakeman & Young (Citation1979); Young, Citation2015; Young & Jakeman (Citation1979, Citation1980).

Let the index i denote the ith computational iteration of the proposed algorithm. Furthermore, ${\hat{θ}}_{}^{Δ}$ denotes the parameter estimate in iteration i − 1. In the ith iteration, z_opt(t) is approximated by (27) $\begin{matrix} z_{p, } (t) & = {\hat{ϕ}}_{r}^{j} (t) : = Ψ (q) (C_{f b} (q) \\ + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))^{- 1} r (t) . \end{matrix}$ (27) Subsequently, z_{p, }(t) in (Equation27(27) $\begin{matrix} z_{p, } (t) & = {\hat{ϕ}}_{r}^{j} (t) : = Ψ (q) (C_{f b} (q) \\ + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))^{- 1} r (t) . \end{matrix}$ (27) ) is used to determine ${\hat{θ}}_{}^{Δ}$ in the ith iteration similar to (Equation26(26) $\begin{matrix} {\hat{θ}}^{Δ} = {\hat{R}}_{z ϕ^{j}}^{- 1} {\hat{R}}_{z e_{m}^{j}} . \end{matrix}$ (26) ): $\begin{matrix} {\hat{θ}}_{}^{Δ} = {({\hat{R}}_{z ϕ^{j}, })}^{- 1} {\hat{R}}_{z e_{m}^{j}, }, \end{matrix}$ where ${\hat{R}}_{z ϕ^{j}, } = \frac{1}{N} \sum_{t = 1}^{N} z_{p, } (t) {(ϕ^{j} (t))}^{T}$ with ϕ^j(t) in (Equation13(13) $\begin{matrix} ϕ^{j} (t) = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{m}^{j} (t) \in R^{n_{θ}} . \end{matrix}$ (13) ), and ${\hat{R}}_{z e_{m}^{j}, } = \frac{1}{N} \sum_{t = 1}^{N} z_{p, } (t) e_{m}^{j} (t)$ with e^j_m(t) in (Equation1(1) $\begin{matrix} \begin{matrix} e_{m}^{j} (t) = e_{r}^{j} (t) - e_{w}^{j} (t), \\ y_{m}^{j} (t) = y_{r}^{j} (t) + y_{w}^{j} (t), \end{matrix} \end{matrix}$ (1) ). The proposed algorithm to determine ${\hat{θ}}^{Δ}$ with optimal accuracy is summarised in Algorithm 4.1.

Algorithm 4.1:

Determine ${\hat{θ}}^{Δ}$ with optimal accuracy

(a)	Initialise ${\hat{θ}}_{< i - 1 >}^{Δ} = 0$ .
(b)	Construct $C_{f f, < i >}^{j} (q, {\hat{θ}}_{< i - 1 >}^{Δ}) = Ψ (q) (θ^{j} + {\hat{θ}}_{< i - 1 >}^{Δ}) .$
(c)	Construct instrumental variables $z_{p, < i >} (t) = Ψ (q) {(C_{f b} (q) + C_{f f, < i >}^{j} (q, {\hat{θ}}_{< i - 1 >}^{Δ}))}^{- 1} r (t)$ .
(d)	Determine ${\hat{R}}_{z ϕ^{j}, < i >} = \frac{1}{N} \sum_{t = 1}^{N} z_{p, < i >} (t) {(ϕ^{j} (t))}^{T}$ and ${\hat{R}}_{z e_{m}^{j}, < i >} = \frac{1}{N} \sum_{t = 1}^{N} z_{p, < i >} (t) e_{m}^{j} (t),$ based on ϕ^j(t) in (Equation13(13) $\begin{matrix} ϕ^{j} (t) = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{m}^{j} (t) \in R^{n_{θ}} . \end{matrix}$ (13) ) and e^j_m(t) in (Equation1(1) $\begin{matrix} \begin{matrix} e_{m}^{j} (t) = e_{r}^{j} (t) - e_{w}^{j} (t), \\ y_{m}^{j} (t) = y_{r}^{j} (t) + y_{w}^{j} (t), \end{matrix} \end{matrix}$ (1) ).
(e)	Solve for ${\hat{θ}}_{< i >}^{Δ}$ as in (Equation26(26) $\begin{matrix} {\hat{θ}}^{Δ} = {\hat{R}}_{z ϕ^{j}}^{- 1} {\hat{R}}_{z e_{m}^{j}} . \end{matrix}$ (26) ): ${\hat{θ}}_{< i >}^{Δ} = {({\hat{R}}_{z ϕ^{j}, < i >})}^{- 1} {\hat{R}}_{z e_{m}^{j}, < i >} .$
(f)	Set i → i + 1 and repeat from Step (b) until a stopping criterion is met.
(g)	Set ${\hat{θ}}^{Δ} = {\hat{θ}}_{< i >}^{Δ}$ .

Before initiating Algorithm 4.1, it is advised to determine an initial parameter θ^j by using a linear least squares estimation approach as in van der Meulen et al. (Citation2008). Similar to the RIV algorithms presented in Young (Citation2015), practical use has shown that such an initial estimate is typically sufficient for subsequent convergence of Algorithm 4.1.

Remark 4.1:

An approach to deal with possible instability of ${(C_{f b} (q) + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))}^{- 1}$ in computing z_{p, }(t) is given in Boeren, Oomen, et al. (Citation2015, Appendix A).

4.2 Accuracy analysis of the proposed approach

In this section, it is shown that optimal accuracy in terms of accuracy is obtained with Algorithm 4.1. Consider the covariance matrix P_{IV, p} corresponding to z_{p, }(t) as given in . The covariance P_{IV, p} follows by substituting (Equation27(27) $\begin{matrix} z_{p, } (t) & = {\hat{ϕ}}_{r}^{j} (t) : = Ψ (q) (C_{f b} (q) \\ + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))^{- 1} r (t) . \end{matrix}$ (27) ) in (Equation25(25) $\begin{matrix} P_{I V} = R_{z ϕ^{j}}^{- 1} J R_{z ϕ^{j}}^{- T}, \end{matrix}$ (25) ). Clearly, optimal accuracy for the proposed method, i.e. P_{IV, p} equal to $P_{I V}^{opt}$ in (Equation23(23) $\begin{matrix} P_{I V}^{opt} = λ_{ε}^{2} {\{\overline{E} ϕ_{r}^{j} (t) {(ϕ_{r}^{j} (t))}^{T}\}}^{- 1}, \end{matrix}$ (23) ), is obtained if z_{p, }(t) converges to z_opt(t) in Algorithm 4.1.

Table 1. The comparison study presented in Sections 4 and 5 of this paper shows that optimal accuracy is not achieved with the existing approaches in van der Meulen et al. (Citation2008) and Boeren, Oomen, et al. (Citation2015) while the proposed approach can achieve optimal accuracy upon convergence of the proposed RIV algorithm.

Display Table

To show that z_{p, }(t) converges to z_opt(t) in subsequent iterations of the proposed algorithm, substitute (Equation2(2) $\begin{matrix} \begin{matrix} e_{r}^{j} (t) = S (q) (1 - P (q) C_{f f}^{j} (q)) r (t), \\ e_{w}^{j} = S (q) w^{j} (t), \\ y_{r}^{j} (t) = S (q) P (q) (C_{f b} (q) + C_{f f}^{j} (q)) r (t), \\ y_{w}^{j} = S (q) w^{j} (t), \end{matrix} \end{matrix}$ (2) ) and (Equation22(22) $\begin{matrix} ϕ_{r}^{j} (t) = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{r}^{j} (t), \end{matrix}$ (22) ) in (Equation24(24) $\begin{matrix} z_{opt} (t) = ϕ_{r}^{j} (t), \\ L_{opt} (q) = 1, \\ W_{opt} = I, and n_{z} = n_{θ}, \end{matrix}$ (24) ) to obtain (28) $\begin{matrix} z_{opt} (t) & = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{r}^{j} (t) \\ = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} S (q) P (q) (C_{f b} (q) \\ + C_{f f}^{j} (q)) r (t) \\ = Ψ (q) S (q) P (q) r (t), \end{matrix}$ (28) where the last equality is obtained by using the commutative property of SISO systems. Then, the difference between z_opt(t) and z_{p, }(t) in the ith iteration can be expressed as $\begin{matrix} z_{opt} (t) - z_{p, } (t) \\ = & Ψ (q) (S (q) P (q) \\ - {(C_{f b} (q) + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))}^{- 1}) r (t), \end{matrix}$ and z_{p, }(t) = z_opt(t), i.e. optimal accuracy, is obtained if (29) $\begin{matrix} {(C_{f b} (q) + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))}^{- 1} = S (q) P (q) . \end{matrix}$ (29)

It remains to be shown that (Equation31(31) $\begin{matrix} P_{I V, 1} & = & λ_{ε}^{2} {[\overline{E} Ψ (q) r (t) {(ϕ_{r}^{j} (t))}^{T}]}^{- 1} \\ \times [\overline{E} Ψ (q) r (t) {(Ψ (q) r (t))}^{T}] \\ \times {[\overline{E} Ψ (q) r (t) {(ϕ_{r}^{j} (t))}^{T}]}^{- T} . \end{matrix}$ (31) ) holds, i.e. z_{p, }(t) converges to z_opt(t), to guarantee that Algorithm 4.1 results in optimal accuracy. Recall from Section 3 that ${\hat{θ}}_{< 1 >}^{Δ}$ in iteration i = 1 is a consistent estimator, i.e. ${\hat{θ}}_{< 1 >}^{Δ} = θ_{0}^{Δ}$ for N to infinity, and that consistency of ${\hat{θ}}_{< 1 >}^{Δ}$ implies that (30) $\begin{matrix} C_{f f, < 2 >}^{j} (q, {\hat{θ}}_{< 1 >}^{Δ}) = P^{- 1} (q) . \end{matrix}$ (30) Substituting (Equation29(29) $\begin{matrix} {(C_{f b} (q) + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))}^{- 1} = S (q) P (q) . \end{matrix}$ (29) ) in (Equation29(29) $\begin{matrix} {(C_{f b} (q) + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))}^{- 1} = S (q) P (q) . \end{matrix}$ (29) ) with i = 2 and rearranging terms illustrates that z_{p, <2 >}(t) = z_opt(t). This result shows that optimal accuracy in terms of accuracy is achieved with Algorithm 4.1.

Remark 4.2:

For finite N, ${\hat{θ}}_{< 1 >}^{Δ}$ is not exactly equal to θ^Δ₀ and multiple iterations are typically required to refine z_{p, }(t). Practical use of the algorithm shows good convergence properties. This is in accordance with the convergence analysis for closely related iterative RIV algorithms (Young, Citation2015).

4.3 Design procedure

Next, Algorithm 4.1 is embedded in a design procedure to determine C^{j + 1}_ff(q, θ^{j + 1}) according to (Equation7(7) $\begin{matrix} {\hat{θ}}^{Δ} = arg min_{θ^{Δ}} V (θ^{Δ}), \end{matrix}$ (7) ) with V(θ^Δ) in (Equation16(16) $\begin{matrix} V (θ^{Δ}) = {||\frac{1}{N} \sum_{t = 1}^{N} z (t) L (q) {\hat{e}}^{j + 1} (t, θ^{Δ})||}_{W}^{2}, \end{matrix}$ (16) ). This design procedure implements the main contribution of this paper, and is given next.

Procedure 4.1:

Estimation of C^{j + 1}_ff(q, θ^{j + 1}) after the jth task:

1.	Measure e^j_m(t) and y^j_m(t) for $t = 1, \dots, N$ , in the jth task.
2.	Construct ϕ^j(t) = Ψ(q)(C_fb(q) + C^j_ff(q, θ^j))^{− 1}y^j_m(t).
3.	Algorithm 4.1: Determine ${\hat{θ}}^{Δ}$ with optimal accuracy.
4.	Construct $C_{f f}^{j + 1} (q, θ^{j + 1}) = Ψ (q) (θ^{j} + {\hat{θ}}^{Δ}) .$
5.	Set j → j + 1 and go to Step 1.

Remark 4.3:

Procedure 4.1 is based on measured data from a single task. The proposed procedure can directly be implemented as a (batch-wise) recursive approach, where measured data from multiple tasks is used to reduce the variance of ${\hat{θ}}^{Δ}$ .

5. Accuracy analysis of existing approaches

In this section, the accuracy properties of the iterative feedforward tuning approaches in van der Meulen et al. (Citation2008) and Boeren, Oomen, et al. (Citation2015) are compared with the optimal approach derived in Section 3. An overview of this comparison is provided in .

5.1 Accuracy analysis of the approach in Boeren, Oomen, et al. (Citation2015)

The instrumental variable approach in Boeren, Oomen, et al. (Citation2015) uses as instruments z₁(t) = Ψ(q)r(t), with Ψ(q) the basis functions of C^j_ff(q), and L₁(q) = 1. The covariance matrix P_{IV, 1} corresponding to this design follows by substituting z₁(t) and L₁(q) = 1 in (Equation25(25) $\begin{matrix} P_{I V} = R_{z ϕ^{j}}^{- 1} J R_{z ϕ^{j}}^{- T}, \end{matrix}$ (25) ), and is given by (31) $\begin{matrix} P_{I V, 1} & = & λ_{ε}^{2} {[\overline{E} Ψ (q) r (t) {(ϕ_{r}^{j} (t))}^{T}]}^{- 1} \\ \times [\overline{E} Ψ (q) r (t) {(Ψ (q) r (t))}^{T}] \\ \times {[\overline{E} Ψ (q) r (t) {(ϕ_{r}^{j} (t))}^{T}]}^{- T} . \end{matrix}$ (31) Based on (Equation31(31) $\begin{matrix} P_{I V, 1} & = & λ_{ε}^{2} {[\overline{E} Ψ (q) r (t) {(ϕ_{r}^{j} (t))}^{T}]}^{- 1} \\ \times [\overline{E} Ψ (q) r (t) {(Ψ (q) r (t))}^{T}] \\ \times {[\overline{E} Ψ (q) r (t) {(ϕ_{r}^{j} (t))}^{T}]}^{- T} . \end{matrix}$ (31) ) and (Equation23(23) $\begin{matrix} P_{I V}^{opt} = λ_{ε}^{2} {\{\overline{E} ϕ_{r}^{j} (t) {(ϕ_{r}^{j} (t))}^{T}\}}^{- 1}, \end{matrix}$ (23) ), it can be shown that $P_{I V, 1} ≻ P_{I V}^{opt},$ i.e. the approach in Boeren, Oomen, et al. (Citation2015) results in non-optimal accuracy in terms of variance.

5.2 Accuracy analysis of the approach in van der Meulen et al. (Citation2008)

The iterative feedforward approach proposed in van der Meulen et al. (Citation2008) utilises L₂(q) = 1 and z₂(t) = ϕ₂(t) as instruments, where ϕ₂(t) = Ψ(q)(C_fb(q) + C^j_ff(q))^{− 1}y_m(t) is constructed based on measured data obtained in an additional task. As such, measured data from two tasks is required to determine ${\hat{θ}}^{Δ}$ , which results in a $1 / \sqrt{2}$ reduced accuracy compared to the optimal approach in Section 3. To see this, note that the asymptotic distribution of ${\hat{θ}}^{Δ}$ corresponding to z₂(t) yields (32) $\begin{matrix} \sqrt{N} ({\hat{θ}}^{Δ} - θ_{0}^{Δ}) \overset{dist}{\to} N (0, P_{I V, 2}), \end{matrix}$ (32) based on two tasks of each N samples, where P_{IV, 2} yields $\begin{matrix} P_{I V, 2} & = & λ_{ε}^{2} {[\overline{E} ϕ_{r}^{j} (t) {(ϕ_{r}^{j} (t))}^{T}]}^{- 1} \\ \times [\overline{E} ϕ_{2} (t) ϕ_{2}^{T} (t)] {[\overline{E} ϕ_{r}^{j} (t) {(ϕ_{r}^{j} (t))}^{T}]}^{- T} . \end{matrix}$ In contrast, the asymptotic distribution corresponding to the optimal instruments z_opt is given by (33) $\begin{matrix} \sqrt{2 N} ({\hat{θ}}^{Δ} - θ_{0}^{Δ}) \overset{dist}{\to} N (0, P_{I V}^{opt}), \end{matrix}$ (33) based on two tasks of each N samples. Comparing (Equation32(34) $\begin{matrix} {\overline{θ}}^{j} = \frac{1}{m} \sum_{l = 1}^{m} {\hat{θ}}_{l}^{j}, \end{matrix}$ (34) ) and (Equation33(33) $\begin{matrix} \sqrt{2 N} ({\hat{θ}}^{Δ} - θ_{0}^{Δ}) \overset{dist}{\to} N (0, P_{I V}^{opt}), \end{matrix}$ (33) ) reveals a $1 / \sqrt{2}$ reduced accuracy for the approach based on z₂(t) when compared to the optimal approach. This result confirms that non-optimal accuracy is obtained with the approach proposed in van der Meulen et al. (Citation2008).

6. Simulation example

In this section, a simulation study is presented to

(1)	Confirm that unbiased parameter estimates are obtained for all considered IV approaches in ,
(2)	Show that the proposed approach z_{p, <i >}(t) leads to enhanced accuracy of the parameter estimates compared to the pre-existing approaches based on z₁(t) and z₂(t),
(3)	Illustrate the close relation between the statistical accuracy of ${\hat{θ}}^{Δ}$ and the performance of a motion system.

6.1 System description

Consider the system P(q) given by $\begin{matrix} P (q) = \frac{1.761 \times 10^{- 9}}{1 - 3.69 q^{- 1} + 5.225 q^{- 2} - 3.38 q^{- 3} + 0.8451 q^{- 4}}, \end{matrix}$ which represents a two-mass spring damper system with non-collocated dynamics (see for a Bode plot). A schematic illustration of the two-mass spring damper system is depicted in . The feedback controller C_fb(q) is given by $\begin{matrix} C_{f b} = \frac{7.444 \times 10^{4} q^{- 1} - 1.47 \times 10^{5} q^{- 2} + 7.259 \times 10^{4} q^{- 3}}{1 - 2.736 q^{- 1} + 2.49 q^{- 2} - 0.7537 q^{- 3}} . \end{matrix}$ Recall from Section 2.2 that the unknown disturbance w^j(t) is assumed to be given by w^j(t) = H(q)ε^j(t). Here, H(q) is designed such that Assumption 2.1 holds, i.e. H(q) = 1 + P(q)C_fb(q), and {ε^j(t)} is normally distributed white noise with zero mean and standard deviation λ_ε = 2.5 × 10⁻⁸. The system is excited by a third-order reference signal, designed as in Biagiotti and Melchiorri (Citation2012). Furthermore, the feedforward controller C^j_ff(q, θ^j) is parametrised as $\begin{matrix} C_{f f}^{j} (q, θ^{j}) = ψ_{a} (q^{- 1}) θ_{a}^{j} + ψ_{s} (q^{- 1}) θ_{s}^{j}, \end{matrix}$ with basis functions $\begin{matrix} ψ_{a} (q^{- 1}) = {(\frac{1 - q^{- 1}}{T_{s}})}^{2}, ψ_{s} (q^{- 1}) = {(\frac{1 - q^{- 1}}{T_{s}})}^{4}, \end{matrix}$ parameters θ^j = [θ^j_a, θ_s^j]^T, and sampling time T_s = 5 × 10⁻⁴ s. Hence, C^j_ff(q, θ^j) consist of acceleration feedforward ψ_a(q^{− 1})θ^j_a and snap feedforward ψ_s(q^{− 1})θ^j_s. Straightforward computations reveal that the true parameter vector θ₀ of C^j_ff(q, θ^j), as defined in Section 3, is given by θ₀ = [22, 3 × 10⁻⁵]^T. This implies that the reference-induced contribution e^j_r(t) is equal to zero for all t when C^j_ff(q, θ₀) is used as feedforward controller.

Figure 4. Bode diagram of the system P.

Figure 5. Schematic illustration of a two-mass spring damper system.

A Monte Carlo simulation study is performed for the proposed approach with z_{p, }(t) in (Equation27(27) $\begin{matrix} z_{p, } (t) & = {\hat{ϕ}}_{r}^{j} (t) : = Ψ (q) (C_{f b} (q) \\ + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))^{- 1} r (t) . \end{matrix}$ (27) ) and the pre-existing approaches based on z₁(t) and z₂(t). The number of realisations is equal to m = 200, and a parameter estimate in the lth realisation is denoted ${\hat{θ}}_{l}$ . In a single realisation, M = 5 tasks are performed, consisting of N = 6000 samples each. After the jth task in the lth realisation, Procedure 4.1 is used to determine ${\hat{θ}}_{l}^{j + 1}$ based on θ^j_l, and the measured signals e^j_m and y^j_m in the jth task. The initial parameter vector is given by θ^init = [16, 1 × 10⁻⁵]^T. The sample mean corresponding to the jth task in the lth realisation is defined as (34) $\begin{matrix} {\overline{θ}}^{j} = \frac{1}{m} \sum_{l = 1}^{m} {\hat{θ}}_{l}^{j}, \end{matrix}$ (34) with ${\overline{θ}}^{j} = {[{\overline{θ}}_{a}^{j} {\overline{θ}}_{s}^{j}]}^{T} .$ The corresponding feedforward controller is denoted as $C_{f f}^{j} (q, {\overline{θ}}^{j})$ .

6.2 Simulation results: parameter estimation

The results of the Monte Carlo simulation study are given in and . The following observations are made:

Figure 6. Simulation results. Parameters θ^ as a function of tasks for m = 200 realisations for z1(t) (left), z2(t) (middle) and the proposed zp, <i >(t) (right) show that the standard deviation of θ^a is comparable for all approaches, while the standard deviation of θ^s is significantly smaller for the proposed zp, <i >(t) compared to z1(t) and z2(t).

Figure 6. Simulation results. Parameters $\hat{θ}$ as a function of tasks for m = 200 realisations for z₁(t) (left), z₂(t) (middle) and the proposed z_{p, <i >}(t) (right) show that the standard deviation of ${\hat{θ}}_{a}$ is comparable for all approaches, while the standard deviation of ${\hat{θ}}_{s}$ is significantly smaller for the proposed z_{p, <i >}(t) compared to z₁(t) and z₂(t).
(1)	Unbiased estimates of θ₀, i.e. ${\overline{θ}}^{j} = θ_{0}$ , are obtained for z_{p, <i >}(t), z₁(t), and z₂(t). This confirms that all considered IV approaches in lead to unbiased estimates.
(2)	The standard deviation of ${\overline{θ}}_{s}^{j}$ is significantly smaller for z_{p, <i >}(t) when compared to z₁(t) and z₂(t). This confirms that the proposed approach leads to improved accuracy in terms of variance.

Table 2. Summary of results of Monte Carlo simulation. The mean value of ${\overline{θ}}_{a}^{1}$ and ${\overline{θ}}_{s}^{1}$ in task j = 1 for z_{p, }(t), z₁(t) and z₂(t) confirm that unbiased parameter estimates are obtained for all methods, while the standard deviation of ${\overline{θ}}_{a}^{1}$ and ${\overline{θ}}_{s}^{1}$ confirms that an enhanced accuracy is obtained with the proposed approach z_{p, }(t) compared to z₁(t) and z₂(t).

Display Table

6.3 Simulation results: accuracy and performance

Next, the relation between the statistical accuracy of the estimated parameters and the obtained performance is analysed for the considered system. Recall from (Equation6(6) $\begin{matrix} e_{r}^{j + 1} (t, θ^{j + 1}) = S (q) (1 - P (q) C_{f f}^{j + 1} (q, θ^{j + 1})) r (t), \end{matrix}$ (6) ) that a high-performance C^j_ff(q, θ^j) minimises e^j_r(t, θ^j), i.e. the error induced by the known r(t). Since ${\overline{θ}}^{j}$ in (Equation34(34) $\begin{matrix} {\overline{θ}}^{j} = \frac{1}{m} \sum_{l = 1}^{m} {\hat{θ}}_{l}^{j}, \end{matrix}$ (34) ) is an unbiased estimate of θ₀ for all approaches in , it follows from Section 3 that $e_{r}^{j} (t, {\overline{θ}}^{j}) = 0$ for all t when $C_{f f}^{j} (q, {\overline{θ}}^{j})$ is used as feedforward controller. Consequently, (Equation1(1) $\begin{matrix} \begin{matrix} e_{m}^{j} (t) = e_{r}^{j} (t) - e_{w}^{j} (t), \\ y_{m}^{j} (t) = y_{r}^{j} (t) + y_{w}^{j} (t), \end{matrix} \end{matrix}$ (1) ) implies that (35) $\begin{matrix} e_{m}^{j} (t, {\overline{θ}}^{j}) = - e_{w}^{j} (t), \end{matrix}$ (35) which is the best possible result in the developed framework for a fixed C_fb(q). However, as already shown in Section 6.2, the estimate ${\hat{θ}}_{l}^{j}$ in realisation l can significantly deviate from the sample mean ${\overline{θ}}^{j}$ . In this section, the influence of this deviation on the achieved performance is analysed for the IV approaches in .

Suppose that ${\hat{θ}}_{l}^{j}$ , $e_{m}^{j} (t, {\hat{θ}}_{l}^{j})$ and $V ({\hat{θ}}_{l}^{j})$ are stored for all tasks, i.e. j = 1, …, 5, and all realisations, i.e. l = 1, …, 200. Let ${\hat{θ}}_{wc}^{j}$ denote the parameters such that $V ({\hat{θ}}_{wc}^{j}) \geq V ({\hat{θ}}_{l}^{j})$ for l = 1, …, 200. The corresponding worst-case error $e_{wc}^{j} (t, {\hat{θ}}_{wc}^{j})$ follows from (Equation1(1) $\begin{matrix} \begin{matrix} e_{m}^{j} (t) = e_{r}^{j} (t) - e_{w}^{j} (t), \\ y_{m}^{j} (t) = y_{r}^{j} (t) + y_{w}^{j} (t), \end{matrix} \end{matrix}$ (1) ) as (36) $\begin{matrix} e_{wc}^{j} (t, {\hat{θ}}_{wc}^{j}) = e_{r}^{j} (t, {\hat{θ}}_{wc}^{j}) - e_{w}^{j} (t), \end{matrix}$ (36) while the feedforward controller is denoted as $C_{f f}^{j} (q, {\hat{θ}}_{wc}^{j})$ . Next, it is shown that enhanced accuracy in terms of variance results in a smaller difference between $e_{wc}^{j} (t, {\hat{θ}}_{wc}^{j})$ in (Equation36(36) $\begin{matrix} e_{wc}^{j} (t, {\hat{θ}}_{wc}^{j}) = e_{r}^{j} (t, {\hat{θ}}_{wc}^{j}) - e_{w}^{j} (t), \end{matrix}$ (36) ) and $e_{m}^{j} (t, {\overline{θ}}^{j})$ in (Equation35(35) $\begin{matrix} e_{m}^{j} (t, {\overline{θ}}^{j}) = - e_{w}^{j} (t), \end{matrix}$ (35) ), i.e. improved worst-case performance.

The error $e_{wc}^{1} (t)$ in task j = 1 and cumulative power spectrum are depicted in and , respectively. The following observations are made:

Figure 7. Simulation results. The error signal e wc 1(t,θ^ wc 1) in task j = 1 shows that ewc1(t,θ^ wc 1) contains a significant reference-induced component er1(t,θ^ wc 1) for z1(t) (left), while ewc1(t,θ^ wc 1) is dominated by e1w(t) for zp, <i >(t) (right). For comparison, the peak value of the error signal with feedback only is given by 1 × 10−4 (m).

Figure 8. Simulation results. Cumulative power spectrum of em1(t,θ‾1) in task j = 1 for Cff1(q,θ‾1) (black), and ewc1(t,θ^ wc 1) corresponding to Cff1(q,θ^wc1) for z1(t) (red) , z2(t) (blue) and zp, <i >(t) (green). (To view this figure in colour, please see the online version of the journal.)

Figure 7. Simulation results. The error signal $e_{wc}^{1} (t, {\hat{θ}}_{wc}^{1})$ in task j = 1 shows that $e_{wc}^{1} (t, {\hat{θ}}_{wc}^{1})$ contains a significant reference-induced component $e_{r}^{1} (t, {\hat{θ}}_{wc}^{1})$ for z₁(t) (left), while $e_{wc}^{1} (t, {\hat{θ}}_{wc}^{1})$ is dominated by e¹_w(t) for z_{p, <i >}(t) (right). For comparison, the peak value of the error signal with feedback only is given by 1 × 10⁻⁴ (m).
(1)	For z_{p, <i >}(t), the contribution of $e_{r}^{1} (t, {\hat{θ}}_{wc}^{1})$ to $e_{wc}^{1} (t, {\hat{θ}}_{wc}^{1})$ is negligible compared to the contribution of e¹_w(t). Hence, $e_{wc}^{1} (t, {\hat{θ}}_{wc}^{1})$ is similar to the optimal case $e_{m}^{1} (t, {\overline{θ}}^{1})$ .
(2)	For z₁(t), the contribution of $e_{r}^{1} (t, {\hat{θ}}_{wc}^{1})$ to $e_{wc}^{1} (t, {\hat{θ}}_{wc}^{1})$ is significant. As a result, $e_{wc}^{1} (t, {\hat{θ}}_{wc}^{1})$ is significantly degraded compared to the optimal case $e_{m}^{1} (t, {\overline{θ}}^{1})$ .

Similar results as provided for z₁(t) are obtained for z₂(t), and are omitted for brevity.

The provided simulation study showed that an enhanced accuracy of the parameter estimates results in a reduced difference between $e_{wc}^{1} (t, {\hat{θ}}_{wc}^{1})$ in (Equation36(36) $\begin{matrix} e_{wc}^{j} (t, {\hat{θ}}_{wc}^{j}) = e_{r}^{j} (t, {\hat{θ}}_{wc}^{j}) - e_{w}^{j} (t), \end{matrix}$ (36) ) and $e_{m}^{1} (t, {\overline{θ}}^{1})$ in (Equation35(35) $\begin{matrix} e_{m}^{j} (t, {\overline{θ}}^{j}) = - e_{w}^{j} (t), \end{matrix}$ (35) ). Hence, the worst-case error $e_{wc}^{1} (t, {\hat{θ}}_{wc}^{1})$ based on a single set of measured data is improved by using the proposed instruments z_{p, }(t). This confirms that the statistical accuracy properties of ${\hat{θ}}^{j}$ are important for performance.

7. Experimental results

In this section, the proposed approach in Section 4 is applied to the prototype industrial nanopositioning system depicted in . The positioning stage, measurement system, and actuation system are placed on a vibration isolation table to isolate the system from external disturbances originating from the environment.

Figure 9. Experimental setup with $. 5 p t - . 9 p t 1$ measurement system, $. 5 p t - . 9 p t 2$ positioning stage, $. 5 p t - . 9 p t 3$ linear magnetic actuation system and $. 5 p t - . 9 p t 4$ vibration isolation table.

The positioning stage is magnetically levitated and actuated, and controlled in six motion degrees of freedom (DOFs): three translations (x, y, and z) and three rotations (R_x, R_y and R_z). Magnetically levitated stages exhibit contactless operation. Therefore, friction (which is typically an important disturbance in motion control) is eliminated.

The actuation system consists of six linear magnetic motors, with an added position offset such that each actuator can also generate a force in the perpendicular direction. The permanent magnets are connected to the vibration isolation table, while the coils are part of the positioning stage.

The measurement system consists of laser interferometers in conjunction with a mirror block, connected to the vibration isolation table and the positioning stage, respectively. This system enables high-accuracy position measurements in all six motion DOFs. In particular, subnanometer resolution position measurements are available for the translational DOFs x, y and z. Throughout, all systems and signals operate in discrete time with a sampling time of T_s = 2 × 10⁻⁴ s.

7.1 Control configuration and reference signal design

The proposed design procedure for the feedforward controller is applied to the x-direction of the system, i.e. the long-stroke (80 mm) direction of the setup in . A stabilising feedback controller $C_{f b}^{mimo}$ is designed for the multivariable system by means of sequential loopshaping (see Skogestad & Postlethwaite, Citation2005, Section ) for details. By closing the control loops for the remaining 5 DOFs, i.e. y, z, R_x, R_y and R_z, a single-input, single-output equivalent system P(q) is obtained for the x-direction. The frequency response function of the equivalent system P(q) for the x-direction is depicted in . The dynamical response of a linear motion system P(s), where s is the Laplace operator, with proportional damping can be written as a sum of N second-order subsystems (37) $\begin{matrix} \begin{matrix} P (s) & = \sum_{i = 1}^{n} \frac{c_{i}^{T} b_{i}}{s^{2}} + \sum_{i = n + 1}^{N} \frac{c_{i}^{T} b_{i}}{s^{2} + 2 ζ_{i} ω_{i} s + ω_{i}^{2}}, \end{matrix} \end{matrix}$ (37) with n the number of rigid-body modes, c^T_i the ith column of the output matrix $C \in R^{n \times N}$ , b_i the ith row of the input matrix $B \in R^{N \times n}$ , ζ_i the dimensionless damping constant, and ω_i the natural frequency of the ith second-order subsystem. Inspection reveals rigid-body behaviour (as described by the first term in (Equation37(37) $\begin{matrix} \begin{matrix} P (s) & = \sum_{i = 1}^{n} \frac{c_{i}^{T} b_{i}}{s^{2}} + \sum_{i = n + 1}^{N} \frac{c_{i}^{T} b_{i}}{s^{2} + 2 ζ_{i} ω_{i} s + ω_{i}^{2}}, \end{matrix} \end{matrix}$ (37) )) below approximately 300 Hz, while the first resonance phenomena (as described by the latter term in (Equation37(37) $\begin{matrix} \begin{matrix} P (s) & = \sum_{i = 1}^{n} \frac{c_{i}^{T} b_{i}}{s^{2}} + \sum_{i = n + 1}^{N} \frac{c_{i}^{T} b_{i}}{s^{2} + 2 ζ_{i} ω_{i} s + ω_{i}^{2}}, \end{matrix} \end{matrix}$ (37) )) appear at 480 and 860 Hz. For a motion system with dominant rigid-body dynamics in the frequency range of interest, parametric models are typically developed that only describe the rigid-body dynamics, i.e. the first term in (Equation37(37) $\begin{matrix} \begin{matrix} P (s) & = \sum_{i = 1}^{n} \frac{c_{i}^{T} b_{i}}{s^{2}} + \sum_{i = n + 1}^{N} \frac{c_{i}^{T} b_{i}}{s^{2} + 2 ζ_{i} ω_{i} s + ω_{i}^{2}}, \end{matrix} \end{matrix}$ (37) ).

Figure 10. Frequency response function of the considered system P(q) in x-direction.

The feedback controller C_fb(q) for the x-direction of the system is depicted in , and achieves a bandwidth (defined as the lowest frequency where |C_fbP| = 1) of 120 Hz. This bandwidth results in rejection of low-frequency disturbances, while having sufficient robustness against uncertainty in the resonances of P(q).

Figure 11. Bode diagram of the feedback controller C_fb(q) for the x-direction of the experimental setup.

The reference r(t) of the performed servo task is depicted in , together with its acceleration, jerk and snap, i.e. the second, third and fourth derivative of r(t).

Figure 12. Position r(t), velocity v(t), jerk j(t), and snap s(t) of the servo task.

7.2 Parametrisation feedforward controller

For the general parametrisation of C_ff(q, θ) in (Equation3(3) $\begin{matrix} C_{f f}^{j + 1} (q, θ^{j + 1}) & = C_{f f}^{j} (q, θ^{j}) + C_{f f}^{Δ} (q, θ^{Δ}) \\ = \sum_{i = 1}^{n_{θ}} ψ_{i} (q^{- 1}) (θ_{i}^{j} + θ_{i}^{Δ}), \end{matrix}$ (3) ), (1) the number of parameters n_θ and (2) the selection of basis functions Ψ(q) should be determined to obtain C_ff(q, θ). Here, the parametrisation of C_ff(q, θ) is chosen as in Lambrechts et al. (Citation2005). This parametrisation is developed for motion systems with dominant rigid-body dynamics as in , and is given by (38) $\begin{matrix} C_{f f} (q, θ) = Ψ (q) θ, \end{matrix}$ (38) with basis functions Ψ(q) = [ψ_v(q⁻¹), ψ_a(q⁻¹), ψ_j(q⁻¹), ψ_s(q⁻¹)], where (39) $\begin{matrix} \begin{matrix} ψ_{v} (q^{- 1}) = (\frac{1 - q^{- 1}}{T_{s}}), & ψ_{a} (q^{- 1}) = {(\frac{1 - q^{- 1}}{T_{s}})}^{2}, \\ ψ_{j} (q^{- 1}) = {(\frac{1 - q^{- 1}}{T_{s}})}^{3}, & ψ_{s} (q^{- 1}) = {(\frac{1 - q^{- 1}}{T_{s}})}^{4}, \end{matrix} \end{matrix}$ (39) and corresponding parameters $θ = {[θ_{v} θ_{a} θ_{j} θ_{s}]}^{T} \in R^{n_{θ}}$ . For C_ff(q, θ) in (40), it holds that $\begin{matrix} C_{f f} {(q, θ) |}_{q = 1} = 0, \end{matrix}$ i.e. the static gain of C_ff(q, θ) is equal to zero. This condition implies that the feedforward signal u_ff is equal to zero when the system is in stand-still, which is a desired property for motion systems with rigid-body dynamics. Furthermore, recall that the considered experimental setup operates contactless, thereby eliminating performance-deteriorating friction. As such, friction feedforward is not included in the feedforward controller in (40).

7.3 Experimental results for optimal IV method

In this section, the key experimental results of this paper are presented, which involve the application of Procedure 4.1 on the nanopositioning system in . Five tasks are performed of N = 2700 samples each, with r(t) as depicted in . In addition, r(t) filtered by the basis functions in (41) is shown in .

During the jth task, the measured signals e^j_m(t) and y^j_m(t), for t = 1, ..., N, are stored. Then, this batch of measured data is used to determine C^{j + 1}_ff(q, θ^{j + 1}). To initialise Procedure 4.1, C¹_ff(q, θ¹) in the first task has parameters θ¹ = [0, 0, 0, 0]^T, i.e., C¹_ff(q, θ¹) = 0. The performance obtained in task j = 1 is, therefore, the performance with only feedback control applied to the system. Alternatively, the linear least squares estimation approach in van der Meulen et al. (Citation2008) can be used to determine an initial parameter vector θ¹.

The measured error signal e^j_m(t) in the first, second and third task are depicted in , while the corresponding cumulative power spectrum is shown in . In addition, the two-norm of e^j_m(t), i.e. ‖e^j_m(t)‖₂², as a function of tasks is depicted in . The following observations are made:

Figure 13. Experimental results. The measured error signal ejm(t) in task j = 1 (red), j = 2 (blue) and j = 3 (green) shows that the peak value of the error signal for feedback only in task j = 1 is 4.5 × 10−6 (m), and is reduced by 97 % by using iterative feedforward control in j = 3. (To view this figure in colour, please see the online version of this journal.)

Figure 14. Experimental results. Cumulative power spectrum of the measured ejm(t) in task j = 1 (red), j = 2 (blue) and j = 3 (green). (To view this figure in colour, please see the online version of this journal.)

Figure 15. The two-norm of the measured error ejm(t) as a function of tasks shows convergence in two tasks.

Figure 13. Experimental results. The measured error signal *e^j_m*(t) in task j = 1 (red), j = 2 (blue) and j = 3 (green) shows that the peak value of the error signal for feedback only in task j = 1 is 4.5 × 10⁻⁶ (m), and is reduced by 97 % by using iterative feedforward control in j = 3. (To view this figure in colour, please see the online version of this journal.)
(1)	The peak value of e³_m(t) in task j = 3 is reduced by approximately 97% when compared to e¹_m(t) in the first task. This confirms that the proposed iterative feedforward approach significantly enhances the positioning accuracy of the system compared to only feedback control.
(2)	and show that the low-frequency contribution up to approximately 10 Hz is not compensated for by C³_ff(q, θ³) in task j = 3. This implies that the dynamical behaviour of the experimental setup in this frequency range is not captured by the parametrisation proposed in Section 7.2. This can be contributed to the dynamics of the cable connection between the fixed world and the (moving) positioning stage, which acts as a low-frequency disturbance on the system.
(3)	shows that ‖e^j_m(t)‖₂² converges in two tasks. This confirms that fast convergence in terms of ‖e^j_m(t)‖₂² is obtained with the proposed approach.

7.4 Analysis of Algorithm 4.1

The iterative refinement of z_{p, }(t) proposed in Algorithm 4.1 is an essential attribute of Procedure 4.1. In this section, Algorithm 4.1 is illustrated for the considered experimental setup. Recall from (Equation28(28) $\begin{matrix} z_{opt} (t) & = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} y_{r}^{j} (t) \\ = Ψ (q) {(C_{f b} (q) + C_{f f}^{j} (q))}^{- 1} S (q) P (q) (C_{f b} (q) \\ + C_{f f}^{j} (q)) r (t) \\ = Ψ (q) S (q) P (q) r (t), \end{matrix}$ (28) ) that the optimal instruments z_opt(t) can be expressed as $\begin{matrix} z_{opt} (t) = Ψ (q) S (q) P (q) r (t), \end{matrix}$ and optimal accuracy is obtained, i.e. z_{p, }(t) = z_opt(t), if $\begin{matrix} {(C_{f b} (q) + C_{f f, }^{j} (q, {\hat{θ}}_{}^{Δ}))}^{- 1} = S (q) P (q) . \end{matrix}$ This result enables a visual illustration of Algorithm 4.1 by comparing the identified frequency response function of S(q)P(q) with the frequency response of (C_fb(q) + C^j_{ff, }(q))^{− 1}.

The analysis in this section is based on the measured signals e¹_m(t) and y¹_m(t), for t = 1, …, N, in task j = 1 with C¹_ff(q, θ¹) implemented on the experimental setup. Furthermore, the number of computational iterations of Algorithm 4.1 is given by K = 3. reveals that (C_fb + C¹_{ff, <3 >})^{− 1} in iteration i = 3 of Algorithm 4.1 is a significantly improved approximation of S(q)P(q) in the frequency range up to 200 Hz, when compared to iteration i = 1. This confirms that iterative refinement of z_{p, }(t) by means of Algorithm 4.1 results in z_{p, }(t) that resemble the optimal z_opt(t).

Figure 16. Iterative refinement of the instruments z_{p, ** }(t) after task j = 1: (C_fb + C¹_{ff, <3 >})^{− 1} (green) corresponding to the i = 3 computational iteration of Algorithm 4.1 is an improved approximation of the frequency response function of the process sensitivity S(q)P(q) (black) compared to (C_fb + C¹_{ff, <1 >})^{− 1} (dashed red) in the i = 1 iteration of Algorithm 4.1. (To view this figure in colour, please see the online version of this journal.)

Figure 16. Iterative refinement of the instruments zp, ** < i >(t) after task j = 1: (Cfb + C1ff, <3 >)− 1 (green) corresponding to the i = 3 computational iteration of Algorithm 4.1 is an improved approximation of the frequency response function of the process sensitivity S(q)P(q) (black) compared to (Cfb + C1ff, <1 >)− 1 (dashed red) in the i = 1 iteration of Algorithm 4.1. (To view this figure in colour, please see the online version of this journal.)

7.5 Enhanced flexibility to reference variations and comparison with ILC

As is argued in Sections 1 and 2, the key motivation for the proposed feedforward approach compared to ILC algorithms is an enhanced flexibility with respect to changes in the reference trajectory. To demonstrate this flexibility, two similar yet slightly different reference trajectories are applied to the considered nanopositioning system. These reference trajectories are depicted in . Then, the optimal IV approach proposed in Section 4 is compared to the standard frequency-domain ILC-based approach (see, e.g. Bristow et al., Citation2006).

Figure 17. Reference trajectory r₁ for task j = 1, 2, …, 5 (blue) and reference trajectory r₂ for task j = 6, 7, …, 10 (dashed green). (To view this figure in colour, please see the online version of this journal.)

Figure 17. Reference trajectory r1 for task j = 1, 2, …, 5 (blue) and reference trajectory r2 for task j = 6, 7, …, 10 (dashed green). (To view this figure in colour, please see the online version of this journal.)

The measured error signals e₅(t) in task j = 5 and e₆(t) in task j = 6 as depicted in for ILC, and in for the proposed optimal IV approach confirm that (1) the servo performance obtained with ILC severely deteriorates when the reference is slightly changed and (2) the proposed approach is insensitive to reference changes. Note that the error in tasks j = 5 is smaller for ILC than for the proposed approach. Since motion systems are typically confronted with similar yet slightly different reference signals (Lambrechts et al., Citation2005; Oomen et al., Citation2014), the proposed optimal IV approach is preferred in industrial practice since learning transients are eliminated when the reference trajectory is changed.

Figure 18. Flexibility with respect to changes in the reference trajectory between tasks for ILC. Before task j = 6, the reference trajectory is changed from r₁ to r₂ (see ). For standard ILC, the measured error signal e₅(t) (blue) in task j = 5 is significantly smaller than for e₆(t) (green) in task j = 6. This confirms that for standard ILC, the servo performance is severely deteriorated if the reference is changed at j = 6. (To view this this figure in colour, please see the online version of this journal.)

Figure 19. Flexibility with respect to changes in the reference trajectory between tasks for the proposed approach in Section 4. Before task j = 6, the reference trajectory is changed from r₁ to r₂ (see ). The measured error signal e₅(t) (blue) in task j = 5 is similar to e₆(t) (green) in task j = 6, which shows that the servo performance for the proposed IV-approach is invariant to changes in the reference. (To view this figure in colour, please see the online version of this journal.)

8. Conclusions

In this paper, a new algorithm is proposed for iterative feedforward control based on instrumental variables. The key advantage of the proposed algorithm is that it achieves optimal accuracy in terms of variance, in contrast to existing approaches, which are shown to be non-optimal. To achieve optimal accuracy, the proposed algorithm iteratively updates an estimate of the optimal instruments. The assumptions that are introduced in Section 2.2 are for a large class of motion systems nonrestrictive for the achievable performance. If the considered motion system is subject to reference signals with high-frequency signal content, the proposed approach can be extended by allowing more general parametrisations by means of input shaping (Boeren, Bruijnen, van Dijk, et al. Citation2014) and rational feedforward (Bolder & Oomen, Citation2015). The proposed method is validated by means of a simulation example, showing improved accuracy compared to pre-existing approaches. Finally, the procedure is successfully applied to an industrial nanopositioning system. The presented experimental results confirm the practical relevance of the proposed approach.

Ongoing research focuses on extensions towards optimal input design (Formentin, Karimi, & Savaresi, Citation2013a), inferential control, positioning-varying effects (Groot Wassink, van de Wal, Scherer, & Bosgra, Citation2005) and multivariable systems. For the considered nanopositioning system in , improved performance can be obtained by compensating for the dynamics of the cable connection between the fixed world and the (moving) positioning stage by means of feedforward control.

Acknowledgements

The authors acknowledge Maarten Steinbuch and Leon van Breugel for their contribution to this work.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This research was supported by Philips Innovation Services and by the Innovational Research Incentives Scheme under the VENI grant Precision Motion: Beyond the Nanometer [grant number 13073] awarded by NWO (The Netherlands Organisation for Scientific Research) and STW (Dutch Science Foundation).

References

Åström, K. (1970). Introduction to stochastic control theory. Mathematics in science and engineering (Vol. 70). New York, NY: Academic Press.
Google Scholar
Bazanella, A.S., Campestrini, L., & Eckhard, D. (2012). Data-driven controller design: The $H_{2}$ approach (1st ed.). Amsterdam: Springer.
Google Scholar
Biagiotti, L., & Melchiorri, C. (2012). FIR filters for online trajectory planning with time- and frequency-domain specifications. Control Engineering Practice, 20, 1385–1399.
Web of Science ®Google Scholar
Boeren, F., Blanken, L., Bruijnen, D., & Oomen, T. (2015). Rational iterative feedforward control: Optimal instrumental variable approach for enhanced performance. Proceedings of the conference on decision and control (pp. 6058–6063), Osaka, Japan.
Google Scholar
Boeren, F., Bruijnen, D., & Oomen, T. (2014). Iterative feedforward tuning approach and experimental verification for nano-precision motion systems. Proceedings of the ASME dynamic systems and control conference, San Antonio, TX, USA.
Google Scholar
Boeren, F., Bruijnen, D., van Dijk, N., & Oomen, T. (2014). Joint input shaping and feedforward for point-to-point motion: Automated tuning for an industrial nanopositioning system. IFAC Mechatronics, 24, 572–581.
Web of Science ®Google Scholar
Boeren, F., Oomen, T., & Steinbuch, M. (2014). Accuracy aspects in motion feedforward tuning. Proceedings of the American control conference (pp. 2178–2183), Portland, OR, USA.
Google Scholar
Boeren, F., Oomen, T., & Steinbuch, M. (2015). Iterative motion feedforward tuning: A data-driven approach based on instrumental variable identification. Control Engineering Practice, 37, 11–19.
Web of Science ®Google Scholar
Boerlage, M., Tousain, R., & Steinbuch, M. (2004). Jerk derivative feedforward control for motion systems. Proceedings of the American control conference (pp. 4843–4848), Boston, MA, USA.
Google Scholar
Bolder, J., & Oomen, T. (2015). Rational basis functions in iterative learning control – with experimental verification on a motion system. IEEE Transactions on Control Systems Technology, 23, 722–729.
Web of Science ®Google Scholar
Bristow, D., Tharayil, M., & Alleyne, A. (2006). A survey of iterative learning control: A learning-based method for high-performance tracking control. IEEE Control Systems Magazine, 26, 96–114.
Web of Science ®Google Scholar
Butterworth, J., Pao, L., & Abramovitch, D. (2012). Analysis and comparison of three discrete-time feedforward model-inverse control techniques for nonminimum-phase systems. IFAC Mechatronics, 22, 577–587.
Web of Science ®Google Scholar
Clayton, G.M., Tien, S., Leang, K.K., Zou, Q., & Devasia, S. (2009). A review of feedforward control approaches in nanopositioning for high-speed SPM. Journal of Dynamic Systems, Measurement, and Control, 131, 0611011–06110119.
Web of Science ®Google Scholar
Devasia, S. (2002). Should model-based inverse inputs be used as feedforward under plant uncertainty ? IEEE Transactions on Automatic Control, 47, 1865–1871.
Web of Science ®Google Scholar
Fleming, A.J. (2014). Measuring and predicting resolution in nanopositioning systems. IFAC Mechatronics, 24, 605–618.
Web of Science ®Google Scholar
Formentin, S., Karimi, A., & Savaresi, S.M. (2013a). Optimal input design for direct data-driven tuning of model-reference controllers. Automatica, 49, 1874–1882.
Web of Science ®Google Scholar
Formentin, S., van Heusden, K., & Karimi, A. (2013b). A comparison of model-based and data-driven controller tuning. International Journal of Adaptive Control and Signal Processing, 28, 882–897.
Web of Science ®Google Scholar
Forssell, U. (1999). Closed-loop identification: Methods, theory and applications. Linköping: Linköping University.
Google Scholar
Gilson, M., & Van den Hof, P. (2005). Instrumental variable methods for closed-loop system identification. Automatica, 41, 241–249.
Web of Science ®Google Scholar
Gorinevsky, D. (2002). Loop-shaping for iterative control of batch processes. IEEE Control Systems Magazine, 22, 55–65.
Web of Science ®Google Scholar
Groot Wassink, M., van de Wal, M., Scherer, C., & Bosgra, O. (2005). LPV control for a wafer stage: Beyond the theoretical solution. Control Engineering Practice, 13, 231–245.
Web of Science ®Google Scholar
Gunnarsson, S., & Norrlöf, M. (2001). On the design of ILC algorithms using optimization. Automatica, 37, 2011–2016.
Web of Science ®Google Scholar
Gunnarsson, S., & Norrlöf, M. (2006). On the disturbance properties of high order iterative learning control algorithms. Automatica, 42, 2031–2034.
Web of Science ®Google Scholar
Heertjes, M., Hennekens, D., & Steinbuch, M. (2010). MIMO feed-forward design in wafer scanners using a gradient approximation-based algorithm. Control Engineering Practice, 18, 495–506.
Web of Science ®Google Scholar
Hjalmarsson, H. (2009). System identification of complex and structured systems. European Journal of Control, 3, 275–310.
Web of Science ®Google Scholar
Hjalmarsson, H., Gevers, M., Gunnarsson, S., & Lequin, O. (1998). Iterative feedback tuning: Theory and applications. IEEE Control Systems, 18, 26–41.
Web of Science ®Google Scholar
Hoelzle, D.J., Johnson, A.J.W., & Alleyne, A.G. (2014). Bumpless transfer filter for exogenous feedforward signals. IEEE Transactions on Control Systems Technology, 22, 1581–1588.
Web of Science ®Google Scholar
Jakeman, A.J., & Young, P.C. (1979). Refined instrumental variable methods of time-series analysis: Parts II. Multivariable systems. International Journal of Control, 29, 621–644.
Web of Science ®Google Scholar
Janot, A., Gautier, M., Jubien, A., & Vandanjon, P.O. (2014). Comparison between the CLOE method and the DIDIM method for robots identification. IEEE Transaction on Control Systems Technology, 22, 1935–1941.
Web of Science ®Google Scholar
Janot, A., Vandanjon, P.O., & Gautier, M. (2014a). A generic instrumental variable approach for industrial robot identification. IEEE Transaction on Control Systems Technology, 22, 132–145.
Web of Science ®Google Scholar
Janot, A., Vandanjon, P.O., & Gautier, M. (2014b). An instrumental variable approach for rigid industrial robots identification. Control Engineering Practice, 25, 85–101.
Web of Science ®Google Scholar
Jung, Y., & Enqvist, M. (2013). Estimating models of inverse systems. Proceedings of the conference on decision and control (pp. 7143–7148), Firenze, Italy.
Google Scholar
Kara-Mohamed, M., Heath, W.P., & Lanzon, A. (2015). Enhanced tracking for nanopositioning systems using feedforward/feedback multivariable control design. IEEE Transactions on Control Systems Technology, 23, 1003–1013.
Web of Science ®Google Scholar
Karimi, A., Butcher, M., & Longchamp, R. (2008). Model-free precompensator tuning based on the correlation approach. IEEE Transactions on Control Systems Technology, 16, 1013–1020.
Web of Science ®Google Scholar
Khalil, W., & Dombre, E. (2002). Modeling, identification, and control of robots. Bristol, PA: Taylor & Francis.
Google Scholar
Kim, K.S., & Zou, Q. (2013). A modeling-free inversion-based iterative feedforward control for precision output tracking of linear time-invariant systems. IEEE/ASME Transactions on Mechatronics, 18, 1767–1777.
Web of Science ®Google Scholar
Kushner, H.J., & Yin, G.G. (2003). Stochastic approximation and recursive algorithms and applications. Stochastic modelling and applied probability (Vol. 35). New York, NY: Springer-Verlag.
Google Scholar
Lambrechts, P., Boerlage, M., & Steinbuch, M. (2005). Trajectory planning and feedforward design for electromechanical motion systems. Control Engineering Practice, 13, 145–157.
Web of Science ®Google Scholar
Mishra, S., Coaplen, J., & Tomizuka, M. (2007). Precision positioning of wafer scanners: Segmented iterative learning control for nonrepetitive disturbances. IEEE Control Systems Magazine, 27, 20–25.
Web of Science ®Google Scholar
Oomen, T., van Herpen, R., Quist, S., van de Wal, M., Bosgra, O., & Steinbuch, M. (2014). Connecting system identification and robust control for next-generation motion control of a wafer stage. IEEE Transactions on Control Systems Technology, 22, 102–118.
Web of Science ®Google Scholar
Puthenpura, S.C., & Sinha, N.K. (1986). Identification of continuous-time systems using instrumental variables with application to an industrial robot. IEEE Transactions on Industrial Electronics, IE-33, 224–229.
Web of Science ®Google Scholar
Skogestad, S., & Postlethwaite, I. (2005). Multivariable feedback control: Analysis and design (2nd ed.). West Sussex: John Wiley & Sons.
Google Scholar
Söderström, T. (2002). Discrete-time stochastic systems – estimation and control. London: Springer-Verlag.
Google Scholar
Söderström, T., & Stoica, P. (1983). Instrumental variable methods for system identification. Lecture notes in control and information sciences (Vol. 57). Berlin: Springer-Verlag.
Google Scholar
Söderström, T., & Stoica, P. (1989). System identification. Hemel Hempstead: Prentice Hall.
Google Scholar
Söderström, T., Stoica, P., & Trulsson, E. (1987). Instrumental variable methods for closed loop systems. Proceedings of the IFAC World Congress on automatic control (pp. 363–368), Munich, Germany.
Google Scholar
Steinbuch, M., & Norg, M. (1998). Advanced motion control: An industrial perspective. European Journal of Control, 4, 278–293.
Web of Science ®Google Scholar
van de Wal, M., van Baars, G., & Sperling, F. (2000). Reduction of residual vibrations by H∞ feedback control. Proceedings of the fifth motion and vibration conference (MOVIC) (Vol. 2, pp. 481–486), Sydney, Australia.
Google Scholar
van de Wijdeven, J., & Bosgra, O. (2010). Using basis functions in iterative learning control: Analysis and design theory. International Journal of Control, 83, 661–675.
Web of Science ®Google Scholar
van der Meulen, S., Tousain, R., & Bosgra, O. (2008). Fixed structure feedforward controller design exploiting iterative trials: Application to a wafer stage and a desktop printer. Journal of Dynamic Systems, Measurement, and Control, 130, 0510061–05100616.
Web of Science ®Google Scholar
Yoshida, K., Ikeda, N., & Mayeda, H. (1992). Experimental study of the identification methods for an industrial robot manipulator. Proceedings of the international conference on intelligent robots and systems (pp. 263–270), Raleigh, NC, USA.
Google Scholar
Young, P.C. (1976). Some observations on instrumental variable methods of time-series analysis. International Journal of Control, 23, 293–612.
Web of Science ®Google Scholar
Young, P.C. (2015). Refined instrumental variable estimation: Maximum likelihood optimization of a unified Box-Jenkins model. Automatica, 52, 35–46.
Web of Science ®Google Scholar
Young, P.C., & Jakeman, A.J. (1979). Refined instrumental variable methods of time-series analysis: Parts I. Single input, single output systems. International Journal of Control, 29, 1–30.
Web of Science ®Google Scholar
Young, P.C., & Jakeman, A.J. (1980). Refined instrumental variable methods of time-series analysis: Parts III. Extensions. International Journal of Control, 31, 741–764.
Web of Science ®Google Scholar
Zhong, H., Pao, L.Y., & de Callafon, R.A. (2012). Feedforward control for disturbance rejection: Model matching and other methods. Proceedings of the Chinese conference on decision and control (pp. 3525–3533), Taiyuan, China.
Google Scholar

Enhancing feedforward controller tuning via instrumental variables: with application to nanopositioning

ABSTRACT

1. Introduction