Search in:

Mathematical and Computer Modelling of Dynamical Systems

Methods, Tools and Applications in Engineering and Related Sciences

Volume 22, 2016 - Issue 4: Model Order Reduction

Submit an article Journal homepage

Free access

921

Views

CrossRef citations to date

Altmetric

Listen

Articles

Tangential interpolation-based eigensystem realization algorithm for MIMO systems

B. KramerDepartment of Aeronautics and Astronautics, Massachusetts Institute of Technology, Cambridge, MA, USACorrespondence[email protected]
View further author information

S. GugercinDepartment of Mathematics and Interdisciplinary Center for Applied Mathematics, Virginia Tech, Blacksburg, VA, USAView further author information

Pages 282-306 | Received 03 Nov 2015, Accepted 02 Jun 2016, Published online: 22 Jun 2016

Cite this article
https://doi.org/10.1080/13873954.2016.1198389
CrossMark

In this article

ABSTRACT
1. Introduction
2. Partial realization and Kung’s algorithm
3. Proposed method: tangential interpolation-based ERA (TERA)
4. Numerical results
5. Conclusions
Disclosure statement
Additional information
Footnotes
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

The eigensystem realization algorithm (ERA) is a commonly used data-driven method for system identification and reduced-order modelling of dynamical systems. The main computational difficulty in ERA arises when the system under consideration has a large number of inputs and outputs, requiring to compute a singular value decomposition (SVD) of a large-scale dense Hankel matrix. In this work, we present an algorithm that aims to resolve this computational bottleneck via tangential interpolation. This involves projecting the original impulse response sequence onto suitably chosen directions. The resulting data-driven reduced model preserves stability and is endowed with an a priori error bound. Numerical examples demonstrate that the modified ERA algorithm with tangentially interpolated data produces accurate reduced models while, at the same time, reducing the computational cost and memory requirements significantly compared to the standard ERA. We also give an example to demonstrate the limitations of the proposed method.

KEYWORDS:

System identification
MIMO systems
eigensystem realization algorithm
interpolation
Hankel matrix

1. Introduction

Control of complex systems can be achieved by using low-dimensional surrogate models that approximate the input–output behaviour of the original system accurately, and are much faster to simulate. When access to the internal description of the model is not available, data-driven techniques are used to approximate the system response. The field of subspace-based system identification (SI) provides powerful tools for fitting a linear time-invariant (LTI) system to given input–output responses of the measured system. Applications of subspace-based SI arise in many engineering disciplines, such as in aircraft wing flutter assessment [Citation1,Citation2], vibration analysis for bridges [Citation3], structural health analysis for buildings [Citation4], modelling of indoor-air behaviour of energy efficient buildings [Citation5], flow control [Citation6–Citation8], seismic imaging [Citation9] and many more. In all applications, the identification of LTI systems was crucial for analysis and control of the plant. An overview of applications and methods for subspace-based SI can be found in [Citation10] and more recently in [Citation11,Citation12].

The eigensystem realization algorithm (ERA) by Kung [Citation13] offers one solution to the SI problem, while simultaneously involving a model reduction step. The algorithm uses discrete-time impulse response data to construct reduced-order models via a singular value decomposition (SVD). Importantly, the resulting reduced models retain stability, see Section 2. Starting with Kung’s work [Citation13], various applications and extensions of the algorithm appeared in the literature [Citation1,Citation3,Citation9,Citation14–Citation17]. As an interesting result, Rowley and co-authors showed in [Citation6] that ERA is the data-driven approximation to balanced truncation, and compared ERA to balanced proper orthogonal decomposition (POD) [Citation18,Citation19] for a flow past an inclined plate. Balanced POD was found to provide superior reduced-order models, yet assumes that system matrices and their adjoints are available. The authors in [Citation20] propose a randomized POD technique to reduce the computational cost of extracting the dominant modes of the Hankel matrix.

Mechanical systems with multiple sensors and actuators are modelled as multi-input multi-output (MIMO) dynamical systems. Such systems impose additional computational challenges for SI and for ERA in particular. For instance, ERA requires a full SVD of a structured Hankel matrix, whose size scales linearly with the input and output dimension. Moreover, large Hankel matrices can arise if the dynamics of the system decay slowly.

We propose a SI and model reduction algorithm for MIMO systems which reduces the computational effort and storage compared to the standard ERA, see Section 3. The new algorithm projects the full impulse response data onto smaller input and output subspaces along carefully chosen left and right tangential directions to minimize the effect of the neglected impulse response data. Computing the SVD of the projected Hankel matrix then becomes feasible and can be executed in shorter time with less storage. Moreover, we show that reduced models obtained via the ERA from tangentially interpolated data (TERA) retain stability. Numerical results in Section 4 demonstrate the accuracy and computational savings of the modified ERA with projected data. The error bound in Theorem 3.4 shows the individual contributions of both the data interpolation and Hankel matrix approximation on the reduced-order model.

For notational convenience, we adopt MATLABFootnote¹ notation. Given a vector $x \in R^{n}$ and r ≤ n, the vector x (1: r) denotes the vector of the first r components of x. Similarly, for a matrix $A \in R^{n \times n}$ , we denote by A (1: r, 1: r) the leading r × r submatrix of A.

Remark 1.1:

A wide range of excellent model reduction techniques for LTI systems exist in the literature, see [Citation21–Citation23] for an overview. In particular, we shall mention balanced truncation [Citation24,Citation25] and balanced POD [Citation18,Citation19], the iterative rational Krylov algorithm (IRKA) [Citation26], and Hankel norm approximations [Citation27]. We do not propose to use ERA as a model reduction technique when state space matrices are available. We rather suggest to use ERA for the combined task of SI and model reduction where only black-box code or experimental measurements are available. In this case, the aforementioned model reduction techniques are not applicable.

Remark 1.2:

In this paper, our data will be restricted to time-domain samples of the impulse response of the underlying dynamical systems. In the frequency domain, this corresponds to sampling the transfer function and its derivates around infinity. For the cases where one has the exibility in choosing the frequency samples, a variety of techniques become available such as the Loewner framework [Citation28], vector fitting [Citation29,Citation30], realization-independent IRKA (transfer function-IRKA [TF-IRKA]) [Citation31] and various rational least-squares fitting methodologies [Citation30,Citation32–Citation34]. However, as stated earlier, our focus here is ERA and to make it computationally more efficient for MIMO systems with large input and output dimensions.

2. Partial realization and Kung’s algorithm

In practice, experimental measurements and outputs of black-box simulations are sampled at discrete time instances. Therefore, consider the discrete-time LTI system in state–space formFootnote²

(1)

x (t + 1) = A x (t) + B u (t),

(1)

(2)

y (t) = C x (t) + D u (t),

(2)

where $t \in N_{0}^{+}$ is a discrete-time instance. The initial condition x (0) = x₀ is assumed to be zero – the system will be excited through external disturbances. In equations (1) and (2), $A \in R^{n \times n}, B \in R^{n \times m}, C \in R^{p \times n}$ and $D \in R^{p \times m}$ are, respectively, state-to-state, state-to-input, state-to-output and feedthrough system matrices. The inputs are $u (t) \in R^{m}$ and the outputs are $y (t) \in R^{p}$ . The system is completely determined by the matrices (A, B, C, D). It is common to define the Markov parameters

(3)

h_{k} := \{\begin{matrix} D, k = 0 \\ C A^{k - 1} B, k = 1, 2, \dots \end{matrix}\} \in R^{p \times m},

(3)

so the output response equation for system (1)–(2) becomes

(4)

y (k) = \sum_{i = 0}^{k} h_{i} u (k - i),

(4)

which is known as the external description of the system and is fully determined by the Markov parameters. Unfortunately, in several practical scenarios, the matrices $(A, B, C, D)$ are not available; instead one has access to the sequence of Markov parameters, describing the reaction of the system to external inputs. If only the Markov parameters (and therefore the external description (4)) are available, how can one reconstruct the internal description (1)–(2) of an LTI system? This is the classical problem of partial realization.

Definition 2.1:

[Citation21, Definition 4.46] Given the finite set of p × m matrices $h_{i}, i = 1, 2, \dots, 2 s - 1$ , the partial realization problem consists of finding a positive integer n and constant matrices $A \in R^{n \times n}, B \in R^{n \times m}, C \in R^{p \times n}$ and $D \in R^{p \times m}$ , such that (3) holds.

A finite sequence of Markov parameters is always realizable and there always exists a minimal realization of order n = rank ( $H$ ). Define the Hankel matrix, denoted by $H$ , constructed by the $2 s - 1$ sampled Markov parameters:

(5)

H := [\begin{matrix} h_{1} & h_{2} & \dots & h_{s} \\ h_{2} & h_{3} & \dots & h_{s + 1} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ h_{s} & h_{s + 1} & \dots & h_{2 s - 1} \end{matrix}] \in R^{p s \times m s} .

(5)

The size of the Hankel matrix grows linearly with m and p. In this work, we propose to construct a projected Hankel matrix that is independent of the input and output dimensions and therefore does not exhibit such growth. For a better understanding of the algorithms to follow, assume for a moment that the system matrices are known, so that the Hankel matrix reads as

H = [\begin{matrix} C B & C A B & \dots & C A^{s - 1} B \\ C A B & C A^{2} B & \dots & C A^{s} B \\ ⋮ & ⋮ & ⋱ & ⋮ \\ C A^{s - 1} B & C A^{s} B & \dots & C A^{2 s - 1} B \end{matrix}] .

It is well known (e.g. [Citation21, Lemma 4.39]) that for a realizable impulse response sequence, the Hankel matrix can be factored into the product of the observability matrix $O$ and the controllability matrix $C$ :

(6)

H = [\begin{matrix} C \\ C A \\ ⋮ \\ C A^{s - 1} \end{matrix}] [B A B \dots A^{s - 1} B] := O C .

(6)

The shifted observability matrix satisfies

(7)

O^{(f)} A = O^{(l)},

(7)

where $O^{(f)}$ and $O^{(l)}$ denote the first and last $s - 1$ block rows of $O$ . Similarly for the controllability matrix, we obtain $A C^{(f)} = C^{(l)}$ .

Silverman [Citation35] proposed an algorithm to construct a minimal realization, which requires finding a rank $n$ submatrix of the partially defined Hankel matrix. The algorithm determines the $n$ th order minimal realization directly, and does not involve a model reduction step. If only a degree- $r$ approximation is constructed where $r < n$ , the algorithm does not guarantee to retain stability.

Kung’s ERA [Citation13] on the other hand, can be divided into two steps, which are briefly reviewed below. To guarantee stability, the following assumption is made in [Citation13].

Assumption 2.2:

Assume that $2 s - 1$ Markov parameters are given and that the given impulse response sequence is convergent in the sense that

h_{i} \to 0 f o r i > s .

As Kung pointed out in his original work [Citation13], this assumption needs further explanation. Clearly for asymptotically stable dynamical systems, $h_{i} \to 0$ as $i \to \infty .$ However, in the case of ERA where only finite length is collected, this assumption means that $| | h_{1} | | ≫ | | h_{s + 1} | |$ ; in other words, the Markov parameters have decayed significantly after $s$ steps. Following the original work [Citation13], we shall refer to this assumption as the property of the Markov parameters.

Step 1 of ERA: Low-rank approximation of Hankel matrix. Construct the Hankel matrix (5) from the given impulse response sequence ${h_{1}, h_{2}, \dots, h_{2 s - 1}}$ and compute its economy-sized SVD

H = U Σ V^{T} \in R^{p s \times m s},

where $U \in R^{p s \times \hat{k}}$ and $V \in R^{m s \times \hat{k}}$ are orthogonal matrices with $\hat{k} = min {m s, p s}$ , and $Σ \in R^{\hat{k} \times \hat{k}}$ is a square matrix containing singular values, $Σ_{i i} = σ_{i}, i = 1, \dots, \hat{k}$ (called Hankel singular values).Footnote³ Per definition, the Hankel singular values are the singular values of the underlying Hankel operator; see, for example, [Citation21], which are ordered as $σ_{1} \geq σ_{2} \geq \dots \geq σ_{n} > σ_{n + 1} = 0$ . The rank of the Hankel matrix is $n$ , the minimal realization order with $n \leq \hat{k}$ . Choose $r \leq n$ and rewrite the decomposition as

(8)

H = [U_{r} {\hat{U}}_{r}] [\begin{matrix} Σ_{r} & 0 \\ 0 & {\hat{Σ}}_{r} \end{matrix}] [\begin{matrix} V_{r}^{T} \\ {\hat{V}}_{r}^{T} \end{matrix}],

(8)

where $U_{r} \in R^{p s \times r}$ contains the leading $r$ columns of $U$ , the square matrix $Σ_{r} = d i a g (σ_{1}, σ_{2}, \dots, σ_{r})$ and $V_{r} \in R^{p s \times r}$ . The matrices ${\hat{U}}_{r}, {\hat{Σ}}_{r}$ and ${\hat{V}}_{r}$ have appropriate dimensions. Consequently, $U_{r}^{T} U_{r} = I_{r}$ and $V_{r}^{T} V_{r} = I_{r}$ . It follows that

H_{r} = U_{r} Σ_{r} V_{r}^{T}

is the best rank $r$ approximation of the Hankel matrix $H$ in the $∥ \cdot ∥_{2}$ and $∥ \cdot ∥_{F}$ norm. The approximation errors are given by $∥ H - H_{r} ∥_{2} = σ_{r + 1}$ and $∥ H - H_{r} ∥_{F} = \sqrt{σ_{r + 1}^{2} + \dots + σ_{n}^{2}}$ .

Step 2 of ERA: Approximate Realization of LTI System. It is the goal of this step to find a realization $(A_{r}, B_{r}, C_{r})$ of the best approximate Hankel matrix $H_{r}$ . Kung [Citation13] suggested that $H_{r}$ should have ‘Hankel structure’ as well, so that it can be factored into a product of an approximate observability and controllability matrix as

H_{r} = O_{r} C_{r}, w h e r e O_{r} = U_{r} Σ_{r}^{1 / 2}, C_{r} = Σ_{r}^{1 / 2} V_{r}^{T} .

In light of Equation (6), if $O_{r}$ is the approximation to the observability matrix, then its first block row can be used to estimate $C_{r}$ , therefore

(9)

C_{r} = [I_{p} 0] U_{r} Σ_{r}^{1 / 2},

(9)

where $I_{p}$ is the $p \times p$ identity matrix. Similarly, the first block column of $C_{r}$ yields an approximation of the control input matrix $B_{r}$ :

(10)

B_{r} = Σ_{r}^{1 / 2} V_{r}^{T} [I_{m} 0]^{T} .

(10)

To estimate the system matrix $A_{r}$ , the shift invariance property (7) is imposed on the approximate controllability and observability matrices as

O_{r}^{(f)} A_{r} = O_{r}^{(l)}, A_{r} C_{r}^{(f)} = C_{r}^{(l)} .

The matrix $O_{r}^{(f)} = O_{r} (1 : (s - 1) p, :)$ again denotes the first $s - 1$ block rows of $O_{r}$ . Similarly, $O_{r}^{(l)}$ refers to the last $s - 1$ block rows of $O_{r}$ . Either equality can be used to solve the least squares problem for $A_{r}$ . Without loss of generality, we focus on the first equality involving the observability matrix. Since $O_{r}^{(f)}$ is a $p (s - 1) \times r$ matrix, a least squares problem to minimize $∥ O_{r}^{(f)} A_{r} - O_{r}^{(l)} ∥$ has to be solved. The minimizing solution is given by the Moore–Penrose pseudo inverse [Citation36, Chapter 5] as

A_{r} = [O_{r}^{(f)}]^{†} O_{r}^{(l)} .

Define the matrix $U_{r}^{(f)}$ via $O_{r}^{(f)} = U_{r}^{(f)} Σ_{r}^{1 / 2}$ , and similarly for $U_{r}^{(l)}$ , so that $A_{r}$ is computed as

(11)

A_{r} = Σ_{r}^{- 1 / 2} [U_{r}^{(f)}]^{T} U_{r}^{(l)} Σ_{r}^{1 / 2} .

(11)

Theorem 2.3:

[Citation13] If the Markov parameters satisfy Assumption 2.2, then the realization given by $(A_{r}, B_{r}, C_{r})$ from (9), (10), (11) provides a stable discrete-time dynamical system. In addition,

(12)

\sum_{i = 1}^{2 s - 1} ∥ C_{r} A_{r}^{i - 1} B_{r} - h_{i} ∥_{F}^{2} \leq σ_{r + 1} (H) \sqrt{r + m + p},

(12)

where $p$ is the number of outputs, $m$ is the number of inputs, $r$ is the order of the reduced model and $σ_{r + 1} (H)$ denotes the first neglected Hankel singular value.

Theorem 2.3 reveals that if the original model is stable, then reduced order models of any order $r$ obtained through ERA are stable, too, with an a priori error bound for the impulse response reconstruction. The rank $n$ of the Hankel matrix is the order of the minimal realization. However, $n$ can be very large and the resulting model too big for design and control purposes. Instead, one would like to obtain reduced-order models of order $r ≪ n$ . The choice of $r$ depends on many factors, such as accuracy of the reduced-order model, performance criteria, limitations on implementable model orders etc.

Example 2.4:

This work has been motivated by the need to generate reduced-order models for the indoor-air behaviour in buildings, see [Citation5, Section 4]. The original model of interest has a large number of inputs and outputs, in particular, we are given m = 26 control inputs and p = 42 measured outputs. The impulse response data are sampled over 3600[s] with a Markov parameter measured every 2[s]. With standard ERA, this requires computing an SVD of size 37,800 × 23,400, which is a computationally challenging problem on a standard desktop machine.

3. Proposed method: tangential interpolation-based ERA (TERA)

To circumvent the bottleneck of computing the SVD of a large Hankel matrix, we propose to project the data sequence along left and right tangential directions before assembling the Hankel matrix. The proposed algorithm, denoted by TERA henceforth, has three stages:

Compute tangential directions and project the impulse response data into smaller input and output dimensions, see Section 3.2.
Use ERA on the projected Hankel matrix to obtain an approximation for the smaller input/output dimension, see Section 3.3.
Lift the reduced realization back to the original input and output dimensions to obtain the final approximation, see Section 3.3.

Our approach is motivated by rational approximation by tangential interpolation, as illustrated in the next section.

3.1. Tangential interpolation from data

A thorough treatment of rational interpolation of a given data set along tangential directions can be found in [Citation23,Citation37]. To illustrate the idea, assume for a moment that a discrete-time dynamical system as in (1)–(2) is given. By taking the $z$ -transform of these equations, we obtain the transfer function $G (z) = C (z I - A)^{- 1} B + D,$ which maps the inputs to the outputs in the frequency domain via $\hat{y} (z) = G (z) \hat{u} (z)$ where $\hat{y} (z)$ and $\hat{u} (z)$ denote the $z$ -transforms of $y (t)$ and $u (t)$ , respectively. Model reduction through rational interpolation seeks a reduced-order transfer function $G_{r} (z) = C_{r} (z E_{r} - A_{r})^{- 1} B_{r} + D_{r}$ , with $A_{r} \in R^{r \times r}, B_{r} \in R^{r \times m}$ , $C_{r} \in R^{p \times r}$ and $D_{r} \in R^{p \times m}$ such that $G (z_{i}) = G_{r} (z_{i})$ for a set of interpolation points ${z_{i} : i = 1, 2, \dots, k}$ . However, for MIMO systems, this is too restrictive since it imposes $p \times m$ conditions for every interpolation point leading to unnecessarily high reduced orders. The concept of tangential interpolation eases those restrictions by only enforcing interpolation along certain directions. Assume that the transfer function $G (\cdot)$ is sampled at $r$ points ${θ_{i} : i = 1, 2, \dots, r}$ along the right tangential directions $w_{i} \in C^{m}$ and $r$ points ${μ_{i} : i = 1, 2, \dots, r}$ along the left tangential directions $v_{i} \in C^{p}$ ; that is, $G (θ_{i}) w_{i}$ and $v_{i}^{T} G (μ_{i})$ are measured. Then, the Loewner framework [Citation28] produces a reduced model $G_{r} (z)$ that tangentially interpolates the given data, that is,

v_{i}^{T} G (μ_{i}) = v_{i}^{T} G_{r} (μ_{i}) a n d G (θ_{i}) w_{i} = G_{r} (θ_{i}) w_{i} .

The details of how the interpolant $G_{r} (z) = C_{r} (z E_{r} - A_{r})^{- 1} B_{r}$ is constructed can be found in [Citation23,Citation28]; here we only show how $E_{r}$ is constructed:

(13)

E_{r} (i, j) = - \frac{v_{i}^{T} (G (μ_{i}) - G (θ_{j})) w_{j}}{μ_{i} - θ_{j}}, f o r i, j = 1, \dots, r .

(13)

The matrix $E_{r}$ is related to a divided difference matrix (called the Loewner matrix) corresponding to $G (\cdot)$ . However, in filling the entries of $E_{r}$ , neither the full-matrix data $G (μ_{i}) \in C^{m \times p}$ nor $G (θ_{i}) \in C^{m \times p}$ is used; instead the tangential data $v_{i}^{T} G (μ_{i}) w_{j} \in C$ and $v_{i}^{T} G (θ_{j}) w_{j} \in C$ are used. Thus dependence on the input and output dimensions are avoided. Without this modification, the reduced matrix $E_{r}$ would be of dimension $(r \cdot m) \times (r \cdot p)$ as opposed to $r \times r$ . This is the motivation for our modification to ERA.

Remark 3.1

The choice of interpolation points and tangential directions are of fundamental importance in model reduction by interpolation. The IRKA of [Citation26] provides a locally optimal strategy in the $H_{2}$ norm. In [Citation31], IRKA has been recently coupled with the Loewner approach to find optimal reduced models in a data-driven setting. However, this approach cannot be applied here since in the ERA setting, the available frequencies are fixed: one can only sample the Markov parameters, which corresponds to sampling the transfer function and its derivatives at infinity.

3.2. Projection of Markov parameters

Inspired by tangential interpolation in the Loewner framework, for systems with high dimensional input and output spaces we will project the impulse response samples $h_{i}$ onto low dimensional subspaces via multiplications by tangential directions. However, achieving this goal in the ERA set-up comes with major additional difficulties that do not appear in the Loewner framework. Therein, the elegant construction of the reduced-model quantities $B_{r}$ and $C_{r}$ guarantee that the number of rows and columns still match the original input and output dimensions even when the tangential interpolation is employed. In other words, only the system dimension is reduced without changing the input/output dimensions. However, in ERA, once the Markov parameters $h_{i} \in R^{p \times m}$ are replaced by the (tangentially) projected quantities ${\hat{h}}_{i} \in R^{ℓ_{1} \times ℓ_{2}}$ where $ℓ_{1} < p$ and $ℓ_{2} < m$ , the reduced model via ERA will have $ℓ_{2}$ inputs and $ℓ_{1}$ outputs; thus the original input and output dimensions will be lost. Therefore, one will need to carefully lift this reduced model back to the original $m$ -inputs and $p$ -outputs spaces. The second difficulty arises from the fact that sampling Markov parameters means sampling $G (\cdot)$ only around infinity. Since we are interested in approximating not only the first Markov parameter but also the higher-order ones (up to order $2 s - 1$ ), with an analogy to tangential interpolation, we need to choose the same tangential directions for every sample. Since selecting a single direction for all the Markov parameters will be extremely restrictive, we will pick multiple dominant tangential directions to project all the Markov parameters.

To deal with large input and output spaces, the authors in [Citation20] use a randomized selection of inputs and outputs and subsequently collect primal and dual simulation data reducing computational time and storage requirements for the SVD of the Hankel matrix. However, the method assumes that primal and dual simulations can be performed separately, which is not possible in several situations and which we will not assume. In [Citation6] and [Citation19], the authors consider fluid dynamical applications, where the output of interest is often the entire state, leading to an enormous output space. Hence, standard ERA is not feasible, especially since the complex dynamical behaviour of fluid systems makes it necessary to sample many Markov parameters. The authors suggest to project the output space onto a low-dimensional manifold and use ERA subsequently. However, this is mentioned as a rather short remark without any details or error analysis and an algorithm to recover the original output dimension is not given, a crucial difficulty arising in the ERA setup as mentioned above. Terminal (input/output) reduction algorithms for model reduction of linear (often circuit) systems are considered within the ESVDMOR framework [Citation38–Citation40]. There, it is assumed that the internal description of the systems is available, which can efficiently be exploited in the terminal reduction framework. Moreover, not only the inputs, but also the number of outputs can cause computational challenges. Recall Example 2.4, where both input and output dimensions are large ( $m = 26$ and $p = 42$ ), which leads to a challenging computation of the SVD. Therefore, we propose a modified ERA method that works with a two-sided interpolation version of the Markov parameters while guaranteeing stability of the reduced model endowed with an error bound. The minimization problem behind the proposed method is to find two projectors $P_{1}$ and $P_{2}$ that solve

(14)

min_{\binom{r a n k (P_{1}) = ℓ_{1}}{r a n k (P_{2}) = ℓ_{2}}} \sum_{i = 1}^{2 s - 1} ∥ P_{1} h_{i} P_{2} - h_{i} ∥_{F}^{2} .

(14)

Ideally, one would like to pick individual projectors $P_{1}^{(i)}$ and $P_{2}^{(i)}$ for every Markov parameter to produce the minimal error $\sum_{i = 1}^{2 s - 1} \sum_{j = ℓ + 1}^{min {m, p}} σ_{j}^{2} (h_{i})$ , where $ℓ = ℓ_{1} = ℓ_{2}$ . However this is impractical since, in an analogy to tangential interpolation, it would correspond to choosing different tangential directions for $G (\cdot)$ and $G^{'} (\cdot)$ . Therefore we restrict ourselves to finding two orthogonal projectors, which are used for the entire dataset of Markov parameters. In Sections 3.3 and 3.4, we will see that this choice of $P_{1}$ and $P_{2}$ preserves the structure of the Hankel matrix at the cost of a suboptimal approximation error.

Thus, $P_{1}$ and $P_{2}$ will be constructed such that

\begin{aligned} P_{1} & = W_{1} W_{1}^{T}, r a n k (P_{1}) = ℓ_{1}, \\ P_{2} & = W_{2} W_{2}^{T}, r a n k (P_{2}) = ℓ_{2}, \end{aligned}

where $W_{1}^{T} W_{1} = I_{ℓ_{1}}$ and $W_{2}^{T} W_{2} = I_{ℓ_{2}}$ . The goal is to compute $P_{1}$ and $P_{2}$ by considering data streams of Markov parameters. As opposed to solving (14) for $P_{1}$ and $P_{2}$ jointly, we will construct $P_{1}$ and $P_{2}$ by solving two separate optimization problems. The reason for this is once again due to the preservation of the Hankel structure, and will be clarified in Section 3.4 in the proof of Theorem 3.4. To compute $P_{1}$ , we arrange the impulse response sequence in a matrix

(15)

Θ_{L} := [h_{1} h_{2} \dots h_{2 s - 1}] \in R^{p \times m (2 s - 1)},

(15)

and solve the optimization problem

(16)

P_{1} = arg min_{r a n k ({\tilde{P}}_{1}) = ℓ_{1}} | | {\tilde{P}}_{1} Θ_{L} - Θ_{L} | |_{F}^{2} .

(16)

The optimal solution of (16) is given by the SVD of $Θ_{L} = U Σ V^{T}$ , and $P_{1} = W_{1} W_{1}^{T}$ where $W_{1} = U (:, 1 : ℓ_{1})$ denotes the leading $ℓ_{1}$ columns of $U$ . The corresponding minimum error is then given by $| | W_{1} W_{1}^{T} Θ_{L} - Θ_{L} | |_{F}^{2} = \sum_{i = ℓ_{1} + 1}^{p} σ_{i}^{2} (Θ_{L})$ where $σ_{i} (Θ_{L})$ denotes the $i$ th singular value of $Θ_{L}$ . To compute $P_{2}$ , we define

(17)

Θ_{R} := [\begin{matrix} h_{1} \\ h_{2} \\ ⋮ \\ h_{2 s - 1} \end{matrix}] \in R^{p (2 s - 1) \times m}

(17)

and consider the corresponding optimization problem

(18)

P_{2} = arg min_{r a n k ({\tilde{P}}_{2}) = ℓ_{2}} | | Θ_{R} {\tilde{P}}_{2} - Θ_{R} | |_{F}^{2} .

(18)

Similarly, compute the SVD of $Θ_{R} = \tilde{U} \tilde{Σ} {\tilde{V}}^{T}$ , and the optimal solution is $P_{2} = W_{2} W_{2}^{T}$ , where $W_{2} = \tilde{V} (:, 1 : ℓ_{2})$ . The minimal error is given by $| | W_{2} W_{2}^{T} Θ_{R} - Θ_{R} | |_{F}^{2} = \sum_{i = ℓ_{2} + 1}^{p} σ_{i}^{2} (Θ_{R})$ . Recall that our goal is to reduce the size of the Markov parameters, and consequently to lessen the cost of the SVD of the Hankel matrix. The factors $W_{1}$ and $W_{2}$ are employed to project the Markov parameters using

(19)

{\hat{h}}_{i} = W_{1}^{T} h_{i} W_{2} \in R^{ℓ_{1} \times ℓ_{2}} .

(19)

Equation (19) can be considered analogous to tangential interpolation where the transfer function $G (z_{i})$ ( $z_{i} = \infty$ in this case) and its derivatives are sampled along various tangential directions; the columns of $W_{1}$ and $W_{2}$ . The projected values ${\hat{h}}_{i}$ are subsequently used to construct a reduced size Hankel matrix $\hat{H}$ . For this, define the block diagonal matrices

(20)

W_{1} := d i a g (W_{1}, \dots, W_{1}), W_{2} := d i a g (W_{2}, \dots, W_{2}) .

(20)

Then the projected Hankel matrix becomes

(21)

\hat{H} = W_{1}^{T} H W_{2} \in R^{s ℓ_{1} \times s ℓ_{2}} .

(21)

Unlike the case for $H$ , the row and column dimensions of $\hat{H}$ are independent of the original input and output dimensions m and p.

3.3. ERA for projected Hankel matrix and recovering original input/output dimensions

Once the projected Hankel matrix (21) is computed, ERA can be applied. However due to the projected input and output dimensions, control and observation matrices are identified in the reduced output/reduced input spaces. Thus, the goal of TERA is to lift these spaces optimally back to the original dimension to recover the full input and output dimensions. The Hankel matrix from tangentially interpolated data is given by

(22)

\hat{H} = [\hat{h}]_{i j} = [\begin{matrix} W_{1}^{T} \\ ⋱ \\ W_{1}^{T} \end{matrix}] [\begin{matrix} h_{1} & h_{2} & \dots & h_{s} \\ h_{2} & h_{3} & \dots & h_{s + 1} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ h_{s} & h_{s + 1} & \dots & h_{2 s - 1} \end{matrix}] [\begin{matrix} W_{2} \\ ⋱ \\ W_{2} \end{matrix}] .

(22)

Using the definitions of $H$ , we can rewrite (22) as

\hat{H} = [\begin{matrix} W_{1}^{T} \\ ⋱ \\ W_{1}^{T} \end{matrix}] [\begin{matrix} C \\ C A \\ ⋮ \\ C A^{s - 1} \end{matrix}] [B A B \dots A^{s - 1} B] [\begin{matrix} W_{2} \\ ⋱ \\ W_{2} \end{matrix}],

and by defining $\hat{C} = W_{1}^{T} C$ and $\hat{B} = B W_{2}$ , the Hankel matrix from interpolated data can be decomposed such that

\hat{H} = [\begin{matrix} \hat{C} \\ \hat{C} A \\ ⋮ \\ \hat{C} A^{s - 1} \end{matrix}] [\hat{B} A \hat{B} \dots A^{s - 1} \hat{B}] .

This illustrates how to identify $(\hat{A}, \hat{B}, \hat{C})$ from the interpolated Hankel matrix. The best rank $r$ approximation of the projected Hankel matrix is given by the truncated SVD

{\hat{H}}_{r} = {\hat{U}}_{r} {\hat{Σ}}_{r} {\hat{V}}_{r}^{T} = {\hat{O}}_{r} {\hat{C}}_{r},

where ${\hat{O}}_{r} = {\hat{U}}_{r} {\hat{Σ}}_{r}^{1 / 2}$ and ${\hat{C}}_{r} = {\hat{Σ}}_{r}^{1 / 2} {\hat{V}}_{r}^{T}$ represent the approximate observability and controllability matrices, respectively. As before, the first block row of ${\hat{O}}_{r}$ gives an approximation for ${\hat{C}}_{r}$ , the observation matrix matching the interpolated impulse response, so

{\hat{C}}_{r} = [I_{ℓ_{1}} 0] {\hat{U}}_{r} {\hat{Σ}}_{r}^{1 / 2} \in R^{ℓ_{1} \times r} .

Analogously, the first block column of ${\hat{C}}_{r}$ yields an approximation for ${\hat{B}}_{r}$ , the control input matrix for the interpolated impulse response sequence, which reads as

{\hat{B}}_{r} = {\hat{Σ}}_{r}^{1 / 2} {\hat{V}}_{r}^{T} [I_{ℓ_{2}} 0]^{T} \in R^{r \times ℓ_{2}} .

To solve the least squares problem for the system matrix ${\hat{A}}_{r}$ , one proceeds as in the previous subsection, so that

(23)

{\hat{A}}_{r} = [{\hat{O}}_{r}^{(f)}]^{†} {\hat{O}}_{r}^{(l)} = {\hat{Σ}}_{r}^{- 1 / 2} [{\hat{U}}_{r}^{(f)}]^{T} {\hat{U}}_{r}^{(l)} {\hat{Σ}}_{r}^{1 / 2},

(23)

which is computed as in (11) with appropriate matrices. To illuminate the connection between $A_{r}$ in (11) obtained from standard ERA and ${\hat{A}}_{r}$ in (23) obtained from the projected sequence, let ${\tilde{W}}_{1}^{T}$ denote the matrix obtained from deleting the last block row and column from $W_{1}^{T}$ , and similarly for ${\tilde{W}}_{2}^{T}$ . Then we have that ${\hat{O}}_{r}^{(f)} = {\tilde{W}}_{1}^{T} O_{r}^{(f)}$ , and it readily follows that

{\hat{A}}_{r} = [{\tilde{W}}_{1}^{T} O_{r}^{(f)}]^{†} {\tilde{W}}_{1}^{T} O_{r}^{(l)} = [O_{r}^{(f)}]^{†} {\tilde{W}}_{1} {\tilde{W}}_{1}^{T} O_{r}^{(l)} .

Recall from (11) that $A_{r} = [O_{r}^{(f)}]^{†} O_{r}^{(l)}$ . Thus, ${\hat{A}}_{r}$ works with $O_{r}^{(l)}$ projected onto the range of ${\tilde{W}}_{1}$ . Note that ${\tilde{W}}_{1} {\tilde{W}}_{1}^{T} \neq I$ unless ${\tilde{W}}_{1}$ is square (i.e. when there is no reduction in input and output dimension in which case one recovers the standard ERA.) The identified system matrices ${\hat{A}}_{r}$ , ${\hat{B}}_{r}$ and ${\hat{C}}_{r}$ match the projected Markov parameters

{\hat{h}}_{i} \approx {\hat{C}}_{r} {\hat{A}}_{r}^{i - 1} {\hat{B}}_{r}, i = 1, \dots, 2 s - 1

with a similar type error bound as in the original ERA, see Corollary 3.3. below.

While ${\hat{A}}_{r}$ is an $r \times r$ matrix (matching the original ERA construction), ${\hat{B}}_{r}$ has $ℓ_{2}$ columns (as opposed to $m$ ) and ${\hat{C}}_{r}$ has $ℓ_{1}$ rows (as opposed to $p$ ). Therefore, we need to lift ${\hat{B}}_{r}$ and ${\hat{C}}_{r}$ to the original input/output dimensions. By virtue of the minimization problem (14), the original input–output dimension of the system can be recovered through injection of ${\hat{h}}_{i}$ to $R^{p \times m}$ . Recall that ${\hat{h}}_{i} = W_{1}^{T} h_{i} W_{2}$ . Therefore, ${{\hat{C}}_{r} {\hat{A}}_{r}^{i - 1} {\hat{B}}_{r}}_{i = 1}^{2 s - 1}$ approximates ${W_{1}^{T} h_{i} W_{2}}_{i = 1}^{2 s - 1}$ in the least-squares sense. To approximate the original sequence ${h_{i}}_{i = 1}^{2 s - 1}$ , replace ${\hat{C}}_{r}$ with $W_{1} {\hat{C}}_{r}$ and ${\hat{B}}_{r}$ with ${\hat{B}}_{r} W_{2}^{T}$ . In other words, the original impulse response sequence is approximated via

(24)

h_{i} \approx \underset{:= C_{r}}{\underset{⏟}{W_{1} {\hat{C}}_{r}}} \underset{:= A_{r}}{\underset{⏟}{{\hat{A}}_{r}}}^{i - 1} \underset{:= B_{r}}{\underset{⏟}{{\hat{B}}_{r} W_{2}^{T}}} = C_{r} A_{r}^{i - 1} B_{r},

(24)

yielding the final reduced-model quantities

(25)

\begin{matrix} A_{r} & = & {\hat{Σ}}_{r}^{- 1 / 2} {[{\hat{U}}_{r}^{(f)}]}^{T} {\hat{U}}_{r}^{(l)} {\hat{Σ}}_{r}^{1 / 2}, \\ B_{r} & = & {\hat{Σ}}_{r}^{1 / 2} {\hat{V}}_{r}^{T} {[I_{ℓ_{2}} 0]}^{T} W_{2}^{T}, \\ C_{r} & = & W_{1} [I_{ℓ_{1}} 0] {\hat{U}}_{r} {\hat{Σ}}_{r}^{1 / 2} . \end{matrix}

(25)

The modified ERA for tangentially interpolated data, henceforth denoted by TERA (tangential ERA), is given in Algorithm 1.

Algorithm 1. TERA

Display Table

3.4. Error analysis and stability

We first show that TERA retains stability.

Corollary 3.2

Let Assumption 2.2 hold (as in the case of the standard ERA). Then the reduced model given by the matrices $A_{r}, B_{r}, C_{r}$ in (25) obtained via TERA from the projected data is a stable dynamical system.

Proof. The projected Markov parameters are ${\hat{h}}_{i} = W_{1}^{T} h_{i} W_{2}$ , where $W_{1} \in R^{p \times ℓ_{1}}$ and $W_{2} \in R^{m \times ℓ_{2}}$ have orthonormal columns. It follows from Assumption 2.2, that $∥ h_{i} ∥_{F} \to 0$ when $i > s$ . Therefore,

∥ {\hat{h}}_{i} ∥_{F} =∥ W_{1}^{T} h_{i} W_{2} ∥_{F} \leq∥ W_{1} ∥_{F} ∥ h_{i} ∥_{F} ∥ W_{2} ∥_{F} \to 0 w h e n i > s .

Thus, it follows that ${\hat{h}}_{i} \to 0$ as $i > s$ , so that the projected impulse response satisfies the convergence to zero property. Since the reduced matrix $A_{r}$ for TERA is obtained by the standard ERA for the projected data, Theorem 2.3 yields stability of the extracted reduced-order model, which completes the proof.

Using Theorem 2.3, we can directly obtain an error bound for the interpolated Markov parameters.

Corollary 3.3:

With the number of left and right tangential directions $ℓ_{1}, ℓ_{2}$ , respectively, the error in the Markov parameter sequence generated by TERA is given by

\sum_{i = 1}^{2 s - 1} ∥ {\hat{C}}_{r} {\hat{A}}_{r}^{i - 1} {\hat{B}}_{r} - {\hat{h}}_{i} ∥_{F}^{2} \leq \sqrt{r + ℓ_{1} + ℓ_{2}} \cdot σ_{r + 1} (\hat{H}) .

Proof. Recall that when ERA is applied to ${\hat{h}}_{i}$ , it yields a stable reduced-order model as shown in Corollary 3.2. Using $m = ℓ_{1}$ and $p = ℓ_{2}$ in Theorem 2.3, the result follows directly, by replacing all quantities by the ‘hat’ quantities.

Corollary 3.3 gives a bound for the error in the interpolated (projected) Markov parameters. However, the real quantity of interest is the error in the reconstruction of the original full Markov parameter sequence ${h_{i}}$ . The next results answers this question.

Theorem 3.4:

Let ${h_{i}}_{i = 1, \dots, 2 s - 1}$ be the original sequence of Markov parameters, and let ${C_{r} A_{r}^{i - 1} B_{r}}_{i = 1, \dots, 2 s - 1}$ be the identified sequence via TERA in (24). The approximation error is given by

(26)

\begin{aligned} \sum_{i = 1}^{2 s - 1} ∥ h_{i} - C_{r} A_{r}^{i - 1} B_{r} ∥_{F}^{2} \\ \leq 4 (\sum_{i = ℓ_{1} + 1}^{p} σ_{i}^{2} (Θ_{L}) + \sum_{i = ℓ_{2} + 1}^{m} σ_{i}^{2} (Θ_{R})) + 2 \sqrt{r + ℓ_{1} + ℓ_{2}} \cdot σ_{r + 1} (\hat{H}), \end{aligned}

(26)

where $Θ_{L}$ and $Θ_{R}$ are as defined in (15) and (17), respectively.

Proof.

We use the definitions of $A_{r}, B_{r}$ and $C_{r}$ in (24) to rewrite the error in terms of ${\hat{B}}_{r}$ and ${\hat{C}}_{r}$ , $W_{1}$ and $W_{2}$ , and then split the error into two parts using the projectors $P_{1}$ in (16) and $P_{2}$ in (18):

\begin{aligned} \sum_{i = 1}^{2 s - 1} | | h_{i} - C_{r} A_{r}^{i - 1} B_{r} | |_{F}^{2} = \sum_{i = 1}^{2 s - 1} | | h_{i} - W_{1} {\hat{C}}_{r} {\hat{A}}_{r}^{i - 1} {\hat{B}}_{r} W_{2}^{T} | |_{F}^{2} \\ = \sum_{i = 1}^{2 s - 1} | | \underset{=: T_{i}}{\underset{⏟}{h_{i} - P_{1} h_{i} P_{2}}} + \underset{=: Z_{i}}{\underset{⏟}{P_{1} h_{i} P_{2} - W_{1} {\hat{C}}_{r} {\hat{A}}_{r}^{i - 1} {\hat{B}}_{r} W_{2}^{T}}} | |_{F}^{2} \\ = \sum_{i = 1}^{2 s - 1} | | T_{i} + Z_{i} | |_{F}^{2} \\ \leq 2 (\underset{ε_{1}}{\underset{⏟}{\sum_{i = 1}^{2 s - 1} | | T_{i} | |_{F}^{2}}} + \underset{ε_{2}}{\underset{⏟}{\sum_{i = 1}^{2 s - 1} | | Z_{i} | |_{F}^{2}}}) . \end{aligned}

Next, we give estimates for the two error terms

ε_{1}

and

ε_{2}

. We begin with

ε_{1}

(27)

\begin{aligned} ε_{1} = \sum_{i = 1}^{2 s - 1} | | h_{i} - P_{1} h_{i} P_{2} | |_{F}^{2} \\ = \sum_{i = 1}^{2 s - 1} | | h_{i} - P_{1} h_{i} + P_{1} (h_{i} - h_{i} P_{2}) | |_{F}^{2} = | | Θ_{L} - P_{1} Θ_{L} + P_{1} (Θ_{L} - Θ_{L} P_{2}) | |_{F}^{2} \\ \leq 2 (| | Θ_{L} - P_{1} Θ_{L} | |_{F}^{2} + | | Θ_{L} - Θ_{L} P_{2} | |_{F}^{2}), \end{aligned}

(27)

where

_{2} = diag (P_{2}, \dots, P_{2})

is block diagonal, and we used in the last equality that

P_{1}

is an orthogonal projector and thus

∥ P_{1} Z ∥_{F} \leq∥ Z ∥_{F}

. For the first term in the sum, it follows from the definition of

P_{1}

in (16) (and by the SVD) that

| | Θ_{L} - P_{1} Θ_{L} | |_{F}^{2} = \sum_{i = ℓ_{1} + 1}^{p} σ_{i}^{2} (Θ_{L}) .

The second term in the sum can be rewritten as

\begin{aligned} | | Θ_{L} - Θ_{L} P_{2} | |_{F}^{2} = | | [h_{1}, h_{2}, \dots, h_{2 s - 1}] - [h_{1} P_{2}, h_{2} P_{2}, \dots, h_{2 s - 1} P_{2}] | |_{F}^{2} \\ = | | Θ_{R} P_{2} - Θ_{R} | |_{F}^{2} \\ = \sum_{i = ℓ_{2} + 1}^{m} σ_{i}^{2} (Θ_{R}), \end{aligned}

where the last equality follows from the definition of

P_{2}

in (18). Collecting the terms yields

ε_{1} \leq 2 (\sum_{i = ℓ_{1} + 1}^{p} σ_{i}^{2} (Θ_{L}) + \sum_{i = ℓ_{2} + 1}^{m} σ_{i}^{2} (Θ_{R})) .

The term

ε_{2}

can be simplified using the orthogonality of

W_{1}

and

W_{2}

and by using Corollary 3.3; namely, we obtain

\begin{aligned} ε_{2} = \sum_{i = 1}^{2 s - 1} | | P_{1} h_{i} P_{2} - W_{1} {\hat{C}}_{r} {\hat{A}}_{r}^{i - 1} {\hat{B}}_{r} W_{2}^{T} | |_{F}^{2} \\ = \sum_{i = 1}^{2 s - 1} | | W_{1} W_{1}^{T} h_{i} W_{2} W_{2}^{T} - W_{1} {\hat{C}}_{r} {\hat{A}}_{r}^{i - 1} {\hat{B}}_{r} W_{2}^{T} | |_{F}^{2} \\ = \sum_{i = 1}^{2 s - 1} | | W_{1}^{T} h_{i} W_{2} - {\hat{C}}_{r} {\hat{A}}_{r}^{i - 1} {\hat{B}}_{r} | |_{F}^{2} \\ = \sqrt{r + ℓ_{1} + ℓ_{2}} \cdot σ_{r + 1} (\hat{H}) . \end{aligned}

Collecting the terms, we obtain

\begin{aligned} \sum_{i = 1}^{2 s - 1} ∥ h_{i} - C_{r} A_{r}^{i - 1} B_{r} ∥_{F}^{2} \leq 2 (ε_{1} + ε_{2}) \\ \leq 4 (\sum_{i = ℓ_{1} + 1}^{p} σ_{i}^{2} (Θ_{L}) + \sum_{i = ℓ_{2} + 1}^{m} σ_{i}^{2} (Θ_{R})) + 2 \sqrt{r + ℓ_{1} + ℓ_{2}} \cdot σ_{r + 1} (\hat{H}), \end{aligned}

which completes the proof.

Remark 3.5:

In addition to the need to preserve the Hankel structure in the projected data (so that ERA can be applied to the tangentially interpolated data), the error term $ε_{1}$ in (27) also reveals why the projections $P_{1}$ and $P_{2}$ were computed separately via (16) and (18), respectively; as opposed to the joint optimization problem (14).

4. Numerical results

In this section, we present numerical results for TERA (Algorithm 3.3) and Kung’s standard ERA. To test these algorithms, a mass spring damper model (MSD) and a cooling model for steel profiles (Rail) are considered. The main computational difference between ERA and TERA is the size of the SVD that needs to be computed. As we will illustrate, TERA offers significant computational savings by working with the SVD of a reduced Hankel matrix, see .

Table 1. Specifications, CPU times to execute, and time savings for the numerical examples. Solved on a cluster with a 6-core Intel Xeon X5680 CPU at 3.33GHz and 48GB RAM, with MATLAB2013b.

Download CSV Display Table

ERA assumes a discrete-time model. The examples we consider are continuous-time dynamical systems, that is, they have the form

(28)

\dot{x} (t) = A_{c} x (t) + B_{c} u (t), y (t) = C_{c} x (t),

(28)

with $t \in R^{+}$ , where the subscripts are used to emphasize the continuous-time setting. Therefore we convert these continuous-time models to discrete-time via a bilinear transformation [Citation41], mapping the left half-plane onto the unit circle. Once the reduced models are computed via ERA and TERA, we use the original system dynamics only for illustration purposes to present a more detailed comparison both in the time-domain by comparing time-domain simulations and in the frequency domain by comparing Bode plots. We emphasize that the matrices $A_{c}, B_{c}, C_{c}$ are never used in the algorithms. Both ERA and TERA have only access to impulse response data.

4.1. Mass spring damper system

This model is taken from [Citation42] and describes a mass spring damper system with masses $m_{i}$ , spring constants k_i and damping coefficients $c_{i}$ $\geq 0$ for $i = 1, 2, \dots, n / 2$ . The state variables are the displacement and momentum of the masses, and the outputs are the velocities of some selected masses. We refer to [Citation42, Section 6] for more details about the model. The model dimension is $n = 1, 000$ , which is equivalent to 500 mass spring damper elements. All masses are $m_{i} = 4$ , the spring constants are $k_{i} = 4$ and the damping coefficients are $c_{i} = 0.1$ for $i = 1, 2, \dots, 500$ . The number of inputs is equal to the number of outputs, namely $m = p = 30$ . We collect $2 s = 1, 000$ Markov parameters. In , the decay of the normalized Markov parameters is plotted, $∥ h_{i} ∥_{F} / ∥ h_{1} ∥_{F}$ . We observe a steep initial decay, followed by a slower decay. shows the singular values of $Θ_{L}$ and $Θ_{R}$ , which in turn help decide how many input/output directions $ℓ_{1}, ℓ_{2}$ are needed in the TERA framework.

Figure 1. MSD model: The Markov parameters decay slowly in time (a); the singular values of the matrices $Θ_{L}$ and $Θ_{R}$ help guide the decision about the number of input/output interpolation directions needed (b).

Application of the standard ERA requires computing an SVD of size $15, 000 \times 15, 000$ . On the other hand, in TERA, we pick $ℓ_{1} = ℓ_{2} = 7$ , reducing the SVD dimension to $3500 \times 3500$ . Even though we picked $ℓ_{1} = ℓ_{2} = 7$ for TERA, we illustrate the leading hundred normalized singular values of both the full Hankel matrix $H$ and several projected Hankel matrices $\hat{H}$ in for various $ℓ_{1}$ and $ℓ_{2}$ choices. There is a drastic difference in the decay of the singular values. At the truncation order $r = 30$ , the singular values of $\hat{H}$ have already dropped significantly. In contrast, the singular values of the full Hankel matrix start a rapid decay only after $r \approx 60$ .

Figure 2. MSD model: The normalized singular values of the Hankel matrices $H$ and $\hat{H}$ are shown in decreasing order. With an increasing number of interpolation directions $ℓ_{1} = ℓ_{2}$ , the singular value decay curves approach the singular value decay of the full Hankel matrix $H$ .

Figure 2. MSD model: The normalized singular values of the Hankel matrices H and Hˆ are shown in decreasing order. With an increasing number of interpolation directions ℓ1=ℓ2, the singular value decay curves approach the singular value decay of the full Hankel matrix H.

We choose the reduced model order as $r = 30$ , and apply both ERA and TERA. Theorem 3.4 via the upper bound in (26) can give valuable insight into the success of TERA. Choosing $r = 30$ , and $ℓ_{1} = ℓ_{2} = 7$ , the actual relative error in the Markov parameters is

\frac{\sum_{i = 1}^{2 s - 1} | | h_{i} - C_{r} A_{r}^{i - 1} B_{r} | |_{F}^{2}}{\sum_{i = 1}^{2 s - 1} | | h_{i} | |_{F}^{2}} = 1.13 \times 10^{- 1},

and the upper bound in (26) yields

\frac{4 (\sum_{i = ℓ_{1} + 1}^{p} σ_{i}^{2} (Θ_{L}) + \sum_{i = ℓ_{2} + 1}^{m} σ_{i}^{2} (Θ_{R})) + 2 \sqrt{r + ℓ_{1} + ℓ_{2}} \cdot σ_{r + 1} (\hat{H})}{\sum_{i = 1}^{2 s - 1} | | h_{i} | |_{F}^{2}} = 8.79 \times 10^{- 1} .

Notably, the error bound is in the same order of magnitude as the actual error. The main contribution to the upper bound results from the truncation of $Θ_{L}$ and $Θ_{R}$ . As a comparison, we also give the actual error and the error bound for the standard $E R A$ . While the actual relative error due to the standard ERA is

\frac{\sum_{i = 1}^{2 s - 1} ∥ C_{r} A_{r}^{i - 1} B_{r} - h_{i} ∥_{F}^{2}}{\sum_{i = 1}^{2 s - 1} ∥ h_{i} ∥_{F}^{2}} = 5.05 \times 10^{- 1},

the upper bound for ERA is

\frac{σ_{r + 1} (\hat{H}) \cdot \sqrt{r + m + p}}{\sum_{i = 1}^{2 s - 1} ∥ h_{i} ∥_{F}^{2}} = 1.33 \times 10^{0} .

As we shall see again in the next example, Kung’s error bound might not be sharp, and in fact may vary by several orders of magnitude from the true error. Thus, since the error bound is above 100%, it would have been hard to know a priori how well the ROM behaves.

To compare the reduced models due to ERA and TERA, both reduced models are converted back to continuous time, yielding

(29)

{\dot{x}}_{r} (t) = A_{r, c} x_{r} (t) + B_{r, c} u (t), y_{r} (t) = C_{r, c} x_{r} (t) + D_{r, c} u (t) .

(29)

The full model and the reduced systems (29) obtained from ERA and TERA, respectively, are simulated from zero to $50 s$ with zero initial conditions. The input functions were chosen as in [Citation42, Ex. 6.3] to be $u_{i} (t) = e^{- 0.05 t} sin (5 t)$ . shows outputs 6 and 11 of time domain simulations for the full model and both reduced models. Here, the TERA-based reduced model follows the full model outputs accurately. In contrast, the ERA-based reduced-order model responses are far from the actual output and produce erroneous results as shown in .

Figure 3. MSD model: Outputs of continuous-time simulations from the full model, and reduced models with $r = 30$ . In this case, TERA-ROM matches the output of the full order model better, whereas ERA shows strong deviations compared to the full order model.

These results illustrate that the reduced order $r = 30$ is too low for ERA to produce satisfactory results; thus we increase $r$ to study the performance of ERA more. Based on the plot of the Hankel singular values, , the singular values of the full Hankel matrix start decaying at order $r \approx 60$ . compares the continuous-time simulations of the full model, and both reduced-order models with $r = 60$ (the left and right interpolation directions for TERA are kept at $ℓ_{1} = ℓ_{2} = 7$ ). The outputs of the ERA model have now improved in accuracy and mimic the full model outputs accurately.

Figure 4. MSD model: Outputs of continuous-time simulations from the full model, and reduced models with $r = 60$ . At higher reduced order dimension, ERA performs better, and is visually indistinguishable from the full model.

For a better comparison of the performance across all inputs and outputs, we include below, where we show the Bode plots for the full model, ERA and TERA with $ℓ_{1} = ℓ_{2} = 10$ and $ℓ_{1} = ℓ_{2} = 15$ . We see that as we increase the interpolation directions, the frequency response of TERA approaches ERA as expected, and the Bode plot is better matched. Note, that the original system has $m = p = 30$ inputs and outputs. However, one should observe that even the original full ERA misses the sharp pick of the Bode plot around $ω = 2$ rad/sec. This is unfortunately the best one can do with an ERA setting here. Unlike other data-driven approaches, such as the Loewner framework [Citation28] or data-driven optimal $H_{2}$ interpolation method TF-IRKA [Citation31], we cannot simply sample the transfer function at various frequencies to have a better/optimal reduced model (and capture the sharp behaviour around $ω = 2$ ). State-space matrices are assumed not available to start with. All that is available are Markov parameters (data), and the goal is to get a reduced model from this data, which are stable, and have a bound for the sum of squares in the Frobenius norm of the error in the Markov parameters.

Figure 5. MSD model: Transfer function for ERA and TERA where ROMS have order $r = 80$ , and the TERA models were obtained by interpolating the data with a different number of directions. In plot (a), the original transfer function shows a peak around $ω = 2$ . In both (a) and (b), ERA remains unchanged, and we show that TERA approaches the transfer function obtained from ERA as we increase the number of tangential directions.

We have tried several lower order models and observed that we needed to increase the reduced order to around $r = 60$ to have a satisfactory reduced model from ERA. This was also reflected in the decay of the singular values in . Thus, for this example, TERA produced a better reduced-order model than ERA even with a smaller $r$ value at the same time reducing the effort for the SVD from a $15, 000 \times 15, 000$ matrix to a $3, 500 \times 3, 500$ matrix. For $r = 60$ , ERA provides a slightly better match in terms of the output of time-domain simulations, yet it still remains more expensive to compute and the advantage of the computational effort for TERA is still persistent. Moreover, the reader should note that a careful balancing of the number of interpolation directions $ℓ_{1}, ℓ_{2}$ , and the reduced-order model size $r$ , led to a satisfactory accuracy in the ROM, while saving computational time. We shall add though, that in general we do not expect TERA to outperform ERA in accuracy, as it did in this particular case. Nonetheless, since ERA is only optimal in reconstruction, the fact that an improvement in accuracy occurred here, is not contradictory.

4.2. Cooling of steel profiles – Rail model

The model is taken from the Oberwolfach benchmark collection for model reduction [Citation43] and is further described in [Citation44]. The process is modelled by a two-dimensional heat equation with boundary control input. A finite element discretization results in a model $(E, A, B, C)$ with $n = 1, 357$ states, $m = 7$ outputs and $p = 6$ outputs. The generalized eigenvalue of $A v = λ E v, v \neq 0$ , with largest real part is $λ_{m a x} = - 1.76 \times 10^{- 5}$ , which implies that the Markov parameters will decay slowly. It is therefore necessary to sample many Markov parameters to capture enough of the system dynamics. The model is converted to a discrete-time model through the bilinear transformation and simulated to construct $2 s = 2, 000$ Markov parameters. Once again, the original matrices are only used for Markov parameter generation and never enter into the algorithm. , shows the normalized decay of the Markov parameters over time, that is, $∥ h_{i} ∥_{F} / ∥ h_{1} ∥_{F}$ for $i = 1 : 20 : 2, 000$ . The plot can guide the choice of when to stop collecting data. Next, we investigate the performance of ERA/TERA, with reduced-model order $r = 20$ , unless indicated otherwise. Thus, the standard ERA is applied to the sequence of $2 s = 2, 000$ Markov parameters in $R^{6 \times 7}$ requiring an SVD of size $6, 000 \times 7, 000$ . Unless otherwise stated, the Markov parameters are projected with $ℓ_{1} = ℓ_{2} = 4$ tangential directions, so that ${\hat{h}}_{i} \in R^{4 \times 4}$ . Therefore, only a singular value decomposition of size $4, 000 \times 4, 000$ has to be computed.

Figure 6. Rail model: Norm (relative) decay of the Markov parameters over time, $| | h_{i} | |_{F} / | | h_{1} | |_{F}$ .

shows the normalized singular values of the matrices $Θ_{L}$ and $Θ_{R}$ , respectively. In addition to computational cost limitations, the decay of the singular values gives valuable insight for choosing the tangential truncation orders $ℓ_{1}$ and $ℓ_{2}$ , since they occur in the error bound in (26). Moreover, in , we see the convergence of the singular values of Hankel matrix from tangentially interpolated data for various values of $ℓ_{1}$ and $ℓ_{2}$ , as the dimension of the reduced-order model $r$ increases. As the size of the Hankel matrix grows the singular values converge to the full model. The first neglected singular value $σ_{r + 1} (\hat{H})$ enters into the upper bound of the TERA error in Equation (26).

Figure 7. Rail model: The singular values of the matrices $Θ_{L}, Θ_{R}$ as well as of the Hankel matrix provide insight into truncation of input/output directions, as well as the ROM dimension $r$ . In plot (b), for increasing numbers of tangential directions, the singular value decay approaches the behaviour of the ERA model, as expected.

compares the full transfer function $G (i ω)$ with the reduced-order transfer functions $G_{r} (i ω)$ of the ERA and TERA ROMs by showing the amplitude Bode plots. The $H_{\infty}$ error (where the Hardy-norm $||G| |_{H_{\infty}} = {sup}_{ω \in R}| |G (i ω)| |_{2}$ ) for TERA is $3.58 \times 10^{- 2}$ and similarly $1.36 \times 10^{- 2}$ for ERA, which is in the same order of magnitude.

Figure 8. Rail model: Bode plots (transfer function) for full model, and the reduced-order models through ERA and TERA. As seen from (a), the transfer functions in the original scaling are visually identical. Thus, plot (b) shows the error of TERA and ERA with respect to the original transfer function.

In the following, we shed some light on the behaviour of the error bound for ERA in (12) as well as the TERA error bound in (26). While the true relative error due to TERA with $ℓ_{1} = ℓ_{2} = 4$ is

\frac{\sum_{i = 1}^{2 s - 1} | | h_{i} - C_{r} A_{r}^{i - 1} {\hat{B}}_{r} | |_{F}^{2}}{\sum_{i = 1}^{2 s - 1} | | h_{i} | |_{F}^{2}} = 2.98 \times 10^{- 3},

the upper bound from Theorem 3.4 yields

\frac{4 (\sum_{i = ℓ_{1} + 1}^{p} σ_{i}^{2} (Θ_{L}) + \sum_{i = ℓ_{2} + 1}^{m} σ_{i}^{2} (Θ_{R})) + 2 \sqrt{r + ℓ_{1} + ℓ_{2}} \cdot σ_{r + 1} (\hat{H})}{\sum_{i = 1}^{2 s - 1} | | h_{i} | |_{F}^{2}} = 8.54 \times 10^{- 2} .

Even though the upper bound is not too pessimistic, it is not as tight as the previous example. This was expected from the slower decay of the Hankel singular values and the singular values of $Θ_{L}$ and $Θ_{R}$ . On the contrary, the error bound for the standard ERA is rather pessimistic. While the true relative error due to the standard ERA is

\frac{\sum_{i = 1}^{2 s - 1} ∥ C_{r} A_{r}^{i - 1} B_{r} - h_{i} ∥_{F}^{2}}{\sum_{i = 1}^{2 s - 1} ∥ h_{i} ∥_{F}^{2}} = 3.53 \times 10^{- 5},

the upper bound for ERA is

\frac{σ_{r + 1} (\hat{H}) \cdot \sqrt{r + m + p}}{\sum_{i = 1}^{2 s - 1} ∥ h_{i} ∥_{F}^{2}} = 2.50 \times 10^{- 1},

four orders of magnitude higher than the actual error. A more thorough look at the error bound of TERA for various reduced-order model sizes $r$ and interpolation directions $ℓ_{1}$ and $ℓ_{2}$ is given in . ERA obviously produces the lowest errors, yet the error bound is almost three orders of magnitudes higher than the true error. For TERA, we see that the error bound and actual error are within the same order of magnitude, yet the method produces higher errors. The error bound in Equation (26) is dominated by the truncated singular values of the matrices $Θ_{L}$ and $Θ_{R}$ , so that there is no significant decay trend of the error bound with increasing order $r$ .

Figure 9. Rail model: The error bound of ERA (normalized) from (12) and TERA from (26) versus the actual errors.

The continuous-time reduced-order models (29) are simulated with an input vector $u (t) \in R^{m}$ with $u_{i} (t) = 0.2 e^{- .005 t}$ , for $i = 1, \dots, m = 7$ . For time stepping, we used ode45 in MATLAB with standard error tolerances. The outputs are compared to the outputs of simulations of the full model. shows outputs No. 1, No. 2 and No. 5 computed from the full model as well as the reduced models obtained through both standard ERA and TERA. In addition to reducing the computational time and memory requirements of standard ERA, the TERA framework performs well in time domain simulations. Outputs No. 1 and No. 2 are captured very accurately. While one can observe a deviation in the approximation of output No. 5, the overall behaviour is still approximated well.

Figure 10. Rail model: The plot shows outputs of time domain simulations of the full and reduced-order models. In plot (a) and (b), all three models provide visually identical results, in plot (c), TERA shows a slight deviation for the first 20 s of simulation.

For both models, to show that the success of the selection of projection/tangential directions for TERA via SVD is not random, we generated tangential directions from a random normal distribution (as opposed to using the singular vectors of $Θ_{R}$ and $Θ_{L}$ ). This approach gave unsatisfactory results in all the test runs and we, therefore, safely exclude it as a choice for tangential directions. For illustration purposes, for the Rail model, we plotted the results of the continuous-time simulations for output two and six in , where random interpolation directions were used for $\hat{H}$ , leading to a rather poor model reduction performance.

Figure 11. Rail model: Time domain simulations of the full and reduced-order models, where the TERA model was obtained by interpolation with random directions. The full model and the ERA model are visually indistinguishable on this scale.

4.3. Indoor-air model for thermal fluid dynamics

We offer a brief explanation for the limitations of the proposed TERA approach by revisiting the indoor-air behaviour model [Citation5] from Example 2.4. The motivation is that it is often not possible to extract system matrices from commercial software, yet one can use SI methods to obtain reduced-order models for the dynamics. Given the size of the problem (input–output dimension as well as data), it would be computationally beneficial to use ERA with tangentially interpolated data. The tools developed earlier can help us decide whether TERA could be applied here.

We consider a similar model as illustrated in Example 2.4. For this problem, $1437$ Markov parameters were obtained by simulating the underlying dynamical system using the ANSYS FLUENT software with a spatial discretization of approximately $200, 000$ finite volume elements used in a three dimensional domain. The version of the model we consider here already has a reduced number of outputs, $p = 19$ , yet similar inputs, $m = 26$ . Consequently, this would mean that the standard ERA for this model is computationally demanding, requiring an SVD for a matrix of dimension $18, 668 \times 13, 642$ ; thus reflecting a limitation for the standard ERA itself. Since the internal representation is not available, we cannot provide the same level of detailed comparison as above.

First, we show in , the norm of the Markov parameters of the full model $| | h_{i} | |_{F}^{2}$ versus the norm of the interpolated Markov parameters $| | {\hat{h}}_{i} | |_{F}^{2}$ . This shows the information that is retained after the tangential interpolation procedure, which then enters the TERA algorithm. shows the decay of the singular values of $Θ_{L}$ and $Θ_{R}$ to determine the number of necessary interpolation directions. The reader should compare this to and (a), where a faster decay in the singular values is observed. Since the error bound in Theorem 3.4 contains the summed tail of the neglected singular values of $Θ_{R}$ and $Θ_{L}$ , the upper bound is expected to be loose.

Figure 12. Indoor-air model: A high number of interpolation directions is needed to retain enough information from the sequence of Markov parameters. Especially plot (b) shows that decay of the singular values for $Θ_{L}, Θ_{R}$ with respect to $ℓ_{1}, ℓ_{2}$ is very slow.

The other ingredient to the error bound in Theorem 3.4 is the first neglected singular value of the full Hankel matrix, $σ_{r + 1} (\hat{H})$ . The singular values of the Hankel matrix from interpolated data are shown in . As in the previous examples, the Hankel singular values converge to the true values as we increase the interpolation directions $ℓ_{1} = ℓ_{2} = ℓ$ . However, the convergence is noticeably slower than in the previous two examples. Taken together, one might expect TERA not to yield satisfactory results for small values of $ℓ$ ; which could hint at the fact that all inputs and outputs are highly relevant for this particular model as we shall see below. In contrast, for large $ℓ$ , the computational benefit of using TERA is negligible, in which case one would say that the methods ‘fails’.

Figure 13. Indoor-air model: The TERA models slowly approach the ERA model as the number of interpolation directions $ℓ_{1} = ℓ_{2}$ is increased, see plot (a). Part (b) shows that even the identified Markov parameters from a full ERA model do not match the original sequence well.

Following this a priori analysis, in , we compare the original and identified Markov parameters, using ERA (an expensive computation of the $18, 668 \times 13, 642$ SVD) and TERA with $ℓ_{1} = ℓ_{2} = 10$ . In both cases, $r = 300$ is chosen as the ROM model order. We note here that the relative error $\sum_{k} | | h_{i} - C_{r} A_{r}^{k} B_{r} | |_{F}^{2} / \sum_{k} | | h_{k} | |_{F}^{2}$ is $2.39 \times 10^{- 1}$ for ERA and $2.70 \times 10^{- 1}$ for TERA. Thus, ERA still performs better, but the errors are still too large for a reduced-order model with good predictive capabilities. In this example, due to several illustrated factors, TERA could not provide a computational benefit to ERA. This illustrates how to approach the problem of deciding whether tangential interpolation of the data prior to SI is beneficial.

5. Conclusions

We modified the standard ERA to handle MIMO systems more efficiently. After the input and output dimensions are reduced by tangential interpolation of the impulse response data, the standard ERA is used on the low dimensional input and output spaces. The observation and control matrices are subsequently lifted back to the original input and output dimensions. The resulting reduced-order model has the original input and output dimensions, and is guaranteed to retain stability. The computational savings for the necessary singular value decomposition are significant, in particular since the complexity of the SVD grows cubically with the size of the Hankel matrix. Moreover, we give criteria to guide the user whether in a particular model using TERA can be beneficial. The a priori error bound in Theorem 3.4 provides a clear picture regarding the contribution of the tangential interpolation error, and the truncation error of the Hankel matrix to the overall error. The numerical findings demonstrate the success of TERA. The algorithm can run with inputs from experiments or black-box code and accurately identify reduced order dynamics.

There are several interesting directions one can pursue. As showed in [Citation6], ERA can be considered a data-driven approximation to balanced truncation. Establishing and understanding a similar connection for TERA might yield other ways of choosing the left and right directions to project the impulse response data. This connection can also help handling the cases where, for example, only the output dimension is massive but there is only a single input. In the setting of balanced truncation, the authors in [Citation45] offered an effective methodology for those situations by employing a numerical quadrature in the computation of one of the gramians involving the other gramian. It will be beneficial to understand the implications for ERA and TERA as well. Moreover, understanding the effect of noise on the TERA computations will also be important.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

Research supported in part by the Energy Efficient Buildings Hub under DOE contract DE-EE0004261.

Notes

1. MATLAB and Statistics Toolbox Release 2015b, The MathWorks, Inc., Natick, Massachusetts, United States.

2. We follow the original ERA notation and assume a standard state-space, that is, the E-term is E = I. This makes the notation involving Markov parameters and the Hankel matrix much simpler. The theory can be extended to the general case. One of the numerical examples in Section 4 has E ≠ I.

3. We use this term to refer to the singular values of the Hankel matrix.

Related Research Data

A damage identification method for a thin plate structure based on PVDF sensors and strain mode

Source: SAGE Publications

Linking provided by

References

I. Houtzager, J. Van Wingerden, and M. Verhaegen, Recursive predictor-based subspace identification with application to the real-time closed-loop tracking of flutter, IEEE Trans. Control Syst. Technol. 20 (2012), pp. 934–949. doi:10.1109/TCST.2011.2157694
Web of Science ®Google Scholar
D.C. Rebolho, E.M. Belo, and F.D. Marques, Aeroelastic parameter identification in wind tunnel testing via the extended eigensystem realization algorithm, J. Vib. Control 20 (2014), pp. 1607–1621. doi:10.1177/1077546312474015
Web of Science ®Google Scholar
M. Döhler and L. Mevel, Fast multi-order computation of system matrices in subspace-based system identification, Control Eng. Pract. 20 (2012), pp. 882–894. doi:10.1016/j.conengprac.2012.05.005
Web of Science ®Google Scholar
J.M. Caicedo, S. Dyke, and E. Johnson, Natural excitation technique and eigensystem realization algorithm for phase I of the IASC-ASCE benchmark problem: Simulated data, J. Eng. Mech. 130 (2004), pp. 49–60. doi:10.1061/(ASCE)0733-9399(2004)130:1(49)
Web of Science ®Google Scholar
J. Borggaard, E. Cliff, and S. Gugercin, Model reduction for indoor-air behavior in control design for energy-efficient buildings, Proceedings of the American Control Conference, Montreal, 2012, pp. 2283–2288. ISBN 9781457710957
Google Scholar
Z. Ma, S. Ahuja, and C. Rowley, Reduced-order models for control of fluids using the eigensystem realization algorithm, Theor. Comput. Fluid Dyn. 25 (2011), pp. 233–247. doi:10.1007/s00162-010-0184-8
Web of Science ®Google Scholar
F. Juillet, P. Schmid, and P. Huerre, Control of amplifier flows using subspace identification techniques, J. Fluid. Mech. 725 (2013), pp. 522–565. doi:10.1017/jfm.2013.194
Web of Science ®Google Scholar
C. Wales, A. Gaitonde, and D. Jones, Stabilisation of reduced order models via restarting, Int. J. Numer. Methods Fluids 73 (2013), pp. 578–599. doi:10.1002/fld.v73.6
Web of Science ®Google Scholar
J. Mendel, Minimum-variance deconvolution, IEEE Trans. Geosci. Remote Sensing GE-19 (1981), pp. 161–171. doi:10.1109/TGRS.1981.350346
Web of Science ®Google Scholar
M. Viberg, Subspace-based methods for the identification of linear time-invariant systems, Automatica 31 (1995), pp. 1835–1851. doi:10.1016/0005-1098(95)00107-5
Web of Science ®Google Scholar
S. Qin, An overview of subspace identification, Comput. Chem. Eng. 30 (2006), pp. 1502–1513. doi:10.1016/j.compchemeng.2006.05.045
Web of Science ®Google Scholar
E. Reynders, System identification methods for (operational) modal analysis: Review and comparison, Arch. Comput. Methods Eng. 19 (2012), pp. 51–124. doi:10.1007/s11831-012-9069-x
Web of Science ®Google Scholar
S.Y. Kung, A new identification and model reduction algorithm via singular value decomposition, Proceedings of 12th Asilomar Conference on Circuits, Systems & Computers, Pacific Grove, CA, 1978, pp. 705–714. IEEE.
Google Scholar
J.-N. Juang and R. Pappa, An eigensystem realization algorithm for modal parameter identification and model reduction, J. Guidance, Control, Dyn. 8 (1985), pp. 620–627. doi:10.2514/3.20031
Web of Science ®Google Scholar
G. Pitstick, J. Cruz, and R. Mulholland, Approximate realization algorithms for truncated impulse response data, IEEE Trans. Acoust. Speech Signal Process. 34 (1986), pp. 1583–1588. doi:10.1109/TASSP.1986.1164997
Google Scholar
R. Longman and J.-N. Juang, Recursive form of the eigensystem realization algorithm for system identification, J. Guidance, Control, Dyn. 12 (1989), pp. 647–652. doi:10.2514/3.20458
Web of Science ®Google Scholar
J. Singler, Model reduction of linear PDE systems: A continuous time eigensystem realization algorithm, Proceedings of the American Control Conference, Montreal, 2012, pp. 1424–1429.
Google Scholar
K. Willcox and J. Peraire, Balanced model reduction via the proper orthogonal decomposition, AIAA J. 40 (2002), pp. 2323–2330. doi:10.2514/2.1570
Google Scholar
C. Rowley, Model reduction for fluids, using balanced proper orthogonal decomposition, Int. J. Bifur. Chaos 15 (2005), pp. 997–1013. doi:10.1142/S0218127405012429
Web of Science ®Google Scholar
D. Yu and S. Chakravorty, A randomized proper orthogonal decomposition technique, arXiv preprint arXiv:1312.3976, 2013.
Google Scholar
A.C. Antoulas, Approximation of Large-Scale Dynamical Systems, Advances in Design and Control Society for Industrial and Applied Mathematics, Philadelphia, PA, 2005.
Google Scholar
U. Baur, P. Benner, and L. Feng, Model order reduction for linear and nonlinear systems: A system-theoretic perspective, Arch. Comput. Methods Eng. 21 (2014), pp. 331–358. doi:10.1007/s11831-014-9111-2
Web of Science ®Google Scholar
A.C. Antoulas, C.A. Beattie, and S. Gugercin, Interpolatory model reduction of large-scale dynamical systems, in Efficient Modeling and Control of Large-Scale Systems, J. Mohammadpour and M.K. Grigoriadis, eds., Springer US, Boston, MA, 2010, pp. 3–58.
Google Scholar
B. Moore, Principal component analysis in linear systems: Controllability, observability, and model reduction, IEEE Trans. Automat. Contr. 26 (1981), pp. 17–32. doi:10.1109/TAC.1981.1102568
Web of Science ®Google Scholar
C. Mullis and R. Roberts, Synthesis of minimum roundoff noise fixed point digital filters, IEEE Trans. Circuits Syst. 23 (1976), pp. 551–562. doi:10.1109/TCS.1976.1084254
Google Scholar
S. Gugercin, A. Antoulas, and C.A. Beattie, model reduction for large-scale linear dynamical systems, SIAM J. Matrix Anal. Appl. 30 (2008), pp. 609–638. doi:10.1137/060666123
Web of Science ®Google Scholar
K. Glover, All optimal Hankel-norm approximations of linear multivariable systems and their L,∞-error bounds, Int. J. Control 39 (1984), pp. 1115–1193. doi:10.1080/00207178408933239
Web of Science ®Google Scholar
A. Mayo and A. Antoulas, A framework for the solution of the generalized realization problem, Linear Algebra Appl. 425 (2007), pp. 634–662. doi:10.1016/j.laa.2007.03.008
Web of Science ®Google Scholar
B. Gustavsen and A. Semlyen, Rational approximation of frequency domain responses by vector fitting, IEEE Trans. Power Deliv. 14 (1999), pp. 1052–1061. doi:10.1109/61.772353
Web of Science ®Google Scholar
Z. Drmač, S. Gugercin, and C. Beattie, Quadrature-based vector fitting for discretized approximation, SIAM J. Sci. Comput. 37 (2015), pp. A625–A652. doi:10.1137/140961511
Web of Science ®Google Scholar
C. Beattie and S. Gugercin, Realization–independent approximation, Proceedings of the 51st IEEE Conference on Decision & Control, Maui, HI, IEEE, 2012, pp. 4953–4958.
Google Scholar
C. Sanathanan and J. Koerner, Transfer function synthesis as a ratio of two complex polynomials, IEEE Trans. Autom. Control 8 (1963), pp. 56–58. doi:10.1109/TAC.1963.1105517
Web of Science ®Google Scholar
M. Berljafa and S. Güttel, Generalized rational Krylov decompositions with an application to rational approximation, SIAM J. Matrix Anal. Appl. 36 (2015), pp. 894–916. doi:10.1137/140998081
Web of Science ®Google Scholar
Z. Drmač, S. Gugercin, and C. Beattie, Vector fitting for matrix-valued rational approximation, SIAM J. Scientific Comput. 37 (2015), pp. A2346–A2379. doi:10.1137/15M1010774
Web of Science ®Google Scholar
L.M. Silverman, Realization of linear dynamical systems, IEEE Trans. Automat. Contr. 16 (1971), pp. 554–567. doi:10.1109/TAC.1971.1099821
Web of Science ®Google Scholar
G. Golub and C.F. Van Loan, Matrix Computations, Johns Hopkins University, Press, Baltimore, MD, 1996, pp. 374–426.
Google Scholar
C.A. Beattie and S. Gugercin, Model reduction by rational interpolation, in Model Reduction and Approximation for Complex Systems, P. Benner, A. Cohen, M. Ohlberger, and K. Willcox, eds., Marseille, 2014. Available at http://arxiv.org/abs/1409.2140.
Google Scholar
P. Feldmann and F. Liu, Sparse and efficient reduced order modeling of linear subcircuits with large number of terminals, in IEEE/ACM International Conference on Computer Aided Design, 2004. ICCAD-2004, 2004, pp. 88–92.
Google Scholar
P. Liu, S.X. Tan, B. Yan, and B. McGaughy, An extended SVD-based terminal and model order reduction algorithm, Proceedings of the 2006 IEEE International Behavioral Modeling and Simulation Workshop San Jose, CA, 2006, pp. 44–49.
Google Scholar
P. Benner and A. Schneider, Model reduction for linear descriptor systems with many ports, in Progress in Industrial Mathematics at ECMI 2010, M. Günther, A. Bartel, M. Brunk, S. Schöps, M. Striebel, eds., Springer, Berlin, 2012, pp. 137–143.
Google Scholar
U. Al-Saggaf and G. Franklin, Model reduction via balanced realizations: An extension and frequency weighting techniques, IEEE Trans. Automat. Contr. 33 (1988), pp. 687–692. doi:10.1109/9.1280
Web of Science ®Google Scholar
S. Gugercin, R. Polyuga, C. Beattie, and A. Van Der Schaft, Structure-preserving tangential interpolation for model reduction of port-Hamiltonian systems, Automatica 48 (2012), pp. 1963–1974. doi:10.1016/j.automatica.2012.05.052
Web of Science ®Google Scholar
IMTEK - Simulation, Oberwolfach benchmark collection, 2003. Available at http://www.simulation.uni-freiburg.de/downloads/benchmark.
Google Scholar
A. Unger and F. Tröltzsch, Fast solution of optimal control problems in the selective cooling of steel, ZAMM Z. Angew. Math. Mech. 81 (2001), pp. 447–456. doi:10.1002/(ISSN)1521-4001
Web of Science ®Google Scholar
P. Benner and A. Schneider, Balanced truncation for descriptor systems with many terminals, Max Planck Institute Magdeburg Preprint MPIMD/13-17, 2013. Available at http://www2.mpi-magdeburg.mpg.de/preprints/2013/MPIMD13-17.pdf.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Tangential interpolation-based eigensystem realization algorithm for MIMO systems

ABSTRACT

1. Introduction

2. Partial realization and Kung’s algorithm

3. Proposed method: tangential interpolation-based ERA (TERA)

3.1. Tangential interpolation from data

3.2. Projection of Markov parameters

3.3. ERA for projected Hankel matrix and recovering original input/output dimensions

Algorithm 1. TERA

3.4. Error analysis and stability

4. Numerical results

Table 1. Specifications, CPU times to execute, and time savings for the numerical examples. Solved on a cluster with a 6-core Intel Xeon X5680 CPU at 3.33GHz and 48GB RAM, with MATLAB2013b.

4.1. Mass spring damper system

4.2. Cooling of steel profiles – Rail model

4.3. Indoor-air model for thermal fluid dynamics

5. Conclusions

Disclosure statement

Related Research Data

References

Information for

Open access

Opportunities

Help and information

Tangential interpolation-based eigensystem realization algorithm for MIMO systems

ABSTRACT

1. Introduction

2. Partial realization and Kung’s algorithm

3. Proposed method: tangential interpolation-based ERA (TERA)

3.1. Tangential interpolation from data

3.2. Projection of Markov parameters

3.3. ERA for projected Hankel matrix and recovering original input/output dimensions

Algorithm 1. TERA

3.4. Error analysis and stability

4. Numerical results

Table 1. Specifications, CPU times to execute, and time savings for the numerical examples. Solved on a cluster with a 6-core Intel Xeon X5680 CPU at 3.33GHz and 48GB RAM, with MATLAB2013b.

4.1. Mass spring damper system

4.2. Cooling of steel profiles – Rail model

4.3. Indoor-air model for thermal fluid dynamics

5. Conclusions

Disclosure statement

Additional information

Funding

Notes

Related Research Data

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date