Search in:

Inverse Problems in Science and Engineering Volume 28, 2020 - Issue 12

Submit an article Journal homepage

Free access

985

Views

CrossRef citations to date

Altmetric

Listen

Articles

Solving generalized inverse eigenvalue problems via L-BFGS-B method

Zeynab DalvandDepartment of Mathematics, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, Iran

Masoud HajarianDepartment of Mathematics, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, IranCorrespondence[email protected]
[email protected]
[email protected]

https://orcid.org/0000-0002-5549-9270

Pages 1719-1746 | Received 30 Jun 2019, Accepted 22 Apr 2020, Published online: 17 Jun 2020

Cite this article
https://doi.org/10.1080/17415977.2020.1763982
CrossMark

In this article

ABSTRACT
1. Introduction
2. Some definitions and theorems
3. Main results
4. Numerical experiments
5. Conclusions
Acknowledgements
Disclosure statement
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

The parameterized generalized inverse eigenvalue problems containing multiplicative and additive inverse eigenvalue problems appear in vibrating systems design, structural design, and inverse Sturm–Liouville problems. In this article, by using the Cholesky factorization and the Jacobi method, we propose two efficient algorithms based on Newton's method and the L-BFGS-B method for solving these problems. To demonstrate the effectiveness of the algorithms, we present three numerical examples.

Keywords:

Inverse eigenvalue problem
parameterized generalized inverse eigenvalue problems
Cholesky factorization
L-BFGS-B method
Jacobi rotation
Newton's method

AMS MSC 2010:

65F18
49M15

1. Introduction

In the eigenvalue problems, the information contains the elements of a matrix or a matrix pencil and the unknowns of these problems include all or some of the eigenvalues. Describing and solving these problems by using some effective methods are found in [Citation1,Citation2]. But in the corresponding inverse problems, the data contains the complete information, or part of the eigenvalues or eigenvectors. In fact, the inverse problems include finding a matrix or matrix pencils according to the given information under spectral and structural constraints [Citation3–6]. The current literature on the inverse eigenvalue problems shows that the creation of the inverse eigenvalue problems are inevitable in practical applications. Inverse eigenvalue problems, according to the application may be seen in different forms. In [Citation3], a set of inverse eigenvalue problems was recognized and categorized according to its specifications. A lot of inverse eigenvalue problems are generalized inverse eigenvalue problems. Since many physical problems can be modelled as generalized inverse eigenvalue problems, many different examples of these problems have appeared. For example, the generalized inverse eigenvalue problem over the Hermitian–Hamiltonian matrices with a submatrix constraint was studied in [Citation7]. In [Citation8], Liu et al. considered the generalized inverse eigenvalue problem for centrohermitian matrices. In [Citation9], the generalized inverse eigenvalue problem for generalized snow-like matrices was investigated. A generalized inverse eigenvalue problem in structural dynamic model updating was studied in [Citation10]. In [Citation11], a generalized inverse eigenvalue problem by the Jacobi matrix was considered. In these problems, a set of eigenvalues and a matrix pencil function that contained unknown parameters are given and the goal is finding unknown parameters under some conditions. Due to their many applications, these problems have always been a concern for researchers [Citation12,Citation13].

To simplify the discussion, we introduce the following notation. The symbols $H_{n}$ and $U_{n}$ stand for the sets of all $n \times n$ upper triangular matrices and $n \times n$ lower triangular matrices, respectively. We use A>0 and $A \geq 0$ , when the matrix A is a positive definite matrix and positive semi-definite matrix, respectively. The symbol $‖ . ‖$ represents the Euclidean norm for vectors and induced norms for matrices. In addition, we denote a matrix pencil pair briefly with the notation $(A, B)$ .

The parameterized generalized inverse eigenvalue problem (PGIEP) can be described as follows:

Definition 1.1

Let $A (c) = A (c_{1}, c_{2}, \dots, c_{n})$ and $B (c) = B (c_{1}, c_{2}, \dots, c_{n})$ be two given $n \times n$ matrices whose entries are analytic functions of parameters $(c_{1}, c_{2}, \dots, c_{n})$ . Given n real numbers $λ_{1}, λ_{2}, \dots, λ_{n}$ , find $c \in R^{n}$ such that the generalized eigenvalue problem $A (c) x = λ B (c) x$ has the prescribed eigenvalues $λ_{1}, λ_{2}, \dots, λ_{n}$ .

In many practical applications, such as structural design [Citation14–16], the pencil matrix pair $(A (c), B (c))$ is affine that causes the creation of the special cases of the PGIEP. In this paper, we consider the special case of this problem in the form below:

Problem 1.1

As special type of the PGIEP, in this problem, we set (1) $A (c) = A_{0} + \sum_{i = 1}^{n} c_{i} A_{i}, B (c) = B_{0} + \sum_{i = 1}^{n} c_{i} B_{i},$ (1) where $A_{i},$ $i = 0, 1, 2, \dots, n,$ are $n \times n$ symmetric matrices, $B_{0} > 0,$ $B_{i} \geq 0$ for $i = 1, 2, \dots, n,$ and the given eigenvalues $λ_{1}, λ_{2}, \dots, λ_{n}$ are ordered as $λ_{1} < λ_{2} < \dots < λ_{n}$ .

The algebraic inverse eigenvalue problem [Citation17] and the additive inverse eigenvalue problems [Citation18] are two special types of Problem 1.1.

Problem 1.1 is considered as one of the most frequent inverse eigenvalue problems that appears in various areas of mathematical and numerical analysis and engineering applications [Citation19–22]. Actually, Problem 1.1 arises in a variety of practical applications of structural engineering, mechanics, and physics. Some of these applications are as follows [Citation14,Citation23,Citation24]:

Studying a vibrating string and [Citation25,Citation26];
Nuclear spectroscopy;
The educational testing problem;
The graph partitioning problem;
Sturm–Liouville problems and preconditioning;
Factor analysis [Citation27] and the design of control systems [Citation18].

We will now describe several examples arising in various areas of applications.

Application 1.1

[Citation28]

Consider inverse Sturm–Liouville problems as follows: (2) $\begin{aligned} - \ddot{u} (x) + p (x) u (x) & = λ u (x) \end{aligned}$ (2) (3) $\begin{aligned} u (0) & = u (π) = 0, \end{aligned}$ (3) where the purpose is to determine the density function $p (x) > 0$ viewing the eigenvalues ${λ_{i}^{*}}_{1}^{\infty}$ . Using finite differences to solve these problems leads to the inverse eigenvalue problem as follows $:$ $A u + D u = λ_{i}^{*} u, i = 1, 2, \dots, n,$ in which the goal is to find the matrix $D = diag (d_{1}, d_{2}, \dots . d_{n}) = diag (p (h), p (2 h), \dots, p (n h)) > 0$ . This is a classical example of the parameterized generalized inverse eigenvalue problems.

Application 1.2

It is showed in [Citation29] that a vibrational mass-spring set can be modelled as a system of ordinary differential equations as fallow: $\dot{x} = A x,$ where the multiplied matrix of this system is a symmetric matrix of physical parameters like $A = [\begin{matrix} k_{1} + k_{2} & - k_{2} \\ - k_{2} & k_{2} + k_{3} & - k_{3} \\ - k_{3} & k_{3} + k_{4} & - k_{4} \\ ⋱ & ⋱ & ⋱ \\ - k_{n} & k_{n} \end{matrix}],$ and $x \in R^{n}$ . It is possible to write A as the following linear combination of physical parameters $A = A (c) = A_{0} + \sum_{i = 1}^{n} c_{i} A_{i},$ where $\begin{aligned} c_{i} = k_{i}, B_{0} = I, B_{i} = 0, i = 1, 2, \dots, n, \\ A_{0} = 0, A_{1} = e_{1} e_{1}^{T}, A_{i} = (e_{i - 1} - e_{i}) (e_{i - 1} - e_{i})^{T}, i = 2, 3, \dots, n, \end{aligned}$ when it is assumed that we have the natural frequency of the system as eigenvalues of A, finding the matrix is changed to determining the multipliers $c_{i} > 0, i = 1, 2, \dots, n$ .

Also Problem 1.1 plays an important role in structure design and applied mechanics [Citation19].

Application 1.3

In many practical applications of structures with l elements and n displacement degrees of freedom, such as, truss structure design, by assuming that $c = (c_{1}, c_{2}, \dots, c_{l})$ is a set of design parameters, the global stiffness matrix $A (c)$ and global mass matrix $B (c)$ are defined as: $\begin{aligned} A (c) & = c_{1} K_{1} + c_{2} K_{2} + \dots + c_{l} K_{l}, \\ B (c) & = M_{0} + c_{1} M_{1} + c_{2} M_{2} + \dots + c_{l} M_{l}, \end{aligned}$ where $K_{i}$ and $M_{i}$ are the stiffness and mass matrices of the ith element of the structure. In these problems, the goal is to determine $c_{i} > 0, i = 1, 2, \dots, l$ such that it is the set ${λ_{1}^{*}, λ_{2}^{*}, \dots, λ_{l}^{*}}$ of the eigenvalues of the generalized eigenvalue problem $A (c) x = λ B (c) x$ .

On the other hand, various applications of the PGIEP have made it a favourite topic for analysis. Many recent studies have focused on the numerical methods and existence theory for different categories of these problems [Citation3,Citation30,Citation31]. Sufficient and necessary conditions for the solvability of the specific types of the PGIEP were presented in [Citation18,Citation31–36]. In addition, in [Citation37], Dai et al. presented sufficient conditions for guaranteeing the existence of a solution for the parameterized generalized inverse eigenvalue problem. More results in this area can be found in [Citation38,Citation39].

Further more, a review of recent literature on inverse eigenvalue problems shows that there are plenty of results for finding the answer to the PGIEP. In general, different approaches have been used to solve these problems according to their types. These methods can be categorized into three main categories of direct methods, iterative methods and continuous methods [Citation3]. However the similarity of these methods is the confrontation with a nonlinear system of equations or an optimization problem. Since the nonlinear system of equations is simpler in repetitive and continuous methods, these methods have been extensively considered in recent years [Citation3]. Also iterative methods were developed for solving the specific types of the PGIEP [Citation40–42]. Formulations of some of these methods were defined by solving the following nonlinear systems (4) $\begin{aligned} F (c) & = [\begin{matrix} λ_{1} (c) - λ_{1} \\ λ_{2} (c) - λ_{2} \\ ⋮ \\ λ_{n} (c) - λ_{n} \end{matrix}] = 0, \end{aligned}$ (4) (5) $\begin{aligned} F (c) & = [\begin{matrix} σ_{min} (A (c) - λ_{1} B (c)) \\ σ_{min} (A (c) - λ_{2} B (c)) \\ ⋮ \\ σ_{min} (A (c) - λ_{n} B (c)) \end{matrix}] = 0, \end{aligned}$ (5) in which $λ_{1} (c), λ_{2} (c), \dots, λ_{n} (c)$ are the eigenvalues of the generalized eigenvalue problem $A (c) x = λ B (c) x$ , the scalers $λ_{1}, λ_{2}, \dots, λ_{n}$ are the eigenvalues given in the problem and $σ_{min} (A (c) - λ_{i} B (c))$ is smallest singular value of the matrix $A (c) - λ_{i} B (c)$ . In the above formulations, the system of nonlinear equations $F (c) = 0$ should be solved. In [Citation42], Newton's method for solving the system of nonlinear equations (Equation4(4) $\begin{aligned} F (c) & = [\begin{matrix} λ_{1} (c) - λ_{1} \\ λ_{2} (c) - λ_{2} \\ ⋮ \\ λ_{n} (c) - λ_{n} \end{matrix}] = 0, \end{aligned}$ (4) ) has been presented, by extending the ideas which were developed by Friedland in [Citation34]. This method requires computing the complete solution of the generalized eigenvalue problem $A (c) x = λ B (c) x$ in each iteration of Newton's method. Also in [Citation43] Shu and Li introduced Homotopy solution for the system of nonlinear equations (Equation4(4) $\begin{aligned} F (c) & = [\begin{matrix} λ_{1} (c) - λ_{1} \\ λ_{2} (c) - λ_{2} \\ ⋮ \\ λ_{n} (c) - λ_{n} \end{matrix}] = 0, \end{aligned}$ (4) ). In [Citation41], an another form of the formulation has been presented by using a QR-like decomposition as follows: (6) $F (c) = [\begin{matrix} r_{n n}^{(1)} (c) \\ r_{n n}^{(2)} (c) \\ ⋮ \\ r_{n n}^{(n)} (c)) \end{matrix}] = 0,$ (6) in which $r_{n n}^{(i)} (c)$ is obtained by computing QR-like decomposition of $(A (c) - λ_{i} B (c))$ for $i = 1, 2, \dots, n$ as follows: $(A (c) - λ_{i} B (c)) = Q_{i} (c) R_{i} (c), R_{i} (c) = [\begin{matrix} R_{11}^{(i)} (c) & R_{12}^{(i)} (c) \\ 0 & r_{n n}^{(i)} (c) \end{matrix}] .$ In addition, Lancaster [Citation44] and Biegler-Konig [Citation45] presented a formulation based on the determinant evaluation for the additive and multiplicative inverse eigenvalue problems, by the following form: (7) $F (c) = [\begin{matrix} \det (A (c) - λ_{1} I) \\ \det (A (c) - λ_{2} I) \\ ⋮ \\ \det (A (c) - λ_{n} I) \end{matrix}] = 0.$ (7) It should be noted that the formulations (Equation4(4) $\begin{aligned} F (c) & = [\begin{matrix} λ_{1} (c) - λ_{1} \\ λ_{2} (c) - λ_{2} \\ ⋮ \\ λ_{n} (c) - λ_{n} \end{matrix}] = 0, \end{aligned}$ (4) )–(Equation7(7) $F (c) = [\begin{matrix} \det (A (c) - λ_{1} I) \\ \det (A (c) - λ_{2} I) \\ ⋮ \\ \det (A (c) - λ_{n} I) \end{matrix}] = 0.$ (7) ) are not computationally attractive. To avoid solving an eigenvalue problem in each iteration and to lower computational complexity, we propose a numerical algorithm based on the Cholesky factorization and the Jacobi method. In this paper, we consider the formulation of both methods for solving the Problem 1.1, assuming the existence of a solution.

This paper is organized as follows. We review some basic definitions and theorems for the Cholesky factorization and the Jacobi method in Section 2. In Section 3, first we consider necessary theories for the Cholesky factorization of a matrix dependent on several parameters and then two new algorithms based on the Cholesky factorization and the Jacobi method is proposed. Finally in Section 4 some numerical experiments are presented.

2. Some definitions and theorems

Before starting, we review some basic definitions and theorems for the Cholesky factorization and the Jacobi method quickly.

The Cholesky factorization is a factorization of a positive-definite matrix into the product of a lower triangular matrix and its conjugate transpose, which is helpful for effective numerical solutions [Citation46]. Actually, positive-definite matrices possess numerous significant properties, specifically they can be represented in the form $A = H H^{T}$ for a non-singular matrix H. The Cholesky factorization is a special type of this factorization, where H is a lower triangular matrix with positive diagonal elements. By a form of Gaussian elimination, we can compute the Cholesky factorization [Citation46]. Balancing $(i, j)$ elements in the equation $A = H H^{T}$ shows $\begin{aligned} a_{i j} & = \sum_{k = 1}^{i} h_{k i}^{2}, i = j, \\ a_{i j} & = \sum_{k = 1}^{i} h_{k i} h_{k j}, i > j . \end{aligned}$ These equations can be solved to make a column of the matrix H at a time, according to the following form: $\begin{aligned} h_{11} & = a_{11} and for i = 1, 2, . . n, \\ h_{i i} & = {(a_{i i} - \sum_{k = 1}^{i - 1} h_{k i}^{2})}^{1 / 2}, \\ h_{i j} & = \frac{(a_{i i} - \sum_{k = 1}^{i - 1} h_{i k} h_{j k})}{h_{i i}}, j = 1, 2, \dots, n . \end{aligned}$ There are theorems that show the existence of a Cholesky factorization for symmetric and positive definite matrices. According to one of this theorem if all leading principal minors a matrix $A \in R^{n \times n}$ are non-singular, then there exist a diagonal matrix D and two unit upper triangular matrices L and M so that $A = L D M^{T},$ and if $A \in R^{n \times n}$ be a symmetric and non-singular matrix, then, matrices L and M are equal and we can also write $A = L D L^{T} .$ Jacobi method constructs a sequence of similar matrices by using the orthogonal transformations. Jacobi methods for the symmetric eigenvalue problem attract current attention because they are inherently parallel [Citation47]. This method uses Jacobi rotation matrices for diagonalization of symmetric matrices, by the following theorem:

Theorem 2.1

[Citation46]

Let A be an $n \times n$ real matrix. For each pair of integers $(p, q), 1 \leq p < q \leq n,$ there exist a $θ \in [- π / 4, π / 4]$ such that $(p, q)$ element of matrix $G (p, q, θ)^{T} A G (p, q, θ)$ is zero, where $G (p, q, θ)$ is a rotation matrix.

Actually, the idea of Jacobi method is to reduce the quantity $off (A) = \sqrt{\sum_{i = 1}^{n} \sum_{\begin{matrix} j = 1 \\ j \neq i \end{matrix}}^{n} a_{i j}^{2}}$ by the above theorem and a sequence of rotation matrices.

3. Main results

In this section, we offer two iterative algorithms based on the Cholesky factorization and the Jacobi method for solving the Problem 1.1.

3.1. Smooth Cholesky factorization

In this subsection, we first review a theorem for the matrix pencil pair $(A, B)$ [Citation48] and then we extend it for pencil of matrix-valued function $(A (c), B (c))$ .

Theorem 3.1

Let $(A, B)$ be the matrix pencil pair and matrices A and B be symmetric and positive definite, respectively, then there exist a real nonsingular matrix X such that matrix $X^{T} A X$ is a real diagonal matrix and $X^{T} B X = I,$ and eigenvalues of the matrix pencil pair $(A, B)$ equal to the diagonal elements of the matrix $X^{T} A X$ .

Since matrices $A (c)$ and $B (c)$ are matrix-valued functions dependent on several parameters, we present smooth Cholesky factorization for a differentiable matrix-valued function of multiple parameters.

Let $B (c) = (b_{k l} (c)) \in R^{n \times n}$ be differentiable matrix defined on F, where F is a connected open subset of $R^{n}$ . Such that (8) $B (c) = B (c^{(0)}) + \sum_{i = 1}^{n} \frac{δ B (c^{(0)})}{δ c_{i}} (c_{i} - c_{i}^{(0)}) + O (‖ c - c^{(0)} ‖_{2}),$ (8) in which (9) $\frac{δ B (c^{(0)})}{δ c_{i}} = ({\frac{δ b_{k l} (c)}{δ c_{i}} |}_{c = c^{(0)}}) \in R^{n \times n} .$ (9) Now, we present an existence for the Cholesky factorization of the matrix $B (c)$ . Then we present a smooth form of Theorem 3.1 for pencil of matrix-valued function $(A (c), B (c))$ . To this end, we need a preparatory lemma as follows:

Lemma 3.1

If $B (c) \in R^{n \times n}$ is a differentiable matrix-valued function defined on an open connected domain $F \subseteq R^{n},$ then there exists a unit lower-triangular matrix $L (c)$ and an upper-triangular matrix $U (c),$ both unique and differentiable in F, such that $B (c) = L (c) U (c),$ provided that all leading principal minors of $B (c)$ are nonzero for any $c \subseteq F$ . This lemma obtains by using mathematical induction [Citation37].

Also, the Cholesky decomposition has continuity and smoothness properties as explained in the following theorem:

Theorem 3.2

Let $B (c) \in R^{n \times n}$ be a differentiable matrix-valued function on the a connected open subset $F \subseteq R^{n}$ such that at a given point $c^{(0)},$ $B (c^{(0)})$ is a matrix full rank. Assume that there exists one permutation matrix $Π_{r}$ such that $Π_{r} B (c^{0}) Π_{r}^{T}$ has a Cholesky factorization $Π_{r} B (c^{0}) Π_{r}^{T} = H_{0} H_{0}^{T},$ where $H_{0}$ is a lower triangular matrix. Then there exists a neighbourhood of $c^{(0)},$ say, $N (c^{(0)}),$ such that for all $c \in N (c^{(0)}),$ the matrix-valued function $Π_{r} B (c) Π_{r}^{T}$ has a Cholesky factorization as $Π_{r} B (c) Π_{r}^{T} = H (c) H^{T} (c), for c \in N (c^{(0)}),$ where $H (c)$ is a lower triangular matrix.

Proof.

From $L D L^{T}$ factorization, there exists a diagonal matrix $D_{0}$ and an unit lower triangular matrix $L_{0}$ such that $Π_{r} B (c^{(0)}) Π_{r}^{T} = L_{0} D_{0} L_{0}^{T} = (L_{0}) (D_{0} L_{0}^{T}) = (L_{0}) (U_{0}),$ where $U_{0}$ is an upper triangular matrix. By defining $B_{d} (c)$ as $B_{d} (c) = \sum_{k = 1}^{n} δ \frac{δ B (c^{(0)})}{δ c_{k}} (c_{k} - c_{k}^{(0)}) + O (‖ c - c^{(0)} ‖_{2}),$ we have $B (c) = B (c^{(0)}) + B_{d} (c),$ and $L_{0}^{- 1} Π_{r} B (c) Π_{r}^{T} = L_{0}^{- 1} Π_{r} B (c^{(0)}) Π_{r}^{T} + L_{0}^{- 1} Π_{r} B_{d} (c) Π_{r}^{T} := U_{0} + \tilde{B} (c),$ where $\tilde{B} (c) = L_{0}^{- 1} Π_{r} \sum_{k = 1}^{n} δ \frac{δ B (c^{(0)})}{δ c_{k}} Π_{r}^{T} (c_{k} - c_{k}^{(0)}) + O (‖ c - c^{(0)} ‖_{2}),$ Let $\tilde{B} (c) = [\begin{matrix} {\tilde{B}}_{11} (c) & {\tilde{B}}_{12} (c) \\ {\tilde{B}}_{21} (c) & {\tilde{b}}_{n n} (c) \end{matrix}], U_{0} = [\begin{matrix} U_{11} & U_{12} \\ 0 & u_{n n} \end{matrix}],$ where ${\tilde{B}}_{11} (c) \in C^{(n - 1) \times (n - 1)}$ and $U_{11} \in U_{n - 1}$ . For as much as the matrix-valued function $U_{11} + {\tilde{B}}_{11} (c)$ is invertible on a neighbourhood of $c^{0}$ , as $N (c^{(0)}) \subseteq C^{n}$ , and all of its leading principal minors are nonzero for any $c \in N (c^{(0)})$ , we can define the matrix-valued function, $L_{r} (c)$ , as follows: $L_{r} (c) = [\begin{matrix} I & 0 \\ {\tilde{B}}_{21} (c) + (U_{11} + {\tilde{B}}_{11} (c))^{- 1} & 1 \end{matrix}] .$ Straight computations show that $L_{r} (c)^{- 1} L_{0}^{- 1} Π_{r} B (c) Π_{r}^{T} = [\begin{matrix} {\tilde{U}}_{11} (c) & {\tilde{U}}_{12} (c) \\ 0 & u_{n n} (c) \end{matrix}] .$ It follows from Lemma 3.1 that there exist an LU decomposition of the matrix ${\tilde{U}}_{11} (c)$ as follow: ${\tilde{U}}_{11} (c) = {\tilde{L}}_{l} (c) {\tilde{U}}_{l} (c),$ where ${\tilde{L}}_{l} (c)$ and ${\tilde{U}}_{l} (c)$ are two unit lower-triangular and unit upper-triangular matrix and both unique.

Assuming $L_{π} (c) = [\begin{matrix} {\tilde{L}}_{l} (c) & 0 \\ 0 & 1 \end{matrix}], U_{π} (c) = [\begin{matrix} {\tilde{U}}_{l} (c) & {\tilde{L}}_{l} (c) {\tilde{U}}_{12} (c) \\ 0 & u_{n n} (c) \end{matrix}],$ let $L (c) = L_{0} L_{r} (c) L_{π} (c) and U (c) = U_{π} (c) .$ Then $L (c^{(0)}) = L_{0}$ , $U (c^{(0)}) = U_{0}$ and $L (c) U (c)$ is the LU decomposition of $Π_{r} B (c) Π_{r}^{T}$ , $Π_{r} B (c) Π_{r}^{T} = L (c) U (c) for \forall c \in N (c^{(0)}) .$ We define the matrix-valued function $D (c)$ as $D (c) = d i a g (u_{11} (c), u_{22} (c), \dots, u_{n n} (c)),$ where $u_{i i} (c), i = 1, 2, \dots, n$ are diagonal entries of $U (c)$ , then $Π_{r} B (c) Π_{r}^{T} = L (c) U (c) = L (c) D (c) (D (c)^{- 1} U (c)) .$ Finally, by defining $M = (D (c)^{- 1} U (c)),$ we can obtain $Π_{r} B (c) Π_{r}^{T} = L (c) D (c) M^{T} (c)$ and $Π_{r} B (c) Π_{r}^{T} = (L (c) D (c)^{(1 / 2)}) (D (c)^{(1 / 2)} L^{T} (c)) .$ Let $H (c) = L (c) D^{1 / 2} (c),$ then $Π_{r} B (c) Π_{r}^{T}$ possesses the Cholesky decomposition $Π_{r} B (c) Π_{r}^{T} = H (c) H^{T} (c) for \forall c \in N (c^{(0)}) .$ Now the proof is complete.

Corollary 3.1

Let $B (c), A (c) \in R^{n \times n}$ be differentiable matrices defined on a connected open subset $F \subseteq R^{n}$ such that at a given point, as $c^{(0)} \in F,$ the rank $B (c^{(0)}) \geq n - 1$ . Assume that matrices $A (c^{(0)})$ and $B (c^{(0)})$ are symmetric and positive definite respectively and there exists one lower triangular matrix $H (c^{(0)})$ such that $H (c^{(0)}) H^{T} (c^{(0)})$ has a Cholesky factorization for matrix $B (c^{(0)}),$ then there exists a neighbourhood of $c^{(0)},$ say, $N (c^{(0)}),$ such that for all $c \in N (c^{(0)}),$ there exists a nonsingular matrix-valued function $X (c)$ such that matrix $X^{T} (c) A (c) X (c)$ is real diagonal matrix and $X (c)^{T} B (c) X (c) = I,$ and eigenvalues of the matrix pencil pair $(A (c), B (c))$ are equal to the diagonal elements of the matrix $X^{T} (c) A (c) X (c)$ .

3.2. Two algorithms based on the smooth Cholesky factorization

In this subsection, we present two algorithms based on the Cholesky factorization for Problem 1.1. At first, we make a system of nonlinear equations which is equivalent to Problem 1.1. For this end, we begin by computing the Cholesky factorization of $B (c),$ with complete pivoting as follows: (10) $Π^{T} (B (c)) Π = H (c) H^{T} (c),$ (10) where Π is a permutation matrix and H is a lower triangular matrix. We know that (11) $H^{- 1} Π^{T} (A (c)) Π H^{- T} - λ H^{- 1} Π^{T} (B (c)) Π H^{- T} = H^{- 1} Π^{T} (A (c)) Π H^{- T} - λ I .$ (11) According to the symmetry of $H^{- 1} Π^{T} (A (c)) Π H^{- T}$ , we obtain matrix $Q = Q_{1} Q_{2} \dots Q_{k}$ , such as (12) $H^{- 1} Π^{T} (A (c)) Π H^{- T} = Q^{T} D_{A} Q,$ (12) and by assuming $P = Π H^{- T} Q$ , we get (13) $P^{T} A (c) P = D_{A}, P^{T} B (c) P = I,$ (13) where (14) $D_{A} = [\begin{matrix} d_{11} (c) & 0 & \dots & 0 \\ 0 & d_{22} (c) & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & d_{n n} (c) \end{matrix}] .$ (14) The diagonal elements of $D_{A}$ are not necessarily in descending order, but this can be achieved by a simple sorting algorithm or use a sorted Jacobi algorithm [Citation49]. So let's assume $d_{11} (c) \leq d_{22} (c) \leq \dots \leq d_{n n} (c)$ . Based on Corollary 3.1, the matrices $(A (c) - λ B (c))$ and $D_{A} - λ I$ are similarity matrices and therefor they have equal eigenvalues. As a result, the generalized eigenvalue problem $A (c) x = λ B (c) x$ has the eigenvalues $λ_{1}, λ_{2}, \dots, λ_{n}$ if and only if $d_{i i} (c) - λ_{i} = 0, i = 1, 2, \dots, n .$ Now, we create a system of nonlinear equations for solving Problem 1.1 as following (15) $F (c) = [\begin{matrix} d_{11} (c) - λ_{1} \\ d_{22} (c) - λ_{2} \\ ⋮ \\ d_{n n} (c) - λ_{n} \end{matrix}] = 0.$ (15) We use Newton's method to solve the system of the nonlinear equations (Equation15(15) $F (c) = [\begin{matrix} d_{11} (c) - λ_{1} \\ d_{22} (c) - λ_{2} \\ ⋮ \\ d_{n n} (c) - λ_{n} \end{matrix}] = 0.$ (15) ). Suppose flow iterate $c^{(k)}$ be sufficiently close to a solution of the nonlinear system (Equation15(15) $F (c) = [\begin{matrix} d_{11} (c) - λ_{1} \\ d_{22} (c) - λ_{2} \\ ⋮ \\ d_{n n} (c) - λ_{n} \end{matrix}] = 0.$ (15) ), then one step of Newton's method for the solution of (Equation15(15) $F (c) = [\begin{matrix} d_{11} (c) - λ_{1} \\ d_{22} (c) - λ_{2} \\ ⋮ \\ d_{n n} (c) - λ_{n} \end{matrix}] = 0.$ (15) ) has the form (16) $J_{F} (c^{(k)}) (c^{(k + 1)} - c^{(k)}) = - F (c^{(k)}),$ (16) where the Jacobian matrix $J_{F} (c)$ has the following form: (17) $J_{F} (c) = [\begin{matrix} \frac{\partial d_{11} (c)}{\partial c_{1}} & \dots & \frac{\partial d_{11} (c)}{\partial c_{n}} \\ \frac{\partial d_{22} (c)}{\partial c_{1}} & \dots & \frac{\partial d_{22} (c)}{\partial c_{n}} \\ \dots & \dots \\ \frac{\partial d_{n n} (c)}{\partial c_{1}} & \dots & \frac{\partial d_{n n} (c)}{\partial c_{n}} \end{matrix}],$ (17) in which, using (Equation13(13) $P^{T} A (c) P = D_{A}, P^{T} B (c) P = I,$ (13) ) and the results of Theorem 2.1 of [Citation50], Jacobian matrix $J_{F} (c)$ has elements (18) $\frac{\partial d_{i i} (c)}{\partial c_{j}} = p_{i}^{T} (c) (\frac{\partial A (c)}{\partial c_{j}} - d_{i i} (c) \frac{\partial B (c)}{\partial c_{j}}) p_{i} (c), i, j = 1, 2, \dots, n .$ (18) Clearly, from the definition of matrices $A (c)$ and $B (c)$ in (Equation1(1) $A (c) = A_{0} + \sum_{i = 1}^{n} c_{i} A_{i}, B (c) = B_{0} + \sum_{i = 1}^{n} c_{i} B_{i},$ (1) ), we obtain (19) $\frac{\partial d_{i i} (c)}{\partial c_{j}} = p_{i}^{T} (c) (A_{j} - d_{i i} (c) B_{j}) p_{i} (c) .$ (19) Thus this method for solving the Problem 1.1 be summarized as follows:

Algorithm 3.1

The algorithm for finding a solution of Problem 1.1

Input: Given matrices ${A_{i}}_{i = 0}^{n}$ and ${B_{i}}_{i = 0}^{n},$ eigenvalues $λ_{1}, λ_{2}, \dots, λ_{n}$ and initial guess $c^{(0)} \geq 0$

Output: Computed solution $c^{(k + 1)}$

For $k = 0, 1, 2, \dots$ until the iteration sequence ${c^{(k)}}_{k = 0}^{\infty}$ is convergent,

Step 1.	Compute $A (c^{(k)})$ and $B (c^{(k)});$
Step 2.	Compute the Cholesky factorization of $B (c^{(k)})$ with complete pivoting; $Π_{r} (B (c^{(k)}) Π_{r}^{T} = H (c^{(k)}) H^{T} (c^{(k)}),$ where $H (c^{(k)})$ is the upper triangular matrix.
Step 3.	Find the rotation matrices $Q_{1}, Q_{2}, \dots, Q_{k}$ such that $(\prod_{l = 1}^{k} Q_{l}^{T}) H^{- 1} (c^{(k)}) Π_{r}^{T} A (c^{(k)}) Π_{r} H^{T} (c^{(k)}) (\prod_{l = 1}^{k} Q_{l}) = D_{A} (c^{(k)}),$ where $D_{A} (c^{(k)})$ is a diagonal matrix;
Step 4.	Compute $F (c^{(k)})$ using (Equation15(15) $F (c) = [\begin{matrix} d_{11} (c) - λ_{1} \\ d_{22} (c) - λ_{2} \\ ⋮ \\ d_{n n} (c) - λ_{n} \end{matrix}] = 0.$ (15) );
Step 5.	If $‖ F (c^{(k)}) ‖_{2} = \sqrt{\sum_{i = 1}^{n} ‖ d_{i i} - λ_{i} ‖^{2}}$ is small enough, then stop, otherwise go to next step;
Step 6.	Compute the Jacobian matrix $J_{F} (c^{(k)})$ using (Equation17(17) $J_{F} (c) = [\begin{matrix} \frac{\partial d_{11} (c)}{\partial c_{1}} & \dots & \frac{\partial d_{11} (c)}{\partial c_{n}} \\ \frac{\partial d_{22} (c)}{\partial c_{1}} & \dots & \frac{\partial d_{22} (c)}{\partial c_{n}} \\ \dots & \dots \\ \frac{\partial d_{n n} (c)}{\partial c_{1}} & \dots & \frac{\partial d_{n n} (c)}{\partial c_{n}} \end{matrix}],$ (17) );
Step 7.	Find $c^{(k + 1)}$ by solving linear system (Equation16(16) $J_{F} (c^{(k)}) (c^{(k + 1)} - c^{(k)}) = - F (c^{(k)}),$ (16) );
Step 8.	Go to Step 1.

Let Problem 1.1 have a solution $c^{*}$ . We conclude that if the eigenvalues $λ_{1} (c), λ_{2} (c), \dots, λ_{n} (c)$ have the smooth dependence near $c = c^{*}$ , and the Jacobian at $c^{*}$ is non-singularity, the Newton's method given in Algorithm 3.1 converges.

Theorem 3.3

Suppose that the Problem 1.1 has a solution $c^{*},$ and $λ_{1}, λ_{2}, \dots, λ_{n}$ are given and that $λ = λ (c^{*})$ . Assume that the Jacobian matrix $J_{F} (c^{*})$ with elements corresponds to Equation (Equation17(17) $J_{F} (c) = [\begin{matrix} \frac{\partial d_{11} (c)}{\partial c_{1}} & \dots & \frac{\partial d_{11} (c)}{\partial c_{n}} \\ \frac{\partial d_{22} (c)}{\partial c_{1}} & \dots & \frac{\partial d_{22} (c)}{\partial c_{n}} \\ \dots & \dots \\ \frac{\partial d_{n n} (c)}{\partial c_{1}} & \dots & \frac{\partial d_{n n} (c)}{\partial c_{n}} \end{matrix}],$ (17) ) is nonsingular, then there is a neighbourhood $N (c^{*})$ such that, for all $c^{(0)} \in N (c^{*}),$ the Algorithm 3.1 generates a well-defined sequence $c^{(k)}$ for which $c^{(k)} \to c^{*},$ and convergence is locally quadratic.

Our numerical experiments with the Algorithm 3.1 illustrate that quadratic convergence indeed obtain in practice, but the strong hypotheses of Theorem 3.3 and the absence of the Cholesky factorization of the matrix $B (c)$ when c<0, suggest there is still some degree of uncertainty about its performance. So, we provide another formulation. In the first step, the formulation leads to a constrained optimization problem, and then we will solve the constrained optimization problem by using an improved BFGS method. Many algorithms are proposed for solving constrained optimization problems. BFGS is the Newton's approximation method for solving several nonlinear optimization problems that was proposed by (Broyden–Fletcher–Goldfarb–Shanno). In addition, Byrd et al proposed a version of the L-BFGS algorithm for the bound-constrained optimization, namely the L-BFGS-B [Citation51].

In L-BFGS-B algorithm the idea of limited memory matrices to approximate the Hessian of the objective function [Citation52] is used. This algorithm does not require second derivatives or knowledge of the structure of the objective function and can therefore be feasible when the Hessian matrix is not practical to compute [Citation53].

Therefore, we create a constrained optimization problem of the form: (20) $\begin{aligned} min h (c), h (c) = ‖ g (c) ‖_{2}^{2} \end{aligned}$ (20) $\begin{aligned} subject to c_{j} \geq 0 j = 1, 2, \dots, n, \end{aligned}$ where (21) $g (c) = [\begin{matrix} d_{11} (c) - λ_{1} \\ d_{22} (c) - λ_{2} \\ ⋮ \\ d_{n n} (c) - λ_{n} \end{matrix}] .$ (21) Allow the running iterate $c^{(k)}$ be enough close to a solution $c^{(*)}$ of the nonlinear optimization problem (Equation20(20) $\begin{aligned} min h (c), h (c) = ‖ g (c) ‖_{2}^{2} \end{aligned}$ (20) ), then from (Equation21(21) $g (c) = [\begin{matrix} d_{11} (c) - λ_{1} \\ d_{22} (c) - λ_{2} \\ ⋮ \\ d_{n n} (c) - λ_{n} \end{matrix}] .$ (21) ), (Equation12(12) $H^{- 1} Π^{T} (A (c)) Π H^{- T} = Q^{T} D_{A} Q,$ (12) ) and Corollary 3.1, we know that the functions $d_{i i} (c) - λ_{i} (i = 1, 2, \dots, n)$ are twice continuously differentiable at $c^{(k)}$ , and their differentiate with respect to $c_{j}$ can be expressed as (22) $\frac{\partial d_{i i} (c^{(k)})}{\partial c_{j}} = p_{i}^{T} (A_{j} - d_{i i} (c^{(k)}) B_{j}) p_{i} .$ (22) Therefore, due to the definition of the function g, its gradient will be as follows (23) $\nabla h (c) = J (c)^{T} g (c),$ (23) where $J_{i j} (c) = \partial d_{i i} (c) / \partial c_{j}$ .

Now, we are going to use L-BFGS-B method to solve the the nonlinear optimization problem (Equation20(20) $\begin{aligned} min h (c), h (c) = ‖ g (c) ‖_{2}^{2} \end{aligned}$ (20) ) and attain an iteration method for solving Problem 1.1. considering the current iterative $c^{(k)}$ to be at step kth of the L-BFGS-B method, the objective function has to be approximated by a quadratic model at a point $c^{(k)}$ in the following form: (24) $m^{k} (c) = h (c^{(k)}) + \nabla h (c^{(k)}) (c - c^{(k)}) + \frac{1}{2} (c - c^{(k)})^{T} H (c^{(k)}) (c - c^{(k)}),$ (24) where $H (c^{(k)})$ is the limited memory BFGS matrix which approximates the Hessian matrix at point $c^{(k)}$ . Then the algorithm minimizes $m^{k}$ according to the bounds given [Citation51]. This is done by finding an active set of bounds using the gradient projection method and minimizing $m^{k}$ , treating those bounds as equality constraints.

To do this, first we should find the generalized Cauchy point $c^{c}$ which is determined as the first local minimizer of $m^{k} (c (t))$ on the piece-wise linear path $c (t) = P (c^{(k)} - t \nabla h (c^{(k)}), 0),$ where (25) $P (c, 0)_{i} = {\begin{aligned} 0 & if c_{i} < 0 \\ c_{i} & if c_{i} \geq 0 \end{aligned} .$ (25) Then, by assuming that the variables whose value at $c^{c}$ is at lower or upper bound, the active set $A (c^{c})$ is constituted, the following quadratic problem over the subspace of free variables is considered, (26) $\begin{aligned} \min & {m^{k} (c) : c_{i} = c_{i}^{c}, i \in A (c^{c})} \\ Subject to & c_{i} \geq 0 \forall i \in A (c^{c}) . \end{aligned}$ (26) In [Citation51], an algorithm is presented for computation of the generalized Cauchy point. Also, three approaches to minimize $m^{k}$ over the space of free variables have been introduced, including a primal iterative method using the conjugate gradient method, a direct primal method based on the Sherman- Morrison-Woodbury formula and a direct dual method using Lagrange multipliers. Global convergence of the L-BFGS Algorithm is presented in [Citation54], and the analyses similar to those methods is possible for the l-BFGS-B Algorithm [Citation52]. More details of L-BFGS-B algorithm was presented in [Citation54].

By using L-BFGS-B algorithm for solving Problem 1.1, an iteration method based on Cholesky decomposition for solving Problem 1.1 is given as follows:

Algorithm 3.2

The algorithm for finding a solution of Problem 1.1

Input: Given matrices ${A_{i}}_{k = 0}^{n}$ and ${B_{i}}_{i = 0}^{n},$ eigenvalues $λ_{1}, λ_{2}, \dots, λ_{n}$ and initial guess $c^{(0)} \geq 0$

Output: Computed solution $c^{(k + 1)}$

For $k = 0, 1, 2, \dots$ until the iteration sequence ${c^{(k)}}_{k = 0}^{\infty}$ is convergent,

Step 1.	Compute $A (c^{(k)})$ and $B (c^{(k)});$
Step 2.	Compute the Cholesky factorization of $B (c^{(k)}),$ with complete pivoting; $Π_{r} (B (c^{(k)}) Π_{r}^{T} = H (c^{(k)}) H^{T} (c^{(k)}),$ where $H (c^{(k)})$ is the upper triangular matrix.
Step 3.	Find the rotation matrices $Q_{1}, Q_{2}, \dots, Q_{k}$ such that $(\prod_{l = 1}^{k} Q_{l}^{T}) H^{- 1} (c^{(k)}) Π_{r}^{T} A (c^{(k)}) Π_{r} H^{T} (c^{(k)}) (\prod_{l = 1}^{k} Q_{l}) = D_{A} (c^{(k)}),$ where $D_{A} (c^{(k)})$ is a diagonal matrix;
Step 4.	Compute $g (c^{(k)})$ and $J (c^{(k)})$ using (Equation21(21) $g (c) = [\begin{matrix} d_{11} (c) - λ_{1} \\ d_{22} (c) - λ_{2} \\ ⋮ \\ d_{n n} (c) - λ_{n} \end{matrix}] .$ (21) ) and (Equation22(22) $\frac{\partial d_{i i} (c^{(k)})}{\partial c_{j}} = p_{i}^{T} (A_{j} - d_{i i} (c^{(k)}) B_{j}) p_{i} .$ (22) );
Step 5.	Find $\nabla h (c^{(k)})$ by using (Equation26(26) $\begin{aligned} \min & {m^{k} (c) : c_{i} = c_{i}^{c}, i \in A (c^{c})} \\ Subject to & c_{i} \geq 0 \forall i \in A (c^{c}) . \end{aligned}$ (26) );
Step 6.	If $‖ P (c^{(k)} - \nabla h (c^{(k)}), 0) - c^{(k)} ‖_{\infty} < ϵ$ is satisfied $($ where ε is small enough $),$ than stop, otherwise go to next step;
Step 7.	Find $c^{(k + 1)}$ by use L-BFGS-B algorithm;
Step 8.	Go to Step 2.

In the next section, we will show demeanor of the Algorithms 3.1 and 3.2 by the numerical examples.

4. Numerical experiments

In this section, we apply three numerical experiments to examine convergence of Algorithms 3.1 and 3.2, and show the effectiveness of these algorithms for iteratively computing a solution of Problem 1.1. Also, all codes are run in MATLAB. In our implementations, the iterations were terminated when the current iterate satisfies $‖ f (c^{(k)}) ‖ \leq 10^{- 12}$ for the Algorithm 3.1, and in the Algorithm 3.2, the iterations were stopped when the norm $‖ g (c^{(k)}) ‖_{2}^{2}$ was less than $10^{- 8}$ .

Example 4.1

In this example, we have a parameterized inverse eigenvalue problem in which n = 5, $\begin{aligned} A_{0} & = diag (9, 11, 10, 8, 14), B_{0} = diag (11, 13, 15, 11, 10), A_{1} = B_{1}, \\ A_{2} & = (\begin{matrix} 0 & 2 & 0 & 0 & 0 \\ 2 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 1 & 0 \\ 0 & 0 & 1 & 0 & 1 \\ 0 & 0 & 0 & 1 & 0 \end{matrix}), B_{2} = (\begin{matrix} 0 & 1 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & - 1 & 0 \\ 0 & 0 & - 1 & 0 & - 1 \\ 0 & 0 & 0 & - 1 & 0 \end{matrix}), \\ A_{3} & = (\begin{matrix} 0 & 0 & 3 & 0 & 0 \\ 0 & 0 & 0 & 2 & 0 \\ 3 & 0 & 0 & 0 & - 1 \\ 0 & 2 & 0 & 0 & 0 \\ 0 & 0 & - 1 & 0 & 0 \end{matrix}), A_{4} = (\begin{matrix} 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \end{matrix}), \\ B_{3} & = (\begin{matrix} 0 & 0 & - 1 & 0 & 0 \\ 0 & 0 & 0 & - 1 & 0 \\ - 1 & 0 & 0 & 0 & 1 \\ 0 & - 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 \end{matrix}), B_{4} = (\begin{matrix} 0 & 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & 0 \\ 2 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \end{matrix}), \\ A_{5} & = (\begin{matrix} 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 \end{matrix}) = B_{5}, \\ A (c) & = A_{0} + \sum_{i = 1}^{5} c_{i} A_{i}, B (c) = B_{0} + \sum_{i = 1}^{5} c_{i} B_{i} . \end{aligned}$

The eigenvalues are defined to be $λ^{*} = (0.43278721102, 0.66366274839, 0.94385900467, 1.10928454002, 1.49235323254)^{T} .$ Algorithm 3.1 is used to find the unknown vector c. This algorithm supposing the starting vectors $c^{(0)} = {\begin{array}{lr} (1.25, 1.15, 1.05, 0.9, 1.05)^{T}, & Case (a), \\ (1.15, 1.15, 1.05, .075, 1.05)^{T}, & Case (b), \\ (1.1, 1.2, 1.3, 1.4, 1.5)^{T}, & Case (c), \end{array}$ to the same solution $c^{(*)} = (1, 1, 1, 1, 1)^{T}$ . In addition, we solved this example by using Algorithm 3.2 with assuming these starting vectors and $c^{(0)} = (1, 2, 3, 4, 5)^{T}$ , we get the similar solution.

Our results show that the linear convergence to the similar solution is also achieved from the starting vector $c^{(0)} = (1, 2, 3, 4, 5)^{T}$ , and the number of iterations with this starting vector is 24. The numerical results for Algorithms 3.1 and 3.2 are displayed in Tables and , respectively. For simplicity, we displayed only every second iterate in Table . In Table , we also compare the numerical accuracy of Algorithms 3.1 and 4.1 in [Citation37].

Table 1. Numerical results for Example 4.1 by using Algorithm 3.1 and 4.1 in [Citation37].

Display Table

Table 2. Numerical results for Example 4.1 by using the Algorithm 3.2.

Display Table

Example 4.2

In this example, a parameterized generalized inverse eigenvalue problem is presented which n = 6, $λ = (1.46162105, 1.5, 1.5158215835, 3.23414222485, 19.2978957724, 33.8769603728)^{T}$ and $\begin{aligned} A_{0} & = (\begin{matrix} 216 & 889.2 & - 135 & - 245.4 & 141.6 & 858.12 \\ 889.2 & 66.3 & - 483.75 & - 820.39 & 413.12 & 1598.29 \\ - 135 & - 483.75 & 204.375 & - 2.425 & - 131.5 & - 541.325 \\ - 245.4 & - 820.39 & - 2.425 & 158.495 & - 367.365 & 693.021 \\ 141.6 & 413.12 & - 131.5 & - 367.365 & 209.035 & 154.057 \\ 858.12 & 1598.29 & 0 & - 541.325 & 693.021 & 316.63 \end{matrix}), B_{0} = I, \\ B_{1} & = diag [43, 2222995816, 43.5245534248, \\ 43.2978630424, 43.6484775005, 43.3099531961, 43.6635901927], \end{aligned}$ matrices $A_{k}$ and $B_{k}$ for $k = 1, 2, 3, \dots, 6$ are determined $\begin{aligned} A_{k} & = u_{k} u_{k}^{T}, k = 1, 2, \dots, 6, \\ B_{k} & = \sum_{j = k}^{6} v_{k - 1, j} (e_{k} - 1 e_{j}^{T} + e_{j} e^{T} e_{k - 1}^{T}), k = 1, 2, \dots, 6, \end{aligned}$ where $\begin{aligned} u_{1} & = (\begin{matrix} 0 \\ 12 \\ 0 \\ 0.5 \\ 0 \\ 0.2 \end{matrix}), u_{2} = (\begin{matrix} 12 \\ - 1 \\ 0.5 \\ - 0.4 \\ 0.2 \\ 0.1 \end{matrix}), u_{3} = (\begin{matrix} 0 \\ 0 \\ 12 \\ - 1 \\ 0.5 \\ - 0.4 \end{matrix}), u_{4} = (\begin{matrix} 0 \\ 0 \\ 0 \\ 12 \\ 0 \\ 0.5 \end{matrix}), \\ u_{5} & = (\begin{matrix} 0 \\ 0 \\ 0 \\ 0 \\ 12 \\ 0.1 \end{matrix}), u_{6} = (\begin{matrix} 0 \\ 0 \\ 0 \\ 0 \\ 0 \\ 12 \end{matrix}), \\ v & = (\begin{matrix} 0 & 0.8467500959 & 0.4233750479 & 0.3387000384 \\ 0.8467500959 & 0 & 0.2246513095 & 2.8755367617 \\ 0.4233750479 & 0.2246513095 & 0 & 0.9004508446 \\ 0.3387000384 & 2.8755367617 & 0.9004508446 & 0 \\ 0.1693500192 & 0.0898605238 & 0.4502254223 & 0.6097095231 \\ 0.0846750096 & 1.1232565475 & 0.3579661145 & 6.8750004851 \end{matrix} \\ \begin{matrix} 0.1693500192 & 0.0846750096 \\ 0.0898605238 & 1.1232565475 \\ 0.4502254223 & 0.3579661145 \\ 0.6097095231 & 6.8750004851 \\ 0 & 0.6911965633 \\ 0.6911965633 & 0 \end{matrix}), \end{aligned}$ Algorithms 3.1 and 3.2, on the assumption that the starting vector $c^{(0)} = (3, 15, 3, 15, 1, 18)^{T}$ , converges to a solution $c^{(*)} = (3.311692, 14.030946, 2.226167, 13.539465, 0.953318, 17.892122)^{T} .$ Table displays the residual for these two algorithms. Also, considering the starting vector $c^{(0)} = (3, 13, 3, 13, 3, 13)^{T}$ , Algorithm 3.2 converges the same solution. We report the obtained results in Table (for convenience, only every second iterate is displayed).

Table 3. Numerical results for Example 4.2 by using Algorithms 3.1 and 3.2.

Display Table

Table 4. Numerical results for Example 4.2 by using Algorithm 3.2.

Display Table

Example 4.3

As the Third example, we consider a 10-bar truss problem and present some computational results to show the usefulness of the Problem 1.1 in structural design. Each bar consists of the following parameters: Young's modulus $E = 6.95 \times 10^{1} 0 N / m^{2}$ , eight density $P = 2650 kg / m^{3}$ , acceleration of gravity $g = 9.81 m / s^{2}$ , non-structural mass at all nodes $m_{0} = 425 kg$ and the length $l = 10 m$ of horizontal and vertical bars. In this problem, the design variables are the areas of cross section of the bars for which the seventh and the eighth bars are fixed with the values $0.000865 m^{2}$ and $0.000165 m^{2}$ . The stiffness and the mass matrices of the structure can be expreseed respectively as, $\begin{aligned} A (c) & = A_{0} + \sum_{i = 1}^{8} c_{i} A_{i}, B (c) = B_{0} + \sum_{i = 1}^{8} c_{i} B_{i} . \\ A_{0} & = \frac{E}{L} [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0.000165 & 0 & 0 & 0 & - 0.000165 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0.000165 & 0 & - 0.000165 \\ 0 & 0 & - 0.000165 & 0 & 0 & 0 & 0.000165 & 0 \\ 0 & 0 & 0 & 0 & 0 & - 0.000165 & 0 & 0.000165 \end{matrix}], \\ A_{1} & = \frac{E}{L} [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ A_{2} & = \frac{E}{L} [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & - 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & - 1 & 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ A_{3} & = \frac{E}{L} [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], A_{4} = \frac{E}{L} [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & a & a & 0 & 0 & 0 & 0 \\ 0 & 0 & a & a & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ A_{5} & = \frac{E}{L} [\begin{matrix} a & - a & 0 & 0 & 0 & 0 & 0 & 0 \\ - a & a & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ A_{6} & = \frac{E}{L} [\begin{matrix} 1 & 0 & 0 & 0 & - 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ - 1 & 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ A_{7} & = \frac{E}{L} [\begin{matrix} a & a & 0 & 0 & 0 & 0 & - a & - a \\ a & a & 0 & 0 & 0 & 0 & - a & - a \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ - a & - a & 0 & 0 & 0 & 0 & 0 & 0 \\ - a & - a & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ A_{8} & = \frac{E}{L} [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & a & - a & - a & a & 0 & 0 \\ 0 & 0 & - a & a & - a & - a & 0 & 0 \\ 0 & 0 & - a & - a & a & - a & 0 & 0 \\ 0 & 0 & a & - a & - a & a & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \end{aligned}$ with $a = \frac{1}{2 \sqrt{2}}, b = p l / 6 g$ $\begin{aligned} B_{0} & = 425 I_{8} + b [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0.00033 & 0 & 0 & 0 & 0.000165 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0.00033 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0.000173 & 0 & 0.000865 \\ 0 & 0 & 0.000165 & 0 & 0 & 0 & 0.000165 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0.000865 & 0 & 0.000173 \end{matrix}], \\ B_{1} & = b [\begin{matrix} 2 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], B_{2} = b [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 2 & 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 2 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ B_{3} & = b [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], B_{4} = b [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & \sqrt{2} & \sqrt{2} & 0 & 0 & 0 & 0 \\ 0 & 0 & \sqrt{2} & \sqrt{2} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ B_{5} & = b [\begin{matrix} \sqrt{2} & - \sqrt{2} & 0 & 0 & 0 & 0 & 0 & 0 \\ - \sqrt{2} & \sqrt{2} & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ B_{6} & = b [\begin{matrix} 2 & 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 2 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ B_{7} & = \sqrt{2} b [\begin{matrix} a & a & 0 & 0 & 0 & 0 & - a & - a \\ a & a & 0 & 0 & 0 & 0 & - a & - a \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ - a & - a & 0 & 0 & 0 & 0 & 0 & 0 \\ - a & - a & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \\ B_{8} & = \sqrt{2} b [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & - 1 & 0.5 & - 0.5 & 0 & 0 \\ 0 & 0 & - 1 & 1 & - 0.5 & 0.5 & 0 & 0 \\ 0 & 0 & 0.5 & - 0.5 & 1 & - 1 & 0 & 0 \\ 0 & 0 & - 0.5 & 0.5 & - 1 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \end{aligned}$ We have to determine the areas of cross sections of the bars such that the given eigenvalues of the structure are $λ_{i} = (2 π ω_{i}) (i = 1, 2, \dots, 8)$ , where $ω_{i} = 5 i (i = 1, 2, \dots, 8)$ are the specified natural frequencies of the structure.

We apply Algorithm 3.1 with the starting vectors $c^{(0)} = {\begin{array}{lr} 10^{- 3} \times (1.7, 0.4, 1.7, 1.6, 0.7, 1.1, 0.4, 1.1)^{T}, & Case (a), \\ 10^{- 3} \times (1.7, 0.4, 1.6, 1.6, 0.6, 1, 0.5, 1.3)^{T}, & Case (b), \\ 10^{- 3} \times (2, 0.3, 1.6, 1.7, 0.7, 0.9, 0.6, 1.3)^{T}, & Case (c), \end{array}$ both Algorithms 3.1 and 4.1 in [Citation37] correspondingly converge to solutions $c^{(*)} = {\begin{array}{lr} 10^{- 3} \times (1.702, 0.420, 1.700, 1.606, 0.702, 1.120, 0.439, 1.115)^{T}, & Case (a), \\ 10^{- 3} \times (1.714, 0.385, 1.585, 1.569, 0.732, 1.075, 0.459, 1.289)^{T}, & Case (b), \\ 10^{- 3} \times (1.955, 0.264, 1, 489, 1, 758, 0, 664, 0, 945, 0.647, 1.286)^{T}, & Case (c) . \end{array}$ We demonstrate the results in Table . It is known that there may be as many as $n!$ different solutions (see [Citation55]) and, in practice, we may choose the one among them such that it is optimal in a certain sense. Also, we use Algorithm 3.2 with starting vectors parametrized by the first component $c^{(0)} = (c_{1}^{(0)}, 0.0003, 0.0015, 0.0017, 0.0007, 0.0009, 0.0006, 0.00136)^{T},$ and our exams and numerical results show that Algorithm 3.2 always converges to the same solution if $0.01 \leq c_{1}^{(0)} \leq 0.029$ . In Table , the numerical results for Algorithm 3.2 are shown, for two assumptions $c_{1}^{(0)} = 0.01$ and $c_{1}^{(0)} = 0.02$ . However, our implementation showed that the sequence generated by Algorithm 4.1 fails to converge, in these cases and many more.

Table 5. Numerical results for Example 4.3 by using Algorithm 3.1 and 4.1 in [Citation37].

Display Table

Table 6. Numerical results for Example 4.3 by using Algorithm 3.2.

Display Table

From the above three examples, we can see that Algorithms 3.1 and 3.2 are feasible for solving Problem 1.1.

But, on the other hand, the potential frailty of the numerical methods to solving optimization problems, such as Problem (Equation20(20) $\begin{aligned} min h (c), h (c) = ‖ g (c) ‖_{2}^{2} \end{aligned}$ (20) ), is the sensitivity to the initial guess. To eliminate this frailty, we consider a metaheuristic method as follows.

Particle swarm optimizer (PSO) is a stochastic population-based optimization algorithm that is inspired by the social cooperative and competitive behaviour, such as bird flocking and fish schooling. PSO was proposed by Kennedy and Eberhart [Citation56,Citation57] in 1995. Recently, PSO has emerged as a promising algorithm in solving various optimization problems in the field of engineering and science [Citation58–60]. In PSO, a swarm consists of a number of particles, that each particle represents a potential solution of the optimization task. Each particle moves to a new position according to the new velocity and the previous positions of the particle. PSO shows powerful ability to find optimal solutions and is known for low computational complexity, ease of implementation and few parameters to be adjusted.

In summary, in the PSO each particle moves to a new position according to the new velocity and the previous positions of the particle. This is compared with the best position generated by previous particles in the cost function, and the best one is kept. So each particle accelerates in the direction of not only the local best solution but also the global best position. If a particle discovers a new probable solution, other particles will move closer to it to explore the region more completely in the process [Citation61]. In other words, the algorithm is guided by personal experience (pbest), overall experience (gbest) and the present movement of the particles to decide their next positions in the search space.

Let S denote the swarm size (initial population) in PSO and D denote it's dimension. In general, there are three attributes, current position $x_{i}$ , current velocity $v_{i}$ and past best position $p b e s t_{i}$ , for particles in the search space to present their features. Each particle in the swarm is iteratively updated according to the aforementioned attributes. Assuming that the function h is the objective function of an optimization problem and the function h need to be minimized, the new velocity of every particle is updated by (27) $v_{i, j} (r + 1) = v_{i, j} (r) + γ_{1} ω_{1_{i, j}} (r) [p b e s t_{i, j} (r) - x_{i, j} (r)] + γ_{2} ω_{2_{i, j}} (r) [g b e s t_{j} (r) - x_{i, j} (r)],$ (27) and the new position of a particle is calculated as follows: (28) $x_{i, j} (r + 1) = x_{i, j} (r) + v_{i, j} (r + 1),$ (28) where $v_{i, j}$ is the velocity of jth dimension of the ith particle for $i = 1, 2, \dots, S, j = 1, 2, \dots, D$ , the $γ_{1}$ and $γ_{2}$ denote the acceleration coefficients, $ω_{1}$ and $ω_{2}$ are elements from two uniform random sequences in the range $(0, 1)$ , and r is the number of generations. The past best position of each particle is updated using $p b e s t_{i} (r + 1) = {\begin{aligned} p b e s t_{i} (r), & if h (x_{i} (r + 1)) \geq h (p b e s t_{i} (r)) \\ x_{i} (r + 1), & if h (x_{i} (r + 1)) < h (p b e s t_{i} (r)) \end{aligned},$ and the global best position gbest found from all particles during the previous three steps is defined as: (29) $g b e s t (r + 1) = \arg min_{p b e s t_{i}} h (p b e s t_{i} (r + 1)), 1 \leq i \leq S .$ (29) Here, by presenting an example we show that the PSO method can be used to determine the answer or an initial guess for problem (Equation20(20) $\begin{aligned} min h (c), h (c) = ‖ g (c) ‖_{2}^{2} \end{aligned}$ (20) ).

Example 4.4

In this example, we have a parameterized inverse eigenvalue problem in which n = 5, $\begin{aligned} A_{0} & = diag (9, 11, 10, 8, 14), B_{0} = diag (11, 13, 15, 11, 10), A_{1} = B_{1}, \\ A_{2} & = (\begin{matrix} 0 & 2 & 0 & 0 & 0 \\ 2 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 1 & 0 \\ 0 & 0 & 1 & 0 & 1 \\ 0 & 0 & 0 & 1 & 0 \end{matrix}), B_{2} = (\begin{matrix} 0 & 1 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & - 1 & 0 \\ 0 & 0 & - 1 & 0 & - 1 \\ 0 & 0 & 0 & - 1 & 0 \end{matrix}), \\ A_{3} & = (\begin{matrix} 0 & 0 & 3 & 0 & 0 \\ 0 & 0 & 0 & 2 & 0 \\ 3 & 0 & 0 & 0 & - 1 \\ 0 & 2 & 0 & 0 & 0 \\ 0 & 0 & - 1 & 0 & 0 \end{matrix}), B_{3} = (\begin{matrix} 0 & 0 & - 1 & 0 & 0 \\ 0 & 0 & 0 & - 1 & 0 \\ - 1 & 0 & 0 & 0 & 1 \\ 0 & - 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 \end{matrix}), \\ A_{4} & = (\begin{matrix} 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \end{matrix}), B_{4} = (\begin{matrix} 0 & 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & 0 \\ 2 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \end{matrix}), \\ A_{5} & = B_{5} = (\begin{matrix} 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 \end{matrix}), \\ A (c) & = A_{0} + \sum_{i = 1}^{n} c_{i} A_{i}, B (c) = B_{0} + \sum_{i = 1}^{n} c_{i} B_{i} . \end{aligned}$ The eigenvalues are given by $λ^{*} = (0.43278721102, 0.66366274839, 0.94385900467, 1.10928454002, 1.49235323254)^{T} .$

In this example, PSO method is implemented by using a random initial population with size 100 within the interval $[0, 5]$ for 20 times. Results for these implementations are illustrated in Table , where k is the number of times the PSO method has been implemented. Also, Figure indicates the results obtained for the objective function.

Figure 1. PSO convergence characteristic for Example 4.4.

Table 7. Numerical results for Example 4.4 by using PSO method for 20 times.

Display Table

This example, demonstrate that the PSO method is very effective to find an initial guess.

5. Conclusions

In this work, two iterative methods were presented based on the Cholesky decomposition and the Jacobi method for solving some problems appeared in vibrating systems design and the design of control systems. In these methods, at first we have applied the Cholesky decomposition and rotation matrices to create a system of nonlinear equations or a constrained optimization problem and then we have solved this created system of nonlinear equations and the constrained optimization problem using Newton's method and the L-BFGS-B method, respectively. Numerical results have demonstrated the convergence of these methods and their effect for solving Problem 1.1. In addition, the second algorithm can be easily applied to parameterized generalized inverse eigenvalue problem with constrained unknown parameters. Both algorithms have used the Jacobi method, which is numerically stable and well-suited for implementation on parallel processors.

Acknowledgements

The authors are deeply grateful to the editor and anonymous referees for helpful comments and suggestions which led to a significant improvement of the original manuscript of this paper.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

Saad Y. Numerical[Q3] methods for large eigenvalue problems. Vol. 66 of 2nd revised ed. Philadelphia: SIAM; 2011.
Google Scholar
Güttel S, Tisseur F. The nonlinear eigenvalue problem. Acta Numer. 2017;26:1–94.
Web of Science ®Google Scholar
Chu MT, Golub GH. Inverse eigenvalue problems. Oxford: Oxford University Press; 2005. (Algorithms and Applications).
Google Scholar
Hajarian M, Abbas H. Least squares solutions of quadratic inverse eigenvalue problem with partially bisymmetric matrices under prescribed submatrix constraints. Comput Math Appl. 2018;76(6):1458–1475.
Web of Science ®Google Scholar
Hajarian M. An efficient algorithm based on Lanczos type of BCR to solve constrained quadratic inverse eigenvalue problems. J Comput Appl Math. 2019;346:418–431.
Web of Science ®Google Scholar
Hajarian M. BCR algorithm for solving quadratic inverse eigenvalue problems with partially bisymmetric matrices. Asian J Control. 2020;22(2):687–695.
Web of Science ®Google Scholar
Cia J, Chen J. Least-squares solutions of generalized inverse eigenvalue problem over Hermitian-Hamiltonian matrices with a submatrix constraint. Comput Appl Math. 2018;37(1):593–603.
Google Scholar
Liu ZY, Tan YX, Tian ZL. Generalized inverse eigenvalue problem for centrohermitian matrices. J Shanghai Univ Eng Ed. 2004;8(4):448–453.
Google Scholar
Gu W, Li Z. Generalized inverse eigenvalue problem for generalized snow-like matrices. Fourth International Conference on Computational and Information Sciences; 2012; Chongqing, China. IEEE. p. 662–664.
Google Scholar
Yuan YX, Dai H. A generalized inverse eigenvalue problem in structural dynamic model updating. J Comput Appl Math. 2009;226(1):42–49.
Web of Science ®Google Scholar
Ghanbari K, Parvizpour F. Generalized inverse eigenvalue problem with mixed eigendata. Linear Algebra Appl. 2012;437(8):2056–2063.
Web of Science ®Google Scholar
Gladwell GM. Inverse problems in vibration. Appl Mech Rev. 1986;39(7):1013–1018.
Google Scholar
Gigola S, Lebtahi L, Thome N. Inverse eigenvalue problem for normal J-hamiltonian matrices. Appl Math Lett. 2015;48:36–40.
Web of Science ®Google Scholar
Chu MT, Golub GH. Structured inverse eigenvalue problems. Acta Numer. 2002;11:1–71.
Google Scholar
Joseph KT. Inverse eigenvalue problem in structural design. Numer Linear Algebra Appl. 1992;30(12):2890–2896.
Google Scholar
Jiang J, Dai H, Yuang Y. A symmetric generalized inverse eigenvalue problem in structural dynamics model updating. Linear Algebra Appl. 2013;439(5):1350–1363.
Web of Science ®Google Scholar
Zhang ZZ, Han XL. Solvability conditions for algebra inverse eigenvalue problem over set of anti-Hermitian generalized anti-Hamiltonian matrices. J Cent South Univ Technol. 2005;12(1):294–297.
Google Scholar
Byrnes CI, Wang X. The additive inverse eigenvalue problem for Lie perturbations. SIAM J Matrix Anal Appl. 1993;14(1):113–117.
Web of Science ®Google Scholar
Majkut L. Eigenvalue based inverse model of beam for structural modification and diagnostics: examples of using. Lat Am J Solids Struct. 2010;7(4):437–456.
Web of Science ®Google Scholar
Cox SJ, Embree M, Hokanson JM. One can hear the composition of a string: experiments with an inverse eigenvalue problem. SIAM Rev. 2012;54(1):157–178.
Web of Science ®Google Scholar
Li K, Liu J, Han J, et al. Identification of oil-film coefficients for a rotor-journal bearing system based on equivalent load reconstruction. Tribol Int. 2016;104:285–293.
Web of Science ®Google Scholar
Liu J, Meng X, Zhang D, et al. An efficient method to reduce ill-posedness for structural dynamic load identification. Mech Syst Signal Process. 2017;95:273–285.
Web of Science ®Google Scholar
Liu J, Sun X, Han X, et al. Dynamic load identification for stochastic structures based on gegenbauer polynomial approximation and regularization method. Mech Syst Signal Process. 2015;56:3–54.
Web of Science ®Google Scholar
Bonnet M, Constantinescu A. Inverse problems in elasticity. Inverse Probl. 2005;21(2):R1.
Web of Science ®Google Scholar
Barcilon V. On the multiplicity of solutions of the inverse problem for a vibrating beam. SIAM J Appl Math. 1979;37(3):605–613.
Web of Science ®Google Scholar
Hasanov A, Baysal O. Identification of an unknown spatial load distribution in a vibrating cantilevered beam from final overdetermination. J Inverse Ill-posed Probl. 2015;23(1):85–102.
Web of Science ®Google Scholar
Mulaik SA. Fundamentals of common factor analysis. In: The Wiley Handbook of psychometric testing: a multidisciplinary reference on survey, scale and test development. New Jersey; 2018. p. 209–251.
Google Scholar
Yang Y, Wei G. Inverse scattering problems for Sturm–Liouville operators with spectral parameter dependent on boundary conditions. Math Notes. 2018;103(1–2):59–66.
Web of Science ®Google Scholar
Jensen JS. Phononic band gaps and vibrations in one-and two-dimensional mass–spring structures. J Sound Vib. 2003;266(5):1053–1078.
Web of Science ®Google Scholar
Li LL. Sufficient conditions for the solvability of algebraic inverse eigenvalue problems. Linear Algebra Appl. 1995;221:117–129.
Web of Science ®Google Scholar
Xu SF. On the sufficient conditions for the solvability of algebraic inverse eigenvalue problems. J Comput Math. 1992;10:17–80.
Web of Science ®Google Scholar
Biegler-König FW. Suffcient conditions for the solubility of inverse eigenvalue problems. Linear Algebra Appl. 1981;40:89–100.
Web of Science ®Google Scholar
Alexander J. The additive inverse eigenvalue problem and topological degree. Proc Amer Math Soc. 1978;70:5–7.
Web of Science ®Google Scholar
Friedland S, Nocedal J, Overto ML. The formulation and analysis of numerical methods for inverse eigenvalue problems. SIAM J Numer Anal. 1987;24(3):634–667.
Web of Science ®Google Scholar
Xu S. On the necessary conditions for the solvability of algebraic inverse eigenvalue problems. J Comput Math. 1992;10:93–97.
Web of Science ®Google Scholar
Ji X. On matrix inverse eigenvalue problems. Inverse Probl. 1998;14(2):275–285.
Web of Science ®Google Scholar
Dai H, Bai ZZ, Wei Y. On the solvability condition and numerical algorithm for the parameterized generalized inverse eigenvalue problem. SIAM J Matrix Anal Appl. 2015;36(2):707–726.
Web of Science ®Google Scholar
Rojo O, Soto R. New conditions for the additive inverse eigenvalue problem for matrices. Comput Math Appl. 1992;23:41–46.
Web of Science ®Google Scholar
Li L. Some sufficient conditions for the solvability of inverse eigenvalue problems. Linear Algebra Appl. 1991;148:225–236.
Web of Science ®Google Scholar
Xu SF. A smallest singular value method for solving inverse eigenvalue problems. J Comput Math. 1996;1:23–31.
Google Scholar
Dai H. An algorithm for symmetric generalized inverse eigenvalue problems. Linear Algebra Appl. 1999;296:79–98.
Web of Science ®Google Scholar
Dai H, Lancaster P. Newton's method for a generalized inverse eigenvalue problem. Numer Linear Algebra Appl. 1997;4:1–21.
Web of Science ®Google Scholar
Shu L, Wang B, Hu JZ. Homotopy solution of the inverse generalized eigenvalue problems in structural dynamics. Appl Math Mech. 2004;25(5):580–586.
Google Scholar
Lancaster P. Algorithms for lambda-matrices. Numer Math. 1964;6(1):388–394.
Google Scholar
Biegler-König FW. A newton iteration process for inverse eigenvalue problems. Numer Math. 1981;37(3):349–354.
Web of Science ®Google Scholar
Golub GH, VanLoan CF. Matrix computations. Vol. 3. Baltimore: JHU Press; 2012.
Google Scholar
Yamamoto Y, Lan Z, Kudo S. Convergence analysis of the parallel classical block Jacobi method for the symmetric eigenvalue problem. JSIAM Lett. 2014;6:57–60.
Google Scholar
Johnson C, Horn R. Matrix analysis. Cambridge: Cambridge university press; 1985.
Google Scholar
Xu D, Liu Z, Xu Y, et al. A sorted jacobi algorithm and its parallel implementation. Trans Beijing Inst Technol. 2010;12:1470–1474.
Google Scholar
Ji-guang S. Sensitivity analysis of multiple eigenvalues (i). Int J Comput Math. 1988;1:28–38.
Google Scholar
Byrd RH, Lu P, Nocedal J, et al. A limited memory algorithm for bound constrained optimization. SIAM J Sci Comput. 1995;16(5):1190–1208.
Web of Science ®Google Scholar
Keskar N, Wächter A. A limited-memory quasi-newton algorithm for bound-constrained non-smooth optimization. Optim Methods Softw. 2019;34(1):150–171.
Web of Science ®Google Scholar
Haarala N, Miettinen K, Mäkelä MM. Globally convergent limited memory bundle method for large-scale nonsmooth optimization. Math Program. 2007;109(1):181–205.
Web of Science ®Google Scholar
Mokhtari A, Ribeiro A. Global convergence of online limited memory BFGS. J Mach Learn Res. 2015;16(1):3151–3181.
Google Scholar
Friedland S. Inverse eigenvalue problems. Linear Algebra Appl. 1977;17(1):15–51.
Web of Science ®Google Scholar
Kennedy J, Eberhart R. Particle swarm optimization. Proceedings of the IEEE international conference on neural networks. Vol. 4; 1995. IEEE. p. 1942–1948.
Google Scholar
Shi Y, Eberhart R. A modified particle swarm optimizer. 1998 IEEE international conference on evolutionary computation proceedings. IEEE World Congress on Computational Intelligence (Cat. No. 98TH8360); 1998. p. 69–73.
Google Scholar
AlRashidi MR, El-Hawary ME. A survey of particle swarm optimization applications in electric power systems. IEEE Trans Evol Comput. 2008;13(4):913–918.
Web of Science ®Google Scholar
Robinson J, Rahmat-Samii Y. Particle swarm optimization in electromagnetics. IEEE Trans Antennas Propag. 2004;52(2):397–407.
Web of Science ®Google Scholar
Abousleiman R, Rawashdeh O. Electric vehicle modelling and energy-efficient routing using particle swarm optimisation. IET Intell Transp Syst. 2016;10(2):65–72.
Web of Science ®Google Scholar
Gudise VG, Venayagamoorthy GK. Comparison of particle swarm optimization and backpropagation as training algorithms for neural networks. Proceedings of the 2003 IEEE Swarm Intelligence Symposium. SIS'03 (Cat. No. 03EX706); 2003; Indianapolis, USA. p. 110–117.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Solving generalized inverse eigenvalue problems via L-BFGS-B method

ABSTRACT

1. Introduction

[Citation28]

2. Some definitions and theorems

[Citation46]

3. Main results

3.1. Smooth Cholesky factorization

3.2. Two algorithms based on the smooth Cholesky factorization

The algorithm for finding a solution of Problem 1.1

The algorithm for finding a solution of Problem 1.1

4. Numerical experiments

Table 1. Numerical results for Example 4.1 by using Algorithm 3.1 and 4.1 in [Citation37].

Table 2. Numerical results for Example 4.1 by using the Algorithm 3.2.

Table 3. Numerical results for Example 4.2 by using Algorithms 3.1 and 3.2.

Table 4. Numerical results for Example 4.2 by using Algorithm 3.2.

Table 5. Numerical results for Example 4.3 by using Algorithm 3.1 and 4.1 in [Citation37].

Table 6. Numerical results for Example 4.3 by using Algorithm 3.2.

Table 7. Numerical results for Example 4.4 by using PSO method for 20 times.

5. Conclusions

Acknowledgements

Disclosure statement

References

Information for

Open access

Opportunities

Help and information

Solving generalized inverse eigenvalue problems via L-BFGS-B method

ABSTRACT

1. Introduction

[Citation28]

2. Some definitions and theorems

[Citation46]

3. Main results

3.1. Smooth Cholesky factorization

3.2. Two algorithms based on the smooth Cholesky factorization

The algorithm for finding a solution of Problem 1.1

The algorithm for finding a solution of Problem 1.1

4. Numerical experiments

Table 1. Numerical results for Example 4.1 by using Algorithm 3.1 and 4.1 in [Citation37].

Table 2. Numerical results for Example 4.1 by using the Algorithm 3.2.

Table 3. Numerical results for Example 4.2 by using Algorithms 3.1 and 3.2.

Table 4. Numerical results for Example 4.2 by using Algorithm 3.2.

Table 5. Numerical results for Example 4.3 by using Algorithm 3.1 and 4.1 in [Citation37].

Table 6. Numerical results for Example 4.3 by using Algorithm 3.2.

Table 7. Numerical results for Example 4.4 by using PSO method for 20 times.

5. Conclusions

Acknowledgements

Disclosure statement

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date