Search in:

Inverse Problems in Science and Engineering Volume 22, 2014 - Issue 5

Submit an article Journal homepage

Free access

268

Views

CrossRef citations to date

Altmetric

Listen

Articles

A vector regularization method to solve linear inverse problems

Chein-Shan LiuDepartment of Civil Engineering, National Taiwan University, Taipei, Taiwan.Correspondence[email protected]

Pages 765-786 | Received 27 Dec 2012, Accepted 14 Jun 2013, Published online: 01 Aug 2013

Cite this article
https://doi.org/10.1080/17415977.2013.823415
CrossMark

In this article

Introduction
A vector regularization method
Invariant manifold method
Dynamics on the invariant manifold
Linear inverse problems
Conclusions
Acknowledgements
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

The linear inverse problem is discretized to be an n-dimensional ill-posed linear equations system $B x = b$ . In the present paper, an invariant manifold defined in terms of the square norm of a residual vector $r : = B x - b$ is used to derive an iterative algorithm with a fast descent direction $A r$ , which is close to, but not exactly equal to, the best descent direction $B^{- 1} r$ . The matrix $A$ is obtained by using a vector regularization method together with a matrix conjugate gradient method to find the right inversion of $B$ : $B A = I_{n}$ . The vector regularization iterative algorithm is proven to be Lyapunov stable, and the direct inversion method with solution expressed by $x = A b$ converges fast. The accuracy and efficiency of them are verified through the numerical tests of linear inverse problems under a large random noise.

Keywords:

linear inverse problems
vector regularization method
invariant manifold
vector regularization iterative algorithm
direct inversion method

Introduction

It is known that an iterative method for solving the system of algebraic equations can be derived from the discretization of a certain ordinary differential equations (ODEs) system.[Citation1, Citation2] In particular, the descent methods can be interpreted as the discretizations of gradient flows.[Citation3] Indeed, it has a long time history that continuous algorithms have been investigated in many literature, for example, Gavurin [Citation4], Alber [Citation5] and Hirsch and Smale [Citation6]. Chu [Citation7] has developed a systematic approach to the continuous realization of several iterative algorithms in numerical linear algebra. The Lyapunov methods used in the analysis of iterative methods have been made by Ortega and Rheinboldt [Citation8], and Bhaya and Kaszkurewicz [Citation1, Citation9, Citation10].

The author and his co-workers have developed several methods to solve ill-posed system of linear equations: the fictitious time integration method as a filter,[Citation11] a modified polynomial expansion method, [Citation12] the Laplacian preconditioners and postconditioners,[Citation13] a vector regularization method,[Citation14] a relaxed steepest descent method,[Citation15, Citation16] an optimal iterative algorithm with an optimal descent vector,[Citation17] an adaptive Tikhonov regularization method,[Citation18] the best vector iterative method,[Citation19] the globally optimal vector iterative method,[Citation20] the optimally scaled vector regularization method,[Citation21] an optimally generalized Tikhonov regularization method,[Citation22] as well as an optimal tri-vector iterative algorithm.[Citation23] As a continuation of these works, in this paper, we propose a more simple and robust method to solve(1) $\begin{matrix} B x = b, \end{matrix}$ (1) where $x \in R^{n}$ is an unknown vector determined from a given coefficient matrix $B \in R^{n \times n}$ , which might be unsymmetric, and the input $b \in R^{n}$ which might be polluted by random noise. Many linear-type inverse problems can be discretized into the above form.

A measure of the ill-posedness of Equation (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ) is the condition number of $B$ :(2) $\begin{matrix} Cond (B) = ∥ B ∥_{F} ∥ B^{- 1} ∥_{F}, \end{matrix}$ (2) where $∥ B ∥_{F}$ is the Frobenius norm of $B$ .

For every matrix norm $∥ • ∥$ , we have $ρ (B) \leq ∥ B ∥$ , where $ρ (B)$ is a radius of the spectrum of $B$ . The Householder theorem states that for every $ϵ > 0$ and every matrix $B$ , there exists a matrix norm $∥ B ∥$ depending on $B$ and $ϵ$ such that $∥ B ∥ \leq ρ (B) + ϵ$ . So far, the spectral condition number $ρ (B) ρ (B^{- 1})$ can be used as an estimation of the condition number of $B$ by(3) $\begin{matrix} Cond (B) = \frac{\max_{σ (B)} | λ |}{\min_{σ (B)} | λ |}, \end{matrix}$ (3) where $σ (B)$ is the collection of all the eigenvalues of $B$ . If the Frobenius norm in Equation (Equation2(2) $\begin{matrix} Cond (B) = ∥ B ∥_{F} ∥ B^{- 1} ∥_{F}, \end{matrix}$ (2) ) is replaced by the operator norm, then Equations (Equation2(2) $\begin{matrix} Cond (B) = ∥ B ∥_{F} ∥ B^{- 1} ∥_{F}, \end{matrix}$ (2) ) and (Equation3(3) $\begin{matrix} Cond (B) = \frac{\max_{σ (B)} | λ |}{\min_{σ (B)} | λ |}, \end{matrix}$ (3) ) lead to the same condition number. Turning back to the Frobenius norm, we have(4) $\begin{matrix} ∥ B ∥_{F} \leq \sqrt{n} \max_{σ (B)} | λ | . \end{matrix}$ (4) In particular, for the symmetric case $ρ (B) ρ (B^{- 1}) = ∥ B ∥_{2} ∥ B^{- 1} ∥_{2}$ . Roughly speaking, the numerical solution of Equation (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ) may lose the accuracy of k decimal points when $Cond (B) = 10^{k}$ .

Instead of Equation (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ), we can solve a normal linear system:(5) $\begin{matrix} C x = b_{1}, \end{matrix}$ (5) where(6) $\begin{matrix} b_{1} = B^{T} b, \\ C = B^{T} B > 0 . \end{matrix}$ (6) We consider an iterative method for solving Equation (Equation5(5) $\begin{matrix} C x = b_{1}, \end{matrix}$ (5) ) and define for any vector $x_{k}$ the steepest descent vector(7) $\begin{matrix} R_{k} : = C x_{k} - b_{1} . \end{matrix}$ (7) Ascher et al. [Citation24], and also Liu and Chang [Citation25] have viewed the gradient descent method:(8) $\begin{matrix} x_{k + 1} = x_{k} - α_{k} R_{k}, \end{matrix}$ (8) as a forward Euler scheme of the following system of ODEs:(9) $\begin{matrix} \overset{\cdot}{x} = b_{1} - C x . \end{matrix}$ (9) The absolute stability bound(10) $\begin{matrix} α_{k} \leq \frac{2}{\max_{σ (C)} λ} \end{matrix}$ (10) must be obeyed if a uniform stepsize is employed.

Specifically, Equation (Equation8(8) $\begin{matrix} x_{k + 1} = x_{k} - α_{k} R_{k}, \end{matrix}$ (8) ) presents a steepest descent method (SDM), if(11) $\begin{matrix} α_{k} = \frac{∥ R_{k} ∥^{2}}{R_{k}^{T} C R_{k}} . \end{matrix}$ (11) When $∥ R_{k} ∥$ is small, the calculated $R_{k}$ may deviate from the real steepest descent direction to a great extent due to a round-off error of computing machine, which usually leads to the numerical instability of SDM. An improvement of SDM is the conjugate gradient method (CGM), which enhances the search direction of the minimum of a quadratic functional by imposing an orthogonality to the residual vector at each iterative step. The algorithm of CGM for solving Equation (Equation5(5) $\begin{matrix} C x = b_{1}, \end{matrix}$ (5) ) is summarized as follows.

Give an initial $x_{0}$ , compute $R_{0} = C x_{0} - b_{1}$ and then set $p_{0} = R_{0}$ .
For $k = 0, 1, 2, \dots$ , we repeat the following iterations:(12) $\begin{matrix} η_{k} = \frac{∥ R_{k} ∥^{2}}{p_{k}^{T} C p_{k}}, \\ x_{k + 1} = x_{k} - η_{k} p_{k}, \\ R_{k + 1} = C x_{k + 1} - b_{1}, \\ α_{k + 1} = \frac{∥ R_{k + 1} ∥^{2}}{∥ R_{k} ∥^{2}}, \\ p_{k + 1} = α_{k + 1} p_{k} + R_{k + 1} . \end{matrix}$ (12)

x_{k + 1}

converges according to a given stopping criterion

∥ R_{k + 1} ∥ < ε

, then stop; otherwise, go to step (ii).

Even if the CGM works very well for most linear systems, the CGM does lose some of its luster for some linear inverse problems. The gradient descent methods enjoy a fictitious time integration method with different stepsizes. This is particularly useful for image deblurring. The concept of optimal descent vector was first developed by Liu and Atluri [Citation26] for solving non-linear algebraic equations. In this paper, we explore the concept of best descent vector [Citation19] and use it as a guidance to develop a new method to solve linear algebraic equations.

The remaining parts of this paper are arranged as follows. The vector regularization method for inverting non-singular matrix is introduced in Section 2. In Section 3, we start from an invariant manifold to derive a non-linear ODEs system for the numerical solution of Equation (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ) with the descent direction to be $u = A r$ , where $A$ is solved from the vector regularization method as a right inversion of the coefficient matrix $B$ given in Equation (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ). Then, a Lyapunov stable dynamics on the invariant manifold is constructed in Section 4, resulting in a vector regularization iterative algorithm (VRIA). The linear inverse problems are solved in Section 5 to display some advantages of the VRIA and the direct inversion method (DIM). Finally, the conclusions are drawn in Section 6.

A vector regularization method

The vector regularization method for inverting non-singular matrices was first developed by Liu et al. [Citation14]. Recently, Liu [Citation21] has developed a systematic method to determine the vector regularization parameter.

Let us begin with the following matrix equation:(13) $\begin{matrix} V^{T} U^{T} = I_{m}, i.e. {(U V)}^{T} = I_{m}, \end{matrix}$ (13) if $U$ is the inversion of a given non-singular $m \times m$ matrix $V$ , where $I_{m}$ is the $m \times m$ identity matrix. Numerically, we can say that the above $U$ is a left inversion of $V$ due to $U V = I_{m}$ . Then, multiplying the above equation by $V$ from the left side we have(14) $\begin{matrix} D U^{T} = V, D : = V V^{T}, \end{matrix}$ (14) from which we can solve for $U^{T} : = C$ by the following matrix conjugate gradient method (MCGM):

Assume an initial $C_{0}$ .
Calculate $R_{0} = V - D C_{0}$ and $P_{1} = R_{0}$ .
For $k = 1, 2, \dots$ , we repeat the following iterations:(15) $\begin{matrix} α_{k} = \frac{∥ R_{k - 1} ∥^{2}}{P_{k} \cdot (D P_{k})}, \\ C_{k} = C_{k - 1} + α_{k} P_{k}, \\ R_{k} = V - D C_{k}, \\ η_{k} = \frac{∥ R_{k} ∥^{2}}{∥ R_{k - 1} ∥^{2}}, \\ P_{k + 1} = R_{k} + η_{k} P_{k} . \end{matrix}$ (15) If $C_{k}$ converges according to a given stopping criterion:(16) $\begin{matrix} ∥ R_{k} ∥ < ε_{2}, \end{matrix}$ (16) then stop; otherwise, go to step (iii). In the above, the capital boldfaced letters denote $m \times m$ matrices, the norm $∥ R_{k} ∥$ is the Frobenius norm (similar to the Euclidean norm for a vector), and the inner product is for matrices. When $C$ is calculated, the left inversion of $V$ is given by $U = C^{T}$ .

The above MCGM is suitable to find the inversion of a well-conditioned matrix; however, for a given ill-conditioned matrix with a large condition number, we need more study to find its inversion. Let

(17)

\begin{matrix} V x_{0} = y_{0}, \end{matrix}

(17) through which, given

x_{0}

, say

x_{0} = 1 = {[1, \dots, 1]}^{T}

, we can calculate

y_{0}

readily, because

V

is a given matrix. Hence, multiplying the above equation by

U

from the left side and using Equation (Equation13

(13)

\begin{matrix} V^{T} U^{T} = I_{m}, i.e. {(U V)}^{T} = I_{m}, \end{matrix}

(13) ) we have

(18)

\begin{matrix} y_{0}^{T} U^{T} = x_{0}^{T}, i.e. x_{0} = U y_{0} . \end{matrix}

(18) Together, Equations (Equation13

(13)

\begin{matrix} V^{T} U^{T} = I_{m}, i.e. {(U V)}^{T} = I_{m}, \end{matrix}

(13) ) and (Equation22

(22)

\begin{matrix} V U = I_{m}, \end{matrix}

(22) ) constitute an over-determined system to calculate

U^{T}

. This over-determined system can be written as

(19)

\begin{matrix} W U^{T} = [\begin{matrix} I_{m} \\ x_{0}^{T} \end{matrix}], \end{matrix}

(19) where

(20)

\begin{matrix} W : = [\begin{matrix} V^{T} \\ y_{0}^{T} \end{matrix}] \end{matrix}

(20) is an

n \times m

matrix with

n = m + 1

. Multiplying Equation (Equation23

(23)

\begin{matrix} y_{0}^{T} U = x_{0}^{T}, i.e. y_{0} = V^{T} x_{0} . \end{matrix}

(23) ) by

W^{T}

, we obtain an

m \times m

matrix equation again:

(21)

\begin{matrix} [V V^{T} + y_{0} y_{0}^{T}] U^{T} = V + y_{0} x_{0}^{T}, \end{matrix}

(21) which, similar to Equation (Equation14

(14)

\begin{matrix} D U^{T} = V, D : = V V^{T}, \end{matrix}

(14) ), is solved by the MCGM with

D = V V^{T} + y_{0} y_{0}^{T}

and

V

being replaced by

V + y_{0} x_{0}^{T}

. This algorithm for solving the left inversion

U

of an ill-conditioned matrix

V

has been labelled as the MCGML method.

The above algorithm is suitable for finding the left inversion of $V$ , i.e. $U V = I_{m}$ ; however, we need to solve(22) $\begin{matrix} V U = I_{m}, \end{matrix}$ (22) when we want $U$ to be a right inversion of $V$ . Mathematically, the left inversion is equal to the right inversion. But, numerically, they are hardly being equal, especially for ill-conditioned matrices.

For the right inversion, we can supplement, as in Equation (Equation21(21) $\begin{matrix} [V V^{T} + y_{0} y_{0}^{T}] U^{T} = V + y_{0} x_{0}^{T}, \end{matrix}$ (21) ), another equation:(23) $\begin{matrix} y_{0}^{T} U = x_{0}^{T}, i.e. y_{0} = V^{T} x_{0} . \end{matrix}$ (23) Then, the combination of Equations (Equation26(26) $\begin{matrix} α_{k} = \frac{{∥ R_{k - 1} ∥}^{2}}{P_{k} \cdot (D P_{k})}, \\ U_{k} = U_{k - 1} + α_{k} P_{k}, \\ R_{k} = V_{1} - D U_{k}, \\ η_{k} = \frac{∥ R_{k} ∥^{2}}{∥ R_{k - 1} ∥^{2}}, \\ P_{k + 1} = R_{k} + η_{k} P_{k} . \end{matrix}$ (26) ) and (Equation27(27) $\begin{matrix} ∥ R_{k} ∥ < ε_{2}, \end{matrix}$ (27) ) leads to the following over-determined system:(24) $\begin{matrix} [\begin{matrix} V \\ y_{0}^{T} \end{matrix}] U = [\begin{matrix} I_{m} \\ x_{0}^{T} \end{matrix}] . \end{matrix}$ (24) Multiplying the transpose of the leading matrix on both sides, we can obtain an $m \times m$ matrix equation:(25) $\begin{matrix} [V^{T} V + y_{0} y_{0}^{T}] U = V^{T} + y_{0} x_{0}^{T}, \end{matrix}$ (25) which is then solved by the following MCGMR:

Let $D = V^{T} V + y_{0} y_{0}^{T}$ , and $V_{1} = V^{T} + y_{0} x_{0}^{T}$ .
Assume an initial value $U_{0}$ .
Calculate $R_{0} = V_{1} - D U_{0}$ , and $P_{0} = R_{0}$ .
For $k = 1, 2, \dots$ , we repeat the following iterations:(26) $\begin{matrix} α_{k} = \frac{{∥ R_{k - 1} ∥}^{2}}{P_{k} \cdot (D P_{k})}, \\ U_{k} = U_{k - 1} + α_{k} P_{k}, \\ R_{k} = V_{1} - D U_{k}, \\ η_{k} = \frac{∥ R_{k} ∥^{2}}{∥ R_{k - 1} ∥^{2}}, \\ P_{k + 1} = R_{k} + η_{k} P_{k} . \end{matrix}$ (26) If $U_{k}$ converges according to a given stopping criterion,(27) $\begin{matrix} ∥ R_{k} ∥ < ε_{2}, \end{matrix}$ (27) then stop; otherwise, go to step (iv).

Equations (Equation25

(25)

\begin{matrix} [V^{T} V + y_{0} y_{0}^{T}] U = V^{T} + y_{0} x_{0}^{T}, \end{matrix}

(25) ) and (Equation29

(29)

\begin{matrix} r = B x - b, \end{matrix}

(29) ) are two regularized and non-perturbed algebraic equations to find, respectively, the left and right inversion of a given ill-conditioned matrix

V

. Due to the appearance of

y_{0}

, we have a chance to reduce the condition numbers of the coefficient matrices of these two equations; however, we refer the paper [Citation21] for a detailed description of how to find the best value of

y_{0}

Invariant manifold method

The most simple way for solving the linear equations system (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ) is(28) $\begin{matrix} x = B^{- 1} b, \end{matrix}$ (28) where we can apply the MCGML to find $B^{- 1}$ from the given matrix $B$ . This inversion method is usually quite dangerous and unstable when there exists a noise on $b$ . The error of noise will be amplified by $B^{- 1}$ to cause a large error of $x$ , when the singular values of $B$ are clustered near to zero value. In Section 5.1, we will give a numerical example to reveal this type phenomenon.

Instead of Equation (32), we search an iterative method based on the idea of MCGMR. For the linear equations system (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ), which is expressed to be $r = 0$ in terms of the residual vector:(29) $\begin{matrix} r = B x - b, \end{matrix}$ (29) we can introduce a scalar homotopy function:(30) $\begin{matrix} h (x, t) = \frac{Q (t)}{2} ∥ r (x) ∥^{2} - \frac{1}{2} ∥ r (x_{0}) ∥^{2} = 0 . \end{matrix}$ (30) We expect $h (x, t) = 0$ to be an invariant manifold in the space-time domain $(x, t)$ for a dynamical system $h (x (t), t) = 0$ to be specified further. When $Q > 0$ , the manifold defined by Equation (Equation34(34) $\begin{matrix} A r = B^{- 1} r . \end{matrix}$ (34) ) is continuous and differentiable, and thus the following differential operation makes sense:(31) $\begin{matrix} \frac{1}{2} \overset{\cdot}{Q} (t) ∥ r (x) ∥^{2} + Q (t) r \cdot (B \overset{\cdot}{x}) = 0, \end{matrix}$ (31) which is obtained by taking the time differential of Equation (Equation34(34) $\begin{matrix} A r = B^{- 1} r . \end{matrix}$ (34) ) with respect to t, considering $x = x (t)$ and using Equation (Equation33(33) $\begin{matrix} u = A r . \end{matrix}$ (33) ).

We suppose that the evolution of $x$ is governed by a vector $u$ :(32) $\begin{matrix} \overset{\cdot}{x} = λ u, \end{matrix}$ (32) where(33) $\begin{matrix} u = A r . \end{matrix}$ (33) We hope $A r$ is near to $B^{- 1} r$ for a fast convergence (see Section 4.2), i.e.(34) $\begin{matrix} A r = B^{- 1} r . \end{matrix}$ (34) Then, we can apply the MCGMR to find the right inversion $A$ of $B$ by(35) $\begin{matrix} B A = I_{n} . \end{matrix}$ (35) Inserting Equation (Equation36(36) $\begin{matrix} \overset{\cdot}{x} = - q (t) \frac{∥ r ∥^{2}}{r^{T} v} u, \end{matrix}$ (36) ) into Equation (Equation35(35) $\begin{matrix} B A = I_{n} . \end{matrix}$ (35) ) we can derive(36) $\begin{matrix} \overset{\cdot}{x} = - q (t) \frac{∥ r ∥^{2}}{r^{T} v} u, \end{matrix}$ (36) where(37) $\begin{matrix} v : = B u = B A r, \\ q (t) : = \frac{\overset{\cdot}{Q} (t)}{2 Q (t)} . \end{matrix}$ (37) Hence, in our algorithm if Q(t) can be guaranteed to be a monotonically increasing function of t, we have an absolutely convergent property in solving the linear equations system (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ) by decreasing $∥ r ∥^{2}$ according to(38) $\begin{matrix} ∥ r (x) ∥^{2} = \frac{C}{Q (t)}, \end{matrix}$ (38) where(39) $\begin{matrix} C = ∥ r (x_{0}) ∥^{2} \end{matrix}$ (39) is determined by the initial value $x_{0}$ .

Dynamics on the invariant manifold

In order to keep $x$ on the manifold (Equation43(43) $\begin{matrix} x (t + Δ t) = x (t) - η \frac{∥ r ∥^{2}}{r^{T} v} u, \end{matrix}$ (43) ), we can consider the evolution of $r$ along the path $x (t)$ by(40) $\begin{matrix} \overset{\cdot}{r} = B \overset{\cdot}{x} = - q (t) \frac{∥ r ∥^{2}}{r^{T} v} v . \end{matrix}$ (40) Because of $q (t) > 0$ we can introduce a new independent variable $τ = \int_{0}^{t} q (ξ) d ξ$ , such that the above equation can be written as(41) $\begin{matrix} \frac{d r}{d τ} = - \frac{∥ r ∥^{2}}{r^{T} B A r} B A r . \end{matrix}$ (41) where we have inserted Equation (Equation41(41) $\begin{matrix} \frac{d r}{d τ} = - \frac{∥ r ∥^{2}}{r^{T} B A r} B A r . \end{matrix}$ (41) ) for $v$ . It is an autonomous non-linear dynamical system for $r$ .

First, we note that $r = 0$ is an equilibrium point of Equation (Equation46(46) $\begin{matrix} \frac{C}{Q (t + Δ t)} = \frac{C}{Q (t)} - 2 η \frac{C}{Q (t)} + η^{2} \frac{C}{Q (t)} \frac{∥ r ∥^{2}}{{(r^{T} v)}^{2}} ∥ v ∥^{2} . \end{matrix}$ (46) ). It is known that the Lyapunov’s second, or direct, method provides a stronger stability condition of the equilibrium point of a nonlinear dynamical system.[Citation27] To describe the Lyapunov theorem we need to define the positive-definite function. A real scalar function $V (r)$ is defined to be positive-definite in some closed bounded region D of state space if for all $r$ in D,

$V (r)$ is continuously differentiable with respect to $r$ ,
$V (0) = 0$ , and
$V (r) > 0$ , for all $r \neq 0$ .

Then

V (r)

is a Lyapunov function in a neighbourhood D of the origin if

$V (r)$ is positive-definite, and
$dV (r) / d τ$ is negative-semidefinite for all $r \in D$ .

The Lyapunov theorem of stability says that the origin is stable if there exists a Lyapunov function

V (r)

throughout D; and that the origin is asymptotically stable if there exists a Lyapunov function

V (r)

throughout D, such that

dV (r) / d τ

is negative-definite for all

r \in D / 0

By using the Lyapunov function $V = ∥ r ∥^{2} / 2$ and from Equation (Equation45(45) $\begin{matrix} r (t + Δ t) = r (t) - η \frac{∥ r ∥^{2}}{r^{T} v} v, \end{matrix}$ (45) ), it is easy to prove that(42) $\begin{matrix} V (0) = 0, V (r) = \frac{∥ r ∥^{2}}{2} > 0, \forall r \neq 0, \\ \overset{\cdot}{V} = r \cdot \overset{\cdot}{r} = - q (t) ∥ r ∥^{2} < 0, \forall r \neq 0 . \end{matrix}$ (42) In Section 4.2, we will prove $η = q Δ t > 0$ . Then, by using the Lyapunov stability theory, the ODEs in Equation (Equation40(40) $\begin{matrix} \overset{\cdot}{r} = B \overset{\cdot}{x} = - q (t) \frac{∥ r ∥^{2}}{r^{T} v} v . \end{matrix}$ (40) ) are asymptotically stable to $r = 0$ , and the iterative algorithm derived from it with a suitable selection of $q > 0$ is stable.

Discretizing, yet keeping $x$ on the manifold

Now we discretize the foregoing continuous time dynamics (Equation40(40) $\begin{matrix} \overset{\cdot}{r} = B \overset{\cdot}{x} = - q (t) \frac{∥ r ∥^{2}}{r^{T} v} v . \end{matrix}$ (40) ) into a discrete time dynamics by applying the forward Euler scheme:(43) $\begin{matrix} x (t + Δ t) = x (t) - η \frac{∥ r ∥^{2}}{r^{T} v} u, \end{matrix}$ (43) where(44) $\begin{matrix} η = q (t) Δ t \end{matrix}$ (44) is a steplength.

Similarly, we use the forward Euler scheme to integrate Equation (Equation45(45) $\begin{matrix} r (t + Δ t) = r (t) - η \frac{∥ r ∥^{2}}{r^{T} v} v, \end{matrix}$ (45) ):(45) $\begin{matrix} r (t + Δ t) = r (t) - η \frac{∥ r ∥^{2}}{r^{T} v} v, \end{matrix}$ (45) and taking the square norms of both the sides and using Equation (Equation43(43) $\begin{matrix} x (t + Δ t) = x (t) - η \frac{∥ r ∥^{2}}{r^{T} v} u, \end{matrix}$ (43) ), we can obtain(46) $\begin{matrix} \frac{C}{Q (t + Δ t)} = \frac{C}{Q (t)} - 2 η \frac{C}{Q (t)} + η^{2} \frac{C}{Q (t)} \frac{∥ r ∥^{2}}{{(r^{T} v)}^{2}} ∥ v ∥^{2} . \end{matrix}$ (46) Thus, the following scalar equation is derived:(47) $\begin{matrix} a_{0} η^{2} - 2 η + 1 - \frac{Q (t)}{Q (t + Δ t)} = 0, \end{matrix}$ (47) where(48) $\begin{matrix} a_{0} : = \frac{∥ r ∥^{2} ∥ v ∥^{2}}{{(r^{T} v)}^{2}} \geq 1 \end{matrix}$ (48) by using the Cauchy–Schwarz inequality: $r^{T} v \leq ∥ r ∥ ∥ v ∥ .$ As a result, $h (x, t) = 0$ remains to be an invariant manifold in the space time of $(x, t)$ for the discrete time dynamical system $h (x (t), t) = 0$ , $t \in {0, 1, 2, \dots}$ , which will be explored further in the next section.

An iterative dynamics

Let(49) $\begin{matrix} s : = \frac{Q (t)}{Q (t + Δ t)} = \frac{∥ r (x (t + Δ t)) ∥^{2}}{∥ r (x (t)) ∥^{2}}, \end{matrix}$ (49) which is an important quantity to assess the convergent property of our numerical algorithm for solving linear equations system (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ).

From Equations (Equation53(53) $\begin{matrix} η = \frac{1 - γ}{a_{0}} > 0, \end{matrix}$ (53) ) and (Equation55(55) $\begin{matrix} x (t + Δ t) = x (t) - (1 - γ) \frac{r^{T} v}{∥ v ∥^{2}} u . \end{matrix}$ (55) ), it follows that(50) $\begin{matrix} a_{0} η^{2} - 2 η + 1 - s = 0, \end{matrix}$ (50) of which we can take a preferred solution of $η$ to be(51) $\begin{matrix} η = \frac{1 - \sqrt{1 - (1 - s) a_{0}}}{a_{0}}, if 1 - (1 - s) a_{0} \geq 0 . \end{matrix}$ (51) Let(52) $\begin{matrix} 1 - (1 - s) a_{0} = γ^{2} \geq 0, \\ s = 1 - \frac{1 - γ^{2}}{a_{0}}, \end{matrix}$ (52) and the condition $1 - (1 - s) a_{0} \geq 0$ in Equation (Equation57(57) $\begin{matrix} Convergence Rate : = \frac{∥ r (t) ∥}{∥ r (t + Δ t) ∥} = \frac{1}{\sqrt{s}} > 1 . \end{matrix}$ (57) ) is automatically satisfied; hence, from Equations (Equation57(57) $\begin{matrix} Convergence Rate : = \frac{∥ r (t) ∥}{∥ r (t + Δ t) ∥} = \frac{1}{\sqrt{s}} > 1 . \end{matrix}$ (57) ) and (Equation54(54) $\begin{matrix} 0 \leq γ < 1 \end{matrix}$ (54) ), it follows that(53) $\begin{matrix} η = \frac{1 - γ}{a_{0}} > 0, \end{matrix}$ (53) where(54) $\begin{matrix} 0 \leq γ < 1 \end{matrix}$ (54) is a parameter. Finally, from Equations (Equation49(49) $\begin{matrix} s : = \frac{Q (t)}{Q (t + Δ t)} = \frac{∥ r (x (t + Δ t)) ∥^{2}}{∥ r (x (t)) ∥^{2}}, \end{matrix}$ (49) ), (Equation54(54) $\begin{matrix} 0 \leq γ < 1 \end{matrix}$ (54) ) and (Equation60(60) $\begin{matrix} r_{k} = B x_{k} - b, \\ u_{k} = A r_{k}, \\ v_{k} = B A r_{k}, \\ x_{k + 1} = x_{k} - (1 - γ) \frac{r_{k} \cdot v_{k}}{∥ v_{k} ∥^{2}} u_{k} . \end{matrix}$ (60) ), we can obtain the following algorithm:(55) $\begin{matrix} x (t + Δ t) = x (t) - (1 - γ) \frac{r^{T} v}{∥ v ∥^{2}} u . \end{matrix}$ (55) Under conditions (Equation54(54) $\begin{matrix} 0 \leq γ < 1 \end{matrix}$ (54) ) and (Equation61(61) $\begin{matrix} u_{t} (x, t) = α u_{xx} (x, t), 0 < t < T, 0 < x < ℓ, \\ u (0, t) = u_{0} (t), u (ℓ, t) = u_{ℓ} (t), \end{matrix}$ (61) ), from Equations (Equation55(55) $\begin{matrix} x (t + Δ t) = x (t) - (1 - γ) \frac{r^{T} v}{∥ v ∥^{2}} u . \end{matrix}$ (55) ) and (Equation59(59) $\begin{matrix} x_{k + 1} = x_{k} - (1 - γ) \frac{r_{k}^{T} v_{k}}{∥ v_{k} ∥^{2}} u_{k} . \end{matrix}$ (59) ), we can prove that the new algorithm satisfies(56) $\begin{matrix} \frac{∥ r (t + Δ t) ∥}{∥ r (t) ∥} = \sqrt{s} < 1, \end{matrix}$ (56) which means that the residual error is absolutely decreased. In other words, the convergence rate of present iterative algorithm is greater than one:(57) $\begin{matrix} Convergence Rate : = \frac{∥ r (t) ∥}{∥ r (t + Δ t) ∥} = \frac{1}{\sqrt{s}} > 1 . \end{matrix}$ (57) The property in Equation (Equation64(64) $\begin{matrix} u (z) = \sum_{j = 1}^{N} c_{j} U (z, s_{j}), s_{j} = (η_{j}, τ_{j}) \in Ω^{c}, \end{matrix}$ (64) ) is very important, since it guarantees that the new algorithm is absolutely convergent to the true solution.

From Equations (Equation48(48) $\begin{matrix} a_{0} : = \frac{∥ r ∥^{2} ∥ v ∥^{2}}{{(r^{T} v)}^{2}} \geq 1 \end{matrix}$ (48) ), (Equation50(50) $\begin{matrix} a_{0} η^{2} - 2 η + 1 - s = 0, \end{matrix}$ (50) ) and (Equation60(60) $\begin{matrix} r_{k} = B x_{k} - b, \\ u_{k} = A r_{k}, \\ v_{k} = B A r_{k}, \\ x_{k + 1} = x_{k} - (1 - γ) \frac{r_{k} \cdot v_{k}}{∥ v_{k} ∥^{2}} u_{k} . \end{matrix}$ (60) ), we can derive(58) $\begin{matrix} Δ V = - \frac{1 - γ}{a_{0}} ∥ r ∥^{2} . \end{matrix}$ (58) By using the Lyapunov stability, the best choice of $a_{0}$ is the one that makes $Δ V$ as negative as possible and which can be achieved by rendering $a_{0} \geq 1$ as small as possible. In principle, if we can use $u = B^{- 1} r$ as a search direction, by Equations (Equation41(41) $\begin{matrix} \frac{d r}{d τ} = - \frac{∥ r ∥^{2}}{r^{T} B A r} B A r . \end{matrix}$ (41) ) and (Equation54(54) $\begin{matrix} 0 \leq γ < 1 \end{matrix}$ (54) ) we have $a_{0} = 1$ and with $Δ V = - (1 - γ) ∥ r ∥^{2}$ being the minimum of $Δ V$ . This is, however, impossible because the exact inverse $B^{- 1}$ is hard to be found, and so instead, we use $u = A r$ as a search direction and solve $A$ to near $B^{- 1}$ by using the MCGMR in Section 2. In doing so, we have a fast convergence iterative algorithm to solve the ill-posed linear problem (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ) on one hand, and on the other hand, because $A$ is not exactly equal to $B^{- 1}$ , we can avoid the ill-posedness to magnify the noisy error by $A r$ . Indeed, the present iterative algorithm is a trade-off between accuracy and regularization. We need to point out that for an ill-posed linear system (Equation1(1) $\begin{matrix} B x = b, \end{matrix}$ (1) ), the solution given by $B^{- 1} b$ is unstable. Our algorithms are finding $A$ , not finding $B^{- 1}$ , which is less ill-conditioned than the finding of $B^{- 1}$ . The descent direction detected by $A r$ is not exactly equal to the best descent direction $B^{- 1} r$ , which is an approximation. Because we have taken the direction $B^{- 1} r$ into account, the iterative algorithm based on $A r$ can converge quite fast.

vector regularization iterative algorithm

Since the fictitious time variable is now discrete, $t \in {0, 1, 2, \dots}$ , we let $x_{k}$ denote the numerical value of $x$ at the $k$ -th step. Thus, we arrive at a purely iterative algorithm from Equation (Equation62(62) $\begin{matrix} u (x, T) = u^{T} (x) . \end{matrix}$ (62) ):(59) $\begin{matrix} x_{k + 1} = x_{k} - (1 - γ) \frac{r_{k}^{T} v_{k}}{∥ v_{k} ∥^{2}} u_{k} . \end{matrix}$ (59) Then, the following VRIA is available:

For a given $B$ , find $A$ by using the MCGMR.
Select $γ$ , and give an initial value of $x_{0}$ or $x_{0} = A b$ .
For $k = 0, 1, 2, \dots$ , we repeat the following iterations:(60) $\begin{matrix} r_{k} = B x_{k} - b, \\ u_{k} = A r_{k}, \\ v_{k} = B A r_{k}, \\ x_{k + 1} = x_{k} - (1 - γ) \frac{r_{k} \cdot v_{k}}{∥ v_{k} ∥^{2}} u_{k} . \end{matrix}$ (60) If $x_{k + 1}$ converges according to a given stopping criterion $∥ r_{k + 1} ∥ < ε_{1}$ , then stop; otherwise, go to step (iii). Here, $γ$ is a relaxation parameter, chosen by the user.

Linear inverse problems

In this section, we will apply the method of fundamental solutions (MFS) to discretize some inverse problems into a set of linear algebraic equations. In recent years, a few different meshless boundary collocation methods, like the singular boundary method, have been proposed and developed which are different but highly related to the MFS examined in this paper.[Citation28, Citation29] The regularization method proposed in this paper can also be an efficient alternative for these methods for solving the ill-posed linear inverse problems.[Citation30, Citation31]

Example 1

When the backward heat conduction problem (BHCP) is considered in a spatial interval of $0 < x < ℓ$ by subjecting to the boundary conditions at two ends of a slab:(61) $\begin{matrix} u_{t} (x, t) = α u_{xx} (x, t), 0 < t < T, 0 < x < ℓ, \\ u (0, t) = u_{0} (t), u (ℓ, t) = u_{ℓ} (t), \end{matrix}$ (61) we solve $u$ under a final time condition:(62) $\begin{matrix} u (x, T) = u^{T} (x) . \end{matrix}$ (62) The fundamental solution of Equation (Equation68(68) $\begin{matrix} b_{i} = u (x_{i}, T) + σ R (i), \end{matrix}$ (68) ) is given as follows:(63) $\begin{matrix} K (x, t) = \frac{H (t)}{2 \sqrt{α π t}} \exp (\frac{- x^{2}}{4 α t}), \end{matrix}$ (63) where H(t) is the Heaviside function.

The method of fundamental solutions (MFSs) has a broad application in engineering computations. However, the MFS has a serious drawback in that the resulting system of linear equations is always highly ill-conditioned. In the MFS, the solution of u at the field point $z = (x, t)$ can be expressed as a linear combination of the fundamental solutions $U (z, s_{j})$ :(64) $\begin{matrix} u (z) = \sum_{j = 1}^{N} c_{j} U (z, s_{j}), s_{j} = (η_{j}, τ_{j}) \in Ω^{c}, \end{matrix}$ (64) where N is the number of source points, $c_{j}$ are unknown coefficients and $s_{j}$ are source points located in the complement $Ω^{c}$ of $Ω = [0, ℓ] \times [0, T]$ . For the heat conduction equation, we have the basis functions(65) $\begin{matrix} U (z, s_{j}) = K (x - η_{j}, t - τ_{j}) . \end{matrix}$ (65) It is known that the location of source points in the MFS has a great influence on the accuracy and stability. In a practical application of MFS to solve the BHCP, the source points are uniformly located on two straight lines parallel to the t-axis and not over $t = T$ , which was adopted by Hon and Li [Citation32] and Liu [Citation33], showing a large improvement than the line location of source points below the initial time. After imposing the boundary conditions and the final time condition on Equation (Equation72(72) $\begin{matrix} \ddot{y} (t) + \overset{\cdot}{y} (t) + y (t) = F (t) . \end{matrix}$ (72) ), we can obtain a linear equations system:(66) $\begin{matrix} B x = b, \end{matrix}$ (66) where(67) $\begin{matrix} B_{ij} = U (z_{i}, s_{j}), x = {(c_{1}, \dots, c_{N})}^{T}, \\ b = {(u_{ℓ} (t_{i}), i = 1, \dots, m_{1}; u^{T} (x_{j}), j = 1, \dots, m_{2}; u_{0} (t_{k}), k = m_{1}, \dots, 1)}^{T} . \end{matrix}$ (67) The number $n = 2 m_{1} + m_{2}$ of collocation points does not necessarily equal to the number N of source points.

Since the BHCP is highly ill-posed,[Citation34] the ill-conditioning of the coefficient matrix $B$ in Equation (Equation74(74) $\begin{matrix} c_{k} = \frac{a_{k} \cos (k θ_{k})}{R_{2 k}^{k}} + \frac{b_{k} \sin (k θ_{k})}{R_{2 k + 1}^{k}}, \end{matrix}$ (74) ) is serious. To overcome the ill-posedness of Equation (Equation74(74) $\begin{matrix} c_{k} = \frac{a_{k} \cos (k θ_{k})}{R_{2 k}^{k}} + \frac{b_{k} \sin (k θ_{k})}{R_{2 k + 1}^{k}}, \end{matrix}$ (74) ), we can employ both the DIM and VRIA to solve this problem. Here, we compare the numerical solution with an exact solution: $u (x, t) = \cos (π x) \exp (- π^{2} t) .$ For $T = 1$ the value of final data is in the order of $10^{- 4}$ , which is small in comparison with the value of the initial temperature $u_{0} (x) = \cos (π x)$ to be retrieved, which is $O (1)$ .

In order to test the stability of the proposed algorithm, we add a random noise on the final time data by(68) $\begin{matrix} b_{i} = u (x_{i}, T) + σ R (i), \end{matrix}$ (68) where $σ$ denotes the level of noise and R(i) are random numbers between $[- 1, 1]$ .

Fig. 1 For example 1 of a BHCP under an adding noise 0.001, comparing the exact solution with the numerical solutions obtained by the inversions from MCGML and MCGMR.

Fig. 2 For example 1 of a BHCP under a large relative noise 0.1, (a) the residual error, (b) the value of $a_{0}$ , and (c) comparing the numerical errors.

We take $m_{1} = 15$ , $m_{2} = 10$ , and hence $n = 40$ , and the noise with an intensity $σ = 10^{- 3}$ is adding on the right-hand side. We first apply the MCGML to find the left inversion of $B$ under a convergence criterion $ε_{2} = 10^{- 6}$ , which is convergent with 714 iterations to find the inverse matrix $B^{- 1}$ , such that the solution of $x$ can be written as(69) $\begin{matrix} x = B^{- 1} b . \end{matrix}$ (69) Unfortunately, the noisy error is largely amplified in the above solution, which causes an incorrect solution as shown in Figure by the red dashed line with the maximum error being 2.34. Conversely, when we apply the MCGMR to find the right inversion $A$ of $B$ , which is convergent with 914 iterations, and give the solution by(70) $\begin{matrix} x = A b, \end{matrix}$ (70) which is named a DIM in this paper, the numerical solution is close to the exact solution as shown in Figure by the blue dashed-dotted line, with the maximum error being 0.0478.

The MCGML is a feasible algorithm to find the left inversion of an ill-conditioned matrix. When the MCGML can provide a very accurate solution of $B^{- 1}$ in Equation (Equation77(77) $\begin{matrix} p (x) = a_{0} + \sum_{k = 1}^{m} [a_{k} {(\frac{x}{R_{2 k}})}^{k} \cos (k θ_{k}) + b_{k} {(\frac{x}{R_{2 k + 1}})}^{k} \sin (k θ_{k})], \end{matrix}$ (77) ), the noise is also being magnified, which pollutes the numerical solution of $x$ . Hence, we do not suggest to directly use Equation (Equation77(77) $\begin{matrix} p (x) = a_{0} + \sum_{k = 1}^{m} [a_{k} {(\frac{x}{R_{2 k}})}^{k} \cos (k θ_{k}) + b_{k} {(\frac{x}{R_{2 k + 1}})}^{k} \sin (k θ_{k})], \end{matrix}$ (77) ) to find the solution of $x$ , when the data $b$ are noisy and $B$ is ill-conditioned. The reason is that the noise in $b$ would be enlarged when the elements in $B^{- 1}$ are quite large.

We take $m_{1} = 15$ , $m_{2} = 10$ , and hence $n = 40$ , and a relative random noise with intensity $σ_{r} = 10 %$ is added in the final time data:(71) $\begin{matrix} b_{i} = u (x_{i}, T) [1 + σ_{r} R (i)] . \end{matrix}$ (71) We first apply the MCGMR to find the right inversion of $B$ by imposing a convergence criterion $ε_{2} = 10^{- 3}$ , which is convergent with 74 iterations to find the right inverse matrix $A$ . Then, we use the DIM as shown in Equation (Equation78(78) $\begin{matrix} p (x_{i}) = y_{i}, i = 0, \dots, n - 1 . \end{matrix}$ (78) ) to solve this BHCP, of which the maximum error of initial condition is 0.088. Moreover, when the value of $ε_{2}$ is reduced to $ε_{2} = 10^{- 4}$ , the MCGMR is convergent with 128 iterations to find the inverse matrix $A$ , and the DIM used $A$ in Equation (Equation78(78) $\begin{matrix} p (x_{i}) = y_{i}, i = 0, \dots, n - 1 . \end{matrix}$ (78) ) leads to the maximum error being $1.085 \times 10^{- 3}$ , which is shown in Figure . To the best knowledge of the author, this accuracy is never seen before for this highly ill-posed BHCP under a large noise with $σ_{r} = 0.1$ .

With $γ = 0.05$ , we also apply the VRIA in Section 4.3 to solve this problem, where we take $ε_{1} = 10^{- 4}$ and $ε_{2} = 10^{- 4}$ and with $A b$ as an initial guess. In Figure , we show the residual error obtained by the VRIA, which is convergent only through 5 iterations. By adding 127 for the number of finding $A$ , the total number of iterations for the VRIA is 132, which is larger than 128 iterations for the DIM. The values of $a_{0}$ as shown in Figure is very small, which causes a fast convergence of the VRIA. It can be seen that the algorithm of VRIA is convergent very fast only through five iterations that its residual error reduces from 2.5 to $4.448 \times 10^{- 5}$ . We found that more smaller convergence criterion of $ε_{1}$ would not lead to more accurate solution. The numerical error is compared with that obtained by the DIM in Figure with the maximum error being 0.014, which is greater than that obtained by the DIM. This result is much better than that obtained by Liu [Citation21, Citation22], and also than that obtained by Liu et al. [Citation35].

Example 2

Let us consider the following inverse problem to recover the external force $F (t)$ for an ODE:(72) $\begin{matrix} \ddot{y} (t) + \overset{\cdot}{y} (t) + y (t) = F (t) . \end{matrix}$ (72) In a time interval of $t \in [0, t_{f}]$ , the discretized data $y_{i} = y (t_{i})$ are supposed to be measurable, which are subjected to the random noise with an intensity $σ = 0.01$ . Usually, it is very difficult to recover the external force $F (t_{i})$ from Equation (Equation80(80) $\begin{matrix} R_{2 k} = β_{0} {(\frac{1}{2 m + 1} \sum_{j = 0}^{2 m} x_{j}^{2 k} (\cos k θ_{k})^{2})}^{1 / (2 k)}, k = 1, 2, \dots, m, \\ R_{2 k + 1} = β_{0} {(\frac{1}{2 m + 1} \sum_{j = 0}^{2 m} x_{j}^{2 k} (\sin k θ_{k})^{2})}^{1 / (2 k)}, k = 1, 2, \dots, m, \end{matrix}$ (80) ) by the direct differentials of the noisy data of the displacements, because the differential is an ill-posed linear operator.

To approach this inverse problem by the polynomial interpolation, we begin with(73) $\begin{matrix} p_{m} (x) = c_{0} + \sum_{k = 1}^{m} c_{k} x^{k} . \end{matrix}$ (73) Now, the coefficient $c_{k}$ is split into two coefficients $a_{k}$ and $b_{k}$ to absorb more interpolation points; in the meanwhile, $\cos (k θ_{k})$ and $\sin (k θ_{k})$ are introduced to reduce the condition number of the coefficient matrix.[Citation36] We suppose that(74) $\begin{matrix} c_{k} = \frac{a_{k} \cos (k θ_{k})}{R_{2 k}^{k}} + \frac{b_{k} \sin (k θ_{k})}{R_{2 k + 1}^{k}}, \end{matrix}$ (74) and(75) $\begin{matrix} θ_{k} = \frac{2 k π}{m}, k = 1, \dots, m . \end{matrix}$ (75) The problem domain is $[a, b]$ , and the interpolating points are:(76) $\begin{matrix} a = x_{0} < x_{1} < x_{2} < \dots < x_{2 m - 1} < x_{2 m} = b . \end{matrix}$ (76) Substituting Equation (Equation82(82) $\begin{matrix} u (z) = \sum_{j = 1}^{n} c_{j} U (z, s_{j}), s_{j} \in Ω^{c}, \end{matrix}$ (82) ) into Equation (Equation81(81) $\begin{matrix} Δ u = u_{rr} + \frac{1}{r} u_{r} + \frac{1}{r^{2}} u_{θ θ} = 0, r < ρ, 0 \leq θ \leq 2 π, \\ u (ρ, θ) = h (θ), 0 \leq θ \leq π, \\ u_{n} (ρ, θ) = g (θ), 0 \leq θ \leq π, \end{matrix}$ (81) ), we can obtain(77) $\begin{matrix} p (x) = a_{0} + \sum_{k = 1}^{m} [a_{k} {(\frac{x}{R_{2 k}})}^{k} \cos (k θ_{k}) + b_{k} {(\frac{x}{R_{2 k + 1}})}^{k} \sin (k θ_{k})], \end{matrix}$ (77) where we let $c_{0} = a_{0}$ . Here, $a_{k}$ and $b_{k}$ are unknown coefficients. In order to obtain them, we impose the following $n$ interpolated conditions:(78) $\begin{matrix} p (x_{i}) = y_{i}, i = 0, \dots, n - 1 . \end{matrix}$ (78) Thus, we obtain a linear equations system to determine $a_{k}$ and $b_{k}$ :(79) $\begin{matrix} [\begin{matrix} 1 & \frac{x_{0} \cos θ_{1}}{R_{2}} & \frac{x_{0} \sin θ_{1}}{R_{3}} & \dots & {(\frac{x_{0}}{R_{2 m}})}^{m} \cos m θ_{m} & {(\frac{x_{0}}{R_{2 m + 1}})}^{m} \sin m θ_{m} \\ 1 & \frac{x_{1} \cos θ_{1}}{R_{2}} & \frac{x_{1} \sin θ_{1}}{R_{3}} & \dots & {(\frac{x_{1}}{R_{2 m}})}^{m} \cos m θ_{m} & {(\frac{x_{1}}{R_{2 m + 1}})}^{m} \sin m θ_{m} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 1 & \frac{x_{2 m - 1} \cos θ_{1}}{R_{2}} & \frac{x_{2 m - 1} \sin θ_{1}}{R_{3}} & \dots & {(\frac{x_{2 m - 1}}{R_{2 m}})}^{m} \cos m θ_{m} & {(\frac{x_{2 m - 1}}{R_{2 m + 1}})}^{m} \sin m θ_{m} \\ 1 & \frac{x_{2 m} \cos θ_{1}}{R_{2}} & \frac{x_{2 m} \sin θ_{1}}{R_{3}} & \dots & {(\frac{x_{2 m}}{R_{2 m}})}^{m} \cos m θ_{m} & {(\frac{x_{2 m}}{R_{2 m + 1}})}^{m} \sin m θ_{m} \end{matrix}] [\begin{matrix} a_{0} \\ a_{1} \\ b_{1} \\ ⋮ \\ a_{m} \\ b_{m} \end{matrix}] \\ = [\begin{matrix} y_{0} \\ y_{1} \\ y_{2} \\ ⋮ \\ y_{2 m - 1} \\ y_{2 m} \end{matrix}] . \end{matrix}$ (79) We note that the norm of the first column of the above coefficient matrix is $\sqrt{2 m + 1}$ . According to the concept of equilibrated matrix,[Citation37] we can derive the optimal scales for the current interpolation with a half-order technique as(80) $\begin{matrix} R_{2 k} = β_{0} {(\frac{1}{2 m + 1} \sum_{j = 0}^{2 m} x_{j}^{2 k} (\cos k θ_{k})^{2})}^{1 / (2 k)}, k = 1, 2, \dots, m, \\ R_{2 k + 1} = β_{0} {(\frac{1}{2 m + 1} \sum_{j = 0}^{2 m} x_{j}^{2 k} (\sin k θ_{k})^{2})}^{1 / (2 k)}, k = 1, 2, \dots, m, \end{matrix}$ (80) where $β_{0}$ is a scaling factor.[Citation38] The improved method uses m order polynomial to interpolate $n = 2 m + 1$ data nodes, while regular method with a full order can only interpolate $m + 1$ data points.

Fig. 3 For example 2 under a large noise 0.01, (a) comparing the numerical and exact solutions, and (b) the numerical errors.

Now, we fix $m = 25$ and $t_{f} = 5$ and consider the exact solution to be $F (t) = ω \cos (ω t) + (1 - ω^{2}) \sin (ω t)$ , which is obtained by inserting the exact value of $y (t) = \sin (ω t)$ into Equation (Equation80(80) $\begin{matrix} R_{2 k} = β_{0} {(\frac{1}{2 m + 1} \sum_{j = 0}^{2 m} x_{j}^{2 k} (\cos k θ_{k})^{2})}^{1 / (2 k)}, k = 1, 2, \dots, m, \\ R_{2 k + 1} = β_{0} {(\frac{1}{2 m + 1} \sum_{j = 0}^{2 m} x_{j}^{2 k} (\sin k θ_{k})^{2})}^{1 / (2 k)}, k = 1, 2, \dots, m, \end{matrix}$ (80) ). The parameters used are $ω = 0.5$ , and $β_{0} = 5$ . When we use the MCGMR, it is convergent with nine iterations under $ε_{2} = 10^{- 4}$ . We compare the numerical solution obtained by the DIM with the data given by $F (t) = ω \cos (ω t) + (1 - ω^{2}) \sin (ω t)$ in Figure , of which the maximum error is found to be $0.0196$ as shown in Figure by solid line. Then, we apply the VRIA to solve this problem under $γ = 0.05$ and $β_{0} = 5$ , where we first use the MCGMR to find $A$ , which is convergent with five iterations under $ε_{2} = 10^{- 3}$ . Then, using the initial guess $x_{0} = A b$ , we let VRIA run 20 for iterations. We compare the numerical solution obtained by the VRIA with the exact data in Figure by dashed-dotted line, of which the maximum error is 0.0106 as shown in Figure by dashed line.

Example 3

We solve the Cauchy problem of the Laplace equation under the incomplete boundary conditions:(81) $\begin{matrix} Δ u = u_{rr} + \frac{1}{r} u_{r} + \frac{1}{r^{2}} u_{θ θ} = 0, r < ρ, 0 \leq θ \leq 2 π, \\ u (ρ, θ) = h (θ), 0 \leq θ \leq π, \\ u_{n} (ρ, θ) = g (θ), 0 \leq θ \leq π, \end{matrix}$ (81) where $h (θ)$ and $g (θ)$ are given functions, and $ρ = ρ (θ)$ is a given contour to describe the boundary shape. The contour in the polar coordinates is specified by $Γ = {(r, θ) | r = ρ (θ), 0 \leq θ \leq 2 π}$ , which is the boundary of the problem domain $Ω$ , and $n$ denotes the outward normal direction. We need to find the boundary data on the lower half contour for the completeness of boundary data.

In the MFS, the trial solution of u at the field point $z = (r \cos θ, r \sin θ)$ can be expressed as a linear combination of the fundamental solutions $U (z, s_{j})$ :(82) $\begin{matrix} u (z) = \sum_{j = 1}^{n} c_{j} U (z, s_{j}), s_{j} \in Ω^{c}, \end{matrix}$ (82) where n is the number of source points, $c_{j}$ are the unknown coefficients, $s_{j}$ are the source points and $Ω^{c}$ is the complementary set of $Ω$ . For the Laplace Equation (Equation90(90) $\begin{matrix} u (x, t) = \int_{0}^{t} v (x, ξ) d ξ + f (x), \end{matrix}$ (90) ), we have the fundamental solutions:(83) $\begin{matrix} U (z, s_{j}) = \ln r_{j}, r_{j} = ∥ z - s_{j} ∥ . \end{matrix}$ (83) In the practical application of MFS, usually the source points are distributed uniformly on a circle with a radius R, such that after imposing the boundary conditions (Equation91(91) $\begin{matrix} u_{xx} (x, t) = \int_{0}^{t} v_{xx} (x, ξ) d ξ + f^{''} (x), \end{matrix}$ (91) ) and (Equation92(92) $\begin{matrix} v (x, t) = \int_{0}^{t} v_{xx} (x, ξ) d ξ + f^{''} (x) + H (x) . \end{matrix}$ (92) ) on Equation (Equation93(93) $\begin{matrix} H (x) = v (x, 0) - f^{''} (x) . \end{matrix}$ (93) ), we can obtain a linear equations system:(84) $\begin{matrix} B x = b, \end{matrix}$ (84) where(85) $\begin{matrix} z_{i} = (z_{i}^{1}, z_{i}^{2}) = (ρ (θ_{i}) \cos θ_{i}, ρ (θ_{i}) \sin θ_{i}), \\ s_{j} = (s_{j}^{1}, s_{j}^{2}) = (R \cos θ_{j}, R \sin θ_{j}), \\ B_{ij} = \ln ∥ z_{i} - s_{j} ∥, if i is odd, \\ B_{ij} = \frac{η (θ_{i})}{∥ z_{i} - s_{j} ∥^{2}} (ρ (θ_{i}) - s_{j}^{1} \cos θ_{i} - s_{j}^{2} \sin θ_{i} \\ - \frac{ρ^{'} (θ_{i})}{ρ (θ_{i})} [s_{j}^{1} \sin θ_{i} - s_{j}^{2} \cos θ_{i}]), if i is even, \\ x = {(c_{1}, \dots, c_{n})}^{T}, b = {(h (θ_{1}), g (θ_{1}), \dots, h (θ_{m}), g (θ_{m}))}^{T}, \end{matrix}$ (85) in which $n = 2 m$ , and(86) $\begin{matrix} η (θ) = \frac{ρ (θ)}{\sqrt{ρ^{2} (θ) + {[ρ^{'} (θ)]}^{2}}} . \end{matrix}$ (86) This example poses a great challenge to test the efficiency of linear equations solver, because the Cauchy problem is highly ill-posed. A noise with intensity $σ = 10 %$ is imposed on the given data. We fix $n = 30$ and take a circle with radius $R = 500$ to distribute the source points. Under $ε_{2} = 10^{- 4}$ , we can find $A$ through six iterations. Then, we start from the initial guess $x_{0} = A b$ , and apply the VRIA to solve Equation (Equation95(95) $\begin{matrix} x_{k + 1} = x_{k} - (1 - γ) \frac{r_{k} \cdot v_{k}}{∥ v_{k} ∥^{2}} u_{k}, \end{matrix}$ (95) ) through 5 iterations as shown in Figure . In Figure , we compare the numerical solution with the exact data given by $u = ρ^{2} \cos (2 θ), π \leq θ < 2 π$ , where $ρ = \sqrt{10 - 6 \cos (2 θ)}$ , of which the maximum error is $0.106$ . The DIM converges with 25 iterations under the convergence criterion $ε_{2} = 10^{- 5}$ . As shown in Figure the maximum error of DIM is 0.0957.

Fig. 4 For example 3 of a Cauchy problem under a large noise 0.1, (a) showing residual error, and (b) comparing the numerical and exact solutions.

Fig. 5 For example 4 of an inverse heat source problem under a large noise 0.1, (a) the residual error, (b) the value of $a_{0}$ and (c) comparing the numerical errors.

Example 4

In this section, we apply the VRIA and DIM to identify an unknown space-dependent heat source function H(x) for a one-dimensional heat conduction equation:(87) $\begin{matrix} u_{t} (x, t) = u_{xx} (x, t) + H (x), 0 < x < ℓ, 0 < t < t_{f}, \\ u (0, t) = u_{0} (t), u (ℓ, t) = u_{ℓ} (t), \\ u (x, 0) = f (x) . \end{matrix}$ (87) In order to identify H(x), we can impose an extra condition:(88) $\begin{matrix} u_{x} (0, t) = q (t) . \end{matrix}$ (88) We propose a numerical differential method by letting $v = u_{t}$ . Taking the differentials of Equations (98), (99), (101) with respect to t, and letting $v = u_{t}$ , we can derive(89) $\begin{matrix} v_{t} (x, t) = v_{xx} (x, t), 0 < x < ℓ, 0 < t < t_{f}, \\ v (0, t) = {\overset{\cdot}{u}}_{0} (t), \\ v (ℓ, t) = {\overset{\cdot}{u}}_{ℓ} (t), \\ v_{x} (0, t) = \overset{\cdot}{q} (t) . \end{matrix}$ (89) This is an inverse heat conduction problem (IHCP) for v(x, t) without using the initial condition.

Therefore, as being a numerical method, we can first solve the above IHCP for v(x, t) by using the MFS in Section 5.1 to obtain a linear equations system, and then the method introduced in Section 4 is used to solve the resultant linear equations system; hence, we can construct u(x, t) by(90) $\begin{matrix} u (x, t) = \int_{0}^{t} v (x, ξ) d ξ + f (x), \end{matrix}$ (90) which automatically satisfies the initial condition in Equation (100).

From Equation (106), it follows that(91) $\begin{matrix} u_{xx} (x, t) = \int_{0}^{t} v_{xx} (x, ξ) d ξ + f^{''} (x), \end{matrix}$ (91) which together with $u_{t} = v$ being inserted into Equation (98), leads to(92) $\begin{matrix} v (x, t) = \int_{0}^{t} v_{xx} (x, ξ) d ξ + f^{''} (x) + H (x) . \end{matrix}$ (92) Inserting Equation (102) for $v_{xx} = v_{t}$ into the above equation and integrating it, we can derive the following equation to recover H(x):(93) $\begin{matrix} H (x) = v (x, 0) - f^{''} (x) . \end{matrix}$ (93) For the purpose of comparison we consider the following exact solutions:(94) $\begin{matrix} u (x, t) = x^{2} + 2 xt + \sin (2 π x), \\ H (x) = 2 x - 2 + 4 π^{2} \sin (2 π x) . \end{matrix}$ (94) In Equation (109), we disregard the ill-posedness of $f^{''} (x)$ , and suppose that the data $f^{''} (x)$ are given exactly. We solve this problem by the DIM and VRIA with $γ_{0} = 0.05$ and the convergence criterion is $ε_{2} = 10^{- 4}$ . A random noise with an intensity $σ = 10 %$ is added on the data $\overset{\cdot}{q} (t)$ . In the solution of $A$ , the DIM is convergent through 57 iterations and its numerical error as shown in Figure by dashed line is small with the maximum error being 0.023. We compute $A$ under $ε_{2} = 10^{- 3}$ , which is convergent with 28 iterations. Then, by starting from the initial guess of $x$ by $x_{0} = A b$ , we let the VRIA run five iterations, whose residual error and $a_{0}$ are respectively shown in Figure . The numerical error obtained by the VRIA is shown in Figure by solid line with the maximum error being 0.0086. It is amazing that the accuracy of the VRIA is very good when one knows that the maximum value of H(x) is about 38. This example shows again that the proposed DIM and VRIA can solve linear inverse problems with good efficiency and accuracy, and as well is robustness against the large noise with intensity $σ = 0.1$ being imposed on the data in the right-hand side.

Conclusions

For solving an ill-posed linear equations system $B x = b$ under a large noisy disturbance and with $r = B x - b$ as being a residual vector, we have derived a very simple iterative algorithm including a parameter $γ$ :(95) $\begin{matrix} x_{k + 1} = x_{k} - (1 - γ) \frac{r_{k} \cdot v_{k}}{∥ v_{k} ∥^{2}} u_{k}, \end{matrix}$ (95) where(96) $\begin{matrix} u_{k} = A r_{k}, v_{k} = B u_{k} . \end{matrix}$ (96) The Lyapunov stability theorem guarantees that the present algorithm is stable and fast convergent because the search direction $A r$ is close to the best descent direction $B^{- 1} r$ with $A$ being a right inversion of $B$ . The new algorithm is a VRIA, which has a superior computational efficiency and accuracy in solving ill-posed linear problems. This assertion is further identified by four linear inverse problems, revealing that the proposed DIM and VRIA can solve very well the linear inverse problem with a high robustness against a large noise $σ = 0.1$ being imposed on the data in the right-hand side.

Acknowledgments

First, the author is grateful to the anonymous referees for their comments which substantially improved the quality of this paper. Taiwan’s National Science Council project NSC-99-2221-E-002-074-MY3 granted to the author is highly appreciated. On the other hand, the author would acknowledge the promotion to be a Lifetime Distinguished Professor of National Taiwan University, since 2013.

References

Bhaya A, Kaszkurewicz E. Control perspectives on numerical algorithms and matrix problems. Philadelphia, PA: Society for Industrial and Applied Mathematics; 2006.
Google Scholar
Liu C-S, Atluri SN. A novel time integration method for solving a large system of on-linear algebraic equations. Comput. Model. Eng. Sci. 2008;31:71–83.
Web of Science ®Google Scholar
Helmke U, Moore JB. Optimization and dynamical systems. Berlin: Springer; 1994.
Google Scholar
Gavurin MK. Nonlinear functional equations and continuous analogs of iterative methods. Izv. Vyssh. Uchebn. Zaved. 1958;5:18–31.
Google Scholar
Alber YI. Continuous processes of the Newton type. Differ. Equ. 1971;7:1461–1471.
Google Scholar
Hirsch MW, Smale S. On algorithms for solving f(x) = 0. Commun. Pure Appl. Math. 1979;32:281–312.
Web of Science ®Google Scholar
Chu MT. On the continuous realization of iterative processes. SIAM Rev. 1988;30:375–387.
Web of Science ®Google Scholar
Ortega IM, Rheinboldt WC. Iterative solutions of nonlinear equations in several variables. New York: Academic Press; 1970.
Google Scholar
Bhaya A, Kaszkurewicz E. Steepest descent with momentum for quadratic functions is a version of the conjugate gradient method. Neural Networks. 2004;17:65–71.
PubMed Web of Science ®Google Scholar
Bhaya A, Kaszkurewicz E. A control-theoretic approach to the design of zero finding numerical methods. IEEE Trans. Autom. Control. 2007;52:1014–1026.
Web of Science ®Google Scholar
Liu C-S, Atluri SN. A Fictitious time integration method for the numerical solution of the Fredholm integral equation and for numerical differentiation of noisy data, and its relation to the filter theory. Comput. Model. Eng. Sci. 2009;41:243–261.
Web of Science ®Google Scholar
Liu C-S, Atluri SN. A highly accurate technique for interpolations using very high-order polynomials, and its applications to some ill-posed linear problems. Comput. Model. Eng. Sci. 2009;43:253–276.
Web of Science ®Google Scholar
Liu C-S, Yeih W, Atluri SN. On solving the ill-conditioned system Ax = b: general-purpose conditioners obtained from the boundary-collocation solution of the Laplace equation, using Trefftz expansions with multiple length scales. Comput. Model. Eng. Sci. 2009;44:281–311.
Web of Science ®Google Scholar
Liu C-S, Hong HK, Atluri SN. Novel algorithms based on the conjugate gradient method for inverting ill-conditioned matrices, and a new regularization method to solve ill-posed linear systems. Comput. Model. Eng. Sci. 2010;60:279–308.
Web of Science ®Google Scholar
Liu C-S. A revision of relaxed steepest descent method from the dynamics on an invariant manifold. Comput. Model. Eng. Sci. 2011;80:57–86.
Web of Science ®Google Scholar
Liu C-S. Modifications of steepest descent method and conjugate gradient method against noise for ill-posed linear systems. Commun. Numer. Anal. 2012;Article ID cna-00115:24.
Google Scholar
Liu C-S, Atluri SN. An iterative method using an optimal descent vector, for solving an ill-conditioned system Bx = b, better and faster than the conjugate gradient method. Comput. Model. Eng. Sci. 2012;80:275–298.
Google Scholar
Liu C-S. A dynamical Tikhonov regularization for solving ill-posed linear algebraic systems. Acta Appl. Math. 2013;123:285–307.
Web of Science ®Google Scholar
Liu C-S. The concept of best vector used to solve ill-posed linear inverse problems. Comput. Model. Eng. Sci. 2012;83:499–525.
Web of Science ®Google Scholar
Liu C-S. A globally optimal iterative algorithm to solve an ill-posed linear system. Comput. Model. Eng. Sci. 2012;84:383–403.
Web of Science ®Google Scholar
Liu C-S. Optimally scaled vector regularization method to solve ill-posed linear problems. Appl. Math. Comp. 2012;218:10602–10616.
Web of Science ®Google Scholar
Liu C-S. Optimally generalized regularization methods for solving linear inverse problems. Compu. Mater. Contin. 2012;29:103–127.
Web of Science ®Google Scholar
Liu C-S. An optimal tri-vector iterative algorithm for solving ill-posed linear inverse problems. Inv. Prob. Sci. Eng. 2013;21:650–681.
Google Scholar
Ascher U, van den Doel K, Hunag H, Svaiter B. Gradient descent and fast artificial time integration. M2AN. 2009;43:689–708.
Web of Science ®Google Scholar
Liu C-S, Chang CW. Novel methods for solving severely ill-posed linear equations system. J. Marine Sci. Tech. 2009;17:216–227.
Web of Science ®Google Scholar
Liu C-S, Atluri SN. An iterative algorithm for solving a system of nonlinear algebraic equations, F(x) = b, using the system of ODEs with an optimum α in ẋ = λ[αF + (1 − α)BFF]; Bij = ∂Fi/∂xj. Comput. Model. Eng. Sci. 2011;73:395–431.
Web of Science ®Google Scholar
Mohler RR. Nonlinear systems, Volume 1: dynamics and control. Englewood Cliffs (NJ): Prentice-Hall; 1991.
Google Scholar
Chen W, Gu Y. An improved formulation of singular boundary method. Adv. Appl. Math. Mech. 2012;4:543–558.
Web of Science ®Google Scholar
Gu Y, Chen W, He X-Q. Singular boundary method for steady-state heat conduction in three dimensional general anisotropic media. Int. J. Heat Mass Transfer. 2012;55:4837–4848.
Web of Science ®Google Scholar
Lin J, Chen W, Wang F. A new investigation into regularization techniques for the method of fundamental solutions. Math. Comput. Simul. 2011;81:1144–1152.
Web of Science ®Google Scholar
Fu Z, Chen W, Zhang C. Boundary particle method for Cauchy inhomogeneous potential problems. Inv. Prob. Sci. Eng. 2012;20:189–207.
Web of Science ®Google Scholar
Hon YC, Li M. A discrepancy principle for the source points location in using the MFS for solving the BHCP. Int. J. Comput. Meth. 2009;6:181–197.
Web of Science ®Google Scholar
Liu C-S. The method of fundamental solutions for solving the backward heat conduction problem with conditioning by a new post-conditioner. Num. Heat Transfer B: Fundam. 2011;60:57–72.
Web of Science ®Google Scholar
Liu C-S. Group preserving scheme for backward heat conduction problems. Int. J. Heat Mass Transfer. 2004;47:2567–2576.
Web of Science ®Google Scholar
Liu C-S, Zhang SY, Atluri SN. The Jordan structure of residual dynamics used to solve linear inverse problems. Comput. Model. Eng. Sci. 2012;88:29–47.
Web of Science ®Google Scholar
Liu C-S. A highly accurate multi-scale full/half-order polynomial interpolation. Comput. Mater. Contin. 2011;25:239–263.
Web of Science ®Google Scholar
Liu C-S. An equilibrated method of fundamental solutions to choose the best source points for the Laplace equation. Eng. Anal. Bound. Elem. 2012;36:1235–1245.
Web of Science ®Google Scholar
Liu C-S. A two-side equilibration method to reduce the condition number of an ill-posed linear system. Comput. Model. Eng. Sci. 2013;91:17–42.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature