Search in:

Inverse Problems in Science and Engineering Volume 29, 2021 - Issue 12

Submit an article Journal homepage

Free access

585

Views

CrossRef citations to date

Altmetric

Listen

Research Article

An inverse source identification by nonlinear optimization in a two-dimensional hyperbolic problem

Murat SubaşıScience Faculty, Department of Mathematics, Atatürk University, Erzurum, TurkeyCorrespondence[email protected]

Faika Derya ŞendurScience Faculty, Department of Mathematics, Atatürk University, Erzurum, Turkey

Cavide YaşarScience Faculty, Department of Mathematics, Atatürk University, Erzurum, Turkey

Pages 2110-2130 | Received 09 Sep 2020, Accepted 08 Mar 2021, Published online: 11 Apr 2021

Cite this article
https://doi.org/10.1080/17415977.2021.1904235
CrossMark

In this article

1. Introduction
2. Weak formulation and finite element method for direct problem
3. Solution of inverse identification problem
4. Numerical illustrations
5. Conclusions
Acknowledgements
Disclosure statement
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

This study deals with the identification of source function from final time state observation in a two-dimensional hyperbolic problem. The solution to the direct problem is obtained by the weak solution approach and finite element method. In the part of the inverse problem, the trust-region method and Levenberg–Marquardt method, which are nonlinear least-squares optimization methods, are used for the identification of source function. The findings are presented with numerical examples.

KEYWORDS:

Inverse problems
hyperbolic equations
finite elements method
optimization

Mathematical Subject classifications:

34M50
35L50
65M60
65K10

1. Introduction

In abstract and applied mathematics, the hyperbolic problems related to wave equations have attracted the attention of many scientists. The desire to get some knowledge about the internal dynamics of the wave from its traceable behaviour has been the reason for this interest. In hyperbolic equations, solutions of the corresponding direct problems may have discontinuities, singular components of complicated structure. As it is known, the more stable an operator is, the harder it is to work with its inverse. In this respect, inverse problems for hyperbolic equations have always been popular.

Considering the spatial variables in the region $x \in Ω \subset R^{2}$ and time variable in the interval $t \in (0, T]$ , let us write the hyperbolic equation on the domain $Q_{T} := (x, t) \in Ω \times (0, T]$ in the form (1.1) $\begin{aligned} \frac{\partial^{2} u}{\partial t^{2}} - \nabla \cdot (c (x) \nabla u) + a (x, t) u = σ (t) f (x), (x, t) \in Q_{T} \end{aligned}$ (1.1) (1.2) $\begin{aligned} u (x, 0) = u_{0} (x), u_{t} (x, 0) = v_{0} (x), x \in Ω \end{aligned}$ (1.2) (1.3) $\begin{aligned} u (x, t) = g (t), x \in \partial Ω, t \in (0, T] \end{aligned}$ (1.3) where (1.4) $\begin{aligned} c (x) \in L_{\infty} (Ω) and c (x) > 0, a (x, t) \in L_{\infty} (Q_{T}), σ (t) \in L_{2} (0, T], f (x) \in L_{2} (Ω) \\ u_{0} (x) \in H^{1} (Ω), v (x) \in L_{2} (Ω), g (t) \in L_{2} (0, T] . \end{aligned}$ (1.4) In vibration modelling, the external forces with the form of separation of variables $σ (t) f (x)$ have special meanings. For example, the selection of $σ (t) = \cos w t$ describes a harmonic spatial force. Generally, the problem (1.1)–(1.3) is admitted as a model for flexible waves corresponding to a point slipping source. This kind of point source can be connected with models in ground-penetrating radar, reflection seismology, oil and gas exploration and many other physical systems [Citation1].

For a given source $σ (t) f (x)$ , obtaining the solution $u (x, t)$ of the initial boundary value problem (1.1)–(1.3) is defined as the direct problem. We can conclude from [Citation2] that under the conditions (1.4), the problem (1.1)–(1.3) has a weak solution $u \in C ([0, T], H^{1} (Ω))$ with the weak derivatives $u_{t} \in C ([0, T], L_{2} (Ω))$ and $u_{t t} \in C ([0, T], H^{- 1} (Ω))$ . Also, this solution satisfies the following estimate: (1.5) $\begin{aligned} ‖ u ‖_{L_{\infty} (0, T; H^{1} (Ω))} + ‖ u_{t} ‖_{L_{\infty} (0, T; L_{2} (Ω))} \\ \leq c ({‖ σ ‖}_{L_{2} (0, T]} {‖ f ‖}_{L_{2} (Ω)} + {‖ u_{0} ‖}_{H^{1} (Ω)} + {‖ v ‖}_{L_{2} (Ω)} + ‖ g ‖_{L_{2} (0, T]}) \end{aligned}$ (1.5) Besides, the identification of any unknown data from a given observation related to the solution is defined as the inverse problem.

In this study, we consider the following inverse problem: (1.6) $find the source function f (x) from final time state observation w (x) := u (x, T; f (x))$ (1.6) where $u (x, T; f (x))$ represents the solution of the direct problem for a given source $f (x) \in L_{2} (Ω)$ .

In the last decades, inverse source problems with the final time observation for hyperbolic problems have attracted great attention due to both their significance in engineering applications and significance in the theory of inverse problems [Citation3–10].

The solution of the (1.6) inverse problem is achieved by minimizing the following least-squares functional (1.7) $F (f (x)) = ‖ u (x, T; f (x)) - w (x) ‖^{2}$ (1.7) On the other hand, these types of inverse problems are ill-posed according to Hadamard requirements which are existence, uniqueness and stability of the solution. There are many studies to overcome the ill-posedness by regularization techniques [Citation4,Citation11–14]

The layout of this article is as follows. In Section 2, we have applied the finite element method for weak solution to direct problem. In Section 3, we have implemented a trust-region algorithm and Levenberg–Marquardt method, which are interrelated methods, except for one point, for the identification problem. In the last section, we have tested the proposed methods on two numerical examples.

2. Weak formulation and finite element method for direct problem

Multiplying the equality of (1.1) by a test function $v$ , which is zero on the boundary and integrating over the domain, we get the following integral equality for $t \in (0, T]$ : $\int_{Ω} \ddot{u} v dx - \int_{Ω} \nabla \cdot (c \nabla u) v dx + \int_{Ω} a u v dx = \int_{Ω} σ (t) f (x) v dx$ and by Green’s formula, we have (2.1) $\int_{Ω} \ddot{u} v dx + \int_{Ω} (c \nabla u) \cdot \nabla v dx - \int_{\partial Ω} (c \nabla u) \cdot n v ds + \int_{Ω} a u v dx = \int_{Ω} σ (t) f (x) v dx$ (2.1) By the spaces $H_{g}^{1} = {v : v_{L_{2} (Ω)} + \nabla v_{L_{2} (Ω)} < \infty, {v |}_{\partial Ω} = g}$ and $H_{0}^{1} = {v : v_{L_{2} (Ω)} + \nabla v_{L_{2} (Ω)} < \infty, {v |}_{\partial Ω} = 0}$ , the variational formulation of the considered problem is:

find $u \in H_{g}^{1}$ such that for every fixed $t \in (0, T]$ and (2.2) $\int_{Ω} \ddot{u} v dx + \int_{Ω} (c \nabla u) \cdot \nabla v dx + \int_{Ω} a u v dx = \int_{Ω} σ (t) f (x) v dx, \forall v \in H_{0}^{1} .$ (2.2) Let us apply the finite element method to the direct problem. The domain $Ω$ is approximated by the union of $N_{T}$ elements $Ω \approx τ^{h} \equiv \cup_{i = 1}^{N_{T}} τ_{i}$ where $τ_{i}$ is the $i$ th element. The union of the elements is called $τ^{h}$ , and it is the finite element mesh for the domain $Ω$ . The number $h$ indicates the size of the elements.

We seek the approximate solution in the form (2.3) $u^{h} (x, y, t) = \sum_{j = 1}^{N_{p}} U_{j} (t) ϕ_{j} (x, y)$ (2.3) where $N_{p}$ is the number of the nodes. The basis function $ϕ_{j} (x, y)$ is the second-order Lagrangian shape functions $ϕ_{j} (x, y) = a_{1} + a_{2} x + a_{3} y + a_{4} x^{2} + a_{5} x y + a_{6} y^{2} .$ The six coefficients $a_{j}$ of this polynomial are determined by requiring $ϕ_{j} (x_{k}, y_{k}) = δ_{j k}, j, k = 1, 2, \dots, N_{p} .$ With these six parameters, we consider constructing a quadratic Lagrange polynomial by placing nodes at the vertices and mid-sides of a triangular element [Citation15].

Now, considering the approximate solution $u^{h}$ in (2.2) $\int_{Ω} {\ddot{u}}_{h} v dx + \int_{Ω} (c \nabla u^{h}) \cdot \nabla v dx + \int_{Ω} a u^{h} v dx = \int_{Ω} σ (t) f (x) v dx$ We get $\begin{aligned} \sum_{j = 1}^{N_{p}} {\ddot{u}}_{j} (t) \int_{Ω} ϕ_{j} v dx + \sum_{j = 1}^{N_{p}} U_{j} (t) \int_{Ω} (c \nabla ϕ_{j}) \cdot \nabla v dx + \sum_{j = 1}^{N_{p}} U_{j} (t) \int_{Ω} a ϕ_{j} v dx = \int_{Ω} σ (t) f (x) v dx \end{aligned}$ Choosing $v$ to be each of the basis functions $ϕ_{i}, i = 1, \dots, N_{p}$ , we write $\begin{aligned} \sum_{j = 1}^{N_{p}} {\ddot{u}}_{j} (t) \int_{Ω} ϕ_{j} ϕ_{i} dx + \sum_{j = 1}^{N_{p}} U_{j} (t) \int_{Ω} (c \nabla ϕ_{j}) \cdot \nabla ϕ_{i} dx + \sum_{j = 1}^{N_{p}} U_{j} (t) \int_{Ω} a ϕ_{j} ϕ_{i} dx \\ = \int_{Ω} σ (t) f (x) ϕ_{i} dx, i = 1, \dots, N_{p} . \end{aligned}$ So, we have $N_{p}$ unknowns and $N_{p}$ equations above, which will give a unique solution $U_{1}, U_{2}, \dots, U_{N_{p}}$ .

For solution vector $U = [U_{1}, U_{2}, \dots, U_{N_{p}}]^{T}$ , we can write the following time-dependent matrix equations: (2.4) $M \frac{d^{2} U}{d t^{2}} + KU + A (t) U = F (t)$ (2.4) where $M = M_{i j} = \int_{Ω} ϕ_{j} ϕ_{i} dx, i = 1, \dots, N_{p}, j = 1, \dots, N_{p},$ $K = K_{i j} = \int_{Ω} (c \nabla ϕ_{j}) \cdot \nabla ϕ_{i} dx, i = 1, \dots, N_{p}, j = 1, \dots, N_{p},$ $A (t) = A_{i, j} = \int_{Ω} a ϕ_{j} ϕ_{i} dx, i = 1, \dots, N_{p}, j = 1, \dots, N_{p},$

and $F (t) = F_{i} = \int_{Ω} σ (t) f (x) ϕ_{i} dx, i = 1, \dots, N_{p} .$ The equality of (2.4) is the system of second-order ordinary differential equation and the initial conditions come from (1.2) such as $U_{j} (0) = u_{0} (P_{j}), \frac{d U_{j}}{d t} (0) = v_{0} (P_{j}), j = 1, \dots, N_{p}$ and by vector form $u_{0} = U_{j} (0) and v_{0} = \frac{d U_{j}}{d t} (0) .$ Hence, we have the system (2.5) $\begin{aligned} M \frac{d^{2} U}{d t^{2}} + KU + A (t) U = F (t) \\ u_{0} = U (0), v_{0} = \frac{dU}{d t} (0) \end{aligned}$ (2.5) This system can be solved on the time interval $(0, T]$ by any of the numerical methods like Euler's method, Runge–Kutta method or an adaptive scheme. Although the requirement on $Δ t$ is not so stringent by an implicit method when solving the hyperbolic problems, we must not forget stability issues. In applying (2.5) to approximate a solution to the hyperbolic equation, we realize that the approximate solution is growing implausible in amplitude, and then the time step must be decreased.

The approximate solution $u^{h}$ satisfies the a priori estimate [Citation15] (2.6) $‖ u (t) - u^{h} (t) ‖ \leq C h^{2} (‖ \ddot{u} (t) ‖ + \int_{0}^{t} ‖ \ddot{u} (., s) ‖ d s)$ (2.6)

3. Solution of inverse identification problem

In order to get the unknown source function from the given final time observation by (1.7), we will use the nonlinear least-squares optimization for the problem (3.1) $f^{*} = \underset{f}{argmin} F (f (x))$ (3.1) where $F (f (x)) = ‖ u (x, T; f (x)) - w (x) ‖^{2} = ‖ r (f) ‖^{2} = \sum_{j = 1}^{N_{p}} [u^{h} (P_{j}, T; f (P_{j})) - w (P_{j})]^{2}$ and the components of $r (f)$ are defined as $r_{j} (f) = u^{h} (P_{j}, T; f (P_{j})) - w (P_{j}), j = 1, 2, \dots, N_{p}$ .

Considering the compatibility conditions, the structure of identified function $f (x)$ is estimated and appropriate parameters $c = {c_{1}, c_{2}, \dots, c_{m}}$ are included to this function. So, it is assumed that we want to get $c^{*}$ , which is the minimum of the objective function $F (c) : R^{m} \overset{r}{\to} R^{N_{p}} \to R - .$ Namely, the problem (3.1) is reformulated as follows: (3.2) $f^{*} := F^{*} (c^{*}) = \underset{c}{argmin} ‖ u^{h} (P_{j}, T; f (P_{j})) - w (P_{j}) ‖^{2}$ (3.2) Let us write the Taylor approximation by the degree 2 of $F$ around any $c$ : $F (c + Δ c) ≅ L (Δ c) = F (c) + (Δ c)^{T} g + \frac{1}{2} (Δ c)^{T} B (Δ c)$ where $L (Δ c)$ is the quadratic approximation of $F (c)$ around $c$ , $g$ the gradient of $F (c)$ computed at $c$ , $B$ an approximation of the Hessian matrix $H$ of $F (c)$ at $c$ and $H$ the real hessian matrix of $F (c)$ at $c$ .

Also, we know, from the optimization theory that if $c^{*}$ is a local minimizer, then $g (c^{*}) = 0$ (Necessary condition for a local minimizer) and $(Δ c)^{T} B (c^{*}) (Δ c) > δ ‖ Δ c ‖^{2}$ (Positive definiteness – Sufficient condition for a local minimizer), with some number $δ > 0$ . So, we must always construct the positive definite $B$ matrix.

In the remaining of the paper, we use the 2-norm, $‖ c ‖ = \sqrt{c_{1}^{2} + \dots + c_{m}^{2}}$ .

On the other hand, we know, from the vector analysis that the gradient at $c$ is $g (c) = J (c)^{T} r (c)$ where $r (c)$ is the vector whose components are defined as follows: $r_{j} (c) = u^{h} (P_{j}, T; f (P_{j}; c)) - w (P_{j}), j = 1, 2, \dots, N_{p} .$ Hessian at $c$ is $H (c) = J (c)^{T} J (c) + \sum_{j = 1}^{N_{p}} r_{j} (c) r_{j}^{^{''}} (c)$ and approximation of Hessian at $c$ is $B (c) = J (c)^{T} J (c)$ . Here, $J (c) = [J_{i j}]_{N_{p} \times m} = \frac{\partial r_{i}}{\partial c_{j}} (c)$ is the partial derivative (Jacobian) of $r_{i}$ with respect to $c_{j}$ .

According to the Gauss–Newton method, the computation of the step $Δ c$ involves the solution of the linear system (3.3) $(J {(c)}^{T} J (c)) Δ c = - J (c)^{T} r (c) .$ (3.3) One difficulty of using the Gauss–Newton step is that the Jacobi matrix $J (c)$ may be ill-conditioned, which normally leads to a very big step $Δ c$ . And a very long step $Δ c$ usually causes the algorithm to break down, because of either numerical overflows or failure in line searches. So, when the Gauss–Newton method is used for ill-conditioned problems, it is not efficient about convergence relations. The main difficulty is that the step size $Δ c$ is too large and goes in a ‘bad’ direction that gives little reduction in the function.

We can handle this issue via the use of a Lagrange multiplier and thus replace the problem (3.4) $(J {(c)}^{T} J (c) + λ I) Δ c = - J (c)^{T} r (c),$ (3.4) where $λ > 0$ is the Lagrange multiplier for the constraint at the $k$ th iteration.

The parameter $λ$ affects both the direction and the length of the step $Δ c$ . Depending on the size of $λ$ , the correction $Δ c$ can vary from a Gauss–Newton step for $λ = 0$ to a very short step approximately in the steepest descent direction for large values of $λ$ . As we see from these considerations, this parameter acts similar to the step control for the damped Gauss–Newton method, but it also changes the direction of the correction.

There are two effective methods for solving such nonlinear equations: Levenberg–Marquardt method, first given by Levenberg [Citation16] and re-derived by Marquardt [Citation17], and trust-region method [Citation18,Citation19]. They are both Gauss–Newton-based methods and expose quadratic speed of convergence near $c^{*}$ .

The Levenberg–Marquardt step of (3.4) can be interpreted as solving the normal equations used in the Gauss–Newton method, but ‘shifted’ by a scaled identity matrix, so as to convert the problem from having an ill-conditioned (or positive semi-definite) matrix $J (c)^{T} J (c)$ into a positive definite one. Notice that the positive definiteness implies that the Levenberg–Marquardt direction has always descent and, therefore, the method is well defined. During the process, the size of $λ$ is updated by controlling the gain ratio $ρ = \frac{actual decrease}{predicted decrease} = \frac{F (c) - F (c + Δ c)}{L (0) - L (Δ c)}$ A large value of $ρ$ implies that $L (Δ c)$ is a good approximation to $F (c + Δ c)$ and we can decrease $λ$ so that the next Levenberg–Marquardt step is closer to the Gauss–Newton step. If $ρ$ is small or negative then $L (Δ c)$ is a poor approximation and we must increase $λ$ to get steepest descent direction and reduce the step length.

The Levenberg–Marquardt method’s algorithm is as follows;

Levenberg–Marquardt method’s algorithm

Table

Display Table

The choice of

ζ

should be a small value, e.g.

ζ = 10^{- 6} .

a_{i i}

is computed by the diagonal components in

J (c^{0})^{T} r (c^{0})

ε_{1}

ε_{2}

and

k_{max}

are chosen by the user. The inequalities

‖ J {(c^{k})}^{T} r (c^{k}) ‖ \leq ε_{1}

and

‖ Δ c^{k} ‖ \leq ε_{2} (‖ c^{k} ‖ + ε_{2})

are appropriate stopping criteria and

k < k_{max}

is a safeguard against an infinite loop.

On the other hand, reduction in the step size $‖ Δ c ‖$ is important. It can be proven that, if we apply a proper limitation on the step size $‖ Δ c ‖ \leq δ$ , we sustain global convergence even if $J (c)^{T} J (c)$ is an indefinite matrix. The trust-region algorithm is based on this rule ( $δ$ is called the region radius).

In a trust-region method, it is assumed that the model is sufficiently accurate inside a ball with radius $δ$ (trust-region radius) centred at $c$ and the step $Δ c$ is to solution to a constrained optimization problem (3.5) $\begin{matrix} minimize & L [Δ (c)] = F (c) + {(Δ c)}^{T} g + \frac{1}{2} {(Δ c)}^{T} B (Δ c) \\ subject to & ‖ Δ c ‖ \leq δ \end{matrix}$ (3.5) Indeed, a trust-region algorithm for nonlinear least-squares and Levenberg–Marquardt method are similar, apart from the bound $δ$ is updated from iteration to iteration directly instead of updating parameter $λ$ .

Trust-region method’s algorithm is as follows:

Trust-region method’s algorithm

Table

Display Table

Modifying

δ

directly has the advantage of controlling and observing the length of

Δ c

easily. Hence, nowadays, it is regarded that the trust-region approach is better than the original Levenberg–Marquardt method. A detailed information for the trust-region method can be found in [Citation20,Citation21].

4. Numerical illustrations

Now, we aim to test the explained process by giving on two problems. In solving these problems, we have used the MATLAB-2019 software.

Example 1 Consider the material occupying $Ω = {(x, y) : (x, y) \in (1, 4) \times (1, 5) ∖ (2, 3) \times (3, 5)}$ .

On the time interval $t \in (0, 1]$ , we have the problem (4.1) $\begin{aligned} \frac{\partial^{2} u}{\partial t^{2}} - \nabla \cdot (\nabla u) = \cos t f (x), (x, t) \in Q_{1} \end{aligned}$ (4.1) (4.2) $\begin{aligned} u (x, 0) = 0, u_{t} (x, 0) = 0, x \in Ω \end{aligned}$ (4.2) (4.3) $\begin{aligned} u (x, t) = 0, x \in \partial Ω, t \in (0, 1] \end{aligned}$ (4.3)

and want to identify the $f (x)$ source function from the final time state information (4.4) $w (x) = u (x, 1) = \sin π x \sin π y (\cos \sqrt{2} π - \cos 1) .$ (4.4) The exact source function is $f (x) = (1 - 2 π^{2}) \sin π x \sin π y$ .

Firstly, let us compute the solution of direct problem $f (x) \to u (x, 1)$ by the finite element method.

The domain $Ω$ is approximated by the union of $N_{T}$ elements such as $Ω \approx τ^{h} \equiv \cup_{i = 1}^{N_{T}} τ_{i} .$ In the following, there are the figures of the domain and a finite element mesh for the problem.

The numbers of elements and nodes are 1033 and 2188, respectively, for the mesh size $h = 0.15$ .

The exact solution $u (x, 1)$ and the calculated solution of the finite element method $u^{h} (x, 1)$ by (2.5) are presented in Figures .

Figure 1. The domain and finite element mesh for $h = 0.15$ .

Figure 2. Exact wave function at $T = 1$ .

Figure 3. Calculated wave function at $T = 1$ .

Some error norm values corresponding different $h$ maximum mesh size are given in Table . Due to ill-conditioning of some matrices in (2.5), it can be seen by $h = 0.15$ and $h = 0.1$ in Table that a smaller $h$ value gives a bigger error norm. Hence, we take the $h = 0.15$ mesh size for this example. Figure shows the error of the computation for this mesh size.

Figure 4. Error values for $h = 0.15$ .

Table 1. Some $h$ and corresponding error norm values.

Display Table

Secondly, we deal with the solution of the inverse identification problem $w (x) \to f (x)$ by nonlinear least-squares optimization techniques.

From the given final time state function $w (x) = \sin π x \sin π y (\cos \sqrt{2} π - \cos 1)$ , we can estimate that the source function may have the form (4.5) $f (x) = c \sin π x \sin π y .$ (4.5) On the other hand, the exact source function leading to the given final time state function is $f_{exact} (x) = (1 - 2 π^{2}) \sin π x \sin π y ≅ - 18.7392 \sin π x \sin π y .$ The minimization is carried out by the single-parameter functional (4.6) $F (c) = \underset{c}{argmin} u^{0.15} (P_{j}, 1; f (P_{j})) - w (P_{j})^{2} .$ (4.6) If minimization of this functional by the Levenberg–Marquardt method and trust-region method is carried out using MATLAB then the following outcomes are obtained.

For the Levenberg–Marquardt method, we get Table .

Table 2. Levenberg–Marquardt method.

Download CSV Display Table

Optimization is completed by the Levenberg–Marquardt method since the relative norm of the current step $(3.67 e - 09)$ is less than the value of the step tolerance $(1 e - 06)$ . The found optimal parameter is $c_{Levenberg - Marquardt}^{*} = - 18.72911864359083$ . Hence, the identified source function is (4.7) $f_{Levenberg - Marquardt}^{*} (x) = - 18.72911864359083 \sin π x \sin π y$ (4.7) The residual norm given by (4.6) for this function is $F^{*} = 1.857800410424216 e - 05$ .

For the trust-region method, we get Table .

Table 3. Trust-region method.

Download CSV Display Table

Optimization is completed by the trust-region method since the size of the gradient $(4.08 e - 07)$ , which is the first-order optimality measure, is less than the value of the optimality tolerance $(1 e - 06)$ . The found optimal parameter is $c_{trust - region}^{*} = - 18.72911914110111$ . Hence, the identified source function is (4.8) $f_{trust - region}^{*} (x) = - 18.72911914110111 \sin π x \sin π y$ (4.8) The residual norm given by (4.6) for this function is $F^{*} = 1.857800460571223 e - 05$ .

The difference between the optimal parameter values of two methods is $‖ c_{Levenberg - Marquardt}^{*} - c_{trust - region}^{*} ‖ = 4.9751 e - 07.$ As seen the parameters of these methods are too close to each other. If we take the optimal parameter as $c_{}^{*} ≅ - 18.7291$ then we get the following error norm (4.9) $‖ f_{exact} (x) - f_{}^{*} (x) ‖ = 0.2310.$ (4.9) As seen in this example, the Levenberg–Marquardt method was able to calculate the result with the same success using less iteration.

Now, we will examine that the problem needs regularization process or not. To do this, we generate random noisy data $w_{noisy} (P_{j})$ . The noisy data are simulated as (4.10) $w_{noisy} (P_{j}) = w (P_{j}) + ρ (P_{j}) w (P_{j})$ (4.10) where $ρ (P_{j})$ are random variables with the mean zero and standard deviation $σ$ given by $σ = γ$ , where $γ$ represents the percentage of noise. The random variables $ρ (P_{j}) = normrnd (0, σ, length (P_{j}))$ is generated by normrnd MATLAB function.

Firstly, we generate random noisy data with the percentage of 10% in the observation function and measure the difference between the source functions caused by this noise.

The optimal parameters accompanying with noisy source function are $c_{trust - region}^{* (noisy)} - 18.701749615205685$ and $c_{Levenberg - Marquardt}^{* (noisy)} = - 18.701748831381636$ .

The 10% noise causes the $\frac{‖ w_{noisy} (P_{j}) - w (P_{j}) ‖}{‖ w (P_{j}) ‖} = 0.1587$ norm between observation functions and $\frac{‖ f^{*} (x; w (P_{j})) - f^{*} (x; w_{noisy} (P_{j})) ‖}{‖ f^{*} (x; w (P_{j})) ‖} = 0.0018$ norm between source functions.

Secondly, we generate random noisy data with the percentage of 2% in the observation function and measure the difference between the source functions caused by this noise (Figures ).

Figure 5. The difference between the exact and 10% noisy observation function.

Figure 6. The difference between source functions corresponding to the exact and 10% noisy observation function.

Figure 7. The difference between the exact and 2% noisy observation function.

Figure 8. The difference between source functions corresponding to the exact and 2% noisy observation function.

The optimal parameters accompanying with noisy source function are $c_{trust - region}^{* (noisy)} = - 18.712029543267924$ and $c_{Levenberg - Marquardt}^{* (noisy)} = - 18.712029495721413$ .

The 2% noise causes the $\frac{‖ w_{noisy} (P_{j}) - w (P_{j}) ‖}{‖ w (P_{j}) ‖} = 0.0159$ norm between observation functions and $\frac{‖ f^{*} (x; w (P_{j})) - f^{*} (x; w_{noisy} (P_{j})) ‖}{‖ f^{*} (x; w (P_{j})) ‖} = 0.00091$ norm between source functions.

In Table , we present norms of some noisy observations and corresponding source function differences.

Table 4. Some noisy observations and source function differences.

Display Table

As seen from Table , while noise levels decrease, the corresponding source function differences also decrease. Hence, there is no need for regularization process.

Example 2 Consider the material occupying the domain bounded by the lines $y = \mp \frac{1}{\sqrt{2}} x$ and $y = \mp 1$ .

On the time interval $t \in (0, 2]$ , we have the problem (4.11) $\begin{aligned} \frac{\partial^{2} u}{\partial t^{2}} - \nabla \cdot ((x + y) \nabla u) + 2 u = e^{\frac{t}{2}} f (x), (x, t) \in Q_{2} \end{aligned}$ (4.11) (4.12) $\begin{aligned} u (x, 0) = x^{2} - 2 y^{2}, u_{t} (x, 0) = \frac{1}{2} (x^{2} - 2 y^{2}), x \in Ω \end{aligned}$ (4.12) (4.13) $\begin{aligned} u (x, t) = {\begin{array}{cc} 0 & on y = \mp \frac{1}{\sqrt{2}} x \\ e^{\frac{t}{2}} (x^{2} - 2) & on y = \mp 1 \end{array}, t \in (0, 2] . \end{aligned}$ (4.13)

and want to identify the $f (x)$ source function from the final time state information (4.14) $w (x) = u (x, 2) = e^{1} (x^{2} - 2 y^{2}) .$ (4.14) The exact source function is $f (x) = 0.25 x^{2} + 5.5 y^{2}$ .

Firstly, let us compute the solution of direct problem $f (x) \to u (x, 2)$ by finite element method.

In the following, there are the figures of domain and a finite element mesh for the problem (Figure ).

Figure 9. The domain and finite element mesh for $h = 0.15$ .

The numbers of elements and nodes are 292 and 671, respectively, for the mesh size $h = 0.15$ .

The exact solution $u (x, 2)$ and the calculated solution of the finite element method $u^{h} (x, 2)$ by (2.5) are presented in Figures .

Figure 10. Exact wave function at $T = 2$ .

Figure 11. Calculated wave function at $T = 2$ .

Figure 12. Error values for $h = 0.15$ _.

Figure 13. The difference between exact and 10% noisy observation function.

Figure 14. The difference between source functions corresponding to the exact and 10% noisy observation function.

Figure 15. The difference between the exact and 2% noisy observation function.

Figure 16. The difference between source functions corresponding to the exact and 2% noisy observation function.

Table 5. Some $h$ and corresponding error norm values.

Display Table

Secondly, we deal with the solution of inverse identification problem $w (x) \to f (x)$ by nonlinear least-squares optimization techniques.

From a given final time state function $w (x) = e^{1} (x^{2} - 2 y^{2})$ , we can estimate that the source function may have the form (4.15) $f (x) = c_{1} x^{2} + c_{2} y^{2} .$ (4.15) On the other hand, the exact source function leading to a given final time state function is (4.16) $f_{exact} (x) = 0.25 x^{2} + 5.5 y^{2}$ (4.16) The minimization is carried out by the single-parameter functional (4.17) $F (c_{1}, c_{2}) = \underset{(c_{1}, c_{2})}{argmin} ‖ u^{0.15} (P_{j}, 2; f (P_{j})) - w (P_{j}) ‖^{2}$ (4.17) If minimization of this functional by the Levenberg–Marquardt method and trust-region method is carried out using MATLAB then the following outcomes are obtained.

For the Levenberg–Marquardt method, we get Table .

Table 6. Levenberg–Marquardt method.

Download CSV Display Table

Optimization is completed by the Levenberg–Marquardt method since the final change in the sum of squares relative to its initial value is less than the value of the function tolerance $(1 e - 06)$ . The found optimal parameter is $c_{Levenberg - Marquardt}^{*} = [0.246809936178489, 5.500964113399660]$ . Hence, the identified source function is (4.18) $f_{Levenberg - Marquardt}^{*} (x) = 0.246809936178489 x^{2} + 5.500964113399660 y^{2}$ (4.18) The residual norm given by (4.16) for this function is $F^{*} = 3.381226871464835 e - 05$ .

For the trust-region method, we get Table .

Table 7. Trust-region method.

Download CSV Display Table

Optimization is completed by the trust-region method since the final change in the sum of squares relative to its initial value is less than the value of the function tolerance $(1 e - 06)$ . The found optimal parameter is $c_{trust - region}^{*} = [0.246811657528541, 5.500963848795268]$ . Hence, the identified source function is (4.19) $f_{trust - region}^{*} (x) = 0.246811657528541 x^{2} + 5.500963848795268 y^{2}$ (4.19) The residual norm given by (4.6) for this function is $F^{*} = 3.381227400848656 e - 05$ .

The difference between the optimal parameter values of two methods is $‖ c_{Levenberg - Marquardt}^{*} - c_{trust - region}^{*} ‖ = 1.741568 e - 06.$ As seen, the parameters of these methods are too close to each other. If we take the optimal parameter as $c_{}^{*} ≅ [0.24681, 5.50096]$ then we get the following error norm (4.20) $‖ f_{exact} (x) - f_{}^{*} (x) ‖ = 0.0398$ (4.20) As seen in this example, the trust-region method was able to calculate the result with the same success using less iteration.

Now, we generate random noisy data $w_{noisy} (P_{j})$ by (4.10) to decide the regularization process is needed or not (Figures ).

Firstly, we generate random noisy data with the percentage of 10% in the observation function and measure the difference between the source functions caused by this noise.

The optimal parameters accompanying with noisy source function are $c_{trust - region}^{* (noisy)} = [1.034134676584133, 5.123458167646760]$ and $c_{Levenberg - Marquardt}^{* (noisy)} = [1.01697568379749 3, 5.127563889242101]$ .

The 10% noise causes the $\frac{‖ w_{noisy} (P_{j}) - w (P_{j}) ‖}{‖ w (P_{j}) ‖} = 0.0930$ norm between observation functions and $\frac{‖ f^{*} (x; w (P_{j})) - f^{*} (x; w_{noisy} (P_{j})) ‖}{‖ f^{*} (x; w (P_{j})) ‖} = 0.0986$ norm between source functions (for the trust-region method).

Secondly, we generate random noisy data with the percentage of 2% in the observation function and measure the difference between the source functions caused by this noise.

The optimal parameters accompanying with noisy source function are $c_{trust - region}^{* (noisy)} = [- 0.006929996321541, 5.548445550509578]$ and $c_{Levenberg - Marquardt}^{* (noisy)} = [- 0.00647253387 5535, 5.548249867102941]$ .

The 2% noise causes the $\frac{‖ w_{noisy} (P_{j}) - w (P_{j}) ‖}{‖ w (P_{j}) ‖} = 0.0194$ norm between observation functions and $\frac{‖ f^{*} (x; w (P_{j})) - f^{*} (x; w_{noisy} (P_{j})) ‖}{‖ f^{*} (x; w (P_{j})) ‖} = 0.0387$ norm between source functions (for the trust-region method).

In Table , we present norms of some noisy observations and corresponding source function differences.

Table 8. Some noisy observations and source function differences

Display Table

As seen from Table , while noise levels decrease, the corresponding source function differences also decrease. Hence, there is no need for regularization process.

5. Conclusions

The trust-region and Levenberg–Marquardt methods which are nonlinear least-squares optimization methods are powerful tools for identification problems. The methods known to give successful results in one-dimensional problems have also yielded successful results in two-dimensional problems. The fact that the regularization operations are carried out within the methods eliminates the need for adding an extra regularization term to the objective function. This situation can be seen from Tables and .

Although the difference between two methods is not much in the first example with one identified parameter, in the second example where two identified parameters are included, the trust-region method yielded less iterations than the Levenberg–Marquardt method, in compatible with general predictions.

Acknowledgements

We are very grateful to the reviewers for their comments and suggestions.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

Aki K, Richards PG. Quantitative seismology theory and methods. New York: Freeman; 1980.
Google Scholar
Evans LC. Partial differential equations, 2nd edn. Graduate studies in mathematics, vol. 19. New York: American Mathematical Society; 2002.
Google Scholar
Kuliev GF. Problem of optimal control of the coefficients for hyperbolic equations. Izvestiya Vysshikh Uchebnykh Zavedenii. Matematika. 1985;3:39–44.
Google Scholar
Feng X, Sutton B, Leuhart S, et al. Identification problem for the wave equation with Neumann data input and Dirichlet data abservating. Nonlinear Anal. 2003;52(7):1777–1795.
Google Scholar
Maciąg A. The usage of wave polynomials in solving direct and inverse problems for two-dimensional wave equation. Int J Numer Method Biomed Eng. 2011;27(7):1107–1125.
Google Scholar
Tagiyev RK. On optimal control of the hyperbolic equation coefficients. Autom Remote Control. 2012;73(7):1145–1155.
Google Scholar
Hasanov A, Mukanova B. Relationship between representation formulas for unique regularized solutions of inverse source problems with final overdetermination and singular value decomposition of input–output operators. IMA J Appl Math. 2015;80:676–696.
Google Scholar
Subaşı M, Kaçar A. A variational technique for optimal boundary control in a hyperbolic problem. Appl Math Comput. 2012;218:6629–6636.
Google Scholar
Deiveegan A, Prakash P, Nieto JJ. Optimization method for identifying the source term in an inverse Wave equation. Electron J Diff Equat. 2017;2017(200):1–15.
Google Scholar
Subaşı M, Araz S. Numerical regularization of optimal control for the coefficient function in a wave equation. Iran J Sci Technol Trans A Sci. 2019;43:2325–2333.
Google Scholar
Eng HW, Scherzer O, Yamamato M. Uniqueness and stable determination of forcing terms in linear partial differential equations with overspecified boundary data. Inverse Probl. 1994;10:1253–1276.
Google Scholar
Yamamato M. On ill-posedness and a Tikhonov regularization for a multidimensional inverse hyperbolic problem. J Math Kyoto Univ. 1996;36:825–856.
Google Scholar
Cheng J, Yamamato M. One new strategy for a priori choice of regularization parameters in Tikhonov’s regularization. Inverse Probl. 2000;16:L31–L38.
Google Scholar
Kabanikhin SI, Satybaev AD, Shishlenin MA. Direct methods of solving multidimensional inverse hyperbolic problems. Utrecht: VSP Science Press; 2005.
Google Scholar
Larson MG, Bengzon F. The finite element method: theory, implementation, and applications. Berlin: Springer; 2010.
Google Scholar
Levenberg K. A method for the solution of certain nonlinear problems in least squares. Qart Appl Math. 1944;2:164–166.
Google Scholar
Marquardt DW. An algorithm for least-squares estimation of nonlinear inequalities. SIAM J Appl Math. 1963;11:431–441.
Google Scholar
Goldfeld SM, Quandt RE, Trotter HF. Maximization by quadratic hill-climbing. Econometrica. 1966;34(3):541–551.
Google Scholar
Sorensen DC. Newton's method with a model trust region modification. SIAM J Numer Anal. 1982;19(2):409–426.
Google Scholar
Conn AR, Gould NIM, Toint PL. Trust-region methods. Philadelphia: Society for Industrial and Applied Mathematics; 2000.
Google Scholar
Madsen K, Nielsen HB, Tingleff O. Methods for non-linear least squares problems, informatics and mathematical modelling. Kopenhag: Technical University of Denmark; 2004.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

An inverse source identification by nonlinear optimization in a two-dimensional hyperbolic problem

Abstract

1. Introduction

2. Weak formulation and finite element method for direct problem

3. Solution of inverse identification problem

4. Numerical illustrations