Search in:

Inverse Problems in Science and Engineering Volume 29, 2021 - Issue 5

Submit an article Journal homepage

Free access

406

Views

CrossRef citations to date

Altmetric

Listen

Research Article

Chebyshev pseudospectral method in the reconstruction of orthotropic conductivity

Everton Boosa Department of Mathematics, Federal University of Santa Catarina, Florianópolis, BrazilCorrespondence[email protected]
View further author information

Vanda M. Luchesia Department of Mathematics, Federal University of Santa Catarina, Florianópolis, Brazil;b Institute of Exact and Natural Sciences of Pontal, Federal University of Uberlândia, Ituiutaba, BrazilCorrespondence[email protected]
View further author information

Fermín S. V. Bazána Department of Mathematics, Federal University of Santa Catarina, Florianópolis, BrazilCorrespondence[email protected]
View further author information

Pages 681-711 | Received 20 Sep 2019, Accepted 13 Jul 2020, Published online: 11 Aug 2020

Cite this article
https://doi.org/10.1080/17415977.2020.1801675
CrossMark

In this article

1. Introduction
2. Chebyshev pseudospectral method for forward problem – isotropic case
3. Numerical method for the inverse problem – isotropic case
4. Numerical method for the inverse problem – orthotropic case
5. Conclusion
Disclosure statement
Additional information
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In this paper, we present a method to reconstruct the spatially varying conductivity tensor in isotropic and orthotropic materials, involved in a two-dimensional transient anisotropic model with Robin boundary conditions. For the reconstruction, the partial differential equation is solved by a semi-discrete method that combines a pseudospectral collocation method for spatial variables and Crank–Nicolson for time. The conductivity tensor is reconstructed through a non-linear least-squares problem solved by Levenberg–Marquardt method (LMM), along with Morozov's discrepancy principle as stopping rule to cope with noise in the data. Unlike classic LMM implementations that mitigate poor conditioning in calculating iterates using nonsingular diagonal scaling matrices, in this paper, singular regularization matrices are used. The impact of such a modification is illustrated with numerical experiments using discrete differential operators as scaling matrices. Numerical results show that accurate conductivity values can be obtained using a fairly small number of discretization points at a very low computational cost.

Keywords:

Orthotropic conductivity reconstruction
Chebyshev pseudospectral method
Levenberg–Marquardt method
Morozov's discrepancy principle
inverse problem

2010 Mathematics Subject Classifications:

65M70
65M32

1. Introduction

Materials whose physical properties, such as elasticity moduli, Poisson coefficients, heat conductivity, etc., vary depending on spatial orientation of the physical body are referred to as anisotropic, while those materials that do not change with spatial orientation are referred to as isotropic [Citation1]. Orthotropic material is a type of the anisotropic material whose characteristics remain unchanged along its planes of elastic symmetry. In nature, there are many materials that can be considered anisotropic such as crystal, woods, geological sediments and biological tissues. With the advent of new technologies, new anisotropic materials have been manufactured by industrial engineering, making it necessary to know their driving properties. These properties can be roughly defined by the difference in physical material or mechanical attributes when measured along different axes, such as absorbance, refraction, conductivity, tensile strength, etc.

In this paper, we study anisotropy in a two-dimensional transient conduction problem, with spatially varying conductivity tensor, aiming at a practical method for conductivity estimation based on experimental transient data.

Consider a two-dimensional anisotropic transport problem in the finite domain $Γ \times (0, \bar{t}]$ , $\bar{t} > 0$ , with $Γ = (0, 1) \times (0, 1) \subseteq R^{2}$ , where $R$ denotes the set of real numbers, described by the partial differential equation (1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) constrained to boundary conditions (2) $\begin{aligned} - (K \nabla T) \cdot n = f_{1} (y, t), & on x = 0, y \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (2) (3) $\begin{aligned} - (K \nabla T) \cdot n + h_{2} (y) T (1, y, t) = f_{2} (y, t), & on x = 1, y \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (3) (4) $\begin{aligned} - (K \nabla T) \cdot n = f_{3} (x, t), & on y = 0, x \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (4) (5) $\begin{aligned} - (K \nabla T) \cdot n + h_{4} (x) T (x, 1, t) = f_{4} (x, t), & on y = 1, x \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (5) and initial condition (6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) In this model, $T (x, y, t)$ stands for temperature, C>0 is the heat capacity, K is the conductivity matrix, (7) $K (x, y) = [\begin{matrix} k_{11} (x, y) & k_{12} (x, y) \\ k_{21} (x, y) & k_{22} (x, y) \end{matrix}]$ (7) and g is a source term. In addition, $h_{i}$ , i = 2, 4, are heat transfer functions, $f_{i}$ , $i = 1, \dots, 4$ , are heat flux functions, and $n = [n_{x} n_{y}]^{T}$ is the outward unit normal vector to the boundary of Γ. Here and elsewhere superscript $T$ stands for transpose of a vector or a matrix. For future development and to be clear with the notation, observe that $- (K \nabla T) \cdot n = - (k_{11} \frac{\partial T}{\partial x} + k_{12} \frac{\partial T}{\partial y}) n_{x} - (k_{21} \frac{\partial T}{\partial x} + k_{22} \frac{\partial T}{\partial y}) n_{y} .$ The conductivity K is assumed continuous and positive definite matrix, which means $k_{11} > 0$ and $k_{11} k_{22} - k_{12} k_{21} > 0$ . To classify the problem, K is said to be isotropic ( $k_{11} = k_{22}, k_{12} = k_{21} = 0$ ), orthotropic ( $k_{11} \neq k_{22}, k_{12} = k_{21} = 0$ ) or anisotropic ( $k_{11} \neq k_{22}, k_{12} \neq 0$ , $k_{21} \neq 0$ ). We note in passing that model (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ) is also encountered in other applications areas in which K does not necessarily stand for thermal conductivity and where $T (x, y, t)$ describes transient data of other nature as, e.g. electrostatic potential [Citation2–4]. Irrespective of the application under consideration, several problems associated with model (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ) are found in literature and we are concerned with two of them. The first one, referred to as forward or direct problem, devoted to determine the function $T (x, y, t)$ satisfying (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ) where the remaining parameters are regarded as input data, and the second one referred to as an inverse problem, where the conductivity K is regarded as unknown and has to be estimated from measured temperature data. In this regard, it is known that, while the forward problem is governed by a linear elliptic well-posed operator, this is not the case with the inverse problem, which is non-linear and is ill-posed [Citation5, Chapter 9]. In fact, it is not obvious whether the conductivity K is (uniquely) determined by the data T. However, it is known that the uniqueness of the solution is only valid in some special cases [Citation5,Citation6]. For example, in the isotropic case, we quote the recovery of a discontinuous conductivity in a multidimensional problem with null source and homogeneous boundary conditions [Citation7], the identification problem of aquifer parameters [Citation8], and a 1D identification problem under certain Hölder smoothness conditions [Citation9]; for the orthotropic case, we quote the recovery of thermal conductivity from heat flux measurements [Citation10], whereas for the anisotropic case, we highlight the conductivity reconstruction problem on the homogeneous steady-state case under the constraint that the conductor is known up to an unknown multiple scalar [Citation11]. Anyway, even if the reconstruction problem allows for a (unique) solution, as the problem is ill-posed, small data errors can result in major disturbances in the solution, i.e. the solution does not depend continuously on the data.

Many studies to identify anisotropic conductivity from experimental data have been developed and reported in the literature [Citation12–14]. However, the limited availability of observational measures as well as the indirect nature of the estimation itself makes the problem solution difficult and thus more effective inverse solution techniques should be utilized. It is noteworthy that, in particular for the conduction problem addressed in this paper, Equations (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ), closed form solutions exist only for the orthotropic conductivity case, which does not happen for the anisotropic case [Citation15]. On the other hand, the inverse problem of reconstructing an anisotropic conductivity based on the steady-state assumption, in general, is not unique, even knowing the complete heat/current flux at the boundary [Citation11]. A similar observation holds true for the isotropic case, where conductivity is uniquely identified when the full Dirichlet-to-Neumann boundary map is known [Citation16], which is not always available in practice. To overcome this difficulty, other schemes were hence developed, including more information on the problem such as evaluations of temperature/potential inside the solution domain. As an example, Mera et al. [Citation17] applied the boundary element method (BEM) to solve the Cauchy steady-state heat conduction problem in an anisotropic medium. Other BEM-like techniques may be found in [Citation18,Citation19]. Recently, in connection with a time-dependent model, a finite difference method (FDM) has been employed together with a regularized non-linear minimization process to determine the orthotropic inhomogeneous tensor [Citation20]. In this case, numeric experiments were accomplished using the MATLAB lsqnonlin toolbox routine. In a similar flavour, a method for reconstructing orthotropic thermal conductivity based on the combination of FDM based on an alternating-direction-implicit scheme for solving the direct problem and the conjugate gradient method (CGM) as optimization tool, has been recently reported in [Citation21]. The novelty here is that the optimization problem is formulated in the $L_{2}$ infinite-dimensional environment, which is interesting, but can be expensive due to the calculation of the involved $L_{2}$ -norm.

As mentioned above, mathematically equivalent applications for the conduction problem include reconstruction of electrical conductivity. For instance, in medical imaging applications, the numerical reconstructions of anisotropic conductivity arises in electrical impedance tomography [Citation2–4,Citation22]. Several techniques for computing anisotropic conductivity in this context have been used. In particular, in thermal tomography [Citation23], where the coefficient K is characterized by a finite number of parameters in a high-dimensional subset of $R^{p}$ referred to as parameter domain, the direct problem is solved by coupling the finite element methods (FEM) in the spatial domain and a semi-implicit Euler method in the time interval. On its turn, the inverse problem is formulated as a non-linear least squares problem coupled with Tikhonov regularization. In contrast, in [Citation24], the parameter identification problem is modelled as a variational problem over stochastic Sobolev spaces, where a spectral approximation of the observation field is used to estimate the solution problem using a finite noisy model. In resume, most of these techniques approach the partial differential equation given by (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) ), with Neumann, Dirichlet or mixed boundary conditions, by using one or more numerical methods, then an objective function is formulated and an optimization technique is chosen to find an optimal parameter.

The Chebyshev pseudospectral method (CPM), also referred to as orthogonal collocation, is a method for computing approximate solutions of ordinary or partial differential equations. It differs from other methods such as finite difference methods in that the solution is represented as a continuous or piecewise continuous function [Citation25]. Further, compared to other methods such as BEM or FEM, CPM is known to produce highly accurate solutions at lower computation cost, see, e.g. [Citation26–31]. The purpose of this paper is to offer an alternative method to reconstruct the orthotropic conductivity tensor, which is simple to implement and generalize to the anisotropic case, and relatively inexpensive when compared to existing ones. The main idea behind the method is to combine an efficient solver for the direct problem and the acknowledged value of the Levenberg–Marquardt method in solving non-linear last squares methods. Specifically, the solver involves a spatial discretization of the conduction problem (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ), through Chebyshev pseudospectral method and the solution of the resulting semi-discrete model using the trapezoidal rule in time. The inverse conduction problem is solved through Levenberg–Marquardt method based on a version of Yamashita and Fukushima [Citation32], along with Morozov's discrepancy principle [Citation33] as stopping rule to cope with noisy data. In this respect, as opposed to classic LMM implementations that mitigate poor conditioning in calculating iterates using nonsingular diagonal scaling matrices, in this paper singular regularization matrices are used. This marks a significant difference from the version of LMM implemented in lsqnonlin used in [Citation20]. In addition, since the chosen regularization matrices correspond to discrete versions of differential operators, they produce, without additional cost, a smoothing effect in the computed solution similar to that produced by the Sobolev gradient used in [Citation21] (at the expense of additional work).

This paper is organized as follows. In Section 2, we consider the conduction model (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ) in the isotropic case, with the conductivity as input data, and derive a semi-discrete model for the forward problem using CPM in order to compute function values T (temperature or electrostatic potential). Then, in Section 3, we describe and analyse a strategy to solve the inverse problem of finding conductivity values based on measured values of T at a few time steps, including in particular, a method for computing Jacobians as well as a precise description of LMM with singular regularization matrices instead of scaling diagonal matrices. Numerical results are also included. Section 4 follows the same organization as Section 3 but now in connection with the orthotropic scenario. We finish the paper with conclusions and references.

2. Chebyshev pseudospectral method for forward problem – isotropic case

In this section, we will describe how to solve (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ) with the Chebyshev pseudospectral method. This method has become popular in solving ordinary or partial differential equations (PDEs) due to its high accuracy and lower computational cost compared with finite difference or finite element-based methods and has been successfully applied in solving forward and inverse heat conduction problems, see, e.g. [Citation27–29]. The construction of highly accurate solutions for PDEs lies on the fact that spatial derivatives can be approximated to high precision by means of the Chebyshev differentiation matrix.

Roughly speaking, pseudospectral methods construct approximate solutions in a space of algebraic polynomials of lower degree so that the differential equation being solved is satisfied in a specified number of points referred to as collocation points. For linear stationary and unidimensional problems on a bounded interval, for instance, if the collocation points are denoted by $x_{j}$ , $j = 0, \dots, n$ , and if the approximate solution is denoted by $u_{n}$ and expressed as (8) $u_{n} (x) = \sum_{k = 0}^{n} α_{k} ℓ_{k} (x),$ (8) where $α_{k} ≐ u_{n} (x_{k}), k = 0, \dots, n$ , and $ℓ_{k} (x)$ denotes the related Lagrange polynomial, then the coefficients $α_{j}$ become the unknowns of a linear system obtained by discretizing the differential equation in such a way that derivatives of the exact solution are estimated at the collocation points by differentiating (Equation8(8) $u_{n} (x) = \sum_{k = 0}^{n} α_{k} ℓ_{k} (x),$ (8) ) and evaluating the result at $x_{j}$ . As a matter of fact, note that derivative values $u_{n}^{'} (x_{j})$ , $u_{n}^{'} (x_{j}) = \sum_{k = 0}^{n} α_{k} ℓ_{k}^{'} (x_{j}), j = 0, \dots n,$ can be expressed in matrix form as (9) $u_{n}^{(1)} = D u_{n},$ (9) with $u_{n}^{(1)} = [u_{n}^{'} (x_{0}), \dots, u_{n}^{'} (x_{n})]^{T}, u_{n} = [u_{n} (x_{0}), \dots, u_{n} (x_{n})]^{T},$ where D denotes the $(n + 1) \times (n + 1)$ differentiation matrix whose entries are (10) $D_{j, k} = ℓ_{k}^{'} (x_{j}), j = 0, \dots, n, k = 0, \dots, n .$ (10) A consequence of (Equation9(9) $u_{n}^{(1)} = D u_{n},$ (9) ) is that if $u_{n} (x)$ interpolates $u (x)$ at the collocation points, derivative values $u^{'} (x_{j})$ can be approximated as $u^{'} (x_{j}) \approx u_{n}^{'} (x_{j})$ [Citation30], or in vector form, with abuse of notation, as (11) $[\begin{matrix} u^{'} (x_{0}) \\ ⋮ \\ u^{'} (x_{n}) \end{matrix}] \approx u_{n}^{(1)} = D [\begin{matrix} u (x_{0}) \\ ⋮ \\ u (x_{n}) \end{matrix}] .$ (11) The Chebyshev pseudospectral method takes as collocation points the Chebyshev–Gauss–Lobatto points, (12) $x_{j} = \cos (\frac{π j}{n}), j = 0, \dots, n,$ (12) and for this case, explicit formulas for $D_{j, k}$ can be found in several references, see e.g. [Citation30]. Without getting into any details, we note that for the Chebyshev differentiation matrix the following $L_{2}$ discrete estimate for $u^{'}$ at the collocation points is known to hold [Citation34, Chapter 9, estimate 9.5.22] (13) $‖ u^{'} - u_{n}^{'} ‖ = O (n^{2 - m})$ (13) where m denotes the regularity (smoothness) of u. This result implies that (Equation11(11) $[\begin{matrix} u^{'} (x_{0}) \\ ⋮ \\ u^{'} (x_{n}) \end{matrix}] \approx u_{n}^{(1)} = D [\begin{matrix} u (x_{0}) \\ ⋮ \\ u (x_{n}) \end{matrix}] .$ (11) ) can lead to exponential order of convergence as far as u is sufficiently smooth. In words, the smoother the function u, the smaller the approximation error. We will end this very short explanation on pseudospectral methods by noting that multidimensional problems are treated similarly as above. For 2D cases for instance, if for the sake of simplicity we use the same number of collocation points, n + 1, in the x and y directions, then the Chebyshev pseudospectral method looks as approximate solution for a polynomial $u_{n} (x, y)$ , $u_{n} (x, y) = \sum_{j = 0}^{n} \sum_{k = 0}^{n} u_{n} (x_{j}, y_{k}) ℓ_{j} (x) ℓ_{k} (y),$ where $u_{n} (x_{j}, y_{j})$ are unknowns to be determined by requiring that the given equation (and boundary condition) is satisfied at the grid points $(x_{j}, y_{k})$ . For PDEs with time and space variables, on the other hand, the construction of the approximate solution is as above but with the sole difference that the problem is now transformed into a time-dependent system of ordinary differential equations for which several methods exist.

Returning to the problem we are concerned with, we will first focus on the isotropic case to describe more clearly how CPM works. As mentioned above this case means $k_{11} = k_{22} =: k$ and $k_{12} = k_{21} = 0$ , which simplifies the numerical analysis. Therefore we can rewrite (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) ) as (14) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \frac{\partial}{\partial x} (k (x, y) \frac{\partial T}{\partial x} (x, y, t)) + \frac{\partial}{\partial y} (k (x, y) \frac{\partial T}{\partial y} (x, y, t)) + g (x, y, t) .$ (14) For simplicity, we shall consider a mesh consisting of $(n + 1) \times (n + 1)$ points on the reference domain Γ based on $(n + 1)$ Chebyshev–Gauss–Lobatto points in the horizontal and vertical directions, (15) $x_{i} = \frac{1}{2} [1 - \cos (\frac{i π}{n})], i = 0, \dots, n, y_{j} = \frac{1}{2} [1 - \cos (\frac{j π}{n})], j = 0, \dots, n,$ (15) and assume that grid points are numbered in lexicographic ordering. Assume also that the Chebyshev differentiation matrix is expressed in row-wise (resp. column-wise) form, (16) $D = [\begin{matrix} r_{0}^{T} \\ ⋮ \\ r_{n}^{T} \end{matrix}] = [d_{0}, \dots, d_{n}], r_{i}, d_{i} \in R^{n + 1}, i = 0, \dots, n .$ (16) The next step is to approximate spatial derivatives in (Equation14(14) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \frac{\partial}{\partial x} (k (x, y) \frac{\partial T}{\partial x} (x, y, t)) + \frac{\partial}{\partial y} (k (x, y) \frac{\partial T}{\partial y} (x, y, t)) + g (x, y, t) .$ (14) ) by using matrix D. In fact, along the horizontal direction, we have (17) $\frac{\partial}{\partial x} (k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t)) \approx e_{i + 1}^{T} D K_{j}^{x} {\dot{T}}_{j}, i = 0, \dots, n, j = 0, 1, \dots, n,$ (17) where $e_{i}$ , denotes the i-th canonical vector in $R^{n + 1}$ , $K_{j}^{x}$ is the diagonal matrix (18) $K_{j}^{x} = diag (k (x_{0}, y_{j}), k (x_{1}, y_{j}), \dots, k (x_{n}, y_{j})), j = 0, 1, \dots, n,$ (18) and ${\dot{T}}_{j}$ is the column vector (19) ${\dot{T}}_{j} = {[\frac{\partial T}{\partial x} (x_{0}, y_{j}, t), \frac{\partial T}{\partial x} (x_{1}, y_{j}, t), \dots, \frac{\partial T}{\partial x} (x_{n}, y_{j}, t)]}^{T}, j = 0, 1, \dots, n .$ (19) Clearly, provided the functions k and T are sufficiently regular, the order of approximation of (Equation17(17) $\frac{\partial}{\partial x} (k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t)) \approx e_{i + 1}^{T} D K_{j}^{x} {\dot{T}}_{j}, i = 0, \dots, n, j = 0, 1, \dots, n,$ (17) ) is analogous to that given in (Equation13(13) $‖ u^{'} - u_{n}^{'} ‖ = O (n^{2 - m})$ (13) ). Now let $T_{j}^{x}$ be defined by (20) $T_{j}^{x} = [\begin{matrix} \frac{\partial}{\partial x} (k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t)) \\ ⋮ \\ \frac{\partial}{\partial x} (k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t)) \end{matrix}], j = 0, 1, \dots, n .$ (20) With this notation, observe that the approximation (Equation17(17) $\frac{\partial}{\partial x} (k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t)) \approx e_{i + 1}^{T} D K_{j}^{x} {\dot{T}}_{j}, i = 0, \dots, n, j = 0, 1, \dots, n,$ (17) ) can be expressed in vector form as (21) $T_{j}^{x} \approx D K_{j}^{x} {\dot{T}}_{j} = \sum_{i = 0}^{n} d_{i} k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t),$ (21) and that, since the boundary conditions (Equation2(2) $\begin{aligned} - (K \nabla T) \cdot n = f_{1} (y, t), & on x = 0, y \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (2) )–(Equation3(3) $\begin{aligned} - (K \nabla T) \cdot n + h_{2} (y) T (1, y, t) = f_{2} (y, t), & on x = 1, y \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (3) ) and (Equation16(16) $D = [\begin{matrix} r_{0}^{T} \\ ⋮ \\ r_{n}^{T} \end{matrix}] = [d_{0}, \dots, d_{n}], r_{i}, d_{i} \in R^{n + 1}, i = 0, \dots, n .$ (16) ) imply (22) $\begin{aligned} \sum_{i = 0}^{n} d_{i} k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t) & = d_{0} f_{1} (y_{j}, t) + \sum_{i = 1}^{n - 1} d_{i} k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t) \\ + d_{n} h_{2} (y_{j}) T (x_{n}, y_{j}, t) - d_{n} f_{2} (y_{j}, t) \\ = [D_{1} {\overset{ˇ}{K}}_{j} D_{2} + h_{2} (y_{j}) d_{n} e_{n + 1}^{T}] T_{j} (t) \\ + d_{0} f_{1} (y_{j}, t) - d_{n} f_{2} (y_{j}, t), \end{aligned}$ (22) where (23) $\begin{aligned} T_{j} (t) & = [T (x_{0}, y_{j}, t), T (x_{1}, y_{j}, t), \dots, T (x_{n}, y_{j}, t)]^{T}, \end{aligned}$ (23) $\begin{aligned} D_{1} & = [d_{1}, \dots, d_{n - 1}], D_{2} = [\begin{matrix} r_{1}^{T} \\ ⋮ \\ r_{n - 1}^{T} \end{matrix}], {\overset{ˇ}{K}}_{j} = diag (k (x_{1}, y_{j}), \dots, k (x_{n - 1}, y_{j})), \end{aligned}$ we conclude that (24) $T_{j}^{x} \approx [D_{1} {\overset{ˇ}{K}}_{j} D_{2} + h_{2} (y_{j}) d_{n} e_{n + 1}^{T}] T_{j} + d_{0} f_{1} (y_{j}, t) - d_{n} f_{2} (y_{j}, t) .$ (24) Before proceeding, note that although $T_{j}$ is a vector function in $R^{n + 1}$ that depends on time, to simplify the notation, $T_{j} (t)$ will be denoted by $T_{j}$ . With this convention and based on (Equation24(24) $T_{j}^{x} \approx [D_{1} {\overset{ˇ}{K}}_{j} D_{2} + h_{2} (y_{j}) d_{n} e_{n + 1}^{T}] T_{j} + d_{0} f_{1} (y_{j}, t) - d_{n} f_{2} (y_{j}, t) .$ (24) ), discretization of $\frac{\partial}{\partial x} (k (x, y) \frac{\partial T}{\partial x} (x, y, t))$ along all grid points with $T (x_{i}, y_{j}, t)$ enumerated in lexicographic order leads to (25) $[\begin{matrix} T_{0}^{x} \\ T_{1}^{x} \\ ⋮ \\ T_{n}^{x} \end{matrix}] \approx (\begin{matrix} F_{0} \\ F_{1} \\ ⋱ \\ F_{n} \end{matrix}) [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + [\begin{matrix} d_{0} f_{1} (y_{0}, t) - d_{n} f_{2} (y_{0}, t) \\ d_{0} f_{1} (y_{1}, t) - d_{n} f_{2} (y_{1}, t) \\ ⋮ \\ d_{0} f_{1} (y_{n}, t) - d_{n} f_{2} (y_{n}, t) \end{matrix}],$ (25) where we have set $F_{j} = D_{1} {\overset{ˇ}{K}}_{j} D_{2} + h_{2} (y_{j}) d_{n} e_{n + 1}^{T}$ , $j = 0, 1, \dots, n .$

To discretize spatial derivatives with respect to y, let $T_{j}^{y}$ be defined as in (Equation20(20) $T_{j}^{x} = [\begin{matrix} \frac{\partial}{\partial x} (k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t)) \\ ⋮ \\ \frac{\partial}{\partial x} (k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t)) \end{matrix}], j = 0, 1, \dots, n .$ (20) ) with $\frac{\partial}{\partial x}$ replaced by $\frac{\partial}{\partial y}$ . Define also (26) ${\hat{T}}_{i}^{y} = [\begin{matrix} \frac{\partial}{\partial y} (k (x_{i}, y_{0}) \frac{\partial T}{\partial y} (x_{i}, y_{0})) \\ ⋮ \\ \frac{\partial}{\partial y} (k (x_{i}, y_{n}) \frac{\partial T}{\partial y} (x_{i}, y_{n})) \end{matrix}], i = 0, 1, \dots, n .$ (26) Then, using the respective spatial approximation with respect to y, analogous to (Equation17(17) $\frac{\partial}{\partial x} (k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t)) \approx e_{i + 1}^{T} D K_{j}^{x} {\dot{T}}_{j}, i = 0, \dots, n, j = 0, 1, \dots, n,$ (17) ), and boundary conditions (Equation4(4) $\begin{aligned} - (K \nabla T) \cdot n = f_{3} (x, t), & on y = 0, x \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (4) )–(Equation5(5) $\begin{aligned} - (K \nabla T) \cdot n + h_{4} (x) T (x, 1, t) = f_{4} (x, t), & on y = 1, x \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (5) ), an approximation result similar to (Equation25(25) $[\begin{matrix} T_{0}^{x} \\ T_{1}^{x} \\ ⋮ \\ T_{n}^{x} \end{matrix}] \approx (\begin{matrix} F_{0} \\ F_{1} \\ ⋱ \\ F_{n} \end{matrix}) [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + [\begin{matrix} d_{0} f_{1} (y_{0}, t) - d_{n} f_{2} (y_{0}, t) \\ d_{0} f_{1} (y_{1}, t) - d_{n} f_{2} (y_{1}, t) \\ ⋮ \\ d_{0} f_{1} (y_{n}, t) - d_{n} f_{2} (y_{n}, t) \end{matrix}],$ (25) ) is (27) $[\begin{matrix} {\hat{T}}_{0}^{y} \\ {\hat{T}}_{1}^{y} \\ ⋮ \\ {\hat{T}}_{n}^{y} \end{matrix}] \approx (\begin{array}{cccc} G_{0} \\ G_{1} \\ ⋱ \\ G_{n} \end{array}) [\begin{matrix} {\hat{T}}_{0} \\ {\hat{T}}_{1} \\ ⋮ \\ {\hat{T}}_{n} \end{matrix}] + [\begin{matrix} d_{0} f_{3} (x_{0}, t) - d_{n} f_{4} (x_{0}, t) \\ d_{0} f_{3} (x_{1}, t) - d_{n} f_{4} (x_{1}, t) \\ ⋮ \\ d_{0} f_{3} (x_{n}, t) - d_{n} f_{4} (x_{n}, t) \end{matrix}],$ (27) where $G_{i} = D_{1} {\hat{K}}_{i} D_{2} + h_{4} (x_{i}) d_{n} e_{n + 1}^{T},$ with ${\hat{K}}_{i} = diag (k (x_{i}, y_{1}), k (x_{i}, y_{2}), \dots, k (x_{i}, y_{n - 1})), i = 0, 1, \dots, n,$ and ${\hat{T}}_{i} = [T (x_{i}, y_{0}, t), T (x_{i}, y_{1}, t), \dots, T (x_{i}, y_{n}, t)]^{T}, i = 0, 1, \dots, n .$ Observe that the entries of ${\hat{T}}_{i}$ follow a different ordering as that described in (Equation23(23) $\begin{aligned} T_{j} (t) & = [T (x_{0}, y_{j}, t), T (x_{1}, y_{j}, t), \dots, T (x_{n}, y_{j}, t)]^{T}, \end{aligned}$ (23) ). To fix this, it suffices to observe that there exists a permutation matrix $P \in R^{(n + 1) \times (n + 1)}$ such that (28) $[\begin{matrix} {\hat{T}}_{0} \\ {\hat{T}}_{1} \\ ⋮ \\ {\hat{T}}_{n} \end{matrix}] = P^{T} [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] and [\begin{matrix} {\hat{T}}_{0}^{y} \\ {\hat{T}}_{1}^{y} \\ ⋮ \\ {\hat{T}}_{n}^{y} \end{matrix}] = P^{T} [\begin{matrix} T_{0}^{y} \\ T_{1}^{y} \\ ⋮ \\ T_{n}^{y} \end{matrix}] .$ (28) Recall that a permutation matrix is a matrix obtained by reordering the rows (or columns) of an identity matrix. Finally, mixing (Equation27(27) $[\begin{matrix} {\hat{T}}_{0}^{y} \\ {\hat{T}}_{1}^{y} \\ ⋮ \\ {\hat{T}}_{n}^{y} \end{matrix}] \approx (\begin{array}{cccc} G_{0} \\ G_{1} \\ ⋱ \\ G_{n} \end{array}) [\begin{matrix} {\hat{T}}_{0} \\ {\hat{T}}_{1} \\ ⋮ \\ {\hat{T}}_{n} \end{matrix}] + [\begin{matrix} d_{0} f_{3} (x_{0}, t) - d_{n} f_{4} (x_{0}, t) \\ d_{0} f_{3} (x_{1}, t) - d_{n} f_{4} (x_{1}, t) \\ ⋮ \\ d_{0} f_{3} (x_{n}, t) - d_{n} f_{4} (x_{n}, t) \end{matrix}],$ (27) ) and (Equation28(28) $[\begin{matrix} {\hat{T}}_{0} \\ {\hat{T}}_{1} \\ ⋮ \\ {\hat{T}}_{n} \end{matrix}] = P^{T} [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] and [\begin{matrix} {\hat{T}}_{0}^{y} \\ {\hat{T}}_{1}^{y} \\ ⋮ \\ {\hat{T}}_{n}^{y} \end{matrix}] = P^{T} [\begin{matrix} T_{0}^{y} \\ T_{1}^{y} \\ ⋮ \\ T_{n}^{y} \end{matrix}] .$ (28) ) along with the fact that $P P^{T}$ is the identity matrix as $P$ is an orthogonal matrix, we have (29) $[\begin{matrix} T_{0}^{y} \\ T_{1}^{y} \\ ⋮ \\ T_{n}^{y} \end{matrix}] \approx P (\begin{array}{cccc} G_{0} \\ G_{1} \\ ⋱ \\ G_{n} \end{array}) P^{T} [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + P [\begin{matrix} d_{0} f_{3} (x_{0}, t) - d_{n} f_{4} (x_{0}, t) \\ d_{0} f_{3} (x_{1}, t) - d_{n} f_{4} (x_{1}, t) \\ ⋮ \\ d_{0} f_{3} (x_{n}, t) - d_{n} f_{4} (x_{n}, t) \end{matrix}] .$ (29) To simplify the notation, here and throughout we use $T (x_{i}, y_{j}, t)$ to denote both the exact value and the corresponding approximation associated with the discretized problem. With this assumption, (Equation14(14) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \frac{\partial}{\partial x} (k (x, y) \frac{\partial T}{\partial x} (x, y, t)) + \frac{\partial}{\partial y} (k (x, y) \frac{\partial T}{\partial y} (x, y, t)) + g (x, y, t) .$ (14) ) together with (Equation25(25) $[\begin{matrix} T_{0}^{x} \\ T_{1}^{x} \\ ⋮ \\ T_{n}^{x} \end{matrix}] \approx (\begin{matrix} F_{0} \\ F_{1} \\ ⋱ \\ F_{n} \end{matrix}) [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + [\begin{matrix} d_{0} f_{1} (y_{0}, t) - d_{n} f_{2} (y_{0}, t) \\ d_{0} f_{1} (y_{1}, t) - d_{n} f_{2} (y_{1}, t) \\ ⋮ \\ d_{0} f_{1} (y_{n}, t) - d_{n} f_{2} (y_{n}, t) \end{matrix}],$ (25) ) and (Equation29(29) $[\begin{matrix} T_{0}^{y} \\ T_{1}^{y} \\ ⋮ \\ T_{n}^{y} \end{matrix}] \approx P (\begin{array}{cccc} G_{0} \\ G_{1} \\ ⋱ \\ G_{n} \end{array}) P^{T} [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + P [\begin{matrix} d_{0} f_{3} (x_{0}, t) - d_{n} f_{4} (x_{0}, t) \\ d_{0} f_{3} (x_{1}, t) - d_{n} f_{4} (x_{1}, t) \\ ⋮ \\ d_{0} f_{3} (x_{n}, t) - d_{n} f_{4} (x_{n}, t) \end{matrix}] .$ (29) ) yield the initial value problem (30) ${\begin{cases} C \frac{d T}{d t} = H (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (30) where $T^{0}$ has entries $T_{0} (x_{i}, y_{j})$ with $T_{0}$ defined in (Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ) and grid points $(x_{i}, y_{j})$ being ordered in lexicographic form, (31) $C = (\begin{array}{cccc} C_{0} \\ C_{1} \\ ⋱ \\ C_{n} \end{array}), C_{j} = diag (C (x_{0}, y_{j}), \dots, C (x_{n}, y_{j})), j = 0, \dots, n,$ (31) and (32) $H (t, T) = [(\begin{array}{cccc} F_{0} \\ F_{1} \\ ⋱ \\ F_{n} \end{array}) + P (\begin{array}{cccc} G_{0} \\ G_{1} \\ ⋱ \\ G_{n} \end{array}) P^{T}] [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + S (t),$ (32) with $S (t)$ being used to accommodate terms that do not involve unknowns in (Equation25(25) $[\begin{matrix} T_{0}^{x} \\ T_{1}^{x} \\ ⋮ \\ T_{n}^{x} \end{matrix}] \approx (\begin{matrix} F_{0} \\ F_{1} \\ ⋱ \\ F_{n} \end{matrix}) [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + [\begin{matrix} d_{0} f_{1} (y_{0}, t) - d_{n} f_{2} (y_{0}, t) \\ d_{0} f_{1} (y_{1}, t) - d_{n} f_{2} (y_{1}, t) \\ ⋮ \\ d_{0} f_{1} (y_{n}, t) - d_{n} f_{2} (y_{n}, t) \end{matrix}],$ (25) )–(Equation29(29) $[\begin{matrix} T_{0}^{y} \\ T_{1}^{y} \\ ⋮ \\ T_{n}^{y} \end{matrix}] \approx P (\begin{array}{cccc} G_{0} \\ G_{1} \\ ⋱ \\ G_{n} \end{array}) P^{T} [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + P [\begin{matrix} d_{0} f_{3} (x_{0}, t) - d_{n} f_{4} (x_{0}, t) \\ d_{0} f_{3} (x_{1}, t) - d_{n} f_{4} (x_{1}, t) \\ ⋮ \\ d_{0} f_{3} (x_{n}, t) - d_{n} f_{4} (x_{n}, t) \end{matrix}] .$ (29) ) as well as source term $g (x, y, t)$ values along the grid. Clearly, $T ≐ T (t)$ is the vector formed by temperature values $T (x_{i}, y_{j}, t)$ with grid points enumerated in lexicographic form, i.e, $T = [\begin{matrix} T_{0} \\ ⋮ \\ T_{n} \end{matrix}] .$ The semidiscrete model (Equation30(30) ${\begin{cases} C \frac{d T}{d t} = H (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (30) ), and hence, its solution, does not depend on the conductivity k at the corners. Therefore, if we try to recover conductivity values from temperature values that satisfy (30), difficulties should certainly appear because in these points, there is no explicit connection between k and temperature values. An alternative semi-discrete model that circumvents the above difficulty can be described as follows. Choose positive numbers $α, β$ such that $α + β = 1$ and then split the first and last term in the sum (Equation22(22) $\begin{aligned} \sum_{i = 0}^{n} d_{i} k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t) & = d_{0} f_{1} (y_{j}, t) + \sum_{i = 1}^{n - 1} d_{i} k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t) \\ + d_{n} h_{2} (y_{j}) T (x_{n}, y_{j}, t) - d_{n} f_{2} (y_{j}, t) \\ = [D_{1} {\overset{ˇ}{K}}_{j} D_{2} + h_{2} (y_{j}) d_{n} e_{n + 1}^{T}] T_{j} (t) \\ + d_{0} f_{1} (y_{j}, t) - d_{n} f_{2} (y_{j}, t), \end{aligned}$ (22) ) into (33) $\begin{aligned} d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) & = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) + β d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) \\ = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) + β d_{0} f_{1} (y_{j}, t), \end{aligned}$ (33) (34) $\begin{aligned} d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) & = α d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) + β d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) \\ = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) \\ + β [h_{2} (y_{j}) T (x_{n}, y_{j}, t) d_{n} - f_{2} (y_{j}, t) d_{n}] . \end{aligned}$ (34) The above equalities come on using the boundary conditions (Equation2(2) $\begin{aligned} - (K \nabla T) \cdot n = f_{1} (y, t), & on x = 0, y \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (2) ) and (Equation3(3) $\begin{aligned} - (K \nabla T) \cdot n + h_{2} (y) T (1, y, t) = f_{2} (y, t), & on x = 1, y \in (0, 1), t \in (0, \bar{t}], \end{aligned}$ (3) ). Thanks to this splitting, Equation (Equation22(22) $\begin{aligned} \sum_{i = 0}^{n} d_{i} k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t) & = d_{0} f_{1} (y_{j}, t) + \sum_{i = 1}^{n - 1} d_{i} k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t) \\ + d_{n} h_{2} (y_{j}) T (x_{n}, y_{j}, t) - d_{n} f_{2} (y_{j}, t) \\ = [D_{1} {\overset{ˇ}{K}}_{j} D_{2} + h_{2} (y_{j}) d_{n} e_{n + 1}^{T}] T_{j} (t) \\ + d_{0} f_{1} (y_{j}, t) - d_{n} f_{2} (y_{j}, t), \end{aligned}$ (22) ) can be rewritten as (35) $\sum_{i = 0}^{n} d_{i} k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t) = [\hat{D} K_{j}^{x} D + β h_{2} (y_{j}) d_{n} e_{n + 1}^{T}] T_{j} + β [d_{0} f_{1} (y_{j}, t) - d_{n} f_{2} (y_{j}, t)],$ (35) where $\hat{D} = [α d_{0}, d_{1}, \dots, d_{n - 1}, α d_{n}]$ , and this serves as an alternative approximation for spatial derivatives with respect to x deduced in (Equation24(24) $T_{j}^{x} \approx [D_{1} {\overset{ˇ}{K}}_{j} D_{2} + h_{2} (y_{j}) d_{n} e_{n + 1}^{T}] T_{j} + d_{0} f_{1} (y_{j}, t) - d_{n} f_{2} (y_{j}, t) .$ (24) ). Proceeding this way, a similar expression for derivatives with respect to y can be deduced and then used to write down the following semi-discrete model (36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) where (37) $\hat{H} (t, T) = M T + \hat{S} (t),$ (37) (38) $M = (\begin{array}{cccc} {\hat{F}}_{0} \\ {\hat{F}}_{1} \\ ⋱ \\ {\hat{F}}_{n} \end{array}) + P (\begin{array}{cccc} {\hat{G}}_{0} \\ {\hat{G}}_{1} \\ ⋱ \\ {\hat{G}}_{n} \end{array}) P^{T},$ (38) with $\begin{array}{l} {\hat{F}}_{j} = [\hat{D} K_{j}^{x} D + β h_{2} (y_{j}) d_{n} e_{n + 1}^{T}], j = 0, 1, \dots n, \\ {\hat{G}}_{i} = [\hat{D} K_{i}^{y} D + β h_{4} (x_{i}) d_{n} e_{n + 1}^{T}], K_{i}^{y} = diag (k (x_{i}, y_{0}), \dots, k (x_{i}, y_{n})), i = 0, 1, \dots n . \end{array}$ As before, $\hat{S} (t)$ accommodates terms that do not include unknowns. Observe that by taking $α = 0$ , the Initial Value Problem (IVP) (Equation36(36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) ) reduces to the IVP (Equation30(30) ${\begin{cases} C \frac{d T}{d t} = H (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (30) ).

When using a pseudospectral method to construct approximate solutions of PDEs, the error of the approximate solution in relation to the exact one is naturally of great interest. Theoretical error estimates regarding 1D and 2D problems are well documented in several places, e.g. Canuto et al. [Citation34] and Bernardi and Maday [Citation35]. In most cases, the behaviour of the error estimate is similar to that given in (Equation13(13) $‖ u^{'} - u_{n}^{'} ‖ = O (n^{2 - m})$ (13) ). It depends essentially on the smoothness of the solution, the rule of thumb being: the more the exact solution is smooth, the more the approximate solution is accurate. The rationale behind this is as follows. Since the spatial derivatives of smooth solutions are approximated to high precision, the solution of the associated semi-discrete problem should be close to that of the original problem.

Next step is to apply a numerical method for IVPs to (Equation36(36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) ) such as explicit and implicit Euler methods, trapezoidal rule method (Crank–Nicolson method), Runge–Kutta class methods, or any other. Among them, due to its simplicity, second-order accuracy, and mainly because it is absolutely stable, we choose Crank–Nicolson (CN) method [Citation36]. CN method becomes relevant in our context since the inversion procedure, to be described later on, requires the semi-discrete model (Equation36(36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) ) to be solved many times for distinct conductivity values and with guaranteed stability. Having said that, we now proceed to describe how CN method works when applied to (Equation36(36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) ). In fact, for a positive integer N, we define the time step $Δ t = \bar{t} / N$ and discrete times $t_{q} = q Δ t$ , $q = 0, 1, \dots, N$ , so that we have a uniformly spaced mesh on $[0, \bar{t}]$ . Recalling that the solution of the semidiscrete model (Equation36(36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) ) depends on t, $T ≐ T (t)$ , CN method generates approximate solutions $T^{q}$ to $T (t_{q})$ given by $C T^{(q + 1)} = C T^{(q)} + \frac{Δ t}{2} (M T^{(q)} + S (t_{q}) + M T^{(q + 1)} + S (t_{q + 1})), q = 0, 1, 2, \dots,$ where $T^{(0)} = T^{0}$ is the initial condition (Equation36(36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) ). Rearranging terms, the CN method provides a fully discrete method characterized by being high-order accurate in space and second-order in time, i.e. temporal errors should dominate. It is given as (39) $M_{m} T^{(q + 1)} = M_{p} T^{(q)} + \frac{Δ t}{2} [S (t_{q}) + S (t_{q + 1})], q = 0, 1, 2, \dots,$ (39) where $M_{m} := C - \frac{Δ t}{2} M$ and $M_{p} := C + \frac{Δ t}{2} M$ . Consequently, to calculate approximate solutions at each time step, the system of linear equations (Equation39(39) $M_{m} T^{(q + 1)} = M_{p} T^{(q)} + \frac{Δ t}{2} [S (t_{q}) + S (t_{q + 1})], q = 0, 1, 2, \dots,$ (39) ) must be solved as efficiently as possible. One way to do this is to explore the sparse structure of the matrix $M_{m}$ to calculate its LU or QR factorization and then use the factors to solve the system. In this case, the factorization is calculated only once at the beginning of the CN iteration. The sparse structure of $M_{m}$ can be seen in Figure .

Figure 1. Sparse structure of matrix $M_{m}$ , for n + 1 = 16 grid points on both directions.

As an example to illustrate the effectiveness of the Chebyshev pseudospectral method in conjunction with Crank–Nicolson method, we consider a heat conduction model (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ) on $Γ = (0, 1) \times (0, 1)$ , $0 \leq t \leq \bar{t} = 1$ , extracted from [Citation21], whose solution is given by (40) $T (x, y, t) = (1 - e^{- t}) (\sin (π x) \sin (π y) + (π + 1) (x + y)),$ (40) for input data $\begin{aligned} f_{1} (y, t) & = \frac{1 + y}{12} (1 - e^{- t}) (π \sin (π y) + π + 1), \\ f_{2} (y, t) & = - \frac{2 + y}{12} (1 - e^{- t}) (- π \sin (π y) + π + 1), \\ f_{3} (x, t) & = \frac{1 + x}{12} (1 - e^{- t}) (π \sin (π x) + π + 1), \\ f_{4} (x, t) & = - \frac{2 + x}{12} (1 - e^{- t}) (- π \sin (π x) + π + 1), \\ h_{2} (y) & = h_{4} (x) = 0, \end{aligned}$ source term $\begin{aligned} g (x, y, t) & = (e^{- t} + \frac{π^{2}}{6} (1 + x + y) (1 - e^{- t})) \sin (π x) \sin (π y) + \\ + (π + 1) e^{- t} (x + y) - \frac{1 - e^{- t}}{12} (π \sin (π (x + y)) + 2 π + 2), \end{aligned}$ capacity C = 1, and conductivity given by $k (x, y) = \frac{1 + x + y}{12} .$ For this example, we calculate approximate solutions as well as the associated errors with respect to the exact solution of (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ). CPM is implemented with n + 1 = 11 grid points in both horizontal and vertical directions, $α = 0$ and CN runs with time step $Δ t = 0.01$ . Numerical results for three time stages are shown in Figure . The accuracy of the approximate solutions is evident when we inspect the pointwise errors at the bottom of the figure.

Figure 2. Numerical solutions for three time stages (top) and associated pointwise absolute error (bottom).

To better evaluate the accuracy of the fully discrete method, we calculate normwise errors in the maximum norm defined by (41) $E_{q} = | | T^{(q)} - T (t_{q}) ‖_{\infty} ≐ max_{k} | [T^{(q)} - T (t_{q})]_{k} |,$ (41) where $T (t)$ has entries $T (x_{i}, y_{j}, t)$ with gridpoints $(x_{i}, y_{j})$ in lexicographic order. For $q = 10, q = 50$ and $q = 100,$ which correspond to $t = 0.1, t = 0.5$ and t = 1, respectively, we obtain $E_{10} = 5.9071 \times 10^{- 6}, E_{50} = 2.1342 \times 10^{- 5} and E_{100} = 3.0597 \times 10^{- 5} .$ The errors not only verify the theoretical accuracy predicted in theory, but also the potentiality of the proposed method of providing approximate solutions, with relatively high accuracy, using few grid points and therefore at low cost.

3. Numerical method for the inverse problem – isotropic case

Based on the solver for the direct problem described in the previous section, we will now introduce a method for solving the associated inverse problem, namely the conductivity estimation problem from measured transient data. For the numerical treatment of the inverse problem, unknowns values $k (x_{i}, y_{j})$ are arranged in a vector, say $k$ , and function values T (temperature or electrostatic potential) are regarded as functions of $k$ . Assuming that the unknowns are ordered like functions values, let (42) $ℓ = i + j (n + 1) + 1, i = 0, 1, \dots, n, j = 0, 1, \dots, n .$ (42) It is clear that for each unknown $k (x_{i}, y_{j})$ there is a unique entry $k_{ℓ}$ of $k$ so that the lexicographic ordering of the grid $(x_{i}, y_{j})$ , as illustrated in Figure , is preserved in the entries of $k$ . The ordering in $k_{ℓ}$ is taken into account when implementing the inverse problem approach to be described below. The objective of this section is to determine conductivity values $k$ from measured temperature values $\tilde{T} = T + e$ , where e denotes an unknown perturbation term and $T$ is a vector of exact temperature values on the grid at time steps $t_{q}, q = 1, \dots, N$ , (43) $T = [\begin{matrix} T (t_{1}) \\ ⋮ \\ T (t_{N}) \end{matrix}] \in R^{N (n + 1)^{2}},$ (43) where $T (t_{q}) \in R^{(n + 1)^{2}}$ has entries $T_{ℓ}^{(t_{q})} ≐ T (x_{i}, y_{j}, t_{q})$ with ℓ defined in (Equation42(42) $ℓ = i + j (n + 1) + 1, i = 0, 1, \dots, n, j = 0, 1, \dots, n .$ (42) ) and $(x_{i}, y_{j})$ following the lexicographic ordering. Similar observations apply to $\tilde{T}$ . We will assume that an estimate of the error e is available such that (44) $‖ \tilde{T} - T ‖ = ‖ e ‖ \leq δ,$ (44) though, of course, this may not be the case when dealing with reconstruction problems based on experimental measurements. If not, it is essential that the error $‖ e ‖$ be estimated either based on prior knowledge or by other means. In addition, observe that, under mild conditions, for each $k$ we can solve the direct problem (Equation36(36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) ) to produce a related solution $T (k, t)$ which contains approximations to $T (x_{i}, y_{j}, t)$ on the grid, and hence, a vector $T (k)$ with entries $T_{ℓ}^{(t_{q})} (k)$ . With this observation in mind the conductivity estimation problem can be formulated as one of estimating a vector $k$ such that the difference between computed solutions $T (k)$ , and measured transient data $\tilde{T}$ is small enough in some sense. More precisely, the estimation problem can be formulated as the non-linear least squares problem of finding $k^{*}$ such that (45) $k^{*} = \underset{k \in R^{(n + 1)^{2}}}{argmin} ϕ (k), ϕ (k) = \frac{1}{2} ‖ T (k) - \tilde{T} ‖_{2}^{2} = \frac{1}{2} \sum_{j = 1}^{N} \sum_{ℓ = 1}^{(n + 1)^{2}} {(T_{ℓ}^{(t_{j})} (k) - {\tilde{T}}_{ℓ}^{(t_{j})})}^{2},$ (45) where $k = [k_{1}, \dots, k_{(n + 1)^{2}}]^{T}$ is the vector of unknowns. The non-linear problem (Equation45(45) $k^{*} = \underset{k \in R^{(n + 1)^{2}}}{argmin} ϕ (k), ϕ (k) = \frac{1}{2} ‖ T (k) - \tilde{T} ‖_{2}^{2} = \frac{1}{2} \sum_{j = 1}^{N} \sum_{ℓ = 1}^{(n + 1)^{2}} {(T_{ℓ}^{(t_{j})} (k) - {\tilde{T}}_{ℓ}^{(t_{j})})}^{2},$ (45) ) can be handled in a number of ways, e.g. by using Tikhonov regularization, trust region methods, nonmonotone line search methods, Levenberg–Marquardt method (LMM), or others, see, e.g. [Citation37–40], though clearly the presence of multiple minima cannot be ruled out.

Figure 3. Example of grid, for n = 4, enumerated in accordance with Equation (Equation42(42) $ℓ = i + j (n + 1) + 1, i = 0, 1, \dots, n, j = 0, 1, \dots, n .$ (42) ), i.e, lexicographic order.

Figure 3. Example of grid, for n = 4, enumerated in accordance with Equation (Equation42(42) ℓ=i+j(n+1)+1,i=0,1,…,n,j=0,1,…,n.(42) ), i.e, lexicographic order.

In this work, we choose to apply the Levenberg–Marquardt method (LMM), following a version described by Yamashita and Fukushima [Citation32] who consider LMM with line search and Armijo stepsize rule. To simplify the notation, we set $ϕ (k) = \frac{1}{2} ‖ F (k) ‖_{2}^{2}$ , and observe that the first-order necessary condition for minimization of φ, $\nabla ϕ (k) = 0,$ reads (46) ${F^{'}}^{T} (k) [T (k) - \tilde{T}] = 0,$ (46) where $F^{'} (k)$ denotes derivative of $F (k)$ . LMM minimizes iteratively $ϕ (k)$ (hence solves approximately (Equation46(46) ${F^{'}}^{T} (k) [T (k) - \tilde{T}] = 0,$ (46) )) by determining an estimate, say $\hat{k}$ , such that $‖ {F^{'}}^{T} (\hat{k}) [T (\hat{k}) - \tilde{T}] ‖_{2}$ is small enough. Roughly speaking, chosen an initial guess $k^{0}$ , the underlying idea of LMM is to determine a minimizer of $ϕ (k)$ iteratively by solving a sequence of linear problems of the form (47) $(F^{' T} (k^{j}) F^{'} (k^{j}) + μ^{j} Ω^{j}) d = - F^{' T} (k^{j}) F (k^{j}), j = 0, 1, \dots,$ (47) where $μ^{j}$ is the so-called damping parameter and $Ω^{j}$ is a diagonal matrix with positive entries. Yamashita and Fukushima choose $Ω^{j}$ as the identity matrix and the damping parameter $μ^{j}$ as the squared residual norm at iteration j. This version can be described as follows.

LMM with line search
Step 0: choose parameters $ν, η, γ \in (0, 1)$ and a initial guess $k^{0} \in R^{(n + 1)^{2}}$ . Set j = 0 and $μ^{0} = ‖ F (k^{0}) ‖_{2}^{2}$ .
Step 1: If $k^{j}$ satisfies some termination criterion, stop. Else, go to Step 2.
Step 2: Find the solution $d^{j}$ of the system of linear equations (Equation47(47) $(F^{' T} (k^{j}) F^{'} (k^{j}) + μ^{j} Ω^{j}) d = - F^{' T} (k^{j}) F (k^{j}), j = 0, 1, \dots,$ (47) ). If $d^{j}$ satisfies $‖ F (k^{j} + d^{j}) ‖_{2} \leq γ ‖ F (k^{j}) ‖_{2},$ then $k^{j + 1} = k^{j} + d^{j}$ and go to Step 4. Otherwise, go to Step 3.
Step 3: (Armijo's rule) Take m as the smallest nonnegative integer such that $ϕ (k^{j} + η^{m} d^{j}) - ϕ (k^{j}) \leq ν η^{m} \nabla ϕ (k^{j})^{T} d^{j}$ and set $k^{j + 1} = k^{j} + η^{m} d^{j}$ .
Step 4: Set $μ^{j + 1} = ‖ F (k^{j + 1}) ‖_{2}^{2}$ and j = j + 1. Go to Step 1.

As already mentioned, in this work $Ω^{j}$ in (Equation47(47) $(F^{' T} (k^{j}) F^{'} (k^{j}) + μ^{j} Ω^{j}) d = - F^{' T} (k^{j}) F (k^{j}), j = 0, 1, \dots,$ (47) ) is replaced by a product of the form $R^{T} R$ , in a way that $d^{j}$ is computed by solving the ‘scaled’ system (48) $(F^{' T} (k^{j}) F^{'} (k^{j}) + μ^{j} R^{T} R) d = - F^{' T} (k^{j}) F (k^{j}),$ (48) where $R$ is a 2D discrete differentiation operator referred to as regularization matrix, $R ≐ L_{i},$ (49) $L_{i} = [\begin{matrix} I_{n + 1} \otimes L_{i} (n + 1) \\ L_{i} (n + 1) \otimes I_{n + 1} \end{matrix}], i = 1, 2,$ (49) with matrices $L_{1} (m) \in R^{(m - 1) \times m}$ and $L_{2} (m) \in R^{(m - 2) \times m}$ given by $L_{1} (m) = [\begin{array}{cccc} - 1 & 1 \\ ⋱ & ⋱ \\ - 1 & 1 \end{array}] and L_{2} (m) = [\begin{array}{ccccc} 1 & - 2 & 1 \\ ⋱ & ⋱ & ⋱ \\ 1 & - 2 & 1 \end{array}] .$ Matrices $L_{1}$ and $L_{2}$ represent, respectively, the 1D first and second order discrete differentiation operator; they are often employed in image reconstruction problems [Citation41]. Such a modification is introduced to incorporate in the iterative process smoothing properties of conductivity in the horizontal and vertical directions, as well as to provide the reader with further insight into the role that the regularizer matrix plays into the stabilization of the inverse problem. Illustrative numerical examples are postponed to next sections.

3.1. Jacobian matrix

To apply Step 2 above, it is mandatory the calculation of the derivative of $F$ with respect to $k$ , that depends, implicitly, on the derivative of $T$ with respect to the same variable. Variation of $T$ as a function of $k$ at time step t is determined by the Jacobian of $T$ denoted here by $J (t)$ , $J (t) = [\frac{\partial T}{\partial k_{1}} (k, t), \dots, \frac{\partial T}{\partial k_{(n + 1)^{2}}} (k, t)] .$ It can be calculated as follows. Taking derivative with respect to $k_{ℓ}$ on both sides of (Equation36(36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) ) and assuming continuity of $T$ leads to (50) $C \frac{d}{d t} (\frac{\partial T}{\partial k_{ℓ}} (k, t)) = \frac{\partial M}{\partial k_{ℓ}} T (k, t) + M \frac{\partial T}{\partial k_{ℓ}} (k, t),$ (50) where $M$ denotes the matrix in (Equation38(38) $M = (\begin{array}{cccc} {\hat{F}}_{0} \\ {\hat{F}}_{1} \\ ⋱ \\ {\hat{F}}_{n} \end{array}) + P (\begin{array}{cccc} {\hat{G}}_{0} \\ {\hat{G}}_{1} \\ ⋱ \\ {\hat{G}}_{n} \end{array}) P^{T},$ (38) ). Now, since initial values of $T$ do not depend on $k$ , it follows that the ℓ-th column of the Jacobian matrix solves the IVP (51) ${\begin{cases} C \frac{d}{d t} (\frac{\partial T}{\partial k_{ℓ}} (k, t)) = M \frac{\partial T}{\partial k_{ℓ}} (k, t) + W_{ℓ} (t) \\ \frac{\partial T}{\partial k_{ℓ}} (k, 0) = 0 \end{cases},$ (51) where (52) $W_{ℓ} (t) = \frac{\partial M}{\partial k_{ℓ}} T (k, t), ℓ = 1, 2, \dots, (n + 1)^{2} .$ (52) Therefore, computing the columns of the Jacobian matrix requires solving a sequence of IVPs with distinct source terms $W_{ℓ} (t)$ , the computation of which is straightforward. In fact, it is not difficult to see that $\begin{aligned} \frac{\partial M}{\partial k_{1}} & = [(\begin{array}{clll} α d_{0} r_{0}^{T} \\ 0 \\ ⋱ \\ 0 \end{array}) + P (\begin{array}{clll} α d_{0} r_{0}^{T} \\ 0 \\ ⋱ \\ 0 \end{array}) P^{T}], \\ \frac{\partial M}{\partial k_{2}} & = [(\begin{aligned} d_{1} r_{1}^{T} \\ 0 \\ ⋱ \\ 0 \end{aligned}) + P (\begin{aligned} 0 \\ α d_{0} r_{0}^{T} \\ ⋱ \\ 0 \end{aligned}) P^{T}], \\ ⋮ \\ \frac{\partial M}{\partial k_{n + 1}} & = [(\begin{aligned} α d_{n} r_{n}^{T} \\ 0 \\ ⋱ \\ 0 \end{aligned}) + P (\begin{aligned} 0 \\ 0 \\ ⋱ \\ α d_{0} r_{0}^{T} \end{aligned}) P^{T}] . \end{aligned}$ Derivatives with respect to the remaining parameters are calculated similarly. Summarizing, to apply LMM we compute $F^{'} (k)$ at every needed time step, which is done by solving IVPs (Equation51(51) ${\begin{cases} C \frac{d}{d t} (\frac{\partial T}{\partial k_{ℓ}} (k, t)) = M \frac{\partial T}{\partial k_{ℓ}} (k, t) + W_{ℓ} (t) \\ \frac{\partial T}{\partial k_{ℓ}} (k, 0) = 0 \end{cases},$ (51) ) for all source terms. For clarity purposes, observe that, because $T$ is vector in $R^{N (n + 1)^{2}}$ with block entries $T (t_{i})$ , $i = 1, \dots, N$ , as described in (Equation43(43) $T = [\begin{matrix} T (t_{1}) \\ ⋮ \\ T (t_{N}) \end{matrix}] \in R^{N (n + 1)^{2}},$ (43) ), it is easy to see that $F^{'} (k) = [\begin{matrix} J_{1} \\ J_{2} \\ ⋮ \\ J_{N} \end{matrix}],$ where $J_{q} = J (t_{q})$ represents the Jacobian of $T$ with respect to $k$ at time step $t_{q}$ , for $q = 1, 2, \dots, N$ . We recall that the computation of $J_{q}$ demands to solve a sequence of $(n + 1)^{2}$ IVPs and, in this sense, for large problems, it may be interesting to use parallel computing, in order to reduce operational time along with memory usage.

3.2. Numerical experiments

We report the outcome of numerical experiments to illustrate the effectiveness of the proposed reconstruction method on some test problems using synthetic data. To this end, noisy measurements are simulated by adding random noise to actual data $T (x_{i}, y_{j}, t_{q})$ and arranged in a vector data $\tilde{T} = T + e$ , with $T$ defined in (Equation43(43) $T = [\begin{matrix} T (t_{1}) \\ ⋮ \\ T (t_{N}) \end{matrix}] \in R^{N (n + 1)^{2}},$ (43) ), where e is a vector containing zero-mean random numbers scaled in such a way that (53) $‖ \tilde{T} - T ‖ = NL ‖ T ‖ \leq δ,$ (53) where $NL$ stands for relative noise level in the data and δ is assumed to be available.

As it is often difficult to get exact solutions of problem (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ), in our numerical experiments we will use synthetic data generated by solving the transport model by the finite element method (FEM). For this, the input data will be extracted from [Citation20]. Before proceeding, it is instructive to compare the quality of the solutions obtained by CPM and FEM. To do this, we will use the test problem described at the end of Section 2 whose solution is available. Recall that, when using FEM the domain Γ is discretized using a triangular mesh with $\tilde{N}$ nodes and approximate solutions are determined as (54) $T_{h} (x, y, t) = \sum_{i = 1}^{\tilde{N}} c_{i} (t) φ_{i} (x, y), (x, y, t) \in Γ \times (0, \bar{t}],$ (54) where $c_{i} (t)$ are unknowns to be determined and $φ_{i}$ are frequently called hat functions, every one being continuous, piecewise linear (a plane inside every element/triangle of the mesh), taking a unit value at the ith node and zero elsewhere. Parameter h stands for the global mesh size, defined as the maximum distance between each pair of nodes in the mesh. Considering $C (x, y) = C$ constant in (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) ), FEM turns Equations (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ) into the semidiscrete model (55) ${\begin{cases} C \tilde{M} \frac{d c}{d t} + \tilde{A} c = \tilde{f} (t), t \in (0, \bar{t}] \\ c (0) = 0 \end{cases},$ (55) where $c (t) = (c_{1} (t), \dots, c_{\tilde{N}} (t))^{T}$ , $\tilde{M}$ and $\tilde{A}$ are $\tilde{N} \times \tilde{N}$ known as mass and stiffness matrices, resp., and $\tilde{f} (t)$ contains other independent terms. For more information on FEM discretization, see, for example, Johnson [Citation42] and Larson and Bengzon [Citation43].

To proceed, in a similar way as the semi-discrete problem associated with CPM was solved in Section 2, here the IVP (Equation55(55) ${\begin{cases} C \tilde{M} \frac{d c}{d t} + \tilde{A} c = \tilde{f} (t), t \in (0, \bar{t}] \\ c (0) = 0 \end{cases},$ (55) ) is also solved by the CN method. This guarantees second-order approximations in space and time, the order in space being in relation to the global mesh size h [Citation42]. We calculate approximate solutions $T_{h}$ for $h = 0.1$ and $h = 0.01$ with $Δ t = 0.01$ for both cases. In practice, for chosen h, a structured mesh with $\tilde{N}$ nodes is generated using MATLAB's PDE toolbox via function initmesh. It is noteworthy mentioning that for the above values of h correspond meshes consisting of $\tilde{N} = 185$ and $\tilde{N} = 17361$ nodes, respectively. Note that the refinement made on the mesh results in a significant increase in computational cost, since the unknowns of the linear system grow from $\tilde{N} = 183$ to $\tilde{N} = 17361$ , which in addition to making the application of the CN method more expensive, allows for the potential growth of rounding errors. Figure shows a simple mesh, the respective approximate solution $T_{h}$ at time t = 1 and also the structure of mass matrix, which is repeated in the stiffness matrix, both with density around $3.53 %$ .

Figure 4. Example of FEM mesh's structure with 185 nodes, 328 triangles and global mesh size h = 0.1 (left), respective solution with CN ( $Δ t = 0.01$ ) at time t = 1 (middle) and sparse structure of mass matrix $\tilde{M}$ (right), for data used in Figure .

Normwise errors in the maximum norm are also calculated and defined by ${\hat{E}}_{h} (t) = max_{(x_{i}, y_{j})} | T_{h} (x_{i}, y_{j}, t) - T (x_{i}, y_{j}, t) |,$ with $(x_{i}, y_{j})$ varying along the nodes. The errors associated with the approximate solutions $T_{h}$ for three time stages are shown in Table , in which, for comparison purposes, the errors of CPM-based solutions reported in the previous section are also included. The superior performance of CPM is clearly seen in this table, even in the case h = 0.01, which involves working with linear systems of size $17361 \times 17361$ , as opposed to CPM that deals with systems $121 \times 121$ . This ends the desired comparison.

Table 1. Errors associated with FEM-based solutions for two global mesh sizes and errors associated with CPM.

Display Table

We are now in a position to describe some examples of conductivity reconstruction. As already mentioned, in practice, it is difficult to get exact temperature data $T (x, y, t)$ satisfying the conduction model. So, to challenge the reconstruction method, we shall use synthetic data generated by solving the direct problem through the combination of FEM and CN method. All our numerical experiments are conducted using the following input data: (56) $\begin{aligned} g (x, y, t) & = 0, f_{1} (y, t) = - 1, f_{2} (y, t) = 1, \end{aligned}$ (56) (57) $\begin{aligned} f_{3} (x, t) & = - 1, f_{4} (x, t) = 1, and h_{2} (y) = h_{4} (x) = 0, \end{aligned}$ (57) with $Γ = (0, 1) \times (0, 1)$ , $\bar{t} = 1$ , capacity $C (x, y) = 1$ , initial condition $T_{0} (x, y) = 0$ , with conductivity $k (x, y)$ to be described in consonance with the case/example under consideration. For all examples, we choose $h = 0.01 = Δ t$ ; Thus, $T_{h}$ is $O (h^{2}) = O (10^{- 4})$ at each time step and we accept it as a reliable alternative to the lack of exact T. We emphasize that our decision of using FEM instead of CPM to generate data obeys the need to prevent the inverse crime [Citation44,Citation45]: the inverse crime occurs when the same numerical model is used to synthesize as well as to invert data in an inverse problem. For this, in our numerical examples, what we call ‘exact’ data, supposed to satisfy (Equation43(43) $T = [\begin{matrix} T (t_{1}) \\ ⋮ \\ T (t_{N}) \end{matrix}] \in R^{N (n + 1)^{2}},$ (43) ), are values obtained by interpolating $T_{h}$ in the grid generated by Chebyshev points.

All the reconstruction examples in this section are performed using noisy data $\tilde{T}$ consisting of temperature values on a grid of n + 1 = 16 Chebyshev points associated with N = 5 time steps equally distributed from $0.1$ to $0.5$ . We emphasize that, since the number of unknowns is $(n + 1)^{2}$ and as the temperature in the entire mesh generates $(n + 1)^{2}$ non-linear relations at each stage of time, temperature values of only N = 2 stages are sufficient to make the minimization problem overdetermined. In this case, $N = 2$ would be sufficient to determine conductivity estimates. Our choice N = 5 aims to make the minimization problem highly overdetermined to, on the one hand, allow better noise filtering and, on the other hand, expect that the minimization process results in a unique solution. As for LMM, it starts with $k^{0}$ constant equals to 1/12 along the grid. Other parameters, necessary to LMM, are taken as $η = γ = 0.7$ and $ν = 10^{- 2}$ . For more information about LMM parameters the reader is referred to [Citation46, Section ].

To terminate the estimation process, LMM is stopped by means of the discrepancy principle (DP) [Citation33], i.e. the iterates stop at the first j such that (58) $‖ T (k^{j}) - \tilde{T} ‖_{2} \leq τ δ, τ > 1,$ (58) where τ is a safeward parameter intended to prevent underestimation of the data error $‖ \tilde{T} - T ‖$ , see (Equation44(44) $‖ \tilde{T} - T ‖ = ‖ e ‖ \leq δ,$ (44) ). In agreement with this, τ is often chosen close to 1, though the option τ close to 2 be also found in literature, see e.g. [Citation47, Section ]. In this paper, all implementations consider $τ = 1.1$ . DP is commonly used in solving ill-posed problems because the presence of noise in the data generates during the iterations the so-called semi-convergence behaviour [Citation48]. Roughly speaking, while early iterations capture relevant information about the sought solution, as j grows the iterations capture information about the noise, corrupting gradually the iterative process, see Figure . Thus, DP is used to select an iterate capable of balancing accuracy and stability on the approximate solution.

Figure 5. Residual $‖ T (k^{j}) - \tilde{T} ‖_{2}$ (left) and relative error between $k^{j}$ and the exact conductivity $k (x, y) = (1 + x + y) / 12$ (right), $NL = 1 %$ .

Figure 5. Residual ‖T(kj)−T~‖2 (left) and relative error between kj and the exact conductivity k(x,y)=(1+x+y)/12 (right), NL=1%.

As for notation on tables and figures, RE denotes the relative error between exact k and the approximation and IE represents the relative error considering just grid points inside the grid, i.e. without nodes on the boundary. The error IE is calculated to evaluate the behaviour of the solutions close to the boundary.

Example 1

Consider conductivity $k (x, y) = (1 + x + y) / 12$ and noise level $NL = 0.1 %$ . This first example aims to show the influence of different scaling matrices $R^{T} R$ introduced in (Equation48(48) $(F^{' T} (k^{j}) F^{'} (k^{j}) + μ^{j} R^{T} R) d = - F^{' T} (k^{j}) F (k^{j}),$ (48) ) on the LMM performance in solving the inverse problem, for different values of α in the modified model (Equation36(36) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (36) ). Figure compares generated solutions. Note that the effect of increasing α, Figure (b, c), is seen as a correction of discontinuities at the edge of the reconstructions. This can be explained as follows. As $α + β = 1$ with $α, β \geq 0$ , a consequence of (Equation33(33) $\begin{aligned} d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) & = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) + β d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) \\ = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) + β d_{0} f_{1} (y_{j}, t), \end{aligned}$ (33) )–(Equation34(34) $\begin{aligned} d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) & = α d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) + β d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) \\ = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) \\ + β [h_{2} (y_{j}) T (x_{n}, y_{j}, t) d_{n} - f_{2} (y_{j}, t) d_{n}] . \end{aligned}$ (34) ) is that, if on the one hand $α \approx 1$ strengthens the conductivity values $k (x, y)$ on the boundaries, on the other hand, $β = 1 - α \approx 0$ weakens the influence of both the source term and boundary conditions in the model, see again (Equation33(33) $\begin{aligned} d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) & = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) + β d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) \\ = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) + β d_{0} f_{1} (y_{j}, t), \end{aligned}$ (33) )–(Equation34(34) $\begin{aligned} d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) & = α d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) + β d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) \\ = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) \\ + β [h_{2} (y_{j}) T (x_{n}, y_{j}, t) d_{n} - f_{2} (y_{j}, t) d_{n}] . \end{aligned}$ (34) ). Clearly, a reverse effect should occur when $α \approx 0$ . This suggests that an appropriate balance between α and β is needed, and this is what the numerical results seem to show as α increases starting from values close to zero. However, finding such a balance is difficult since, as it can be seen in Table , as α increases, the number of iterations increases as well, bringing instability to the iterates and not significantly reducing the relative error. On the other hand, Figure (d,e) presents the regularizing effect of $L_{1}$ and $L_{2}$ from (Equation49(49) $L_{i} = [\begin{matrix} I_{n + 1} \otimes L_{i} (n + 1) \\ L_{i} (n + 1) \otimes I_{n + 1} \end{matrix}], i = 1, 2,$ (49) ), smoothing up the solutions, both in interior and boundary, even with $α = 0.1$ , at a fairly small number of iterations. Tests for bigger/smaller α brought small changes and were omitted. Table summarizes the results. In resume, the information we emphasize is that a good choice of the regularizer $R$ can lead to good results, while reducing computational effort.

Figure 6. Comparison between solutions for $k (x, y) = (1 + x + y) / 12$ calculated with different choices of $R$ and α. For (d) and (e), $α = 0.1$ .

Table 2. Errors and number of iterations until DP is satisfied, for different choices of $R$ and α, considering as exact $k (x, y) = (1 + x + y) / 12$ .

Display Table

Example 2

Consider again noise level $NL = 0.1 %$ and take $α = 0.1$ fixed. To assess the effectiveness of the proposed method we choose four different conductivities, namely: $\begin{aligned} k_{1} (x, y) & = (1 + x^{2} + y^{2}) / 12, \\ k_{2} (x, y) & = (1 + y^{3}) / 12, \\ k_{3} (x, y) & = (1 + \sin (x) \sin (y)) / 12 and \\ k_{4} (x, y) & = (1 + \sin (2 π x)^{2} \sin (2 π y)^{2}) / 12, \end{aligned}$ where the regularizer is taken as $R = L_{2}$ . Considering the results in Figure , it is observed that the use of $R$ produces smoothing effect on the calculated solutions $k$ , while reducing unwanted artefacts both at the boundary and inside Γ. Results are summarized in Figure and Table . Observe the good fitting between the sought solution and the reconstruction, with small number of iterations, contributing on the numerical applications, especially bearing in mind that calculating the Jacobian may become expensive.

Figure 7. Contour plots for every exact $k_{i} (x, y)$ (top) and the respective approximation (bottom), $i = 1, \dots, 4$ .

Figure 7. Contour plots for every exact ki(x,y) (top) and the respective approximation (bottom), i=1,…,4.

Table 3. Relative errors RE, interior error IE and number of LMM iterations until DP is satisfied.

Display Table

Example 3

This last example refers to the reconstruction of three conductivities namely, k used in Example 1 and $k_{2}$ , $k_{3}$ of the previous example, for different noise levels in the data. To this end we set $R = L_{2}$ , $α = 0.1$ and choose three values for NL: 2.5%, 1.5% and 0.5%. In Figure , top, we show reconstructions of the lines $k (0.5, y)$ , $k_{2} (0.5, y)$ and $k_{3} (0.5, y),$ while at the bottom, we show relative reconstruction errors on the whole mesh. The numerical results confirm what is often seen in solving inverse problems: as NL gets smaller, so does the reconstruction error.

Figure 8. Behavior of reconstruction results solution for different noise levels. Top: exact conductivities and respective approximations along the line x = 0.5. Bottom: absolute value of reconstruction errors along the line x = 0.5 and relative error RE.

4. Numerical method for the inverse problem – orthotropic case

We now consider the conduction problem (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) )–(Equation6(6) $T (x, y, 0) = T_{0} (x, y) in region Γ .$ (6) ) focusing on the orthotropic case, i.e.with conductivity tensor $K (x, y) = [\begin{array}{cc} k_{11} (x, y) & 0 \\ 0 & k_{22} (x, y) \end{array}],$ and we first describe the Chebyshev pseudospectral method used to solve the direct problem. To this end rewrite (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) ) as (59) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \frac{\partial}{\partial x} (k_{11} (x, y) \frac{\partial T}{\partial x} (x, y, t)) + \frac{\partial}{\partial y} (k_{22} (x, y) \frac{\partial T}{\partial y} (x, y, t)) + g (x, y, t)$ (59) and then note that along the horizontal direction we have a process identical to (Equation17(17) $\frac{\partial}{\partial x} (k (x_{i}, y_{j}) \frac{\partial T}{\partial x} (x_{i}, y_{j}, t)) \approx e_{i + 1}^{T} D K_{j}^{x} {\dot{T}}_{j}, i = 0, \dots, n, j = 0, 1, \dots, n,$ (17) ), changing k by $k_{11}$ in (Equation20(20) $T_{j}^{x} = [\begin{matrix} \frac{\partial}{\partial x} (k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t)) \\ ⋮ \\ \frac{\partial}{\partial x} (k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t)) \end{matrix}], j = 0, 1, \dots, n .$ (20) ), which leads to (60) $[\begin{matrix} T_{0}^{x} \\ T_{1}^{x} \\ ⋮ \\ T_{n}^{x} \end{matrix}] \approx (\begin{array}{cccc} F_{0} \\ F_{1} \\ ⋱ \\ F_{n} \end{array}) [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + [\begin{matrix} d_{0} f_{1} (y_{0}, t) - d_{n} f_{2} (y_{0}, t) \\ d_{0} f_{1} (y_{1}, t) - d_{n} f_{2} (y_{1}, t) \\ ⋮ \\ d_{0} f_{1} (y_{n}, t) - d_{n} f_{2} (y_{n}, t) \end{matrix}]$ (60) where we set $F_{j} = D_{1} {\overset{ˇ}{K}}_{j} D_{2} + h_{2} (y_{j}) d_{n} e_{n + 1}^{T}$ with ${\overset{ˇ}{K}}_{j} = diag (k_{11} (x_{1}, y_{j}), \dots, k_{11} (x_{n - 1}, y_{j})), j = 0, 1, \dots, n,$ and $T (x_{i}, y_{j})$ enumerated in lexicographic order. For approximations in the vertical direction we change k by $k_{22}$ and proceed as before, leading to the initial value problem (61) ${\begin{cases} C \frac{d T}{d t} = H (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (61) where (62) $H (t, T) = [(\begin{array}{cccc} F_{0} \\ F_{1} \\ ⋱ \\ F_{n} \end{array}) + P (\begin{array}{cccc} G_{0} \\ G_{1} \\ ⋱ \\ G_{n} \end{array}) P^{T}] [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + S (t),$ (62) for $G_{i} = D_{1} {\hat{K}}_{i} D_{2} + h_{4} (x_{i}) d_{n} e_{n + 1}^{T},$ with ${\hat{K}}_{i} = diag (k_{22} (x_{i}, y_{1}), k_{22} (x_{i}, y_{2}), \dots, k_{22} (x_{i}, y_{n - 1})), i = 0, 1, \dots, n,$ and $S (t)$ being used to accommodate terms which do not involve unknowns. Essentially, blocks of $F$ and $G$ contain information related to $k_{11}$ and $k_{22}$ , respectively, and the rest is kept unchanged, exactly as in the isotropic case.

It is easy to see that function T does not depend on the values of $k_{11} (x, y)$ when $x = x_{0}$ and $x = x_{n}$ , due to the incorporation of boundary conditions. The same happens for $k_{22} (x, y)$ at $y = y_{0}$ and $y = y_{n}$ , and again, an alternative discretization can be applied, for example, by considering a strategy such as in (Equation33(33) $\begin{aligned} d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) & = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) + β d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) \\ = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) + β d_{0} f_{1} (y_{j}, t), \end{aligned}$ (33) ) and (Equation34(34) $\begin{aligned} d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) & = α d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) + β d_{n} k (x_{n}, y_{j}) \frac{\partial T}{\partial x} (x_{n}, y_{j}, t) \\ = α d_{0} k (x_{0}, y_{j}) \frac{\partial T}{\partial x} (x_{0}, y_{j}, t) \\ + β [h_{2} (y_{j}) T (x_{n}, y_{j}, t) d_{n} - f_{2} (y_{j}, t) d_{n}] . \end{aligned}$ (34) ). Therefore, the semidiscrete model takes the form (63) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (63) where (64) $\hat{H} (t, T) = M T + \hat{S} (t),$ (64) and, again following previous notation, (65) $M = (\begin{array}{cccc} {\hat{F}}_{0} \\ {\hat{F}}_{1} \\ ⋱ \\ {\hat{F}}_{n} \end{array}) + P (\begin{array}{cccc} {\hat{G}}_{0} \\ {\hat{G}}_{1} \\ ⋱ \\ {\hat{G}}_{n} \end{array}) P^{T},$ (65) for $\begin{array}{l} {\hat{F}}_{j} = [\hat{D} K_{j}^{x} D + β h_{2} (y_{j}) d_{n} e_{n + 1}^{T}], K_{j}^{x} = diag (k_{11} (x_{0}, y_{j}), \dots, k_{11} (x_{n}, y_{j})), j = 0, 1, \dots n, \\ {\hat{G}}_{i} = [\hat{D} K_{i}^{y} D + β h_{4} (x_{i}) d_{n} e_{n + 1}^{T}], K_{i}^{y} = diag (k_{22} (x_{i}, y_{0}), \dots, k_{22} (x_{i}, y_{n})), i = 0, 1, \dots n, \end{array}$ and $\hat{S} (t)$ accommodating independent terms. Although the notation here is the same as for the isotropic case, the reader should be aware that the variables involved are different. In fact, in the previous sections, $K_{j}^{x}$ and $K_{i}^{y}$ were related exclusively to k, while in the orthotropic case they involve $k_{11}$ and $k_{22}$ . Ultimately, above relations recover the isotropic case by taking $k_{11} = k_{22} = k$ .

Remark 1

Although beyond the scope of this paper, we note that a semi-discrete model similar to (Equation63(63) ${\begin{cases} C \frac{d T}{d t} = \hat{H} (t, T), t \in (0, \bar{t}] \\ T (0) = T^{0} \end{cases},$ (63) ) for the anisotropic case can be developed analogously. In fact, in this case, Equation (Equation1(1) $C (x, y) \frac{\partial T}{\partial t} (x, y, t) = \nabla \cdot [K (x, y) \nabla T (x, y, t)] + g (x, y, t) in region Γ \times (0, \bar{t}],$ (1) ) takes the form $C \frac{\partial T}{\partial t} = \frac{\partial}{\partial x} (k_{11} \frac{\partial T}{\partial x} + k_{12} \frac{\partial T}{\partial y}) + \frac{\partial}{\partial y} (k_{21} \frac{\partial T}{\partial x} + k_{22} \frac{\partial T}{\partial y}) + g$ and every derivative can be approximated using CPM as in (Equation25(25) $[\begin{matrix} T_{0}^{x} \\ T_{1}^{x} \\ ⋮ \\ T_{n}^{x} \end{matrix}] \approx (\begin{matrix} F_{0} \\ F_{1} \\ ⋱ \\ F_{n} \end{matrix}) [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + [\begin{matrix} d_{0} f_{1} (y_{0}, t) - d_{n} f_{2} (y_{0}, t) \\ d_{0} f_{1} (y_{1}, t) - d_{n} f_{2} (y_{1}, t) \\ ⋮ \\ d_{0} f_{1} (y_{n}, t) - d_{n} f_{2} (y_{n}, t) \end{matrix}],$ (25) ) and (Equation29(29) $[\begin{matrix} T_{0}^{y} \\ T_{1}^{y} \\ ⋮ \\ T_{n}^{y} \end{matrix}] \approx P (\begin{array}{cccc} G_{0} \\ G_{1} \\ ⋱ \\ G_{n} \end{array}) P^{T} [\begin{matrix} T_{0} \\ T_{1} \\ ⋮ \\ T_{n} \end{matrix}] + P [\begin{matrix} d_{0} f_{3} (x_{0}, t) - d_{n} f_{4} (x_{0}, t) \\ d_{0} f_{3} (x_{1}, t) - d_{n} f_{4} (x_{1}, t) \\ ⋮ \\ d_{0} f_{3} (x_{n}, t) - d_{n} f_{4} (x_{n}, t) \end{matrix}] .$ (29) ).

As for the inverse problem for the orthotropic case, the goal now is to recover conductivities $k_{11}$ and $k_{22}$ based on data values of T contaminated by noise. We start by enumerating variables $k_{11} (x_{i}, y_{j})$ and $k_{22} (x_{i}, y_{j})$ as done before (see Section 3) and arranging them in a vector (66) $k = [\begin{matrix} k^{11} \\ k^{22} \end{matrix}], k^{11}, k^{22} \in R^{(n + 1)^{2}},$ (66) such that function values $T$ depend on $k$ . The computation of the Jacobian matrix $\begin{aligned} J (t) & = [\frac{\partial T}{\partial k_{1}} (k, t), \dots, \frac{\partial T}{\partial k_{2 (n + 1)^{2}}} (k, t)] \\ = [\frac{\partial T}{\partial k_{1}^{11}} (k, t), \dots, \frac{\partial T}{\partial k_{(n + 1)^{2}}^{11}} (k, t), \frac{\partial T}{\partial k_{1}^{22}} (k, t), \dots, \frac{\partial T}{\partial k_{(n + 1)^{2}}^{22}} (k, t)] \end{aligned}$ remains essentially the same, but noting that the problem now has $2 (n + 1)^{2}$ unknown parameters and that the source terms in (Equation52(52) $W_{ℓ} (t) = \frac{\partial M}{\partial k_{ℓ}} T (k, t), ℓ = 1, 2, \dots, (n + 1)^{2} .$ (52) ) change accordingly. Of course, if $M$ is the matrix in (Equation65(65) $M = (\begin{array}{cccc} {\hat{F}}_{0} \\ {\hat{F}}_{1} \\ ⋱ \\ {\hat{F}}_{n} \end{array}) + P (\begin{array}{cccc} {\hat{G}}_{0} \\ {\hat{G}}_{1} \\ ⋱ \\ {\hat{G}}_{n} \end{array}) P^{T},$ (65) ), its derivatives with respect to $k$ involve only ${\hat{F}}_{j}$ blocks for the first $(n + 1)^{2}$ entries of $k$ and ${\hat{G}}_{i}$ : $\frac{\partial M}{\partial k_{1}^{11}} = (\begin{array}{clll} α d_{0} r_{0}^{T} \\ 0 \\ ⋱ \\ 0 \end{array}), \frac{\partial M}{\partial k_{2}^{11}} = (\begin{array}{clll} d_{1} r_{1}^{T} \\ 0 \\ ⋱ \\ 0 \end{array}), \dots,$ $\frac{\partial M}{\partial k_{(n + 1)^{2}}^{11}} = (\begin{array}{clll} 0 \\ ⋱ \\ 0 \\ α d_{n} r_{n}^{T} \end{array}), \frac{\partial M}{\partial k_{1}^{22}} = P (\begin{array}{clll} α d_{0} r_{0}^{T} \\ 0 \\ ⋱ \\ 0 \end{array}) P^{T},$ $\frac{\partial M}{\partial k_{2}^{22}} = P (\begin{array}{clll} d_{1} r_{1}^{T} \\ 0 \\ ⋱ \\ 0 \end{array}) P^{T}, \dots, \frac{\partial M}{\partial k_{(n + 1)^{2}}^{22}} = P (\begin{array}{clll} 0 \\ ⋱ \\ 0 \\ α d_{n} r_{n}^{T} \end{array}) P^{T},$ hence the computation of the source term in (Equation52(52) $W_{ℓ} (t) = \frac{\partial M}{\partial k_{ℓ}} T (k, t), ℓ = 1, 2, \dots, (n + 1)^{2} .$ (52) ) is straightforward. Thus in this case, computing $J$ requires solving $2 (n + 1)^{2}$ IVPs. Nevertheless, we anticipate that in practice only a small number of Jacobians are necessary to produce good results, which in a way encourages the application of the proposed method.

Next step is to apply LMM coupled with the discrepancy principle, as described in Section 3, to the problem of finding $k^{*}$ such that (67) $k^{*} = \underset{k \in R^{2 (n + 1)^{2}}}{argmin} ϕ (k), ϕ (k) = \frac{1}{2} ‖ T (k) - \tilde{T} ‖_{2}^{2},$ (67) with $\tilde{T}$ as in (Equation53(53) $‖ \tilde{T} - T ‖ = NL ‖ T ‖ \leq δ,$ (53) ).

As before, for the scaling matrix used in LMM we choose a regularizer $R$ , that now has the form (68) $R = [\begin{array}{cc} R^{11} & 0 \\ 0 & R^{22} \end{array}],$ (68) where $R^{11}$ and $R^{22}$ are introduced to enforce smoothness on $k^{11}$ and $k^{22}$ separately. To see this effect more clearly, it suffices to note that LMM solves (Equation48(48) $(F^{' T} (k^{j}) F^{'} (k^{j}) + μ^{j} R^{T} R) d = - F^{' T} (k^{j}) F (k^{j}),$ (48) ), (69) $\underset{k \in R^{2 (n + 1)^{2}}}{argmin} {‖ F^{'} (k^{j}) d - F (k^{j}) ‖_{2}^{2} + (μ^{j})^{2} ‖ R d ‖_{2}^{2}},$ (69) with $d$ partitioned as $k$ in (Equation66(66) $k = [\begin{matrix} k^{11} \\ k^{22} \end{matrix}], k^{11}, k^{22} \in R^{(n + 1)^{2}},$ (66) ), i.e, $d$ possessing two blocks $d^{11}$ and $d^{22}$ , with the same size and position of $k^{11}$ and $k^{22}$ , respectively, so that $R d = [\begin{matrix} R^{11} d^{11} \\ R^{22} d^{22} \end{matrix}],$ with every block of the regularizer acting in each part of the unknown vector $d$ during the minimization process (Equation69(69) $\underset{k \in R^{2 (n + 1)^{2}}}{argmin} {‖ F^{'} (k^{j}) d - F (k^{j}) ‖_{2}^{2} + (μ^{j})^{2} ‖ R d ‖_{2}^{2}},$ (69) ). It is important to note that there is no formula for choosing $R$ and that the choice is related to the properties we want to preserve in our solutions. For numerical computations we use $R^{11} = R^{22} = L_{i}$ , i = 1, 2, as described in (Equation49(49) $L_{i} = [\begin{matrix} I_{n + 1} \otimes L_{i} (n + 1) \\ L_{i} (n + 1) \otimes I_{n + 1} \end{matrix}], i = 1, 2,$ (49) ); this choice incorporates smoothing properties directly in $k^{11}$ and $k^{22}$ .

4.1. Numerical experiments

Similar to the isotropic case, we report numerical results to illustrate the effectiveness of the proposed method using synthetic data generated by FEM. All FEM and LMM parameters are as in Section 3.2. For CPM we take n + 1 = 16 grid points in each direction and consider noise level $NL = 0.1 %$ and $α = 0.1$ . Data comprises T values at N = 5 time steps equally distributed from 0.1 to 0.5 and initial data $k^{0}$ is taken again as the constant 1/12. It is important to mention, however, that temperature measurements at 3 time stages turn the problem to be overdetermined, in which case we could expect a unique solution. As for the regularization matrices, due to the good fitting presented in our previous numerical results, we choose $R^{11} = R^{22} = L_{2}$ for all tests of this section, with $R$ in the form of (Equation68(68) $R = [\begin{array}{cc} R^{11} & 0 \\ 0 & R^{22} \end{array}],$ (68) ). Finally, our tests are divided into three cases, described in Table , with respective results displayed in Table and contour plots in Figure .

Table 4. Orthotropic test cases.

Display Table

Table 5. Orthotropic results for test cases in Table .

Display Table

Figure 9. Contour plots for every test case: 1 (top row), 2 (middle row) and 3 (bottom row).

Observe that the number of unknowns is $2 (n + 1)^{2} = 512$ , which is exactly the number of IVPs solved every time the Jacobian is needed. However, as it can be seen in Table , good quality results are obtained in a fairly small number of iterations (and so Jacobian computations as well). This shows that the proposed strategy is able to produce suitable reconstructions with acceptable computational effort, and so it can be useful in real-world applications.

Finally, to assess the quality of recovered conductivities, we solve the forward problem with conductivities determined with LMM as input data and compare the obtained solution T against the considered true function values T (generated via FEM). Results of the comparison as well as absolute value of the errors are displayed in Figure . Good fitting between data provided by FEM and approximations given by our strategy is evident. We point out that FEM solutions may contain imprecisions due to discretization and/or numerical treatment, in a way that our solutions could be even better if compared with the exact (yet unavailable) data.

Figure 10. Computed temperature T (above) at time t = 1 using the reconstructions of $k_{11}$ and $k_{22}$ and the absolute value of the error between FEM's T and its approximation (below).

5. Conclusion

We proposed a method for reconstructing orthotropic conductivity combining a solver for the direct problem based on a Chebyshev pseudospectral method and Levenberg–Marquardt method for solving a related non-linear least squares problem. The main feature of the proposed method is that instead of scaling diagonal matrices often used in classical LMM implementations, we use singular regularization matrices. The impact of such a replacement in terms of the quality of the reconstruction was illustrated with numerical experiments. Further, to cope with typical instabilities in inverse reconstruction problems due to noise in the data, we use the Morozov discrepancy principle as stopping rule. Numerical results show a good fit between exact and recovered conductivity at a very low computational cost, and exceptional stability when inverting noisy data due to the use of Morozov's stopping rule. The reason for this is because the method is simple to implement as well as because CPM builds high-order approximations using a rather small set of discretization points, contributing greatly for the reduction of computational effort. Future work includes the extension to simultaneous reconstruction of capacity and conductivity, as well as the LMM convergence analysis under the assumption that scaling matrices are replaced by singular matrices, as made in this paper.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

EB was supported by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Brazil [grant number 88887.178114/2018-00] and Fundação de Amparo à Pesquisa e Inovação do Estado de Santa Catarina (FAPESC), Brazil [grant number 88887.178114/2018-00]. FSVB was supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Brazil [grant number 308523/2017-2].

References

Özişik MN. Boundary value problems of heat conduction. Mineola (NY): ITC; 1968.
Google Scholar
Alessandrini G, de Hoop MV, Gaburro R. Uniqueness for the electrostatic inverse boundary value problem with piecewise constant anisotropic conductivities. Inverse Probl. 2017;33(12):125013.
Google Scholar
Astala K, Päivärinta L. Calderón's inverse problem for anisotropic conductivity in the plane. Commun Par Diff Eq. 2005;30(1–2):207–224.
Google Scholar
Gaburro R, Sincich E. Lipschitz stability for the inverse conductivity problem for a conformal class of anisotropic conductivities. Inverse Probl. 2015;31(1):015008.
Google Scholar
Isakov V. Inverse problems for partial differential equations. 2nd ed. Applied Mathematical Sciences. New York (NY): Springer; 2006.
Google Scholar
Alifanov OM. Inverse heat transfer problems. 1st ed. International Series in Heat and Mass Transfer. Berlin: Springer-Verlag; 1994.
Google Scholar
Elayyan A, Isakov V. On uniqueness of recovery of the discontinuous conductivity coefficient of a parabolic equation. SIAM J Math Anal. 1997;28(1):49–59.
Google Scholar
Carrera J, Neuman SP. Estimation of aquifer parameters under transient and steady-state conditions: 2. Uniqueness, stability and solution algorithms. Water Resour Res. 1986;22:211–227.
Google Scholar
Gol'dman NL. Inverse problems with final overdetermination for parabolic equations with unknown coefficients multiplying the highest derivative. Dokl Math. 2011;83:316–320.
Google Scholar
Huntul MJ, Hussein MS, Lesnic D, et al. Reconstruction of an orthotropic thermal conductivity from non-local heat flux measurements. Int J Math Model Numer Optim. 2020;10(1):102–122.
Google Scholar
Lionheart WRB. Conformal uniqueness results in anisotropic electrical impedance imaging. Inverse Probl. 1997;13(1):125–134.
Google Scholar
Mejias MM, Orlande HRB, Özişik MN. Effects of the heating process and body dimensions on the estimation of the thermal conductivity components of orthotropic solids. Inverse Probl Sci Eng. 2003;11(1):75–89.
Google Scholar
Rodrigues FA, Orlande HRB, Mejias MM. Use of a single heated surface for the estimation of thermal conductivity components of orthotropic 3D solids. Inverse Probl Sci Eng. 2004;12(5):501–517.
Google Scholar
Yen RH, Chen CY, Huang CT, et al. Numerical study of anisotropic thermal conductivity fabrics with heating elements. Int J Numer Method Heat Fluid Flow. 2013;23(5):750–771.
Google Scholar
Pasdunkorale J, Turner IW. A second order finite volume technique for simulating transport in anisotropic media. Int J Numer Method Heat Fluid Flow. 2003;13(1):31–56.
Google Scholar
Kohn R, Vogelius M. Determining conductivity by boundary measurements II. Interior results. Commun Pure Appl Math. 1985;38(5):643–667.
Google Scholar
Mera NS, Elliott L, Ingham DB, et al. A comparison of different regularization methods for a Cauchy problem in anisotropic heat conduction. Int J Numer Method Heat Fluid Flow. 2003;13(5):528–546.
Google Scholar
Mera NS, Elliott L, Ingham DB, et al. An iterative BEM for the Cauchy steady state heat conduction problem in an anisotropic medium with unknown thermal conductivity tensor. Inverse Probl Eng. 2000;8(6):579–607.
Google Scholar
Zhou HL, Xiao X, Chen HL, et al. Identification of thermal conductivity for orthotropic FGMs by DT-DRBEM and L-M algorithm. Inverse Probl Sci Eng. 2020;28(2):196–219.
Google Scholar
Mahmood MS, Lesnic D. Identification of conductivity in inhomogeneous orthotropic media. Int J Numer Method Heat Fluid Flow. 2019;29(1):165–183.
Google Scholar
Cao K, Lesnic D, Cola MJ. Determination of thermal conductivity of inhomogeneous orthotropic materials from temperature measurements. Inverse Probl Sci Eng. 2019;27(10):1372–1398.
Google Scholar
Monard F, Rim D. Imaging of isotropic and anisotropic conductivities from power densities in three dimensions. Inverse Probl. 2018;34(7):075005.
Google Scholar
Mustonen L. Numerical study of a parametric parabolic equation and a related inverse boundary value problem. Inverse Probl. 2016;32(10):105008.
Google Scholar
Borggaard J, van Wyk HW. Gradient-based estimation of uncertain parameters for elliptic partial differential equations. Inverse Probl. 2015;31(6):065008.
Google Scholar
Young LC. Orthogonal collocation revisited. Comput Methods Appl Mech Engrg. 2019;345:1033–1076.
Google Scholar
Berntsson F. A spectral method for solving the sideways heat equation. Inverse Probl. 1999;15(4):891–906.
Google Scholar
Ismailov MI, Bazán FSV, Bedin L. Time-dependent perfusion coefficient estimation in a bioheat transfer problem. Comput Phys Commun. 2018;230:50–58.
Google Scholar
Bazán FSV. Chebyshev pseudospectral method for wave equation with absorbing boundary conditions that does not use a first order hyperbolic system. Math Comput Simul. 2010;80(11):2124–2133.
Google Scholar
Bazán FSV, Bedin L, Bozzoli F. New methods for numerical estimation of convective heat transfer coefficient in circular ducts. Int J Therm Sci. 2019;139:387–402.
Google Scholar
Trefethen LN. Spectral methods in MATLAB. Vol. 10. Philadelphia (PA): Society for Industrial and Applied Mathematics; 2000.
Google Scholar
Gottlieb D, Orzag SA. Numerical analysis of spectral methods: theory and applications. Philadelphia (PA): SIAM; 1977.
Google Scholar
Yamashita N, Fukushima M. On the rate of convergence of the Levenberg-Marquardt method. In: Alefeld G, Chen X, editors. Topics in numerical analysis. Vol. 15. Vienna: Springer; 2001. p. 239–249.
Google Scholar
Morozov VA. Regularization methods for solving incorrectly posed problems. 1st ed. New York (NY): Springer-Verlag; 1984.
Google Scholar
Canuto C, Hussaini MY, Quarteroni A. Spectral methods in fluid dynamics. 1st ed. Springer Series in Computational Physics. Berlin: Springer-Verlag; 1988.
Google Scholar
Bernardi C, Maday Y. Approximations spectrales de problèmes methods aux limites elliptiques. 1st ed. Berlin: Springer-Verlag; 1992.
Google Scholar
Crank J, Nicolson P. A practical method for numerical evaluation of solutions of partial differential equations of the heat-conduction type. Math Proc Cambridge Philos Soc. 1947;43(1):50–67.
Google Scholar
Tikhonov AN, Goncharsky AV, Stepanov VV. Numerical methods for the solution of ill-posed problems. 1st ed. Vol. 328. Dordrecht: Springer Science & Business Media; 1995.
Google Scholar
Toint PL. Non-monotone trust region algorithm for nonlinear optimization subject to convex constraints. Math Program. 1997;77(3):69–94.
Google Scholar
Francisco JB, Bazán FSV. Nonmonotone algorithm for minimization on closed sets with application to minimization on Stiffen manifolds. J Comput Appl Math. 2012;236(10):2717–2727.
Google Scholar
Moré JJ. The Levenberg-Marquardt algorithm: implementation and theory. In: Watson GA, editor. Numerical analysis. Lecture Notes in Mathematics; Vol. 630. Berlin: Springer; 1978. p. 105–116.
Google Scholar
Bazán FSV, Cunha MCC, Borges LS. Extension of GKB-FP algorithm to large-scale general-form Tikhonov regularization. Numer Linear Algebra. 2014;21(3):316–339.
Google Scholar
Johnson C. Numerical solution of partial differential equations by the finite element method. 1st ed. Cambridge: Cambridge University Press; 1987.
Google Scholar
Larson MG, Bengzon F. The finite element method: theory, implementation, and applications. 1st ed. Berlin: Springer; 2013; Texts in Computational Science and Engineering.
Google Scholar
Mueller J, Siltanen S. Linear and nonlinear inverse problems with practical applications. Vol. 10. Philadelphia (PA): Society for Industrial and Applied Mathematics; 2012.
Google Scholar
Chávez CE, Atienza FA, Álvarez D. Avoiding the inverse crime in the inverse problem of electrocardiography: estimating the shape and location of cardiac ischemia. In: Computing in cardiology 2013; 2013. p. 687–690.
Google Scholar
Kelley CT. Iterative methods for optimization. Philadelphia (PA): Society for Industrial and Applied Mathematics; 1999.
Google Scholar
Kaltenbacher B, Neubauer A, Scherzer O. Iterative regularization methods for nonlinear ill-posed problems. Radon Series on Computational and Applied Mathematics; Vol. 6. Berlin: De Gruyter; 2008.
Google Scholar
Natterer F. The mathematics of computerized tomography. Stuttgart: Wiley; 1986.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Chebyshev pseudospectral method in the reconstruction of orthotropic conductivity

Abstract

1. Introduction

2. Chebyshev pseudospectral method for forward problem – isotropic case