406
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Chebyshev pseudospectral method in the reconstruction of orthotropic conductivity

, &
Pages 681-711 | Received 20 Sep 2019, Accepted 13 Jul 2020, Published online: 11 Aug 2020

Abstract

In this paper, we present a method to reconstruct the spatially varying conductivity tensor in isotropic and orthotropic materials, involved in a two-dimensional transient anisotropic model with Robin boundary conditions. For the reconstruction, the partial differential equation is solved by a semi-discrete method that combines a pseudospectral collocation method for spatial variables and Crank–Nicolson for time. The conductivity tensor is reconstructed through a non-linear least-squares problem solved by Levenberg–Marquardt method (LMM), along with Morozov's discrepancy principle as stopping rule to cope with noise in the data. Unlike classic LMM implementations that mitigate poor conditioning in calculating iterates using nonsingular diagonal scaling matrices, in this paper, singular regularization matrices are used. The impact of such a modification is illustrated with numerical experiments using discrete differential operators as scaling matrices. Numerical results show that accurate conductivity values can be obtained using a fairly small number of discretization points at a very low computational cost.

2010 Mathematics Subject Classifications:

1. Introduction

Materials whose physical properties, such as elasticity moduli, Poisson coefficients, heat conductivity, etc., vary depending on spatial orientation of the physical body are referred to as anisotropic, while those materials that do not change with spatial orientation are referred to as isotropic [Citation1]. Orthotropic material is a type of the anisotropic material whose characteristics remain unchanged along its planes of elastic symmetry. In nature, there are many materials that can be considered anisotropic such as crystal, woods, geological sediments and biological tissues. With the advent of new technologies, new anisotropic materials have been manufactured by industrial engineering, making it necessary to know their driving properties. These properties can be roughly defined by the difference in physical material or mechanical attributes when measured along different axes, such as absorbance, refraction, conductivity, tensile strength, etc.

In this paper, we study anisotropy in a two-dimensional transient conduction problem, with spatially varying conductivity tensor, aiming at a practical method for conductivity estimation based on experimental transient data.

Consider a two-dimensional anisotropic transport problem in the finite domain Γ×(0,t¯], t¯>0, with Γ=(0,1)×(0,1)R2, where R denotes the set of real numbers, described by the partial differential equation (1) C(x,y)Tt(x,y,t)=[K(x,y)T(x,y,t)]+g(x,y,t)inregionΓ×(0,t¯],(1) constrained to boundary conditions (2) (KT)n=f1(y,t),onx=0,y(0,1),t(0,t¯],(2) (3) (KT)n+h2(y)T(1,y,t)=f2(y,t),onx=1,y(0,1),t(0,t¯],(3) (4) (KT)n=f3(x,t),ony=0,x(0,1),t(0,t¯],(4) (5) (KT)n+h4(x)T(x,1,t)=f4(x,t),ony=1,x(0,1),t(0,t¯],(5) and initial condition (6) T(x,y,0)=T0(x,y)inregionΓ.(6) In this model, T(x,y,t) stands for temperature, C>0 is the heat capacity, K is the conductivity matrix, (7) K(x,y)=[k11(x,y)k12(x,y)k21(x,y)k22(x,y)](7) and g is a source term. In addition, hi, i = 2, 4, are heat transfer functions, fi, i=1,,4, are heat flux functions, and n=[nxny]T is the outward unit normal vector to the boundary of Γ. Here and elsewhere superscript T stands for transpose of a vector or a matrix. For future development and to be clear with the notation, observe that (KT)n=(k11Tx+k12Ty)nx(k21Tx+k22Ty)ny. The conductivity K is assumed continuous and positive definite matrix, which means k11>0 and k11k22k12k21>0. To classify the problem, K is said to be isotropic (k11=k22,k12=k21=0), orthotropic (k11k22,k12=k21=0) or anisotropic (k11k22,k120, k210). We note in passing that model (Equation1)–(Equation6) is also encountered in other applications areas in which K does not necessarily stand for thermal conductivity and where T(x,y,t) describes transient data of other nature as, e.g. electrostatic potential [Citation2–4]. Irrespective of the application under consideration, several problems associated with model (Equation1)–(Equation6) are found in literature and we are concerned with two of them. The first one, referred to as forward or direct problem, devoted to determine the function T(x,y,t) satisfying (Equation1)–(Equation6) where the remaining parameters are regarded as input data, and the second one referred to as an inverse problem, where the conductivity K is regarded as unknown and has to be estimated from measured temperature data. In this regard, it is known that, while the forward problem is governed by a linear elliptic well-posed operator, this is not the case with the inverse problem, which is non-linear and is ill-posed [Citation5, Chapter 9]. In fact, it is not obvious whether the conductivity K is (uniquely) determined by the data T. However, it is known that the uniqueness of the solution is only valid in some special cases [Citation5,Citation6]. For example, in the isotropic case, we quote the recovery of a discontinuous conductivity in a multidimensional problem with null source and homogeneous boundary conditions [Citation7], the identification problem of aquifer parameters [Citation8], and a 1D identification problem under certain Hölder smoothness conditions [Citation9]; for the orthotropic case, we quote the recovery of thermal conductivity from heat flux measurements [Citation10], whereas for the anisotropic case, we highlight the conductivity reconstruction problem on the homogeneous steady-state case under the constraint that the conductor is known up to an unknown multiple scalar [Citation11]. Anyway, even if the reconstruction problem allows for a (unique) solution, as the problem is ill-posed, small data errors can result in major disturbances in the solution, i.e. the solution does not depend continuously on the data.

Many studies to identify anisotropic conductivity from experimental data have been developed and reported in the literature [Citation12–14]. However, the limited availability of observational measures as well as the indirect nature of the estimation itself makes the problem solution difficult and thus more effective inverse solution techniques should be utilized. It is noteworthy that, in particular for the conduction problem addressed in this paper, Equations  (Equation1)–(Equation6), closed form solutions exist only for the orthotropic conductivity case, which does not happen for the anisotropic case [Citation15]. On the other hand, the inverse problem of reconstructing an anisotropic conductivity based on the steady-state assumption, in general, is not unique, even knowing the complete heat/current flux at the boundary [Citation11]. A similar observation holds true for the isotropic case, where conductivity is uniquely identified when the full Dirichlet-to-Neumann boundary map is known [Citation16], which is not always available in practice. To overcome this difficulty, other schemes were hence developed, including more information on the problem such as evaluations of temperature/potential inside the solution domain. As an example, Mera et al. [Citation17] applied the boundary element method (BEM) to solve the Cauchy steady-state heat conduction problem in an anisotropic medium. Other BEM-like techniques may be found in [Citation18,Citation19]. Recently, in connection with a time-dependent model, a finite difference method (FDM) has been employed together with a regularized non-linear minimization process to determine the orthotropic inhomogeneous tensor [Citation20]. In this case, numeric experiments were accomplished using the MATLAB lsqnonlin toolbox routine. In a similar flavour, a method for reconstructing orthotropic thermal conductivity based on the combination of FDM based on an alternating-direction-implicit scheme for solving the direct problem and the conjugate gradient method (CGM) as optimization tool, has been recently reported in [Citation21]. The novelty here is that the optimization problem is formulated in the L2 infinite-dimensional environment, which is interesting, but can be expensive due to the calculation of the involved L2-norm.

As mentioned above, mathematically equivalent applications for the conduction problem include reconstruction of electrical conductivity. For instance, in medical imaging applications, the numerical reconstructions of anisotropic conductivity arises in electrical impedance tomography [Citation2–4,Citation22]. Several techniques for computing anisotropic conductivity in this context have been used. In particular, in thermal tomography [Citation23], where the coefficient K is characterized by a finite number of parameters in a high-dimensional subset of Rp referred to as parameter domain, the direct problem is solved by coupling the finite element methods (FEM) in the spatial domain and a semi-implicit Euler method in the time interval. On its turn, the inverse problem is formulated as a non-linear least squares problem coupled with Tikhonov regularization. In contrast, in [Citation24], the parameter identification problem is modelled as a variational problem over stochastic Sobolev spaces, where a spectral approximation of the observation field is used to estimate the solution problem using a finite noisy model. In resume, most of these techniques approach the partial differential equation given by (Equation1), with Neumann, Dirichlet or mixed boundary conditions, by using one or more numerical methods, then an objective function is formulated and an optimization technique is chosen to find an optimal parameter.

The Chebyshev pseudospectral method (CPM), also referred to as orthogonal collocation, is a method for computing approximate solutions of ordinary or partial differential equations. It differs from other methods such as finite difference methods in that the solution is represented as a continuous or piecewise continuous function [Citation25]. Further, compared to other methods such as BEM or FEM, CPM is known to produce highly accurate solutions at lower computation cost, see, e.g. [Citation26–31]. The purpose of this paper is to offer an alternative method to reconstruct the orthotropic conductivity tensor, which is simple to implement and generalize to the anisotropic case, and relatively inexpensive when compared to existing ones. The main idea behind the method is to combine an efficient solver for the direct problem and the acknowledged value of the Levenberg–Marquardt method in solving non-linear last squares methods. Specifically, the solver involves a spatial discretization of the conduction problem (Equation1)–(Equation6), through Chebyshev pseudospectral method and the solution of the resulting semi-discrete model using the trapezoidal rule in time. The inverse conduction problem is solved through Levenberg–Marquardt method based on a version of Yamashita and Fukushima [Citation32], along with Morozov's discrepancy principle [Citation33] as stopping rule to cope with noisy data. In this respect, as opposed to classic LMM implementations that mitigate poor conditioning in calculating iterates using nonsingular diagonal scaling matrices, in this paper singular regularization matrices are used. This marks a significant difference from the version of LMM implemented in lsqnonlin used in [Citation20]. In addition, since the chosen regularization matrices correspond to discrete versions of differential operators, they produce, without additional cost, a smoothing effect in the computed solution similar to that produced by the Sobolev gradient used in [Citation21] (at the expense of additional work).

This paper is organized as follows. In Section 2, we consider the conduction model (Equation1)–(Equation6) in the isotropic case, with the conductivity as input data, and derive a semi-discrete model for the forward problem using CPM in order to compute function values T (temperature or electrostatic potential). Then, in Section 3, we describe and analyse a strategy to solve the inverse problem of finding conductivity values based on measured values of T at a few time steps, including in particular, a method for computing Jacobians as well as a precise description of LMM with singular regularization matrices instead of scaling diagonal matrices. Numerical results are also included. Section 4 follows the same organization as Section 3 but now in connection with the orthotropic scenario. We finish the paper with conclusions and references.

2. Chebyshev pseudospectral method for forward problem – isotropic case

In this section, we will describe how to solve (Equation1)–(Equation6) with the Chebyshev pseudospectral method. This method has become popular in solving ordinary or partial differential equations (PDEs) due to its high accuracy and lower computational cost compared with finite difference or finite element-based methods and has been successfully applied in solving forward and inverse heat conduction problems, see, e.g. [Citation27–29]. The construction of highly accurate solutions for PDEs lies on the fact that spatial derivatives can be approximated to high precision by means of the Chebyshev differentiation matrix.

Roughly speaking, pseudospectral methods construct approximate solutions in a space of algebraic polynomials of lower degree so that the differential equation being solved is satisfied in a specified number of points referred to as collocation points. For linear stationary and unidimensional problems on a bounded interval, for instance, if the collocation points are denoted by xj, j=0,,n, and if the approximate solution is denoted by un and expressed as (8) un(x)=k=0nαkk(x),(8) where αkun(xk),k=0,,n, and k(x) denotes the related Lagrange polynomial, then the coefficients αj become the unknowns of a linear system obtained by discretizing the differential equation in such a way that derivatives of the exact solution are estimated at the collocation points by differentiating (Equation8) and evaluating the result at xj. As a matter of fact, note that derivative values un(xj), un(xj)=k=0nαkk(xj),j=0,n, can be expressed in matrix form as (9) un(1)=Dun,(9) with un(1)=[un(x0),,un(xn)]T,un=[un(x0),,un(xn)]T, where D denotes the (n+1)×(n+1) differentiation matrix whose entries are (10) Dj,k=k(xj),j=0,,n,k=0,,n.(10) A consequence of (Equation9) is that if un(x) interpolates u(x) at the collocation points, derivative values u(xj) can be approximated as u(xj)un(xj) [Citation30], or in vector form, with abuse of notation, as (11) [u(x0)u(xn)]un(1)=D[u(x0)u(xn)].(11) The Chebyshev pseudospectral method takes as collocation points the Chebyshev–Gauss–Lobatto points, (12) xj=cos(πjn),j=0,,n,(12) and for this case, explicit formulas for Dj,k can be found in several references, see e.g. [Citation30]. Without getting into any details, we note that for the Chebyshev differentiation matrix the following L2 discrete estimate for u at the collocation points is known to hold [Citation34, Chapter 9, estimate 9.5.22] (13) uun=O(n2m)(13) where m denotes the regularity (smoothness) of u. This result implies that (Equation11) can lead to exponential order of convergence as far as u is sufficiently smooth. In words, the smoother the function u, the smaller the approximation error. We will end this very short explanation on pseudospectral methods by noting that multidimensional problems are treated similarly as above. For 2D cases for instance, if for the sake of simplicity we use the same number of collocation points, n + 1, in the x and y directions, then the Chebyshev pseudospectral method looks as approximate solution for a polynomial un(x,y), un(x,y)=j=0nk=0nun(xj,yk)j(x)k(y), where un(xj,yj) are unknowns to be determined by requiring that the given equation (and boundary condition) is satisfied at the grid points (xj,yk). For PDEs with time and space variables, on the other hand, the construction of the approximate solution is as above but with the sole difference that the problem is now transformed into a time-dependent system of ordinary differential equations for which several methods exist.

Returning to the problem we are concerned with, we will first focus on the isotropic case to describe more clearly how CPM works. As mentioned above this case means k11=k22=:k and k12=k21=0, which simplifies the numerical analysis. Therefore we can rewrite (Equation1) as (14) C(x,y)Tt(x,y,t)=x(k(x,y)Tx(x,y,t))+y(k(x,y)Ty(x,y,t))+g(x,y,t).(14) For simplicity, we shall consider a mesh consisting of (n+1)×(n+1) points on the reference domain Γ based on (n+1) Chebyshev–Gauss–Lobatto points in the horizontal and vertical directions, (15) xi=12[1cos(iπn)],i=0,,n,yj=12[1cos(jπn)],j=0,,n,(15) and assume that grid points are numbered in lexicographic ordering. Assume also that the Chebyshev differentiation matrix is expressed in row-wise (resp. column-wise) form, (16) D=[r0TrnT]=[d0,,dn],ri,diRn+1,i=0,,n.(16) The next step is to approximate spatial derivatives in (Equation14) by using matrix D. In fact, along the horizontal direction, we have (17) x(k(xi,yj)Tx(xi,yj,t))ei+1TDKjxT˙j,i=0,,n,j=0,1,,n,(17) where ei, denotes the i-th canonical vector in Rn+1, Kjx is the diagonal matrix (18) Kjx=diag(k(x0,yj),k(x1,yj),,k(xn,yj)),j=0,1,,n,(18) and T˙j is the column vector (19) T˙j=[Tx(x0,yj,t),Tx(x1,yj,t),,Tx(xn,yj,t)]T,j=0,1,,n.(19) Clearly, provided the functions k and T are sufficiently regular, the order of approximation of (Equation17) is analogous to that given in (Equation13). Now let Tjx be defined by (20) Tjx=[x(k(x0,yj)Tx(x0,yj,t))x(k(xn,yj)Tx(xn,yj,t))],j=0,1,,n.(20) With this notation, observe that the approximation (Equation17) can be expressed in vector form as (21) TjxDKjxT˙j=i=0ndik(xi,yj)Tx(xi,yj,t),(21) and that, since the boundary conditions (Equation2)–(Equation3) and (Equation16) imply (22) i=0ndik(xi,yj)Tx(xi,yj,t)=d0f1(yj,t)+i=1n1dik(xi,yj)Tx(xi,yj,t)+dnh2(yj)T(xn,yj,t)dnf2(yj,t)=[D1KˇjD2+h2(yj)dnen+1T]Tj(t)+d0f1(yj,t)dnf2(yj,t),(22) where (23) Tj(t)=[T(x0,yj,t),T(x1,yj,t),,T(xn,yj,t)]T,(23) D1=[d1,,dn1],D2=[r1Trn1T],Kˇj=diag(k(x1,yj),,k(xn1,yj)), we conclude that (24) Tjx[D1KˇjD2+h2(yj)dnen+1T]Tj+d0f1(yj,t)dnf2(yj,t).(24) Before proceeding, note that although Tj is a vector function in Rn+1 that depends on time, to simplify the notation, Tj(t) will be denoted by Tj. With this convention and based on (Equation24), discretization of x(k(x,y)Tx(x,y,t)) along all grid points with T(xi,yj,t) enumerated in lexicographic order leads to (25) [T0xT1xTnx](F0F1Fn)[T0T1Tn]+[d0f1(y0,t)dnf2(y0,t)d0f1(y1,t)dnf2(y1,t)d0f1(yn,t)dnf2(yn,t)],(25) where we have set Fj=D1KˇjD2+h2(yj)dnen+1T, j=0,1,,n.

To discretize spatial derivatives with respect to y, let Tjy be defined as in (Equation20) with x replaced by y. Define also (26) T^iy=[y(k(xi,y0)Ty(xi,y0))y(k(xi,yn)Ty(xi,yn))],i=0,1,,n.(26) Then, using the respective spatial approximation with respect to y, analogous to (Equation17), and boundary conditions (Equation4)–(Equation5), an approximation result similar to (Equation25) is (27) [T^0yT^1yT^ny](G0G1Gn)[T^0T^1T^n]+[d0f3(x0,t)dnf4(x0,t)d0f3(x1,t)dnf4(x1,t)d0f3(xn,t)dnf4(xn,t)],(27) where Gi=D1K^iD2+h4(xi)dnen+1T, with K^i=diag(k(xi,y1),k(xi,y2),,k(xi,yn1)),i=0,1,,n, and T^i=[T(xi,y0,t),T(xi,y1,t),,T(xi,yn,t)]T,i=0,1,,n. Observe that the entries of T^i follow a different ordering as that described in (Equation23). To fix this, it suffices to observe that there exists a permutation matrix PR(n+1)×(n+1) such that (28) [T^0T^1T^n]=PT[T0T1Tn]and[T^0yT^1yT^ny]=PT[T0yT1yTny].(28) Recall that a permutation matrix is a matrix obtained by reordering the rows (or columns) of an identity matrix. Finally, mixing (Equation27) and (Equation28) along with the fact that PPT is the identity matrix as P is an orthogonal matrix, we have (29) [T0yT1yTny]P(G0G1Gn)PT[T0T1Tn]+P[d0f3(x0,t)dnf4(x0,t)d0f3(x1,t)dnf4(x1,t)d0f3(xn,t)dnf4(xn,t)].(29) To simplify the notation, here and throughout we use T(xi,yj,t) to denote both the exact value and the corresponding approximation associated with the discretized problem. With this assumption, (Equation14) together with (Equation25) and (Equation29) yield the initial value problem (30) {CdTdt=H(t,T),t(0,t¯]T(0)=T0,(30) where T0 has entries T0(xi,yj) with T0 defined in (Equation6) and grid points (xi,yj) being ordered in lexicographic form, (31) C=(C0C1Cn),Cj=diag(C(x0,yj),,C(xn,yj)),j=0,,n,(31) and (32) H(t,T)=[(F0F1Fn)+P(G0G1Gn)PT][T0T1Tn]+S(t),(32) with S(t) being used to accommodate terms that do not involve unknowns in (Equation25)–(Equation29) as well as source term g(x,y,t) values along the grid. Clearly, TT(t) is the vector formed by temperature values T(xi,yj,t) with grid points enumerated in lexicographic form, i.e, T=[T0Tn]. The semidiscrete model (Equation30), and hence, its solution, does not depend on the conductivity k at the corners. Therefore, if we try to recover conductivity values from temperature values that satisfy (30), difficulties should certainly appear because in these points, there is no explicit connection between k and temperature values. An alternative semi-discrete model that circumvents the above difficulty can be described as follows. Choose positive numbers α,β such that α+β=1 and then split the first and last term in the sum (Equation22) into (33) d0k(x0,yj)Tx(x0,yj,t)=αd0k(x0,yj)Tx(x0,yj,t)+βd0k(x0,yj)Tx(x0,yj,t)=αd0k(x0,yj)Tx(x0,yj,t)+βd0f1(yj,t),(33) (34) dnk(xn,yj)Tx(xn,yj,t)=αdnk(xn,yj)Tx(xn,yj,t)+βdnk(xn,yj)Tx(xn,yj,t)=αd0k(x0,yj)Tx(x0,yj,t)+β[h2(yj)T(xn,yj,t)dnf2(yj,t)dn].(34) The above equalities come on using the boundary conditions (Equation2) and (Equation3). Thanks to this splitting, Equation (Equation22) can be rewritten as (35) i=0ndik(xi,yj)Tx(xi,yj,t)=[D^KjxD+βh2(yj)dnen+1T]Tj+β[d0f1(yj,t)dnf2(yj,t)],(35) where D^=[αd0,d1,,dn1,αdn], and this serves as an alternative approximation for spatial derivatives with respect to x deduced in (Equation24). Proceeding this way, a similar expression for derivatives with respect to y can be deduced and then used to write down the following semi-discrete model (36) {CdTdt=H^(t,T),t(0,t¯]T(0)=T0,(36) where (37) H^(t,T)=MT+S^(t),(37) (38) M=(F^0F^1F^n)+P(G^0G^1G^n)PT,(38) with F^j=[D^KjxD+βh2(yj)dnen+1T],j=0,1,n,G^i=[D^KiyD+βh4(xi)dnen+1T],Kiy=diag(k(xi,y0),,k(xi,yn)),i=0,1,n. As before, S^(t) accommodates terms that do not include unknowns. Observe that by taking α=0, the Initial Value Problem (IVP) (Equation36) reduces to the IVP (Equation30).

When using a pseudospectral method to construct approximate solutions of PDEs, the error of the approximate solution in relation to the exact one is naturally of great interest. Theoretical error estimates regarding 1D and 2D problems are well documented in several places, e.g. Canuto et al. [Citation34] and Bernardi and Maday [Citation35]. In most cases, the behaviour of the error estimate is similar to that given in (Equation13). It depends essentially on the smoothness of the solution, the rule of thumb being: the more the exact solution is smooth, the more the approximate solution is accurate. The rationale behind this is as follows. Since the spatial derivatives of smooth solutions are approximated to high precision, the solution of the associated semi-discrete problem should be close to that of the original problem.

Next step is to apply a numerical method for IVPs to (Equation36) such as explicit and implicit Euler methods, trapezoidal rule method (Crank–Nicolson method), Runge–Kutta class methods, or any other. Among them, due to its simplicity, second-order accuracy, and mainly because it is absolutely stable, we choose Crank–Nicolson (CN) method [Citation36]. CN method becomes relevant in our context since the inversion procedure, to be described later on, requires the semi-discrete model (Equation36) to be solved many times for distinct conductivity values and with guaranteed stability. Having said that, we now proceed to describe how CN method works when applied to (Equation36). In fact, for a positive integer N, we define the time step Δt=t¯/N and discrete times tq=qΔt, q=0,1,,N, so that we have a uniformly spaced mesh on [0,t¯]. Recalling that the solution of the semidiscrete model (Equation36) depends on t, TT(t), CN method generates approximate solutions Tq to T(tq) given by CT(q+1)=CT(q)+Δt2(MT(q)+S(tq)+MT(q+1)+S(tq+1)),q=0,1,2,, where T(0)=T0 is the initial condition (Equation36). Rearranging terms, the CN method provides a fully discrete method characterized by being high-order accurate in space and second-order in time, i.e. temporal errors should dominate. It is given as (39) MmT(q+1)=MpT(q)+Δt2[S(tq)+S(tq+1)],q=0,1,2,,(39) where Mm:=CΔt2M and Mp:=C+Δt2M. Consequently, to calculate approximate solutions at each time step, the system of linear equations (Equation39) must be solved as efficiently as possible. One way to do this is to explore the sparse structure of the matrix Mm to calculate its LU or QR factorization and then use the factors to solve the system. In this case, the factorization is calculated only once at the beginning of the CN iteration. The sparse structure of Mm can be seen in Figure .

Figure 1. Sparse structure of matrix Mm, for n + 1 = 16 grid points on both directions.

Figure 1. Sparse structure of matrix Mm, for n + 1 = 16 grid points on both directions.

As an example to illustrate the effectiveness of the Chebyshev pseudospectral method in conjunction with Crank–Nicolson method, we consider a heat conduction model (Equation1)–(Equation6) on Γ=(0,1)×(0,1), 0tt¯=1, extracted from [Citation21], whose solution is given by (40) T(x,y,t)=(1et)(sin(πx)sin(πy)+(π+1)(x+y)),(40) for input data f1(y,t)=1+y12(1et)(πsin(πy)+π+1),f2(y,t)=2+y12(1et)(πsin(πy)+π+1),f3(x,t)=1+x12(1et)(πsin(πx)+π+1),f4(x,t)=2+x12(1et)(πsin(πx)+π+1),h2(y)=h4(x)=0, source term g(x,y,t)=(et+π26(1+x+y)(1et))sin(πx)sin(πy)++(π+1)et(x+y)1et12(πsin(π(x+y))+2π+2), capacity C = 1, and conductivity given by k(x,y)=1+x+y12. For this example, we calculate approximate solutions as well as the associated errors with respect to the exact solution of (Equation1)–(Equation6). CPM is implemented with n + 1 = 11 grid points in both horizontal and vertical directions, α=0 and CN runs with time step Δt=0.01. Numerical results for three time stages are shown in Figure . The accuracy of the approximate solutions is evident when we inspect the pointwise errors at the bottom of the figure.

Figure 2. Numerical solutions for three time stages (top) and associated pointwise absolute error (bottom).

Figure 2. Numerical solutions for three time stages (top) and associated pointwise absolute error (bottom).

To better evaluate the accuracy of the fully discrete method, we calculate normwise errors in the maximum norm defined by (41) Eq=||T(q)T(tq)maxk|[T(q)T(tq)]k|,(41) where T(t) has entries T(xi,yj,t) with gridpoints (xi,yj) in lexicographic order. For q=10,q=50 and q=100, which correspond to t=0.1,t=0.5 and t = 1, respectively, we obtain E10=5.9071×106,E50=2.1342×105andE100=3.0597×105. The errors not only verify the theoretical accuracy predicted in theory, but also the potentiality of the proposed method of providing approximate solutions, with relatively high accuracy, using few grid points and therefore at low cost.

3. Numerical method for the inverse problem – isotropic case

Based on the solver for the direct problem described in the previous section, we will now introduce a method for solving the associated inverse problem, namely the conductivity estimation problem from measured transient data. For the numerical treatment of the inverse problem, unknowns values k(xi,yj) are arranged in a vector, say k, and function values T (temperature or electrostatic potential) are regarded as functions of k. Assuming that the unknowns are ordered like functions values, let (42) =i+j(n+1)+1,i=0,1,,n,j=0,1,,n.(42) It is clear that for each unknown k(xi,yj) there is a unique entry k of k so that the lexicographic ordering of the grid (xi,yj), as illustrated in Figure , is preserved in the entries of k. The ordering in k is taken into account when implementing the inverse problem approach to be described below. The objective of this section is to determine conductivity values k from measured temperature values T~=T+e, where e denotes an unknown perturbation term and T is a vector of exact temperature values on the grid at time steps tq,q=1,,N, (43) T=[T(t1)T(tN)]RN(n+1)2,(43) where T(tq)R(n+1)2 has entries T(tq)T(xi,yj,tq) with ℓ defined in (Equation42) and (xi,yj) following the lexicographic ordering. Similar observations apply to T~. We will assume that an estimate of the error e is available such that (44) T~T=eδ,(44) though, of course, this may not be the case when dealing with reconstruction problems based on experimental measurements. If not, it is essential that the error e be estimated either based on prior knowledge or by other means. In addition, observe that, under mild conditions, for each k we can solve the direct problem (Equation36) to produce a related solution T(k,t) which contains approximations to T(xi,yj,t) on the grid, and hence, a vector T(k) with entries T(tq)(k). With this observation in mind the conductivity estimation problem can be formulated as one of estimating a vector k such that the difference between computed solutions T(k), and measured transient data T~ is small enough in some sense. More precisely, the estimation problem can be formulated as the non-linear least squares problem of finding k such that (45) k=argminkR(n+1)2ϕ(k),ϕ(k)=12T(k)T~22=12j=1N=1(n+1)2(T(tj)(k)T~(tj))2,(45) where k=[k1,,k(n+1)2]T is the vector of unknowns. The non-linear problem (Equation45) can be handled in a number of ways, e.g. by using Tikhonov regularization, trust region methods, nonmonotone line search methods, Levenberg–Marquardt method (LMM), or others, see, e.g. [Citation37–40], though clearly the presence of multiple minima cannot be ruled out.

Figure 3. Example of grid, for n = 4, enumerated in accordance with Equation (Equation42), i.e, lexicographic order.

Figure 3. Example of grid, for n = 4, enumerated in accordance with Equation (Equation42(42) ℓ=i+j(n+1)+1,i=0,1,…,n,j=0,1,…,n.(42) ), i.e, lexicographic order.

In this work, we choose to apply the Levenberg–Marquardt method (LMM), following a version described by Yamashita and Fukushima [Citation32] who consider LMM with line search and Armijo stepsize rule. To simplify the notation, we set ϕ(k)=12F(k)22, and observe that the first-order necessary condition for minimization of φ, ϕ(k)=0, reads (46) FT(k)[T(k)T~]=0,(46) where F(k) denotes derivative of F(k). LMM minimizes iteratively ϕ(k) (hence solves approximately (Equation46)) by determining an estimate, say k^, such that FT(k^)[T(k^)T~]2 is small enough. Roughly speaking, chosen an initial guess k0, the underlying idea of LMM is to determine a minimizer of ϕ(k) iteratively by solving a sequence of linear problems of the form (47) (FT(kj)F(kj)+μjΩj)d=FT(kj)F(kj),j=0,1,,(47) where μj is the so-called damping parameter and Ωj is a diagonal matrix with positive entries. Yamashita and Fukushima choose Ωj as the identity matrix and the damping parameter μj as the squared residual norm at iteration j. This version can be described as follows.

  • LMM with line search

  • Step 0: choose parameters ν,η,γ(0,1) and a initial guess k0R(n+1)2. Set j = 0 and μ0=F(k0)22.

  • Step 1: If kj satisfies some termination criterion, stop. Else, go to Step 2.

  • Step 2: Find the solution dj of the system of linear equations (Equation47). If dj satisfies F(kj+dj)2γF(kj)2,then kj+1=kj+dj and go to Step 4. Otherwise, go to Step 3.

  • Step 3: (Armijo's rule) Take m as the smallest nonnegative integer such that ϕ(kj+ηmdj)ϕ(kj)νηmϕ(kj)Tdjand set kj+1=kj+ηmdj.

  • Step 4: Set μj+1=F(kj+1)22 and j = j + 1. Go to Step 1.

As already mentioned, in this work Ωj in (Equation47) is replaced by a product of the form RTR, in a way that dj is computed by solving the ‘scaled’ system (48) (FT(kj)F(kj)+μjRTR)d=FT(kj)F(kj),(48) where R is a 2D discrete differentiation operator referred to as regularization matrix, RLi, (49) Li=[In+1Li(n+1)Li(n+1)In+1],i=1,2,(49) with matrices L1(m)R(m1)×m and L2(m)R(m2)×m given by L1(m)=[1111]andL2(m)=[121121]. Matrices L1 and L2 represent, respectively, the 1D first and second order discrete differentiation operator; they are often employed in image reconstruction problems [Citation41]. Such a modification is introduced to incorporate in the iterative process smoothing properties of conductivity in the horizontal and vertical directions, as well as to provide the reader with further insight into the role that the regularizer matrix plays into the stabilization of the inverse problem. Illustrative numerical examples are postponed to next sections.

3.1. Jacobian matrix

To apply Step 2 above, it is mandatory the calculation of the derivative of F with respect to k, that depends, implicitly, on the derivative of T with respect to the same variable. Variation of T as a function of k at time step t is determined by the Jacobian of T denoted here by J(t), J(t)=[Tk1(k,t),,Tk(n+1)2(k,t)]. It can be calculated as follows. Taking derivative with respect to k on both sides of (Equation36) and assuming continuity of T leads to (50) Cddt(Tk(k,t))=MkT(k,t)+MTk(k,t),(50) where M denotes the matrix in (Equation38). Now, since initial values of T do not depend on k, it follows that the ℓ-th column of the Jacobian matrix solves the IVP (51) {Cddt(Tk(k,t))=MTk(k,t)+W(t)Tk(k,0)=0,(51) where (52) W(t)=MkT(k,t),=1,2,,(n+1)2.(52) Therefore, computing the columns of the Jacobian matrix requires solving a sequence of IVPs with distinct source terms W(t), the computation of which is straightforward. In fact, it is not difficult to see that Mk1=[(αd0r0T00)+P(αd0r0T00)PT],Mk2=[(d1r1T00)+P(0αd0r0T0)PT],Mkn+1=[(αdnrnT00)+P(00αd0r0T)PT]. Derivatives with respect to the remaining parameters are calculated similarly. Summarizing, to apply LMM we compute F(k) at every needed time step, which is done by solving IVPs (Equation51) for all source terms. For clarity purposes, observe that, because T is vector in RN(n+1)2 with block entries T(ti), i=1,,N, as described in  (Equation43), it is easy to see that F(k)=[J1J2JN], where Jq=J(tq) represents the Jacobian of T with respect to k at time step tq, for q=1,2,,N. We recall that the computation of Jq demands to solve a sequence of (n+1)2 IVPs and, in this sense, for large problems, it may be interesting to use parallel computing, in order to reduce operational time along with memory usage.

3.2. Numerical experiments

We report the outcome of numerical experiments to illustrate the effectiveness of the proposed reconstruction method on some test problems using synthetic data. To this end, noisy measurements are simulated by adding random noise to actual data T(xi,yj,tq) and arranged in a vector data T~=T+e, with T defined in (Equation43), where e is a vector containing zero-mean random numbers scaled in such a way that (53) T~T=NLTδ,(53) where NL stands for relative noise level in the data and δ is assumed to be available.

As it is often difficult to get exact solutions of problem (Equation1)–(Equation6), in our numerical experiments we will use synthetic data generated by solving the transport model by the finite element method (FEM). For this, the input data will be extracted from [Citation20]. Before proceeding, it is instructive to compare the quality of the solutions obtained by CPM and FEM. To do this, we will use the test problem described at the end of Section 2 whose solution is available. Recall that, when using FEM the domain Γ is discretized using a triangular mesh with N~ nodes and approximate solutions are determined as (54) Th(x,y,t)=i=1N~ci(t)φi(x,y),(x,y,t)Γ×(0,t¯],(54) where ci(t) are unknowns to be determined and φi are frequently called hat functions, every one being continuous, piecewise linear (a plane inside every element/triangle of the mesh), taking a unit value at the ith node and zero elsewhere. Parameter h stands for the global mesh size, defined as the maximum distance between each pair of nodes in the mesh. Considering C(x,y)=C constant in (Equation1), FEM turns Equations (Equation1)–(Equation6) into the semidiscrete model (55) {CM~dcdt+A~c=f~(t),t(0,t¯]c(0)=0,(55) where c(t)=(c1(t),,cN~(t))T, M~ and A~ are N~×N~ known as mass and stiffness matrices, resp., and f~(t) contains other independent terms. For more information on FEM discretization, see, for example, Johnson [Citation42] and Larson and Bengzon [Citation43].

To proceed, in a similar way as the semi-discrete problem associated with CPM was solved in Section 2, here the IVP (Equation55) is also solved by the CN method. This guarantees second-order approximations in space and time, the order in space being in relation to the global mesh size h [Citation42]. We calculate approximate solutions Th for h=0.1 and h=0.01 with Δt=0.01 for both cases. In practice, for chosen h, a structured mesh with N~ nodes is generated using MATLAB's PDE toolbox via function initmesh. It is noteworthy mentioning that for the above values of h correspond meshes consisting of N~=185 and N~=17361 nodes, respectively. Note that the refinement made on the mesh results in a significant increase in computational cost, since the unknowns of the linear system grow from N~=183 to N~=17361, which in addition to making the application of the CN method more expensive, allows for the potential growth of rounding errors. Figure  shows a simple mesh, the respective approximate solution Th at time t = 1 and also the structure of mass matrix, which is repeated in the stiffness matrix, both with density around 3.53%.

Figure 4. Example of FEM mesh's structure with 185 nodes, 328 triangles and global mesh size h = 0.1 (left), respective solution with CN (Δt=0.01) at time t = 1 (middle) and sparse structure of mass matrix M~ (right), for data used in Figure .

Figure 4. Example of FEM mesh's structure with 185 nodes, 328 triangles and global mesh size h = 0.1 (left), respective solution with CN (Δt=0.01) at time t = 1 (middle) and sparse structure of mass matrix M~ (right), for data used in Figure 2.

Normwise errors in the maximum norm are also calculated and defined by E^h(t)=max(xi,yj)|Th(xi,yj,t)T(xi,yj,t)|, with (xi,yj) varying along the nodes. The errors associated with the approximate solutions Th for three time stages are shown in Table , in which, for comparison purposes, the errors of CPM-based solutions reported in the previous section are also included. The superior performance of CPM is clearly seen in this table, even in the case h = 0.01, which involves working with linear systems of size 17361×17361, as opposed to CPM that deals with systems 121×121. This ends the desired comparison.

Table 1. Errors associated with FEM-based solutions for two global mesh sizes and errors associated with CPM.

We are now in a position to describe some examples of conductivity reconstruction. As already mentioned, in practice, it is difficult to get exact temperature data T(x,y,t) satisfying the conduction model. So, to challenge the reconstruction method, we shall use synthetic data generated by solving the direct problem through the combination of FEM and CN method. All our numerical experiments are conducted using the following input data: (56) g(x,y,t)=0,f1(y,t)=1,f2(y,t)=1,(56) (57) f3(x,t)=1,f4(x,t)=1,andh2(y)=h4(x)=0,(57) with Γ=(0,1)×(0,1), t¯=1, capacity C(x,y)=1, initial condition T0(x,y)=0, with conductivity k(x,y) to be described in consonance with the case/example under consideration. For all examples, we choose h=0.01=Δt; Thus, Th is O(h2)=O(104) at each time step and we accept it as a reliable alternative to the lack of exact T. We emphasize that our decision of using FEM instead of CPM to generate data obeys the need to prevent the inverse crime [Citation44,Citation45]: the inverse crime occurs when the same numerical model is used to synthesize as well as to invert data in an inverse problem. For this, in our numerical examples, what we call ‘exact’ data, supposed to satisfy (Equation43), are values obtained by interpolating Th in the grid generated by Chebyshev points.

All the reconstruction examples in this section are performed using noisy data T~ consisting of temperature values on a grid of n + 1 = 16 Chebyshev points associated with N = 5 time steps equally distributed from 0.1 to 0.5. We emphasize that, since the number of unknowns is (n+1)2 and as the temperature in the entire mesh generates (n+1)2 non-linear relations at each stage of time, temperature values of only N = 2 stages are sufficient to make the minimization problem overdetermined. In this case, N=2 would be sufficient to determine conductivity estimates. Our choice N = 5 aims to make the minimization problem highly overdetermined to, on the one hand, allow better noise filtering and, on the other hand, expect that the minimization process results in a unique solution. As for LMM, it starts with k0 constant equals to 1/12 along the grid. Other parameters, necessary to LMM, are taken as η=γ=0.7 and ν=102. For more information about LMM parameters the reader is referred to [Citation46, Section ].

To terminate the estimation process, LMM is stopped by means of the discrepancy principle (DP) [Citation33], i.e. the iterates stop at the first j such that (58) T(kj)T~2τδ,τ>1,(58) where τ is a safeward parameter intended to prevent underestimation of the data error T~T, see (Equation44). In agreement with this, τ is often chosen close to 1, though the option τ close to 2 be also found in literature, see e.g. [Citation47, Section ]. In this paper, all implementations consider τ=1.1. DP is commonly used in solving ill-posed problems because the presence of noise in the data generates during the iterations the so-called semi-convergence behaviour [Citation48]. Roughly speaking, while early iterations capture relevant information about the sought solution, as j grows the iterations capture information about the noise, corrupting gradually the iterative process, see Figure . Thus, DP is used to select an iterate capable of balancing accuracy and stability on the approximate solution.

Figure 5. Residual T(kj)T~2 (left) and relative error between kj and the exact conductivity k(x,y)=(1+x+y)/12 (right), NL=1%.

Figure 5. Residual ‖T(kj)−T~‖2 (left) and relative error between kj and the exact conductivity k(x,y)=(1+x+y)/12 (right), NL=1%.

As for notation on tables and figures, RE denotes the relative error between exact k and the approximation and IE represents the relative error considering just grid points inside the grid, i.e. without nodes on the boundary. The error IE is calculated to evaluate the behaviour of the solutions close to the boundary.

Example 1

Consider conductivity k(x,y)=(1+x+y)/12 and noise level NL=0.1%. This first example aims to show the influence of different scaling matrices RTR introduced in (Equation48) on the LMM performance in solving the inverse problem, for different values of α in the modified model (Equation36). Figure  compares generated solutions. Note that the effect of increasing α, Figure (b, c), is seen as a correction of discontinuities at the edge of the reconstructions. This can be explained as follows. As α+β=1 with α,β0, a consequence of (Equation33)–(Equation34) is that, if on the one hand α1 strengthens the conductivity values k(x,y) on the boundaries, on the other hand, β=1α0 weakens the influence of both the source term and boundary conditions in the model, see again (Equation33)–(Equation34). Clearly, a reverse effect should occur when α0. This suggests that an appropriate balance between α and β is needed, and this is what the numerical results seem to show as α increases starting from values close to zero. However, finding such a balance is difficult since, as it can be seen in Table , as α increases, the number of iterations increases as well, bringing instability to the iterates and not significantly reducing the relative error. On the other hand, Figure (d,e) presents the regularizing effect of L1 and L2 from (Equation49), smoothing up the solutions, both in interior and boundary, even with α=0.1, at a fairly small number of iterations. Tests for bigger/smaller α brought small changes and were omitted. Table  summarizes the results. In resume, the information we emphasize is that a good choice of the regularizer R can lead to good results, while reducing computational effort.

Figure 6. Comparison between solutions for k(x,y)=(1+x+y)/12 calculated with different choices of R and α. For (d) and (e), α=0.1.

Figure 6. Comparison between solutions for k(x,y)=(1+x+y)/12 calculated with different choices of R and α. For (d) and (e), α=0.1.

Table 2. Errors and number of iterations until DP is satisfied, for different choices of R and α, considering as exact k(x,y)=(1+x+y)/12.

Example 2

Consider again noise level NL=0.1% and take α=0.1 fixed. To assess the effectiveness of the proposed method we choose four different conductivities, namely: k1(x,y)=(1+x2+y2)/12,k2(x,y)=(1+y3)/12,k3(x,y)=(1+sin(x)sin(y))/12andk4(x,y)=(1+sin(2πx)2sin(2πy)2)/12, where the regularizer is taken as R=L2. Considering the results in Figure , it is observed that the use of R produces smoothing effect on the calculated solutions k, while reducing unwanted artefacts both at the boundary and inside Γ. Results are summarized in Figure  and Table . Observe the good fitting between the sought solution and the reconstruction, with small number of iterations, contributing on the numerical applications, especially bearing in mind that calculating the Jacobian may become expensive.

Figure 7. Contour plots for every exact ki(x,y) (top) and the respective approximation (bottom), i=1,,4.

Figure 7. Contour plots for every exact ki(x,y) (top) and the respective approximation (bottom), i=1,…,4.

Table 3. Relative errors RE, interior error IE and number of LMM iterations until DP is satisfied.

Example 3

This last example refers to the reconstruction of three conductivities namely, k used in Example 1 and k2, k3 of the previous example, for different noise levels in the data. To this end we set R=L2, α=0.1 and choose three values for NL: 2.5%, 1.5% and 0.5%. In Figure , top, we show reconstructions of the lines k(0.5,y), k2(0.5,y) and k3(0.5,y), while at the bottom, we show relative reconstruction errors on the whole mesh. The numerical results confirm what is often seen in solving inverse problems: as NL gets smaller, so does the reconstruction error.

Figure 8. Behavior of reconstruction results solution for different noise levels. Top: exact conductivities and respective approximations along the line x = 0.5. Bottom: absolute value of reconstruction errors along the line x = 0.5 and relative error RE.

Figure 8. Behavior of reconstruction results solution for different noise levels. Top: exact conductivities and respective approximations along the line x = 0.5. Bottom: absolute value of reconstruction errors along the line x = 0.5 and relative error RE.

4. Numerical method for the inverse problem – orthotropic case

We now consider the conduction problem (Equation1)–(Equation6) focusing on the orthotropic case, i.e.with conductivity tensor K(x,y)=[k11(x,y)00k22(x,y)], and we first describe the Chebyshev pseudospectral method used to solve the direct problem. To this end rewrite (Equation1) as (59) C(x,y)Tt(x,y,t)=x(k11(x,y)Tx(x,y,t))+y(k22(x,y)Ty(x,y,t))+g(x,y,t)(59) and then note that along the horizontal direction we have a process identical to (Equation17), changing k by k11 in (Equation20), which leads to (60) [T0xT1xTnx](F0F1Fn)[T0T1Tn]+[d0f1(y0,t)dnf2(y0,t)d0f1(y1,t)dnf2(y1,t)d0f1(yn,t)dnf2(yn,t)](60) where we set Fj=D1KˇjD2+h2(yj)dnen+1T with Kˇj=diag(k11(x1,yj),,k11(xn1,yj)),j=0,1,,n, and T(xi,yj) enumerated in lexicographic order. For approximations in the vertical direction we change k by k22 and proceed as before, leading to the initial value problem (61) {CdTdt=H(t,T),t(0,t¯]T(0)=T0,(61) where (62) H(t,T)=[(F0F1Fn)+P(G0G1Gn)PT][T0T1Tn]+S(t),(62) for Gi=D1K^iD2+h4(xi)dnen+1T, with K^i=diag(k22(xi,y1),k22(xi,y2),,k22(xi,yn1)),i=0,1,,n, and S(t) being used to accommodate terms which do not involve unknowns. Essentially, blocks of F and G contain information related to k11 and k22, respectively, and the rest is kept unchanged, exactly as in the isotropic case.

It is easy to see that function T does not depend on the values of k11(x,y) when x=x0 and x=xn, due to the incorporation of boundary conditions. The same happens for k22(x,y) at y=y0 and y=yn, and again, an alternative discretization can be applied, for example, by considering a strategy such as in (Equation33) and (Equation34). Therefore, the semidiscrete model takes the form (63) {CdTdt=H^(t,T),t(0,t¯]T(0)=T0,(63) where (64) H^(t,T)=MT+S^(t),(64) and, again following previous notation, (65) M=(F^0F^1F^n)+P(G^0G^1G^n)PT,(65) for F^j=[D^KjxD+βh2(yj)dnen+1T],Kjx=diag(k11(x0,yj),,k11(xn,yj)),j=0,1,n,G^i=[D^KiyD+βh4(xi)dnen+1T],Kiy=diag(k22(xi,y0),,k22(xi,yn)),i=0,1,n, and S^(t) accommodating independent terms. Although the notation here is the same as for the isotropic case, the reader should be aware that the variables involved are different. In fact, in the previous sections, Kjx and Kiy were related exclusively to k, while in the orthotropic case they involve k11 and k22. Ultimately, above relations recover the isotropic case by taking k11=k22=k.

Remark 1

Although beyond the scope of this paper, we note that a semi-discrete model similar to (Equation63) for the anisotropic case can be developed analogously. In fact, in this case, Equation (Equation1) takes the form CTt=x(k11Tx+k12Ty)+y(k21Tx+k22Ty)+g and every derivative can be approximated using CPM as in (Equation25) and (Equation29).

As for the inverse problem for the orthotropic case, the goal now is to recover conductivities k11 and k22 based on data values of T contaminated by noise. We start by enumerating variables k11(xi,yj) and k22(xi,yj) as done before (see Section 3) and arranging them in a vector (66) k=[k11k22],k11,k22R(n+1)2,(66) such that function values T depend on k. The computation of the Jacobian matrix J(t)=[Tk1(k,t),,Tk2(n+1)2(k,t)]=[Tk111(k,t),,Tk(n+1)211(k,t),Tk122(k,t),,Tk(n+1)222(k,t)] remains essentially the same, but noting that the problem now has 2(n+1)2 unknown parameters and that the source terms in (Equation52) change accordingly. Of course, if M is the matrix in (Equation65), its derivatives with respect to k involve only F^j blocks for the first (n+1)2 entries of k and G^i: Mk111=(αd0r0T00),Mk211=(d1r1T00),, Mk(n+1)211=(00αdnrnT),Mk122=P(αd0r0T00)PT, Mk222=P(d1r1T00)PT,,Mk(n+1)222=P(00αdnrnT)PT, hence the computation of the source term in (Equation52) is straightforward. Thus in this case, computing J requires solving 2(n+1)2 IVPs. Nevertheless, we anticipate that in practice only a small number of Jacobians are necessary to produce good results, which in a way encourages the application of the proposed method.

Next step is to apply LMM coupled with the discrepancy principle, as described in Section 3, to the problem of finding k such that (67) k=argminkR2(n+1)2ϕ(k),ϕ(k)=12T(k)T~22,(67) with T~ as in (Equation53).

As before, for the scaling matrix used in LMM we choose a regularizer R, that now has the form (68) R=[R1100R22],(68) where R11 and R22 are introduced to enforce smoothness on k11 and k22 separately. To see this effect more clearly, it suffices to note that LMM solves (Equation48), (69) argminkR2(n+1)2{F(kj)dF(kj)22+(μj)2Rd22},(69) with d partitioned as k in (Equation66), i.e, d possessing two blocks d11 and d22, with the same size and position of k11 and k22, respectively, so that Rd=[R11d11R22d22], with every block of the regularizer acting in each part of the unknown vector d during the minimization process (Equation69). It is important to note that there is no formula for choosing R and that the choice is related to the properties we want to preserve in our solutions. For numerical computations we use R11=R22=Li, i = 1, 2, as described in (Equation49); this choice incorporates smoothing properties directly in k11 and k22.

4.1. Numerical experiments

Similar to the isotropic case, we report numerical results to illustrate the effectiveness of the proposed method using synthetic data generated by FEM. All FEM and LMM parameters are as in Section 3.2. For CPM we take n + 1 = 16 grid points in each direction and consider noise level NL=0.1% and α=0.1. Data comprises T values at N = 5 time steps equally distributed from 0.1 to 0.5 and initial data k0 is taken again as the constant 1/12. It is important to mention, however, that temperature measurements at 3 time stages turn the problem to be overdetermined, in which case we could expect a unique solution. As for the regularization matrices, due to the good fitting presented in our previous numerical results, we choose R11=R22=L2 for all tests of this section, with R in the form of (Equation68). Finally, our tests are divided into three cases, described in Table , with respective results displayed in Table  and contour plots in Figure .

Table 4. Orthotropic test cases.

Table 5. Orthotropic results for test cases in Table .

Figure 9. Contour plots for every test case: 1 (top row), 2 (middle row) and 3 (bottom row).

Figure 9. Contour plots for every test case: 1 (top row), 2 (middle row) and 3 (bottom row).

Observe that the number of unknowns is 2(n+1)2=512, which is exactly the number of IVPs solved every time the Jacobian is needed. However, as it can be seen in Table , good quality results are obtained in a fairly small number of iterations (and so Jacobian computations as well). This shows that the proposed strategy is able to produce suitable reconstructions with acceptable computational effort, and so it can be useful in real-world applications.

Finally, to assess the quality of recovered conductivities, we solve the forward problem with conductivities determined with LMM as input data and compare the obtained solution T against the considered true function values T (generated via FEM). Results of the comparison as well as absolute value of the errors are displayed in Figure . Good fitting between data provided by FEM and approximations given by our strategy is evident. We point out that FEM solutions may contain imprecisions due to discretization and/or numerical treatment, in a way that our solutions could be even better if compared with the exact (yet unavailable) data.

Figure 10. Computed temperature T (above) at time t = 1 using the reconstructions of k11 and k22 and the absolute value of the error between FEM's T and its approximation (below).

Figure 10. Computed temperature T (above) at time t = 1 using the reconstructions of k11 and k22 and the absolute value of the error between FEM's T and its approximation (below).

5. Conclusion

We proposed a method for reconstructing orthotropic conductivity combining a solver for the direct problem based on a Chebyshev pseudospectral method and Levenberg–Marquardt method for solving a related non-linear least squares problem. The main feature of the proposed method is that instead of scaling diagonal matrices often used in classical LMM implementations, we use singular regularization matrices. The impact of such a replacement in terms of the quality of the reconstruction was illustrated with numerical experiments. Further, to cope with typical instabilities in inverse reconstruction problems due to noise in the data, we use the Morozov discrepancy principle as stopping rule. Numerical results show a good fit between exact and recovered conductivity at a very low computational cost, and exceptional stability when inverting noisy data due to the use of Morozov's stopping rule. The reason for this is because the method is simple to implement as well as because CPM builds high-order approximations using a rather small set of discretization points, contributing greatly for the reduction of computational effort. Future work includes the extension to simultaneous reconstruction of capacity and conductivity, as well as the LMM convergence analysis under the assumption that scaling matrices are replaced by singular matrices, as made in this paper.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

EB was supported by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Brazil [grant number 88887.178114/2018-00] and Fundação de Amparo à Pesquisa e Inovação do Estado de Santa Catarina (FAPESC), Brazil [grant number 88887.178114/2018-00]. FSVB was supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Brazil [grant number 308523/2017-2].

References

  • Özişik MN. Boundary value problems of heat conduction. Mineola (NY): ITC; 1968.
  • Alessandrini G, de Hoop MV, Gaburro R. Uniqueness for the electrostatic inverse boundary value problem with piecewise constant anisotropic conductivities. Inverse Probl. 2017;33(12):125013.
  • Astala K, Päivärinta L. Calderón's inverse problem for anisotropic conductivity in the plane. Commun Par Diff Eq. 2005;30(1–2):207–224.
  • Gaburro R, Sincich E. Lipschitz stability for the inverse conductivity problem for a conformal class of anisotropic conductivities. Inverse Probl. 2015;31(1):015008.
  • Isakov V. Inverse problems for partial differential equations. 2nd ed. Applied Mathematical Sciences. New York (NY): Springer; 2006.
  • Alifanov OM. Inverse heat transfer problems. 1st ed. International Series in Heat and Mass Transfer. Berlin: Springer-Verlag; 1994.
  • Elayyan A, Isakov V. On uniqueness of recovery of the discontinuous conductivity coefficient of a parabolic equation. SIAM J Math Anal. 1997;28(1):49–59.
  • Carrera J, Neuman SP. Estimation of aquifer parameters under transient and steady-state conditions: 2. Uniqueness, stability and solution algorithms. Water Resour Res. 1986;22:211–227.
  • Gol'dman NL. Inverse problems with final overdetermination for parabolic equations with unknown coefficients multiplying the highest derivative. Dokl Math. 2011;83:316–320.
  • Huntul MJ, Hussein MS, Lesnic D, et al. Reconstruction of an orthotropic thermal conductivity from non-local heat flux measurements. Int J Math Model Numer Optim. 2020;10(1):102–122.
  • Lionheart WRB. Conformal uniqueness results in anisotropic electrical impedance imaging. Inverse Probl. 1997;13(1):125–134.
  • Mejias MM, Orlande HRB, Özişik MN. Effects of the heating process and body dimensions on the estimation of the thermal conductivity components of orthotropic solids. Inverse Probl Sci Eng. 2003;11(1):75–89.
  • Rodrigues FA, Orlande HRB, Mejias MM. Use of a single heated surface for the estimation of thermal conductivity components of orthotropic 3D solids. Inverse Probl Sci Eng. 2004;12(5):501–517.
  • Yen RH, Chen CY, Huang CT, et al. Numerical study of anisotropic thermal conductivity fabrics with heating elements. Int J Numer Method Heat Fluid Flow. 2013;23(5):750–771.
  • Pasdunkorale J, Turner IW. A second order finite volume technique for simulating transport in anisotropic media. Int J Numer Method Heat Fluid Flow. 2003;13(1):31–56.
  • Kohn R, Vogelius M. Determining conductivity by boundary measurements II. Interior results. Commun Pure Appl Math. 1985;38(5):643–667.
  • Mera NS, Elliott L, Ingham DB, et al. A comparison of different regularization methods for a Cauchy problem in anisotropic heat conduction. Int J Numer Method Heat Fluid Flow. 2003;13(5):528–546.
  • Mera NS, Elliott L, Ingham DB, et al. An iterative BEM for the Cauchy steady state heat conduction problem in an anisotropic medium with unknown thermal conductivity tensor. Inverse Probl Eng. 2000;8(6):579–607.
  • Zhou HL, Xiao X, Chen HL, et al. Identification of thermal conductivity for orthotropic FGMs by DT-DRBEM and L-M algorithm. Inverse Probl Sci Eng. 2020;28(2):196–219.
  • Mahmood MS, Lesnic D. Identification of conductivity in inhomogeneous orthotropic media. Int J Numer Method Heat Fluid Flow. 2019;29(1):165–183.
  • Cao K, Lesnic D, Cola MJ. Determination of thermal conductivity of inhomogeneous orthotropic materials from temperature measurements. Inverse Probl Sci Eng. 2019;27(10):1372–1398.
  • Monard F, Rim D. Imaging of isotropic and anisotropic conductivities from power densities in three dimensions. Inverse Probl. 2018;34(7):075005.
  • Mustonen L. Numerical study of a parametric parabolic equation and a related inverse boundary value problem. Inverse Probl. 2016;32(10):105008.
  • Borggaard J, van Wyk HW. Gradient-based estimation of uncertain parameters for elliptic partial differential equations. Inverse Probl. 2015;31(6):065008.
  • Young LC. Orthogonal collocation revisited. Comput Methods Appl Mech Engrg. 2019;345:1033–1076.
  • Berntsson F. A spectral method for solving the sideways heat equation. Inverse Probl. 1999;15(4):891–906.
  • Ismailov MI, Bazán FSV, Bedin L. Time-dependent perfusion coefficient estimation in a bioheat transfer problem. Comput Phys Commun. 2018;230:50–58.
  • Bazán FSV. Chebyshev pseudospectral method for wave equation with absorbing boundary conditions that does not use a first order hyperbolic system. Math Comput Simul. 2010;80(11):2124–2133.
  • Bazán FSV, Bedin L, Bozzoli F. New methods for numerical estimation of convective heat transfer coefficient in circular ducts. Int J Therm Sci. 2019;139:387–402.
  • Trefethen LN. Spectral methods in MATLAB. Vol. 10. Philadelphia (PA): Society for Industrial and Applied Mathematics; 2000.
  • Gottlieb D, Orzag SA. Numerical analysis of spectral methods: theory and applications. Philadelphia (PA): SIAM; 1977.
  • Yamashita N, Fukushima M. On the rate of convergence of the Levenberg-Marquardt method. In: Alefeld G, Chen X, editors. Topics in numerical analysis. Vol. 15. Vienna: Springer; 2001. p. 239–249.
  • Morozov VA. Regularization methods for solving incorrectly posed problems. 1st ed. New York (NY): Springer-Verlag; 1984.
  • Canuto C, Hussaini MY, Quarteroni A. Spectral methods in fluid dynamics. 1st ed. Springer Series in Computational Physics. Berlin: Springer-Verlag; 1988.
  • Bernardi C, Maday Y. Approximations spectrales de problèmes methods aux limites elliptiques. 1st ed. Berlin: Springer-Verlag; 1992.
  • Crank J, Nicolson P. A practical method for numerical evaluation of solutions of partial differential equations of the heat-conduction type. Math Proc Cambridge Philos Soc. 1947;43(1):50–67.
  • Tikhonov AN, Goncharsky AV, Stepanov VV. Numerical methods for the solution of ill-posed problems. 1st ed. Vol. 328. Dordrecht: Springer Science & Business Media; 1995.
  • Toint PL. Non-monotone trust region algorithm for nonlinear optimization subject to convex constraints. Math Program. 1997;77(3):69–94.
  • Francisco JB, Bazán FSV. Nonmonotone algorithm for minimization on closed sets with application to minimization on Stiffen manifolds. J Comput Appl Math. 2012;236(10):2717–2727.
  • Moré JJ. The Levenberg-Marquardt algorithm: implementation and theory. In: Watson GA, editor. Numerical analysis. Lecture Notes in Mathematics; Vol. 630. Berlin: Springer; 1978. p. 105–116.
  • Bazán FSV, Cunha MCC, Borges LS. Extension of GKB-FP algorithm to large-scale general-form Tikhonov regularization. Numer Linear Algebra. 2014;21(3):316–339.
  • Johnson C. Numerical solution of partial differential equations by the finite element method. 1st ed. Cambridge: Cambridge University Press; 1987.
  • Larson MG, Bengzon F. The finite element method: theory, implementation, and applications. 1st ed. Berlin: Springer; 2013; Texts in Computational Science and Engineering.
  • Mueller J, Siltanen S. Linear and nonlinear inverse problems with practical applications. Vol. 10. Philadelphia (PA): Society for Industrial and Applied Mathematics; 2012.
  • Chávez CE, Atienza FA, Álvarez D. Avoiding the inverse crime in the inverse problem of electrocardiography: estimating the shape and location of cardiac ischemia. In: Computing in cardiology 2013; 2013. p. 687–690.
  • Kelley CT. Iterative methods for optimization. Philadelphia (PA): Society for Industrial and Applied Mathematics; 1999.
  • Kaltenbacher B, Neubauer A, Scherzer O. Iterative regularization methods for nonlinear ill-posed problems. Radon Series on Computational and Applied Mathematics; Vol. 6. Berlin: De Gruyter; 2008.
  • Natterer F. The mathematics of computerized tomography. Stuttgart: Wiley; 1986.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.