282
Views
1
CrossRef citations to date
0
Altmetric
Articles

Quasi-optimal Tikhonov penalization and parameterization coarseness in space-dependent function estimation

, , , &
Pages 465-481 | Received 04 Jul 2014, Accepted 28 Apr 2015, Published online: 01 Jun 2015

Abstract

The determination of space-dependent functions from boundary measurements or inner pointwise measurements is ill-posed inverse problems that require regularization tools to be stabilized. Among numerous regularization strategies, the Tikhonov penalization is one of the most used in the field of space-dependent function estimation. Its efficient use relies on the Tikhonov parameter value for which search is time consuming although necessary, especially in the field of nonlinear inversion. Other strategies, such as appropriate parameterization, have recently proven to be very efficient to cope with the ill-posedness of such problems. This paper shows that the optimal Tikhonov parameter is almost independent of the mesh used to project the functions to be retrieved. As a consequence, this value should be seeked using a coarse mesh even though reconstructions could further be done on finer meshes. This conclusion is validated by numerical means.

AMS Subject Classifications:

1. Introduction

The reconstruction of space-dependent functions from pointwise measurements belongs to inverse problems known to be difficult to be solved due to their ill-posed character. More specifically, the reconstruction of maps of radiative properties by illuminating a semi-transparent medium with near infrared radiation and measuring emerging radiation is by definition of the classical optical tomography inverse problem.

From a mathematical point of view, within the diffuse approximation framework in the frequency domain,[Citation1] the geometry (D) being fixed, the total inward flux (I) of the amplitude-modulated diffuse source at the frequency ν being known as well as the speed of light (c) in the medium, the knowledge of the space-dependent absorption (κ) and reduced scattering (σ) coefficients are sufficient to simulate the photon density distribution (φ) in each location of the computational domain. Such modelling is the forward problem which is known to be well-posed mathematically.

Contrarily, the reconstruction of physical properties from the knowledge of photon density on sensors constitutes an inverse problem. Unfortunately, such a problem is ill-posed in the sense of Hadamard [Citation2]. Hence, regularization must be used to stabilize the solution. Several regularization tools have been introduced in the field of optical tomography in these last decades among which the use of appropriate control space parameterization.[Citation3Citation6] Dealing with matrix-based inversion, the Tikhonov regularization method, which is a popular method, consists in adding a penalization term to the cost function to be minimized; this method actually relies on a weight parameter that is to be determined very carefully since: (i) under-regularization leads to small cost functions at the end but at the price of highly fluctuating property maps; and (ii) over-regularization leads to stable property maps around priors, leading to biased solutions.[Citation7]

In the following, the optimal Tikhonov parameter, ϑ, is defined as the one that minimizes the distance between the actual solution and the noise-free solution. This distance can be computed with E(ϑ)=γϑ-γ¯γ¯, where ϑ is the Tikhonov parameter, γϑ is the solution given for the Tikhonov parameter ϑ, γ¯ is the exact solution and · is a norm. The point is that one cannot, in practice, compute the optimal Tikhonov parameter, γ¯ being unknown. However, numerous heuristic methods have been introduced to compute quasi-optimal Tikhonov parameters. Some methods use the assumed to be known standard deviation such as in the discrepancy principle.[Citation7, Citation8] Others [Citation9Citation11] use less information about the noise properties, such as, for instance, the generalized cross validation (GCV), or even no information about the noise level present in the system, such as the well-known and controversial L-curve. The methods used to find the optimal Tikhonov parameter or at least a quasi-optimal parameter is beyond of the scope of this paper. Moreover, even though methods that do not use error informations are widely used for solving practical problems in various engineering areas, let us recall that in general the L-curve method, for instance, because it does not rely on any noise information, is, as shown by [Citation12, Citation13], not convergent and introduces a nonremovable bias.

In this paper, the diffuse optical tomography problem is formulated as a nonlinear least squares problem and solved by the Gauss–Newton method. Though this method has been widely used in these last years to solve this inverse problem,[Citation14Citation19] it is worth mentioning that the Gauss–Newton method is only locally convergent, and this is so, only when the initial guess is not too far from the minimum and the inverse problem is mildly nonlinear.[Citation20] This means that some choices of the initial guess far from the solution lead to inaccurate results whatever the regularization method used. To illustrate this property, the dependence of the accuracy of the numerical solution with respect to the initial guess is discussed in the numerical results section. Although the study of the initial guess of the solution on the convergence of the optimizer is not the main issue of this paper, it should be noted that globally convergent methods, which overcome the major drawback of the Gauss–Newton optimizer, have been developed and applied to the diffuse optical tomography problem.[Citation21Citation26]

The remainder of the paper will show that the optimal Tikhonov parameter is almost independent of the mesh coarsening used for the parameterization. This assertion is demonstrated theoretically and verified numerically based on two different under-determined inverse problems of space-dependent function estimation: a linear inverse heat conduction problem of flux estimation and a nonlinear diffuse optical tomography problem. The studies are performed on synthetic data and therefore the errors on reconstructions could be calculated. For this reason, one can somehow consider that the proposed approach is heuristic.

Before dealing with the nonlinear inverse problem of diffuse optical tomography (which is the main objective of this paper), a steady-state two-dimensional (2D) inverse heat conduction problem of space-dependent heat flux estimation is dealt with in Section 2. Such inverse problem has recently been studied.[Citation27Citation30] This space-dependent heat flux estimation problem is highly different from the diffuse optical tomography problem because (i) it deals with a different physics: heat conduction; (ii) measurements are performed within the domain rather than on a boundary; (iii) the heat flux on a boundary is the function to be retrieved rather than physical properties within the medium; and (iv) the forward problem is linear with respect to the coefficients involved in the space-dependent finite element heat flux parameterization. The interest of this first study is above all to illustrate that the quasi-independence of the optimal Tikhonov parameter with the control space parameterization is found in a less heuristic way since only measurement errors are used in the whole process. Indeed, the determination of the Tikhonov parameter can be performed straightforwardly through the knowledge of measurement errors only. In this case, the discrepancy principle is the major recipe to compute the parameters involved in the space-dependent heat flux parameterization. It is known that this method gives satisfactory results and other methods found in the literature may give a Tikhonov parameter closer to the optimal one [Citation9Citation11], but such consideration is out of the scope of this paper. Moreover, the statement of this article is also validated on this linear inverse problem in the sense of the optimal Tikhonov parameter ϑ previously defined as argminE(ϑ).

Section 3 deals with the nonlinear diffuse optical tomography problem. The forward model of light propagation in a highly diffuse medium is first presented along with the inverse problem settings. First- and second-order cost function derivatives used afterwards in optimization are then detailed. The finite element parameterization of, on the one side, the state and its derivative and, on the other side, the properties to be retrieved, is presented. The optimization is then written down based on the proposed finite dimensional control space. This section then presents the most usual Tikhonov regularization and proves that, under some weak hypothesis, the optimal Tikhonov parameter is quasi-independent of the dimension of the control space. A numerical verification is performed on the nonlinear diffuse optical tomography problem based on the theoretical demonstration.

Overall, the conclusion is that, when dealing with nonlinear inverse problems demanding heavy computational time, the determination of the Tikhonov parameter, whatever the method chosen among those of [Citation31] for instance, should be preferably performed on a highly coarse mesh in order to lower computational time.

2. Space-dependent heat flux estimation

2.1. Problem statement

In this first study, a linear steady-state 2D inverse heat conduction problem is considered to illustrate the quasi-independence of the Tikhonov parameter with mesh coarsening used for the finite element projection of the quantity of interest. In the present case, already studied in different contexts,[Citation27Citation30] a space-dependent heat flux on a part of the boundary is to be retrieved from pointwise measurements within a bounded domain D=[0,1]2. This application is treated first because of its simplicity: as a matter of fact, the response being linear with respect to the input flux,[Citation30] the cost function to be minimized is purely quadratic, and specific tools such as the singular value decomposition coupled with the maximum discrepancy principle [Citation7, Citation8] can be used to compute straightforwardly a quasi-optimal Tikhonov parameter. In this case, one can speak of ‘Tikhonov parameter found according to the discrepancy principle’. It is shown elsewhere [Citation11] that this quasi-optimal Tikhonov parameter is likely to be close to the optimal one.

Homogeneous steady-state heat conduction without source term but with mixed boundary conditions is described such that:(1) -ΔT=0(x1,x2)D=]0,1[×]0,1[T=0x1=0T·n=0x2=0andx2=1-λT·n=φx1=1(1)

where T is the temperature, λ is the thermal conductivity and φ is the space-dependent heat flux. The forward problem consists in solving Equation (Equation1) for T assuming λ and φ(x2) are known. In contrast, the inverse problem consists in estimating the space-dependent heat flux φ(x2) on the basis of suitable measurement data T˘l in D, under the assumption that the value of λ is known.

Let us consider k inner pointwise measurements (x1l,x2l), l=1,,k. Discrepancies between predictions Tl=T(x1l,x2l) and associated measurements T˘l, l=1,,k, are integrated to the cost function:(2) jϑ(φ)=J(T)+J+(φ)=T-T˘Rk2+ϑϕRΞ2(2)

with aRk2=l=1kal2 and aRΞ2=l=1Ξal2. ϑ is the Tikhonov parameter to be searched according to the noise level ϵ. ϕ is the vector obtained after finite element parameterization of the heat flux φ, that is φ(x2)=l=1ΞΘl(x2)φ(x2l)=l=1ΞΘl(x2)ϕl where Θll=1Ξ is a finite element basis of [0,1]. The singular value decomposition of the matrix A:ϕRΞTRk such that A=WΛVt enables us to rewrite the cost function such that:(3) jϑ(ξ)=Λξ-ξ˘Rk2+ϑξRΞ2(3)

where ξ=Vtϕ and ξ˘=WtT˘. With such a decomposition, the solution of ξϑ=argminjϑ(ξ) is given in terms of singular values of A, ηii=1Ξ:(4) ξϑi=ηiξ˘iηi2+ϑi=1,,Ξ(4)

On the other hand, the discrepancy principle leads to determine ξϑ such that:(5) Λξϑ-ξ˘Rk2=ϵ2(5)

where ϵ is the sum of variances of noise on all sensors, i.e. ϵ2=l=1kϵl2. Combining this latter relationship with Equation (Equation4) leads to determine the quasi-optimal Tikhonov parameter found according to the discrepancy principle, ϑdp solution of:(6) R(ϑdp)=i=1kϑdpξ˘iηi2+ϑdp2-ϵ2=0(6)

Figure 1. Distance from the solution to the target E(ϑ) as a function of the Tikhonov parameter ϑ, for three control space dimensions Ξ equal to 7, 15 and 20. It is seen that ϑ=argminE(ϑ) is independent of Ξ. The quasi-optimal Tikhonov parameter found according to the discrepancy principle has also been added.

Figure 1. Distance from the solution to the target E(ϑ) as a function of the Tikhonov parameter ϑ, for three control space dimensions Ξ equal to 7, 15 and 20. It is seen that ϑ∗=argminE(ϑ) is independent of Ξ. The quasi-optimal Tikhonov parameter found according to the discrepancy principle has also been added.

2.2. Numerical results

The numerical study shows that the solution of Equation (Equation6) is almost independent of the dimension Ξ used in the parameterization of the flux. To do so, let the synthetic data be generated with the flux φ¯(x2)=φ0sinπx22-1, φ0=-104 W/m2 and the thermal conductivity λ=30 W/m K. Five pointwise temperature measurements (k=5) are performed at locations x1=0.9, and x2=0.1, 0.3, 0.5, 0.7 and 0.9. The data are then perturbated according to a Gaussian white noise with variance ϵl2 ranging from 10-4 to 10-1 K2, l=1,,k, yielding to T˘. The finite element method is used to solve the forward problem Equation (Equation1) based on a regular mesh of the bounded domain D and a discretization of the temperature T with Lagrange P1 elements. The regular grid associated to D is chosen sufficiently fine to ensure that errors due to the approximation method are negligible when compared to measurement errors. The basis functions Θll=1Ξ used in the parameterization of the unknown flux φ are also continuous first-order Lagrange functions, i.e. linear functions per element satisfying Θl(x2p)=1 if and only if p=l.

Table presents the quasi-optimal Tikhonov parameter ϑdp for several uniform Lagrange parameterizations with Ξ ranging from 7 to 25, and several variances of noise in the data, from ϵl2=10-4 to ϵl2=10-1. It is seen that for a given level of noise, the Tikhonov parameter ϑdp depends only very slightly on the flux discretization. The small fluctuations that remain may come from all numerical approximations: finite element computations to build the state matrix A, the singular value decomposition WΛVt=A and the numerical optimization when solving the nonlinear problem Equation (Equation6), ϑdp=argminR(ϑdp). Moreover, it is worth noting the linear dependency of ϑdp with the noise variance ϵl2 (Table ).

Figure presents the distance from the actual solution to the expected target distribution, E(ϑ)=φϑ-φ¯φ¯, as a function of the Tikhonov parameter ϑ, for the variance noise ϵl=10-3, l. The norm was chosen to be defined as φ=x1=1φ2dx212. Results obtained for other noise variances gave similar curves to the one given in Figure and are thus not presented here. Figure shows that for low Tikhonov parameters ϑ, the distance from the solution to the target decreases when the control space dimension Ξ decreases. On the contrary, for large Tikhonov parameters, the distance from the solution to the target is independent of the control space dimension. In between, there is a plateau, and the minimum point is found to be independent of Ξ.

In order to illustrate the effect of regularization, different reconstructions are presented in Figure . This figure shows how parameterization influences the reconstructions.[Citation5, Citation6] It is also observed from Figure that despite the very limited number of steady-state measurements (a study on the optimal number and location of inner pointwise measurements is out of the scope of this paper), the space-dependent heat flux could be retrieved for appropriate Tikhonov regularization combined with appropriate control space parameterization.

Table 1. Value of the Tikhonov parameter ϑdp solution of (Equation6) for different discretizations and different noise magnitudes.

Figure 2. Reconstructions φϑ(x2) for Ξ=15 (top), Ξ=10 (middle) and Ξ=7 (bottom) for under-regularization (ϑ=10-15), over-regularization (ϑ=1) and appropriate regularization (ϑ=510-10ϑ). Note that points for Ξ=15 and Ξ=10 (over-parameterization) with ϑ=10-15 (under-regularization) are not presented because of divergence. The noise variance ϵl2=0.001 was used after generating the synthetic data.

Figure 2. Reconstructions φϑ(x2) for Ξ=15 (top), Ξ=10 (middle) and Ξ=7 (bottom) for under-regularization (ϑ=10-15), over-regularization (ϑ=1) and appropriate regularization (ϑ=510-10≈ϑ∗). Note that points for Ξ=15 and Ξ=10 (over-parameterization) with ϑ=10-15 (under-regularization) are not presented because of divergence. The noise variance ϵl2=0.001 was used after generating the synthetic data.

3. Optical tomography

3.1. Forward model and inversion setting

Solving the optical tomography inverse problem is usually based on the minimization of a cost function which depends on the discrepancy between some measurements and the related predictions, the latter being a solution of the forward model which is, in the present case, based on the 2D diffuse approximation. The complex photon density, φ:DC, is mathematically described by the diffuse approximation model expressed in the frequency domain [Citation1]:(7) -·2κ+σ-1φ+κ+2πiνcφ=0inD(7) (8) φ+A2π-12(κ+σ)-1φ·n=Iπ-11ζDsonD(8)

where κ and σ are the absorption and reduced scattering coefficients, respectively, c is the speed of light in the medium, n is the unit outward normal vector, I is the total inward flux of the amplitude-modulated diffuse source at the frequency ν, 1[·] denotes the indicator function, Ds depicts the light source location, i is the imaginary unit and A is the parameter which characterizes the reflection at the boundary and can be derived from Fresnel’s law if specular reflection is considered [Citation18] or from experimental set-ups.[Citation32]

The forward problem consists in solving Equations (Equation7)–(Equation8) for the photon density φ assuming the absorption coefficient κ, the reduced scattering coefficient σ, the location of the light source Ds and values of ν, c, A and I are known. In contrast, the inverse problem consists in estimating the space-dependent radiative properties κ and σ on the basis of photon density measurements φ˘ on D, under the assumption that the location of the light source Ds and values of ν, c, A and I are known.

More specifically, the forward model leads to compute the state φ that depends on radiative properties γ=(κ,σ). The difference (in the least squares sense) between this density and the measured one is integrated to the cost function to be minimized j(γ)=J(φ):(9) J(φ)=12k=1Kd=1Dφ(k;d)-φ˘(k;d)φ˘(k;d)2(9)

where the index k defines the source (test) number and the index d defines the detection number. The solution of the inverse problem requires an optimization problem formulation of the kind: ‘Find the functions κ(x) and σ(x) such that j(κ,σ)=minj(κ,σ)’.

3.2. Cost function derivative

In order to derive the Gauss–Newton algorithm, one first needs to recall basic definitions of directional derivatives. Following,[Citation33] let us denote the point (κ,σ)=γΛ2[L2(D)]2 and directions η and ζΛ. The directional derivative of the state φ at point γ towards η is(10) φ(γ;η):=limε0+φ(γ+εη)-φ(γ)ε(10)

An analogous definition can be stated for the directional derivative of the cost function along with its second-order directional derivative such that:(11) j(γ;η,ζ):=limε0+j(γ+εζ;η)-j(γ;η)ε(11)

Next, the operators involved in Newton’s method are extracted through the following equations:(12) j(γ;η)=j(γ),ηL2(D)(12) (13) j(γ;η,ζ)=2j(γ)η,ζL2(D)(13)

with the inner product is defined as γ,ηL2(D)=Dγηdx.

3.3. Parameterization

The solution of the optimization problem relies on iterative minimization algorithms for which control and state spaces must be finite in practice. To do so, let the functions to be determined α=κ, σ and φ be approximated using finite element basis functions ψξξ=1Ξα and Θξξ=1Ξφ, respectively:(14) φ(x)=ξ=1ΞφΘξ(x)φ(xξ)(14) (15) α(x)=αapξ=1Ξαψξ(x)α~(xξ),α=κ,σ(15)

where it should be noted that the dimension of the control space Ξα is chosen less or equal to that of the state space Ξφ. As a consequence, projections of functions of the expression (Equation15) into the state functional space must be used to make the solution of the system (Equation7) and (Equation8) possible. Moreover, parameters are adimensionalized with a priori properties αap=κap,σap such that α=αapα~, in order to make both functions α~=ϰ(x),ς(x) to be searched about unity.

Let us note γ~=(ϰ,ς). Parameterization of radiative property map functions allows the construction of the Gauss–Newton matrix system which is written as:(16) ~2j(γ~)δγ~=-~j(γ~)(16)

with state second-order derivatives assumed to be negligible when compared to first-order state derivatives and where vector ~j and matrix ~2j represent continuous gradient j and approached Hessian matrix 2j of (Equation12) and (Equation13) decomposed on finite element basis functions ψξξ=1Ξα. Finally, the Gauss–Newton matrix system to be solved in order to update radiative properties at each iteration is expressed as:(17) RSS¯δγ~=-RSR¯(17)

where S and S¯ are the transposed and the conjugate of S, respectively, and R(·) denotes the real part of the imaginary vector or matrix. The forward model behaving nonlinearly with respect to properties γ~=(ϰ,ς), the increment δγ~l=γ~l-γ~l-1 given by the solution of the Gauss–Newton Equation (Equation17) is solved several times until convergence is attained. With the nomenclature defined earlier, the expressions for S and R are obtained such that:(18) S=φϰ(k;d)ξφ˘(k;d),φς(k;d)ξφ˘(k;d)(k;d)=(1,,K;1,,D)ξ=1,,Ξα(18) (19) R=φ(k;d)-φ˘(k;d)φ˘(k;d)(k;d)=(1,,K;1,,D)(19)

where K and D are the number of sources and sensors, respectively.

SC(K×D)×2Ξα, RC(K×D), and derivative functions φϰ and φς can be obtained directly by differentiating the forward model (Equation7) and (Equation8) with respect to the adimensionalized radiative properties ϰ and ς, respectively. φϰ and φς are solutions of the following partial differential equations:(20) -·[2(κ+σ)]-1φϰ+κ+2πiνcφϰ=-κapϰφ-κap·ϰφ2(κ+σ)2inD(20) (21) φϰ+A2π-1[2(κ+σ)]-1φϰ·n=κapA2π-1ϰφ2(κ+σ)2φ·nonD(21)

and(22) -·[2(κ+σ)]-1φς+κ+2πiνcφς=-σap·ςφ2(κ+σ)2inD(22) (23) φς+A2π-1[2(κ+σ)]-1φς·n=σapA2π-1ςφ2(κ+σ)2φ·nonD(23)

where ϰ, ς denote the perturbations. Algorithm 1 gives some explanations about the construction of vector R and matrix S involved in (Equation17).

3.4. Tikhonov regularization

The Tikhonov regularization method consists in adding a penalization term ϑJ+ which is based on priors to the cost to be minimized (Equation9):(24) J+(ϰ,ς)=12α~=ϰ,ςDα~-α~ap2dx(24) ϑ being the Tikhonov parameter, and α~ap being equal to 1 as explained above. This has been successfully used in the optical tomography area, e.g. by [Citation14Citation19] when using the Gauss–Newton optimizer. Doing so, the optimization problem becomes ‘Find the functions ϰϑ(x) and ςϑ(x) such that jϑ(ϰϑ,ςϑ)=minjϑ(ϰ,ς)’ with jϑ=J+ϑJ+.

The simultaneous use of several regularizations together has been studied in [Citation5, Citation6], among which the reduction of the control space dimension [Citation4] with Tikhonov regularization. It is shown here that the control space reduction modifies only slightly the optimal Tikhonov parameter.

The control space mesh is assumed to be chosen sufficiently fine to ensure that the spatial fluctuations of the radiative properties are well taken into account. Thus, the projections from the control functional space to the state functional space are such that the state is almost independent of the number of degrees of freedom Ξα, and thus the cost function J is also almost independent of Ξα. Next, to simplify the presentation, let us consider only one parameter, say ϰ.

Proposition 3.1

Under the following assumptions:

(H1)

ψξ are basis functions constant per element on a regular mesh, i.e. Lagrange P0 elements are used for the discretization of the function ϰ,

(H2)

ϰN1,σϰ2,

then the optimal Tikhonov parameter is asymptotically independent of the dimension Ξα.

Proof

Let ϰϑ be a solution of the optimization problem(25) ϰϑ=argminϰJ(φ)+ϑJ+(ϰ)(25)

with ϑ representing the best compromise between both costs J(φ) and J+(ϰ). Then, showing J+ does not depend on the parameterization yields ϑ independent of Ξα. Let ϑ~=σϰ2ϑ and Δx=|D|/Ξα be the area of one element of the regular mesh. With H1), we have ϑJ+(ϰ)=ϑ~|D|21Ξαi=1Ξαϰi-1σϰ2. With H2), ϰi-1σϰN0,1i, thus Υ=i=1Ξαϰi-1σϰ2χΞα2. Then, using the fact that EΥ=Ξα, we have EJ+=σϰ2|D|2 which is independent of Ξα. This completes the proof.

Remark 3.2

This demonstration can be easily extended to several parameters, say with a function of the type of (Equation24) assuming that σςσϰ.

Remark 3.3

It could also be extended to other finite element parameterizations, but with much heavier calculations.

3.5. Numerical validation

The ability to determine on a coarse mesh a reasonable parameter for the Tikhonov regularization is illustrated on a circular test medium. The rationale behind this choice is to reduce computational time required for getting a Tikhonov parameter so that the solution is close enough to the true solution, at least from an engineering point of view.

Synthetic data and predicted values are collected with the help of eight equally distributed sources and sensors about the disk perimeter (K=8). Sources and sensors are alternatively distributed as schematically depicted in Figure . Each sensor contains seven pointwise photon density measurements (D=56). Radiative property maps to be recovered are also given in Figure . The areas to be detected are located at about 2.5 cm in opposite locations at 45  and 225, respectively. The defects are 4/3 cm in diameters, while the whole medium has a 4 cm radius. Each probe is 0.8 cm in length. The other physical parameters involved in (Equation7) and (Equation8) are fixed to: n=1.4, ν=1×108 s-1, c=c0/n with c0=3×1010 cm s-1, I=0.1 W cm-2 and A is derived from Fresnel’s law (see [Citation18]).

Figure 3. Test medium geometry representation. Eight sources are located on the boundary. For each source (which constitutes a test), the emerging radiation is measured on all sensors.

Figure 3. Test medium geometry representation. Eight sources are located on the boundary. For each source (which constitutes a test), the emerging radiation is measured on all sensors.

The radiative parameters are initialized to the radiative properties of the background considered as priors: κ0=κap=0.8cm-1 and σ0=σap=20cm-1 all over the medium D. Thus, initial control variables ϰ0=κ0/κap and ς0=σ0/σap are initialized to 1. Synthetic data φ˘ and initial adimensionalized radiative properties ϰ and ς are inputs of the inverse problem. The way in which synthetic data are generated is given hereafter.

Within the inverse procedure, the forward model (Equation7) and (Equation8) is solved by a finite element method with Lagrange P1 elements, that is, with φVhφ=VMhφ,P1. The state mesh Mhφ is chosen fine enough to ensure that numerical predictions can fit the synthetic data. Synthetic data φ˘ are also generated with the finite element method with Lagrange P1 elements but using a much finer mesh Mhφ˘ in order to avoid the inverse crime.[Citation34] After projecting the synthetic data in the functional space Vhφ, φ˘ are corrupted with a multiplicative white Gaussian noise of signal-to-noise ratio SNR=30 dB: φ˘noisy=φ˘1+10-SNR/10×ε where εN(0,1).

Next, the control space Vhα=VMhα,P1 for α=κ,σ relies on a more or less coarse mesh Mhα according to cases (see Table ). The FreeFem++ environment has been used to perform all computations.[Citation35]

Table 2. Dimensions of finite element spaces for φ, φ˘ and α.

Table 3. CPU time comparisons for the generation of the error curves E(ϑ).

The error curves E(ϑ) are built at the 20th iteration for the three cases described in Table : a fine mesh for the state (dimVhφ=2924), and successively a fine mesh, a medium mesh and a coarse mesh with respectively dimVhα equal to 2924, 1382 and 384 nodes for the parameters. The three error curves are presented all together in Figure . Figure first shows that for ϑ>0.4, the calculated errors are independent of the mesh coarseness. Otherwise, for ϑ<0.3, it is seen that the reduction of the control space reduces dramatically the errors. Such reduction of control space thus regularizes the inverse problem preconditioning the optimization.[Citation4] In between, Figure shows that the optimal Tikhonov parameters are found to be quasi-independent of the number of degrees of freedom of the finite control space. Within the range [10-2,10+2], the optimal Tikhonov parameter ϑ is found to be equal to 0.3 for Ξα=384 and 0.4 for Ξα=1382 and 2924. This means that the minimal residual is indeed quasi-independent of the mesh coarsening, i.e. of dimVhα. This is what has been shown in Proposition 3.1.

Table also provides the relative CPU time needed to build such curves for cases 1, 2 and 3 described in Table : the construction of this error curve with the coarsest mesh needs, in this case, roughly the 17th of the initial CPU time related to the construction with the finest mesh. This can be easily understood by examining the construction scheme of the Gauss–Newton matrix system presented above (see Algorithm 1).

Figures and present the reconstructions of both κ and σ obtained with the Gauss–Newton optimizer at the 20th iteration, with Ξα=2924, and with the Tikhonov weight equal to 0.2 and 0.4, respectively. Figure shows the divergence of the reconstruction with negative values for σ due to under-regularization: the weight parameter ϑ is too low. Figure presents the same reconstruction but with the Tikhonov parameter ϑ chosen at the minimum of the error curve. Reconstructions are good even in the presence of a 30 dB noise.

Figure 4. Distance from the solution to the target E(ϑ) as a function of the Tikhonov parameter ϑ, for κap,σap=0.08,20cm-1 and three control space dimensions Ξα equal to 384, 1382 and 2924. It is seen that ϑ=argminE(ϑ) is quasi-independent of Ξ.

Figure 4. Distance from the solution to the target E(ϑ) as a function of the Tikhonov parameter ϑ, for κap,σap=0.08,20cm-1 and three control space dimensions Ξα equal to 384, 1382 and 2924. It is seen that ϑ∗=argminE(ϑ) is quasi-independent of Ξ.

Figure 5. Reconstruction of κ (left) and σ (right) with ϑ=0.2, κap,σap=0.08,20cm-1 and Ξα=2924. It should be noted the divergence for the reduced diffusion coefficient.

Figure 5. Reconstruction of κ (left) and σ (right) with ϑ=0.2, κap,σap=0.08,20cm-1 and Ξα=2924. It should be noted the divergence for the reduced diffusion coefficient.

Figure 6. Reconstruction of κ (left) and σ (right) with ϑ=0.4, κap,σap=0.08,20cm-1 and Ξα=2924.

Figure 6. Reconstruction of κ (left) and σ (right) with ϑ=0.4, κap,σap=0.08,20cm-1 and Ξα=2924.

Figure 7. Reconstruction of κ (left) and σ (right) with ϑ=1.4, κap,σap=0.088,24cm-1 and Ξα=2924.

Figure 7. Reconstruction of κ (left) and σ (right) with ϑ=1.4, κap,σap=0.088,24cm-1 and Ξα=2924.

Two supplementary numerical tests are considered in order to investigate the behaviour of the inverse method with respect to the initial guess. As previously, initial control variables ϰ0 and ς0 are initialized to 1, so that the initial radiative properties are equal to the a priori properties κap and σap. For the first test, a priori properties are fixed to κap,σap=0.088,24cm-1 (10 and 20However, it should be noted that, as for the previous case where a priori properties equal the radiative properties of the background, the optimal Tikhonov parameter is found to be slightly smaller for Ξα=384 (1382) than for Ξα=1382 (2924). This can be explained by the fact that the reduction of the control space dimension acts as a regularization method so that the smaller the control space dimension Ξα, the smaller the optimal Tikhonov parameter ϑ. As a result, the optimal Tikhonov parameter provided by a strongly reduced control space should not be considered as such for the original control space, but has to be interpreted as a very good indicator: the optimal Tikhonov parameter for the original control space will be slightly larger. Figure presents the reconstructions of both κ and σ obtained with the Gauss–Newton optimizer at the 20th iteration, with ϑ=1.4 and Ξα=2924. It is observed that the reconstructions are less accurate than those obtained when a priori properties equal the radiative properties of the background, but the inclusions can still be distinguished.

Finally, for the second test, the a priori properties are fixed to κap,σap=0.096,28cm-1 (20 and 40% larger than radiative properties of the background medium for the absorption and reduced scattering coefficients, respectively). It was observed that the error curves associated to this initial guess are very irregular, so that it is difficult to find an optimal Tikhonov parameter. In fact, the results show that the Gauss–Newton optimizer fails to reconstruct the target properties whatever the value of the Tikhonov parameter. The initial guess is too far from the target so that the descent direction of the Gauss–Newton algorithm is never pointed towards the minimum and leads inevitably to very inaccurate reconstructions whatever the Tikhonov weight. Thus, for such choice of the initial guess, the use of more sophisticated methods, such as the globally convergent ones developed in [Citation21Citation26], becomes necessary.

4. Conclusion

This paper has demonstrated that the optimal Tikhonov parameter, defined as that which minimizes the distance between the actual solution and the noise-free solution, is almost independent of the control space parameterization. This has first been verified on a standard linear inverse heat conduction problem in which, in phase one, the Tikhonov parameter has been approximated with the singular value decomposition and the maximum discrepancy principle and then, in a second phase, in the sense of the optimal Tikhonov parameter previously defined. Next, this validation has been extended to the nonlinear inverse problem of diffuse optical tomography. The error curves with respect to the Tikhonov parameter showed that their minimum is quasi-independent of the control space parameterization. This may result in CPU time reduction when searching for the optimal Tikhonov parameter, whatever the method used to find it. Of course, this is highly recommended for large size (three-dimensional) objects when the Gauss–Newton optimizer is chosen. Finally, note that the two inverse problems dealt with in this paper were under-determined, as it is the case in most space-dependent function estimation problems. Consequently, the theory has been numerically validated only on these two specific inverse problems. This may be viewed as a limitation, but the need of control space dimension reduction is much less useful for over-determined inverse problems than for space-dependent ones which are usually under-determined.

Acknowledgements

The authors would like to thank Pr. Hecht for providing the very efficient finite element tool FreeFem++.[35] The authors would also like to thank the Levis’s town, the Regional Conference of Elected Representatives of Chaudière-Appalaches, Énergie Valero Inc. and Ecosystem for their financial support to the industrial research chair t3e. The authors also thank the Centre Régional de Calcul Intensif des Pays de la Loire (CCIPL), financed by the French Research Ministry, the Région Pays de la Loire and the University of Nantes, allowing computations on their supercomputers. Finally, the authors would like to sincerely thank the reviewers for their constructive and relevant comments formulated on the first version that significantly enhanced the quality of this paper.

Notes

No potential conflict of interest was reported by the authors.

References

  • Arridge SR. Optical tomography in medical imaging. Inverse Probl. 1999;15:R41.
  • Hadamard J. Sur les problèmes aux dérivées partielles et leur signification physique [On partial derivative problems and their physical meaning]. Princeton Univ. Bull. 1902;13:49–52.
  • Gu X, Xu Y, Jiang H. Mesh-based enhancement schemes in diffuse optical tomography. Med. Phys. 2003;30:861–869.
  • Chavent G. Nonlinear least squares for inverse problems: theoretical foundations and step-by-step guide for applications. Springer; 2010.
  • Favennec Y, Dubot F, Rousseau B, Rousse D. Mixing regularization tools for enhancing regularity in optical tomography applications. In: IPDO-2007-Inverse Problems, Design and Optimization Symposium; Albi; 2013.
  • Balima O, Favennec Y, Rousse D. Optical tomography reconstruction algorithm with the finite element method: an optimal approach with regularization tools. J. Comput. Phys. 2013;251:461–479.
  • Morozov VA, Nashed Z, Aries A. Methods for solving incorrectly posed problems. New York (NY): Springer; 1984.
  • Goncharskii A, Leonov AS, Yagola AG. A generalized discrepancy principle. USSR Comput. Math. Math. Phys. 1973;13:25–37.
  • Golub GH, Heath M, Wahba G. Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics. 1979;21:215–223.
  • Hansen PC, O’Leary DP. The use of the L-curve in the regularization of discrete ill-posed problems. SIAM J. Sci. Comput. 1993;14:1487–1503.
  • O’Leary DP. Near-optimal parameters for Tikhonov and other regularization methods. SIAM J. Sci. Comput. 2001;23:1161–1171.
  • Yagola A, Leonov A, Titarenko V. Data errors and an error estimation for ill-posed problems. Inverse Probl. Eng. 2002;10:117–129.
  • Vogel CR. Non-convergence of the l-curve regularization parameter selection method. Inverse Probl. 1996;12:535.
  • Paulsen KD, Jiang H. Spatially varying optical property reconstruction using a finite element diffusion equation approximation. Med. Phys. 1995;22:691–701.
  • Schweiger M, Arridge SR, Nissilä I. Gauss–Newton method for image reconstruction in diffuse optical tomography. Phys. Med. Biol. 2005;50:2365.
  • Niu H, Guo P, Ji L, Zhao Q, Jiang T. Improving image quality of diffuse optical tomography with a projection-error-based adaptive regularization method. Optics Exp. 2008;16:12423–12434.
  • Tarvainen T, Vauhkonen M, Arridge S. Gauss–Newton reconstruction method for optical tomography using the finite element solution of the radiative transfer equation. J. Quant. Spect. Rad. Trans. 2008;109:2767–2778.
  • Dehghani H, Srinivasan S, Pogue BW, Gibson A. Numerical modelling and image reconstruction in diffuse optical tomography. Philos. Trans. Roy. Soc. A: Math. Phys. Eng. Sci. 2009;367:3073–3093.
  • Dehghani H, Eames ME, Yalavarthy PK, Davis SC, Srinivasan S, Carpenter CM, Pogue BW, Paulsen KD. Near infrared optical tomography using nirfast: algorithm for numerical model and image reconstruction. Commun. Numer. Meth. Eng. 2009;25:711–732.
  • Björck A. Numerical methods for least squares problems. SIAM; 1996.
  • Su J, Shan H, Liu H, Klibanov MV. Reconstruction method with data from a multiple-site continuous-wave source for three-dimensional optical tomography. J. Optical Soc. Am. A. 2006;23:2388–2395.
  • Shan H, Klibanov MV, Liu H, Pantong N, Su J. Numerical implementation of the convexification algorithm for an optical diffusion tomograph. Inverse Probl. 2008;24:025006.
  • Shan H, Klibanov MV, Su J, Pantong N, Liu H. A globally accelerated numerical method for optical tomography with continuous wave source. J. Inverse Ill-posed Probl. 2008;16:763–790.
  • Beilina L, Klibanov MV. A globally convergent numerical method for a coefficient inverse problem. SIAM J. Sci. Comput. 2008;31:478–509.
  • Beilina L, Klibanov MV. Approximate global convergence and adaptivity for coefficient inverse problems. Springer Science & Business Media; 2012.
  • Su J, Liu Y, Lin Z, Teng S, Rhoden A, Pantong N, Liu H. Reconstructions for continuous-wave diffuse optical tomography by a globally convergent method. J. Appl. Math. Phys. 2014;2:204–213.
  • Hensel E, Hills R. Steady-state two-dimensional inverse heat conduction. Numer. Heat Trans. Part B Fund. 1989;15:227–240.
  • Taler J. Nonlinear steady-state inverse heat conduction problem with space-variable boundary conditions. J. Heat Trans. 1992;114:1048–1051.
  • Dulikravich G, Martin T. Inverse shape and boundary condition problems and optimization in heat conduction. Adv. Numer. Heat Trans. 1996;1:381–426.
  • Jarny Y. 2011. Lecture 9: Inverse problems & regularized solutions. Eurotherm Spring School METTI 2011: Thermal measurements and inverse techniques. Available from: http://www.sft.asso.fr/document.php?pagendx=12299&project=sft;Roscoff.
  • Tikhonov A, Leonov A, Yagola A. Nonlinear ill-posed problems. Vol. 1 and 2. London: Chapman and Hall; 1998.
  • Schweiger M, Arridge S, Hiraoka M, Delpy D. The finite element method for the propagation of light in scattering media: boundary and source conditions. Med. Phys. 1995;22:1779–1792.
  • Lions JL, Faurre P. Cours d’analyse numérique [Lectures on numerical analysis]. Paris: École Polytechnique; 1982.
  • Kaipio J. Statistical and computational inverse problems. Vol. 160. Springer; 2005.
  • Hecht F. New development in freefem++. J. Numer. Math. 2012;20:251–266.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.