Search in:

Inverse Problems in Science and Engineering Volume 24, 2016 - Issue 3

Submit an article Journal homepage

Free access

282

Views

CrossRef citations to date

Altmetric

Listen

Articles

Quasi-optimal Tikhonov penalization and parameterization coarseness in space-dependent function estimation

F. DubotLaboratoire de Thermocinétique de Nantes (LTN), UMR CNRS 6607, Nantes Cedex 3, France.;Chaire de recherche industrielle en technologies de l’énergie et en efficacité énergétique (t3e), École de technologie supérieure, Montréal, Canada.

Y. FavennecLaboratoire de Thermocinétique de Nantes (LTN), UMR CNRS 6607, Nantes Cedex 3, France.Correspondence[email protected]

B. RousseauLaboratoire de Thermocinétique de Nantes (LTN), UMR CNRS 6607, Nantes Cedex 3, France.

Y. JarnyLaboratoire de Thermocinétique de Nantes (LTN), UMR CNRS 6607, Nantes Cedex 3, France.

D.R. RousseChaire de recherche industrielle en technologies de l’énergie et en efficacité énergétique (t3e), École de technologie supérieure, Montréal, Canada.

Pages 465-481 | Received 04 Jul 2014, Accepted 28 Apr 2015, Published online: 01 Jun 2015

Cite this article
https://doi.org/10.1080/17415977.2015.1047362
CrossMark

In this article

1. Introduction
2. Space-dependent heat flux estimation
3. Optical tomography
4. Conclusion
Acknowledgements
Footnotes
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

The determination of space-dependent functions from boundary measurements or inner pointwise measurements is ill-posed inverse problems that require regularization tools to be stabilized. Among numerous regularization strategies, the Tikhonov penalization is one of the most used in the field of space-dependent function estimation. Its efficient use relies on the Tikhonov parameter value for which search is time consuming although necessary, especially in the field of nonlinear inversion. Other strategies, such as appropriate parameterization, have recently proven to be very efficient to cope with the ill-posedness of such problems. This paper shows that the optimal Tikhonov parameter is almost independent of the mesh used to project the functions to be retrieved. As a consequence, this value should be seeked using a coarse mesh even though reconstructions could further be done on finer meshes. This conclusion is validated by numerical means.

Keywords:

inverse problem
Tikhonov regularization
Gauss–Newton
finite element parameterization
ill-posed problems

AMS Subject Classifications:

35R30
65F22

1. Introduction

The reconstruction of space-dependent functions from pointwise measurements belongs to inverse problems known to be difficult to be solved due to their ill-posed character. More specifically, the reconstruction of maps of radiative properties by illuminating a semi-transparent medium with near infrared radiation and measuring emerging radiation is by definition of the classical optical tomography inverse problem.

From a mathematical point of view, within the diffuse approximation framework in the frequency domain,[Citation1] the geometry ( $D$ ) being fixed, the total inward flux ( $I$ ) of the amplitude-modulated diffuse source at the frequency $ν$ being known as well as the speed of light ( $c$ ) in the medium, the knowledge of the space-dependent absorption ( $κ$ ) and reduced scattering ( $σ$ ) coefficients are sufficient to simulate the photon density distribution ( $φ$ ) in each location of the computational domain. Such modelling is the forward problem which is known to be well-posed mathematically.

Contrarily, the reconstruction of physical properties from the knowledge of photon density on sensors constitutes an inverse problem. Unfortunately, such a problem is ill-posed in the sense of Hadamard [Citation2]. Hence, regularization must be used to stabilize the solution. Several regularization tools have been introduced in the field of optical tomography in these last decades among which the use of appropriate control space parameterization.[Citation3–Citation6] Dealing with matrix-based inversion, the Tikhonov regularization method, which is a popular method, consists in adding a penalization term to the cost function to be minimized; this method actually relies on a weight parameter that is to be determined very carefully since: (i) under-regularization leads to small cost functions at the end but at the price of highly fluctuating property maps; and (ii) over-regularization leads to stable property maps around priors, leading to biased solutions.[Citation7]

In the following, the optimal Tikhonov parameter, $ϑ^{*}$ , is defined as the one that minimizes the distance between the actual solution and the noise-free solution. This distance can be computed with $E (ϑ) = \frac{‖ γ_{ϑ} - \bar{γ} ‖}{‖ \bar{γ} ‖}$ , where $ϑ$ is the Tikhonov parameter, $γ_{ϑ}$ is the solution given for the Tikhonov parameter $ϑ$ , $\bar{γ}$ is the exact solution and $‖ \cdot ‖$ is a norm. The point is that one cannot, in practice, compute the optimal Tikhonov parameter, $\bar{γ}$ being unknown. However, numerous heuristic methods have been introduced to compute quasi-optimal Tikhonov parameters. Some methods use the assumed to be known standard deviation such as in the discrepancy principle.[Citation7, Citation8] Others [Citation9–Citation11] use less information about the noise properties, such as, for instance, the generalized cross validation (GCV), or even no information about the noise level present in the system, such as the well-known and controversial L-curve. The methods used to find the optimal Tikhonov parameter or at least a quasi-optimal parameter is beyond of the scope of this paper. Moreover, even though methods that do not use error informations are widely used for solving practical problems in various engineering areas, let us recall that in general the L-curve method, for instance, because it does not rely on any noise information, is, as shown by [Citation12, Citation13], not convergent and introduces a nonremovable bias.

In this paper, the diffuse optical tomography problem is formulated as a nonlinear least squares problem and solved by the Gauss–Newton method. Though this method has been widely used in these last years to solve this inverse problem,[Citation14–Citation19] it is worth mentioning that the Gauss–Newton method is only locally convergent, and this is so, only when the initial guess is not too far from the minimum and the inverse problem is mildly nonlinear.[Citation20] This means that some choices of the initial guess far from the solution lead to inaccurate results whatever the regularization method used. To illustrate this property, the dependence of the accuracy of the numerical solution with respect to the initial guess is discussed in the numerical results section. Although the study of the initial guess of the solution on the convergence of the optimizer is not the main issue of this paper, it should be noted that globally convergent methods, which overcome the major drawback of the Gauss–Newton optimizer, have been developed and applied to the diffuse optical tomography problem.[Citation21–Citation26]

The remainder of the paper will show that the optimal Tikhonov parameter is almost independent of the mesh coarsening used for the parameterization. This assertion is demonstrated theoretically and verified numerically based on two different under-determined inverse problems of space-dependent function estimation: a linear inverse heat conduction problem of flux estimation and a nonlinear diffuse optical tomography problem. The studies are performed on synthetic data and therefore the errors on reconstructions could be calculated. For this reason, one can somehow consider that the proposed approach is heuristic.

Before dealing with the nonlinear inverse problem of diffuse optical tomography (which is the main objective of this paper), a steady-state two-dimensional (2D) inverse heat conduction problem of space-dependent heat flux estimation is dealt with in Section 2. Such inverse problem has recently been studied.[Citation27–Citation30] This space-dependent heat flux estimation problem is highly different from the diffuse optical tomography problem because (i) it deals with a different physics: heat conduction; (ii) measurements are performed within the domain rather than on a boundary; (iii) the heat flux on a boundary is the function to be retrieved rather than physical properties within the medium; and (iv) the forward problem is linear with respect to the coefficients involved in the space-dependent finite element heat flux parameterization. The interest of this first study is above all to illustrate that the quasi-independence of the optimal Tikhonov parameter with the control space parameterization is found in a less heuristic way since only measurement errors are used in the whole process. Indeed, the determination of the Tikhonov parameter can be performed straightforwardly through the knowledge of measurement errors only. In this case, the discrepancy principle is the major recipe to compute the parameters involved in the space-dependent heat flux parameterization. It is known that this method gives satisfactory results and other methods found in the literature may give a Tikhonov parameter closer to the optimal one [Citation9–Citation11], but such consideration is out of the scope of this paper. Moreover, the statement of this article is also validated on this linear inverse problem in the sense of the optimal Tikhonov parameter $ϑ^{*}$ previously defined as $arg min E (ϑ)$ .

Section 3 deals with the nonlinear diffuse optical tomography problem. The forward model of light propagation in a highly diffuse medium is first presented along with the inverse problem settings. First- and second-order cost function derivatives used afterwards in optimization are then detailed. The finite element parameterization of, on the one side, the state and its derivative and, on the other side, the properties to be retrieved, is presented. The optimization is then written down based on the proposed finite dimensional control space. This section then presents the most usual Tikhonov regularization and proves that, under some weak hypothesis, the optimal Tikhonov parameter is quasi-independent of the dimension of the control space. A numerical verification is performed on the nonlinear diffuse optical tomography problem based on the theoretical demonstration.

Overall, the conclusion is that, when dealing with nonlinear inverse problems demanding heavy computational time, the determination of the Tikhonov parameter, whatever the method chosen among those of [Citation31] for instance, should be preferably performed on a highly coarse mesh in order to lower computational time.

2. Space-dependent heat flux estimation

2.1. Problem statement

In this first study, a linear steady-state 2D inverse heat conduction problem is considered to illustrate the quasi-independence of the Tikhonov parameter with mesh coarsening used for the finite element projection of the quantity of interest. In the present case, already studied in different contexts,[Citation27–Citation30] a space-dependent heat flux on a part of the boundary is to be retrieved from pointwise measurements within a bounded domain $D = {[0, 1]}^{2}$ . This application is treated first because of its simplicity: as a matter of fact, the response being linear with respect to the input flux,[Citation30] the cost function to be minimized is purely quadratic, and specific tools such as the singular value decomposition coupled with the maximum discrepancy principle [Citation7, Citation8] can be used to compute straightforwardly a quasi-optimal Tikhonov parameter. In this case, one can speak of ‘Tikhonov parameter found according to the discrepancy principle’. It is shown elsewhere [Citation11] that this quasi-optimal Tikhonov parameter is likely to be close to the optimal one.

Homogeneous steady-state heat conduction without source term but with mixed boundary conditions is described such that:(1) $\begin{matrix} \{\begin{matrix} - Δ T = 0 & (x_{1}, x_{2}) \in D =] 0, 1 [\times] 0, 1 [ \\ T = 0 & x_{1} = 0 \\ \nabla T \cdot n = 0 & x_{2} = 0 and x_{2} = 1 \\ - λ \nabla T \cdot n = φ & x_{1} = 1 \end{matrix} \end{matrix}$ (1)

where $T$ is the temperature, $λ$ is the thermal conductivity and $φ$ is the space-dependent heat flux. The forward problem consists in solving Equation (Equation1(1) $\begin{matrix} \{\begin{matrix} - Δ T = 0 & (x_{1}, x_{2}) \in D =] 0, 1 [\times] 0, 1 [ \\ T = 0 & x_{1} = 0 \\ \nabla T \cdot n = 0 & x_{2} = 0 and x_{2} = 1 \\ - λ \nabla T \cdot n = φ & x_{1} = 1 \end{matrix} \end{matrix}$ (1) ) for $T$ assuming $λ$ and $φ (x_{2})$ are known. In contrast, the inverse problem consists in estimating the space-dependent heat flux $φ (x_{2})$ on the basis of suitable measurement data ${\overset{˘}{T}}^{l}$ in $D$ , under the assumption that the value of $λ$ is known.

Let us consider $k$ inner pointwise measurements $(x_{1}^{l}, x_{2}^{l})$ , $l = 1, \dots, k$ . Discrepancies between predictions $T^{l} = T (x_{1}^{l}, x_{2}^{l})$ and associated measurements ${\overset{˘}{T}}^{l}$ , $l = 1, \dots, k$ , are integrated to the cost function:(2) $\begin{matrix} j_{ϑ} (φ) = J (T) + J^{+} (φ) = {∥T - \overset{˘}{T}∥}_{R^{k}}^{2} + ϑ {∥ϕ∥}_{R^{Ξ}}^{2} \end{matrix}$ (2)

with ${∥a∥}_{R^{k}}^{2} = \sum_{l = 1}^{k} a_{l}^{2}$ and ${∥a∥}_{R^{Ξ}}^{2} = \sum_{l = 1}^{Ξ} a_{l}^{2}$ . $ϑ$ is the Tikhonov parameter to be searched according to the noise level $ϵ$ . $ϕ$ is the vector obtained after finite element parameterization of the heat flux $φ$ , that is $φ (x_{2}) = \sum_{l = 1}^{Ξ} Θ_{l} (x_{2}) φ ({x_{2}}_{l}) = \sum_{l = 1}^{Ξ} Θ_{l} (x_{2}) ϕ_{l}$ where ${(Θ_{l})}_{l = 1}^{Ξ}$ is a finite element basis of $[0, 1]$ . The singular value decomposition of the matrix $A : ϕ \in R^{Ξ} \mapsto T \in R^{k}$ such that $A = W Λ V^{t}$ enables us to rewrite the cost function such that:(3) $\begin{matrix} j_{ϑ} (ξ) = {∥Λ ξ - \overset{˘}{ξ}∥}_{R^{k}}^{2} + ϑ {∥ξ∥}_{R^{Ξ}}^{2} \end{matrix}$ (3)

where $ξ = V^{t} ϕ$ and $\overset{˘}{ξ} = W^{t} \overset{˘}{T}$ . With such a decomposition, the solution of $ξ_{ϑ}^{*} = arg min j_{ϑ} (ξ)$ is given in terms of singular values of $A$ , ${(η_{i})}_{i = 1}^{Ξ}$ :(4) $\begin{matrix} {ξ_{ϑ}^{*}}_{i} = \frac{η_{i} {\overset{˘}{ξ}}_{i}}{η_{i}^{2} + ϑ} \forall i = 1, \dots, Ξ \end{matrix}$ (4)

On the other hand, the discrepancy principle leads to determine $ξ_{ϑ}^{*}$ such that:(5) $\begin{matrix} {∥Λ ξ_{ϑ}^{*} - \overset{˘}{ξ}∥}_{R^{k}}^{2} = ϵ^{2} \end{matrix}$ (5)

where $ϵ$ is the sum of variances of noise on all sensors, i.e. $ϵ^{2} = \sum_{l = 1}^{k} ϵ_{l}^{2}$ . Combining this latter relationship with Equation (Equation4(4) $\begin{matrix} {ξ_{ϑ}^{*}}_{i} = \frac{η_{i} {\overset{˘}{ξ}}_{i}}{η_{i}^{2} + ϑ} \forall i = 1, \dots, Ξ \end{matrix}$ (4) ) leads to determine the quasi-optimal Tikhonov parameter found according to the discrepancy principle, $ϑ_{dp}^{*}$ solution of:(6) $\begin{matrix} R (ϑ_{dp}^{*}) = \sum_{i = 1}^{k} {(\frac{ϑ_{dp}^{*} {\overset{˘}{ξ}}_{i}}{η_{i}^{2} + ϑ_{dp}^{*}})}^{2} - ϵ^{2} = 0 \end{matrix}$ (6)

Figure 1. Distance from the solution to the target $E (ϑ)$ as a function of the Tikhonov parameter $ϑ$ , for three control space dimensions $Ξ$ equal to $7$ , $15$ and $20$ . It is seen that $ϑ^{*} = arg min E (ϑ)$ is independent of $Ξ$ . The quasi-optimal Tikhonov parameter found according to the discrepancy principle has also been added.

2.2. Numerical results

The numerical study shows that the solution of Equation (Equation6(6) $\begin{matrix} R (ϑ_{dp}^{*}) = \sum_{i = 1}^{k} {(\frac{ϑ_{dp}^{*} {\overset{˘}{ξ}}_{i}}{η_{i}^{2} + ϑ_{dp}^{*}})}^{2} - ϵ^{2} = 0 \end{matrix}$ (6) ) is almost independent of the dimension $Ξ$ used in the parameterization of the flux. To do so, let the synthetic data be generated with the flux $\bar{φ} (x_{2}) = φ_{0} (sin (\frac{π x_{2}}{2}) - 1)$ , $φ_{0} = - 10^{4}$ W/m $^{2}$ and the thermal conductivity $λ = 30$ W/m K. Five pointwise temperature measurements ( $k = 5$ ) are performed at locations $x_{1} = 0.9$ , and $x_{2} = 0.1$ , $0.3$ , $0.5$ , $0.7$ and $0.9$ . The data are then perturbated according to a Gaussian white noise with variance $ϵ_{l}^{2}$ ranging from $10^{- 4}$ to $10^{- 1}$ K $^{2}$ , $\forall l = 1, \dots, k$ , yielding to $\overset{˘}{T}$ . The finite element method is used to solve the forward problem Equation (Equation1(1) $\begin{matrix} \{\begin{matrix} - Δ T = 0 & (x_{1}, x_{2}) \in D =] 0, 1 [\times] 0, 1 [ \\ T = 0 & x_{1} = 0 \\ \nabla T \cdot n = 0 & x_{2} = 0 and x_{2} = 1 \\ - λ \nabla T \cdot n = φ & x_{1} = 1 \end{matrix} \end{matrix}$ (1) ) based on a regular mesh of the bounded domain $D$ and a discretization of the temperature $T$ with Lagrange $P_{1}$ elements. The regular grid associated to $D$ is chosen sufficiently fine to ensure that errors due to the approximation method are negligible when compared to measurement errors. The basis functions ${(Θ_{l})}_{l = 1}^{Ξ}$ used in the parameterization of the unknown flux $φ$ are also continuous first-order Lagrange functions, i.e. linear functions per element satisfying $Θ_{l} ({x_{2}}_{p}) = 1$ if and only if $p = l$ .

Table presents the quasi-optimal Tikhonov parameter $ϑ_{dp}^{*}$ for several uniform Lagrange parameterizations with $Ξ$ ranging from 7 to 25, and several variances of noise in the data, from $ϵ_{l}^{2} = 10^{- 4}$ to $ϵ_{l}^{2} = 10^{- 1}$ . It is seen that for a given level of noise, the Tikhonov parameter $ϑ_{dp}^{*}$ depends only very slightly on the flux discretization. The small fluctuations that remain may come from all numerical approximations: finite element computations to build the state matrix $A$ , the singular value decomposition $W Λ V^{t} = A$ and the numerical optimization when solving the nonlinear problem Equation (Equation6(6) $\begin{matrix} R (ϑ_{dp}^{*}) = \sum_{i = 1}^{k} {(\frac{ϑ_{dp}^{*} {\overset{˘}{ξ}}_{i}}{η_{i}^{2} + ϑ_{dp}^{*}})}^{2} - ϵ^{2} = 0 \end{matrix}$ (6) ), $ϑ_{dp}^{*} = arg min R (ϑ_{dp})$ . Moreover, it is worth noting the linear dependency of $ϑ_{dp}^{*}$ with the noise variance $ϵ_{l}^{2}$ (Table ).

Figure presents the distance from the actual solution to the expected target distribution, $E (ϑ) = \frac{‖ φ_{ϑ} - \bar{φ} ‖}{‖ \bar{φ} ‖}$ , as a function of the Tikhonov parameter $ϑ$ , for the variance noise $ϵ_{l} = 10^{- 3}$ , $\forall l$ . The norm was chosen to be defined as $‖ φ ‖ = {(\int_{x_{1} = 1} φ^{2} d x_{2})}^{\frac{1}{2}}$ . Results obtained for other noise variances gave similar curves to the one given in Figure and are thus not presented here. Figure shows that for low Tikhonov parameters $ϑ$ , the distance from the solution to the target decreases when the control space dimension $Ξ$ decreases. On the contrary, for large Tikhonov parameters, the distance from the solution to the target is independent of the control space dimension. In between, there is a plateau, and the minimum point is found to be independent of $Ξ$ .

In order to illustrate the effect of regularization, different reconstructions are presented in Figure . This figure shows how parameterization influences the reconstructions.[Citation5, Citation6] It is also observed from Figure that despite the very limited number of steady-state measurements (a study on the optimal number and location of inner pointwise measurements is out of the scope of this paper), the space-dependent heat flux could be retrieved for appropriate Tikhonov regularization combined with appropriate control space parameterization.

Table 1. Value of the Tikhonov parameter $ϑ_{dp}^{}$ solution of (Equation6(6) $\begin{matrix} R (ϑ_{dp}^{}) = \sum_{i = 1}^{k} {(\frac{ϑ_{dp}^{} {\overset{˘}{ξ}}_{i}}{η_{i}^{2} + ϑ_{dp}^{}})}^{2} - ϵ^{2} = 0 \end{matrix}$ (6) ) for different discretizations and different noise magnitudes.

Display Table

Figure 2. Reconstructions $φ_{ϑ} (x_{2})$ for $Ξ = 15$ (top), $Ξ = 10$ (middle) and $Ξ = 7$ (bottom) for under-regularization ( $ϑ = 10^{- 15}$ ), over-regularization ( $ϑ = 1$ ) and appropriate regularization ( $ϑ = 5 10^{- 10} \approx ϑ^{*}$ ). Note that points for $Ξ = 15$ and $Ξ = 10$ (over-parameterization) with $ϑ = 10^{- 15}$ (under-regularization) are not presented because of divergence. The noise variance $ϵ_{l}^{2} = 0.001$ was used after generating the synthetic data.

3. Optical tomography

3.1. Forward model and inversion setting

Solving the optical tomography inverse problem is usually based on the minimization of a cost function which depends on the discrepancy between some measurements and the related predictions, the latter being a solution of the forward model which is, in the present case, based on the 2D diffuse approximation. The complex photon density, $φ : D \mapsto C$ , is mathematically described by the diffuse approximation model expressed in the frequency domain [Citation1]:(7) $\begin{matrix} - \nabla \cdot ({[2 (κ + σ)]}^{- 1} \nabla φ) + (κ + \frac{2 π i ν}{c}) φ & = 0 i n D \end{matrix}$ (7) (8) $\begin{matrix} φ + \frac{A}{2 π^{- 1}} {[2 (κ + σ)]}^{- 1} \nabla φ \cdot n & = \frac{I}{π^{- 1}} 1_{[ζ \in \partial D_{s}]} o n \partial D \end{matrix}$ (8)

where $κ$ and $σ$ are the absorption and reduced scattering coefficients, respectively, $c$ is the speed of light in the medium, $n$ is the unit outward normal vector, $I$ is the total inward flux of the amplitude-modulated diffuse source at the frequency $ν$ , $1_{[\cdot]}$ denotes the indicator function, $\partial D_{s}$ depicts the light source location, $i$ is the imaginary unit and $A$ is the parameter which characterizes the reflection at the boundary and can be derived from Fresnel’s law if specular reflection is considered [Citation18] or from experimental set-ups.[Citation32]

The forward problem consists in solving Equations (Equation7(7) $\begin{matrix} - \nabla \cdot ({[2 (κ + σ)]}^{- 1} \nabla φ) + (κ + \frac{2 π i ν}{c}) φ & = 0 i n D \end{matrix}$ (7) )–(Equation8(8) $\begin{matrix} φ + \frac{A}{2 π^{- 1}} {[2 (κ + σ)]}^{- 1} \nabla φ \cdot n & = \frac{I}{π^{- 1}} 1_{[ζ \in \partial D_{s}]} o n \partial D \end{matrix}$ (8) ) for the photon density $φ$ assuming the absorption coefficient $κ$ , the reduced scattering coefficient $σ$ , the location of the light source $\partial D_{s}$ and values of $ν$ , $c$ , $A$ and $I$ are known. In contrast, the inverse problem consists in estimating the space-dependent radiative properties $κ$ and $σ$ on the basis of photon density measurements $\overset{˘}{φ}$ on $\partial D$ , under the assumption that the location of the light source $\partial D_{s}$ and values of $ν$ , $c$ , $A$ and $I$ are known.

More specifically, the forward model leads to compute the state $φ$ that depends on radiative properties $γ = (κ, σ)$ . The difference (in the least squares sense) between this density and the measured one is integrated to the cost function to be minimized $j (γ) = J (φ)$ :(9) $\begin{matrix} J (φ) = \frac{1}{2} \sum_{k = 1}^{K} \sum_{d = 1}^{D} {|\frac{φ_{(k; d)} - {\overset{˘}{φ}}_{(k; d)}}{{\overset{˘}{φ}}_{(k; d)}}|}^{2} \end{matrix}$ (9)

where the index $k$ defines the source (test) number and the index $d$ defines the detection number. The solution of the inverse problem requires an optimization problem formulation of the kind: ‘Find the functions $κ^{*} (x)$ and $σ^{*} (x)$ such that $j (κ^{*}, σ^{*}) = min j (κ, σ)$ ’.

3.2. Cost function derivative

In order to derive the Gauss–Newton algorithm, one first needs to recall basic definitions of directional derivatives. Following,[Citation33] let us denote the point $(κ, σ) = γ \in Λ^{2} \subset {[L_{2} (D)]}^{2}$ and directions $η$ and $ζ$ $\in Λ$ . The directional derivative of the state $φ$ at point $γ$ towards $η$ is(10) $\begin{matrix} φ^{'} (γ; η) : = lim_{ε \to 0^{+}} \frac{φ (γ + ε η) - φ (γ)}{ε} \end{matrix}$ (10)

An analogous definition can be stated for the directional derivative of the cost function along with its second-order directional derivative such that:(11) $\begin{matrix} j^{″} (γ; η, ζ) : = lim_{ε \to 0^{+}} \frac{j^{'} (γ + ε ζ; η) - j^{'} (γ; η)}{ε} \end{matrix}$ (11)

Next, the operators involved in Newton’s method are extracted through the following equations:(12) $\begin{matrix} j^{'} (γ; η) = {(\nabla j (γ), η)}_{L_{2} (D)} \end{matrix}$ (12) (13) $\begin{matrix} j^{″} (γ; η, ζ) = {(\nabla^{2} j (γ) η, ζ)}_{L_{2} (D)} \end{matrix}$ (13)

with the inner product is defined as ${(γ, η)}_{L_{2} (D)} = \int_{D} γ η d x$ .

3.3. Parameterization

The solution of the optimization problem relies on iterative minimization algorithms for which control and state spaces must be finite in practice. To do so, let the functions to be determined $α = κ$ , $σ$ and $φ$ be approximated using finite element basis functions ${(ψ_{ξ})}_{ξ = 1}^{Ξ_{α}}$ and ${(Θ_{ξ})}_{ξ = 1}^{Ξ_{φ}}$ , respectively:(14) $\begin{matrix} φ (x) & = \sum_{ξ = 1}^{Ξ_{φ}} Θ_{ξ} (x) φ (x_{ξ}) \end{matrix}$ (14) (15) $\begin{matrix} α (x) & = α_{a p} \sum_{ξ = 1}^{Ξ_{α}} ψ_{ξ} (x) \tilde{α} (x_{ξ}), α = κ, σ \end{matrix}$ (15)

where it should be noted that the dimension of the control space $Ξ_{α}$ is chosen less or equal to that of the state space $Ξ_{φ}$ . As a consequence, projections of functions of the expression (Equation15(15) $\begin{matrix} α (x) & = α_{a p} \sum_{ξ = 1}^{Ξ_{α}} ψ_{ξ} (x) \tilde{α} (x_{ξ}), α = κ, σ \end{matrix}$ (15) ) into the state functional space must be used to make the solution of the system (Equation7(7) $\begin{matrix} - \nabla \cdot ({[2 (κ + σ)]}^{- 1} \nabla φ) + (κ + \frac{2 π i ν}{c}) φ & = 0 i n D \end{matrix}$ (7) ) and (Equation8(8) $\begin{matrix} φ + \frac{A}{2 π^{- 1}} {[2 (κ + σ)]}^{- 1} \nabla φ \cdot n & = \frac{I}{π^{- 1}} 1_{[ζ \in \partial D_{s}]} o n \partial D \end{matrix}$ (8) ) possible. Moreover, parameters are adimensionalized with a priori properties $α_{a p} = κ_{a p}, σ_{a p}$ such that $α = α_{a p} \tilde{α}$ , in order to make both functions $\tilde{α} = ϰ (x), ς (x)$ to be searched about unity.

Let us note $\tilde{γ} = (ϰ, ς)$ . Parameterization of radiative property map functions allows the construction of the Gauss–Newton matrix system which is written as:(16) $\begin{matrix} {\tilde{\nabla}}^{2} j (\tilde{γ}) δ \tilde{γ} = - \tilde{\nabla} j (\tilde{γ}) \end{matrix}$ (16)

with state second-order derivatives assumed to be negligible when compared to first-order state derivatives and where vector $\tilde{\nabla} j$ and matrix ${\tilde{\nabla}}^{2} j$ represent continuous gradient $\nabla j$ and approached Hessian matrix $\nabla^{2} j$ of (Equation12(12) $\begin{matrix} j^{'} (γ; η) = {(\nabla j (γ), η)}_{L_{2} (D)} \end{matrix}$ (12) ) and (Equation13(13) $\begin{matrix} j^{″} (γ; η, ζ) = {(\nabla^{2} j (γ) η, ζ)}_{L_{2} (D)} \end{matrix}$ (13) ) decomposed on finite element basis functions ${(ψ_{ξ})}_{ξ = 1}^{Ξ_{α}}$ . Finally, the Gauss–Newton matrix system to be solved in order to update radiative properties at each iteration is expressed as:(17) $\begin{matrix} R (S^{⊤} \bar{S}) δ \tilde{γ} = - R (S^{⊤} \bar{R}) \end{matrix}$ (17)

where $S^{⊤}$ and $\bar{S}$ are the transposed and the conjugate of $S$ , respectively, and $R (\cdot)$ denotes the real part of the imaginary vector or matrix. The forward model behaving nonlinearly with respect to properties $\tilde{γ} = (ϰ, ς)$ , the increment $δ {\tilde{γ}}^{l} = {\tilde{γ}}^{l} - {\tilde{γ}}^{l - 1}$ given by the solution of the Gauss–Newton Equation (Equation17(17) $\begin{matrix} R (S^{⊤} \bar{S}) δ \tilde{γ} = - R (S^{⊤} \bar{R}) \end{matrix}$ (17) ) is solved several times until convergence is attained. With the nomenclature defined earlier, the expressions for $S$ and $R$ are obtained such that:(18) $\begin{matrix} S & = {(\frac{{(φ_{ϰ}^{'})}_{(k; d)}^{ξ}}{|{\overset{˘}{φ}}_{(k; d)}|}, \frac{{(φ_{ς}^{'})}_{(k; d)}^{ξ}}{|{\overset{˘}{φ}}_{(k; d)}|})}_{(k; d) = (1, \dots, K; 1, \dots, D)}^{ξ = 1, \dots, Ξ_{α}} \end{matrix}$ (18) (19) $\begin{matrix} R & = {(\frac{φ_{(k; d)} - {\overset{˘}{φ}}_{(k; d)}}{|{\overset{˘}{φ}}_{(k; d)}|})}_{(k; d) = (1, \dots, K; 1, \dots, D)} \end{matrix}$ (19)

where $K$ and $D$ are the number of sources and sensors, respectively.

$S \in C^{(K \times D) \times 2 Ξ_{α}}$ , $R \in C^{(K \times D)}$ , and derivative functions $φ_{ϰ}^{'}$ and $φ_{ς}^{'}$ can be obtained directly by differentiating the forward model (Equation7(7) $\begin{matrix} - \nabla \cdot ({[2 (κ + σ)]}^{- 1} \nabla φ) + (κ + \frac{2 π i ν}{c}) φ & = 0 i n D \end{matrix}$ (7) ) and (Equation8(8) $\begin{matrix} φ + \frac{A}{2 π^{- 1}} {[2 (κ + σ)]}^{- 1} \nabla φ \cdot n & = \frac{I}{π^{- 1}} 1_{[ζ \in \partial D_{s}]} o n \partial D \end{matrix}$ (8) ) with respect to the adimensionalized radiative properties $ϰ$ and $ς$ , respectively. $φ_{ϰ}^{'}$ and $φ_{ς}^{'}$ are solutions of the following partial differential equations:(20) $\begin{matrix} - \nabla \cdot ({[2 (κ + σ)]}^{- 1} \nabla φ_{ϰ}^{'}) + (κ + \frac{2 π i ν}{c}) φ_{ϰ}^{'} & = - κ_{a p} ϰ^{'} φ - κ_{a p} \nabla \cdot (\frac{ϰ^{'} \nabla φ}{2 {(κ + σ)}^{2}}) i n D \end{matrix}$ (20) (21) $\begin{matrix} φ_{ϰ}^{'} + \frac{A}{2 π^{- 1}} {[2 (κ + σ)]}^{- 1} \nabla φ_{ϰ}^{'} \cdot n & = κ_{a p} \frac{A}{2 π^{- 1}} \frac{ϰ^{'} \nabla φ}{2 {(κ + σ)}^{2}} \nabla φ \cdot n o n \partial D \end{matrix}$ (21)

and(22) $\begin{matrix} - \nabla \cdot ({[2 (κ + σ)]}^{- 1} \nabla φ_{ς}^{'}) + (κ + \frac{2 π i ν}{c}) φ_{ς}^{'} & = - σ_{a p} \nabla \cdot (\frac{ς^{'} \nabla φ}{2 {(κ + σ)}^{2}}) i n D \end{matrix}$ (22) (23) $\begin{matrix} φ_{ς}^{'} + \frac{A}{2 π^{- 1}} {[2 (κ + σ)]}^{- 1} \nabla φ_{ς}^{'} \cdot n & = σ_{a p} \frac{A}{2 π^{- 1}} \frac{ς^{'} \nabla φ}{2 {(κ + σ)}^{2}} \nabla φ \cdot n o n \partial D \end{matrix}$ (23)

where $ϰ^{'}$ , $ς^{'}$ denote the perturbations. Algorithm 1 gives some explanations about the construction of vector $R$ and matrix $S$ involved in (Equation17(17) $\begin{matrix} R (S^{⊤} \bar{S}) δ \tilde{γ} = - R (S^{⊤} \bar{R}) \end{matrix}$ (17) ).

3.4. Tikhonov regularization

The Tikhonov regularization method consists in adding a penalization term $ϑ J^{+}$ which is based on priors to the cost to be minimized (Equation9(9) $\begin{matrix} J (φ) = \frac{1}{2} \sum_{k = 1}^{K} \sum_{d = 1}^{D} {|\frac{φ_{(k; d)} - {\overset{˘}{φ}}_{(k; d)}}{{\overset{˘}{φ}}_{(k; d)}}|}^{2} \end{matrix}$ (9) ):(24) $\begin{matrix} J^{+} (ϰ, ς) = \frac{1}{2} \sum_{\tilde{α} = ϰ, ς} \int_{D} {(\tilde{α} - {\tilde{α}}_{ap})}^{2} d x \end{matrix}$ (24) $ϑ$ being the Tikhonov parameter, and ${\tilde{α}}_{ap}$ being equal to 1 as explained above. This has been successfully used in the optical tomography area, e.g. by [Citation14–Citation19] when using the Gauss–Newton optimizer. Doing so, the optimization problem becomes ‘Find the functions $ϰ_{ϑ}^{*} (x)$ and $ς_{ϑ}^{*} (x)$ such that $j_{ϑ} (ϰ_{ϑ}^{*}, ς_{ϑ}^{*}) = min j_{ϑ} (ϰ, ς)$ ’ with $j_{ϑ} = J + ϑ J^{+}$ .

The simultaneous use of several regularizations together has been studied in [Citation5, Citation6], among which the reduction of the control space dimension [Citation4] with Tikhonov regularization. It is shown here that the control space reduction modifies only slightly the optimal Tikhonov parameter.

The control space mesh is assumed to be chosen sufficiently fine to ensure that the spatial fluctuations of the radiative properties are well taken into account. Thus, the projections from the control functional space to the state functional space are such that the state is almost independent of the number of degrees of freedom $Ξ_{α}$ , and thus the cost function $J$ is also almost independent of $Ξ_{α}$ . Next, to simplify the presentation, let us consider only one parameter, say $ϰ$ .

Proposition 3.1

Under the following assumptions:

(H1)	$ψ_{ξ}$ are basis functions constant per element on a regular mesh, i.e. Lagrange $P_{0}$ elements are used for the discretization of the function $ϰ$ ,
(H2)	$ϰ \sim N (1, σ_{ϰ}^{2})$ ,

then the optimal Tikhonov parameter is asymptotically independent of the dimension

Ξ_{α}

Proof

Let $ϰ_{ϑ}^{*}$ be a solution of the optimization problem(25) $\begin{matrix} ϰ_{ϑ}^{*} = arg min_{ϰ} (J (φ) + ϑ J^{+} (ϰ)) \end{matrix}$ (25)

with $ϑ$ representing the best compromise between both costs $J (φ)$ and $J^{+} (ϰ)$ . Then, showing $J^{+}$ does not depend on the parameterization yields $ϑ$ independent of $Ξ_{α}$ . Let $\tilde{ϑ} = σ_{ϰ}^{2} ϑ$ and $Δ x = | D | / Ξ_{α}$ be the area of one element of the regular mesh. With H1), we have $ϑ J^{+} (ϰ) = \frac{\tilde{ϑ} | D |}{2} \frac{1}{Ξ_{α}} \sum_{i = 1}^{Ξ_{α}} {(\frac{ϰ_{i} - 1}{σ_{ϰ}})}^{2}$ . With H2), $\frac{ϰ_{i} - 1}{σ_{ϰ}} \sim N (0, 1)$ $\forall i$ , thus $Υ = \sum_{i = 1}^{Ξ_{α}} {(\frac{ϰ_{i} - 1}{σ_{ϰ}})}^{2} \sim χ_{Ξ_{α}}^{2}$ . Then, using the fact that $E (Υ) = Ξ_{α}$ , we have $E (J^{+}) = \frac{σ_{ϰ}^{2} | D |}{2}$ which is independent of $Ξ_{α}$ . This completes the proof. $□$

Remark 3.2

This demonstration can be easily extended to several parameters, say with a function of the type of (Equation24(24) $\begin{matrix} J^{+} (ϰ, ς) = \frac{1}{2} \sum_{\tilde{α} = ϰ, ς} \int_{D} {(\tilde{α} - {\tilde{α}}_{ap})}^{2} d x \end{matrix}$ (24) ) assuming that $σ_{ς} \approx σ_{ϰ}$ .

Remark 3.3

It could also be extended to other finite element parameterizations, but with much heavier calculations.

3.5. Numerical validation

The ability to determine on a coarse mesh a reasonable parameter for the Tikhonov regularization is illustrated on a circular test medium. The rationale behind this choice is to reduce computational time required for getting a Tikhonov parameter so that the solution is close enough to the true solution, at least from an engineering point of view.

Synthetic data and predicted values are collected with the help of eight equally distributed sources and sensors about the disk perimeter $(K = 8)$ . Sources and sensors are alternatively distributed as schematically depicted in Figure . Each sensor contains seven pointwise photon density measurements $(D = 56)$ . Radiative property maps to be recovered are also given in Figure . The areas to be detected are located at about 2.5 cm in opposite locations at $45^{\circ}$ and $225^{\circ}$ , respectively. The defects are $4 / 3$ cm in diameters, while the whole medium has a 4 cm radius. Each probe is 0.8 cm in length. The other physical parameters involved in (Equation7(7) $\begin{matrix} - \nabla \cdot ({[2 (κ + σ)]}^{- 1} \nabla φ) + (κ + \frac{2 π i ν}{c}) φ & = 0 i n D \end{matrix}$ (7) ) and (Equation8(8) $\begin{matrix} φ + \frac{A}{2 π^{- 1}} {[2 (κ + σ)]}^{- 1} \nabla φ \cdot n & = \frac{I}{π^{- 1}} 1_{[ζ \in \partial D_{s}]} o n \partial D \end{matrix}$ (8) ) are fixed to: $n = 1.4$ , $ν = 1 \times 10^{8}$ s $^{- 1}$ , $c = c_{0} / n$ with $c_{0} = 3 \times 10^{10}$ cm s $^{- 1}$ , $I = 0.1$ W cm $^{- 2}$ and $A$ is derived from Fresnel’s law (see [Citation18]).

Figure 3. Test medium geometry representation. Eight sources are located on the boundary. For each source (which constitutes a test), the emerging radiation is measured on all sensors.

The radiative parameters are initialized to the radiative properties of the background considered as priors: $κ^{0} = κ_{a p} = 0.8 {cm}^{- 1}$ and $σ^{0} = σ_{a p} = 20 {cm}^{- 1}$ all over the medium $D$ . Thus, initial control variables $ϰ^{0} = κ^{0} / κ_{a p}$ and $ς^{0} = σ^{0} / σ_{a p}$ are initialized to 1. Synthetic data $\overset{˘}{φ}$ and initial adimensionalized radiative properties $ϰ$ and $ς$ are inputs of the inverse problem. The way in which synthetic data are generated is given hereafter.

Within the inverse procedure, the forward model (Equation7(7) $\begin{matrix} - \nabla \cdot ({[2 (κ + σ)]}^{- 1} \nabla φ) + (κ + \frac{2 π i ν}{c}) φ & = 0 i n D \end{matrix}$ (7) ) and (Equation8(8) $\begin{matrix} φ + \frac{A}{2 π^{- 1}} {[2 (κ + σ)]}^{- 1} \nabla φ \cdot n & = \frac{I}{π^{- 1}} 1_{[ζ \in \partial D_{s}]} o n \partial D \end{matrix}$ (8) ) is solved by a finite element method with Lagrange $P_{1}$ elements, that is, with $φ \in V_{h}^{φ} = V (M_{h}^{φ}, P_{1})$ . The state mesh $M_{h}^{φ}$ is chosen fine enough to ensure that numerical predictions can fit the synthetic data. Synthetic data $\overset{˘}{φ}$ are also generated with the finite element method with Lagrange $P_{1}$ elements but using a much finer mesh $M_{h}^{\overset{˘}{φ}}$ in order to avoid the inverse crime.[Citation34] After projecting the synthetic data in the functional space $V_{h}^{φ}$ , $\overset{˘}{φ}$ are corrupted with a multiplicative white Gaussian noise of signal-to-noise ratio $S N R = 30$ dB: ${\overset{˘}{φ}}_{n o i s y} = \overset{˘}{φ} (1 + 10^{- S N R / 10} \times ε)$ where $ε \sim N (0, 1)$ .

Next, the control space $V_{h}^{α} = V (M_{h}^{α}, P_{1})$ for $α = κ, σ$ relies on a more or less coarse mesh $M_{h}^{α}$ according to cases (see Table ). The FreeFem++ environment has been used to perform all computations.[Citation35]

Table 2. Dimensions of finite element spaces for $φ$ , $\overset{˘}{φ}$ and $α$ .

Display Table

Table 3. CPU time comparisons for the generation of the error curves $E (ϑ)$ .

Display Table

The error curves $E (ϑ)$ are built at the 20th iteration for the three cases described in Table : a fine mesh for the state ( $dim V_{h}^{φ} = 2924$ ), and successively a fine mesh, a medium mesh and a coarse mesh with respectively $dim V_{h}^{α}$ equal to 2924, 1382 and 384 nodes for the parameters. The three error curves are presented all together in Figure . Figure first shows that for $ϑ > 0.4$ , the calculated errors are independent of the mesh coarseness. Otherwise, for $ϑ < 0.3$ , it is seen that the reduction of the control space reduces dramatically the errors. Such reduction of control space thus regularizes the inverse problem preconditioning the optimization.[Citation4] In between, Figure shows that the optimal Tikhonov parameters are found to be quasi-independent of the number of degrees of freedom of the finite control space. Within the range $[10^{- 2}, 10^{+ 2}]$ , the optimal Tikhonov parameter $ϑ^{*}$ is found to be equal to $0.3$ for $Ξ_{α} = 384$ and $0.4$ for $Ξ_{α} = 1382$ and $2924$ . This means that the minimal residual is indeed quasi-independent of the mesh coarsening, i.e. of $dim V_{h}^{α}$ . This is what has been shown in Proposition 3.1.

Table also provides the relative CPU time needed to build such curves for cases 1, 2 and 3 described in Table : the construction of this error curve with the coarsest mesh needs, in this case, roughly the 17th of the initial CPU time related to the construction with the finest mesh. This can be easily understood by examining the construction scheme of the Gauss–Newton matrix system presented above (see Algorithm 1).

Figures and present the reconstructions of both $κ$ and $σ$ obtained with the Gauss–Newton optimizer at the 20th iteration, with $Ξ_{α} = 2924$ , and with the Tikhonov weight equal to 0.2 and 0.4, respectively. Figure shows the divergence of the reconstruction with negative values for $σ$ due to under-regularization: the weight parameter $ϑ$ is too low. Figure presents the same reconstruction but with the Tikhonov parameter $ϑ$ chosen at the minimum of the error curve. Reconstructions are good even in the presence of a 30 dB noise.

Figure 4. Distance from the solution to the target $E (ϑ)$ as a function of the Tikhonov parameter $ϑ$ , for $(κ_{a p}, σ_{a p}) = (0.08, 20) c m^{- 1}$ and three control space dimensions $Ξ_{α}$ equal to $384$ , $1382$ and $2924$ . It is seen that $ϑ^{*} = arg min E (ϑ)$ is quasi-independent of $Ξ$ .

Figure 5. Reconstruction of $κ$ (left) and $σ$ (right) with $ϑ = 0.2$ , $(κ_{a p}, σ_{a p}) = (0.08, 20) {cm}^{- 1}$ and $Ξ_{α} = 2924$ . It should be noted the divergence for the reduced diffusion coefficient.

Figure 6. Reconstruction of $κ$ (left) and $σ$ (right) with $ϑ = 0.4$ , $(κ_{a p}, σ_{a p}) = (0.08, 20) {cm}^{- 1}$ and $Ξ_{α} = 2924$ .

Figure 7. Reconstruction of $κ$ (left) and $σ$ (right) with $ϑ = 1.4$ , $(κ_{a p}, σ_{a p}) = (0.088, 24) {cm}^{- 1}$ and $Ξ_{α} = 2924$ .

Two supplementary numerical tests are considered in order to investigate the behaviour of the inverse method with respect to the initial guess. As previously, initial control variables $ϰ^{0}$ and $ς^{0}$ are initialized to 1, so that the initial radiative properties are equal to the a priori properties $κ_{a p}$ and $σ_{a p}$ . For the first test, a priori properties are fixed to $(κ_{a p}, σ_{a p}) = (0.088, 24) {cm}^{- 1}$ (10 and 20However, it should be noted that, as for the previous case where a priori properties equal the radiative properties of the background, the optimal Tikhonov parameter is found to be slightly smaller for $Ξ_{α} = 384$ ( $1382$ ) than for $Ξ_{α} = 1382$ ( $2924$ ). This can be explained by the fact that the reduction of the control space dimension acts as a regularization method so that the smaller the control space dimension $Ξ_{α}$ , the smaller the optimal Tikhonov parameter $ϑ^{*}$ . As a result, the optimal Tikhonov parameter provided by a strongly reduced control space should not be considered as such for the original control space, but has to be interpreted as a very good indicator: the optimal Tikhonov parameter for the original control space will be slightly larger. Figure presents the reconstructions of both $κ$ and $σ$ obtained with the Gauss–Newton optimizer at the 20th iteration, with $ϑ^{*} = 1.4$ and $Ξ_{α} = 2924$ . It is observed that the reconstructions are less accurate than those obtained when a priori properties equal the radiative properties of the background, but the inclusions can still be distinguished.

Finally, for the second test, the a priori properties are fixed to $(κ_{ap}, σ_{ap}) = (0.096, 28) {cm}^{- 1}$ (20 and 40% larger than radiative properties of the background medium for the absorption and reduced scattering coefficients, respectively). It was observed that the error curves associated to this initial guess are very irregular, so that it is difficult to find an optimal Tikhonov parameter. In fact, the results show that the Gauss–Newton optimizer fails to reconstruct the target properties whatever the value of the Tikhonov parameter. The initial guess is too far from the target so that the descent direction of the Gauss–Newton algorithm is never pointed towards the minimum and leads inevitably to very inaccurate reconstructions whatever the Tikhonov weight. Thus, for such choice of the initial guess, the use of more sophisticated methods, such as the globally convergent ones developed in [Citation21–Citation26], becomes necessary.

4. Conclusion

This paper has demonstrated that the optimal Tikhonov parameter, defined as that which minimizes the distance between the actual solution and the noise-free solution, is almost independent of the control space parameterization. This has first been verified on a standard linear inverse heat conduction problem in which, in phase one, the Tikhonov parameter has been approximated with the singular value decomposition and the maximum discrepancy principle and then, in a second phase, in the sense of the optimal Tikhonov parameter previously defined. Next, this validation has been extended to the nonlinear inverse problem of diffuse optical tomography. The error curves with respect to the Tikhonov parameter showed that their minimum is quasi-independent of the control space parameterization. This may result in CPU time reduction when searching for the optimal Tikhonov parameter, whatever the method used to find it. Of course, this is highly recommended for large size (three-dimensional) objects when the Gauss–Newton optimizer is chosen. Finally, note that the two inverse problems dealt with in this paper were under-determined, as it is the case in most space-dependent function estimation problems. Consequently, the theory has been numerically validated only on these two specific inverse problems. This may be viewed as a limitation, but the need of control space dimension reduction is much less useful for over-determined inverse problems than for space-dependent ones which are usually under-determined.

Acknowledgements

The authors would like to thank Pr. Hecht for providing the very efficient finite element tool FreeFem++.[35] The authors would also like to thank the Levis’s town, the Regional Conference of Elected Representatives of Chaudière-Appalaches, Énergie Valero Inc. and Ecosystem for their financial support to the industrial research chair t3e. The authors also thank the Centre Régional de Calcul Intensif des Pays de la Loire (CCIPL), financed by the French Research Ministry, the Région Pays de la Loire and the University of Nantes, allowing computations on their supercomputers. Finally, the authors would like to sincerely thank the reviewers for their constructive and relevant comments formulated on the first version that significantly enhanced the quality of this paper.

Notes

No potential conflict of interest was reported by the authors.

References

Arridge SR. Optical tomography in medical imaging. Inverse Probl. 1999;15:R41.
Web of Science ®Google Scholar
Hadamard J. Sur les problèmes aux dérivées partielles et leur signification physique [On partial derivative problems and their physical meaning]. Princeton Univ. Bull. 1902;13:49–52.
Google Scholar
Gu X, Xu Y, Jiang H. Mesh-based enhancement schemes in diffuse optical tomography. Med. Phys. 2003;30:861–869.
PubMed Web of Science ®Google Scholar
Chavent G. Nonlinear least squares for inverse problems: theoretical foundations and step-by-step guide for applications. Springer; 2010.
Google Scholar
Favennec Y, Dubot F, Rousseau B, Rousse D. Mixing regularization tools for enhancing regularity in optical tomography applications. In: IPDO-2007-Inverse Problems, Design and Optimization Symposium; Albi; 2013.
Google Scholar
Balima O, Favennec Y, Rousse D. Optical tomography reconstruction algorithm with the finite element method: an optimal approach with regularization tools. J. Comput. Phys. 2013;251:461–479.
Web of Science ®Google Scholar
Morozov VA, Nashed Z, Aries A. Methods for solving incorrectly posed problems. New York (NY): Springer; 1984.
Google Scholar
Goncharskii A, Leonov AS, Yagola AG. A generalized discrepancy principle. USSR Comput. Math. Math. Phys. 1973;13:25–37.
Google Scholar
Golub GH, Heath M, Wahba G. Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics. 1979;21:215–223.
Web of Science ®Google Scholar
Hansen PC, O’Leary DP. The use of the L-curve in the regularization of discrete ill-posed problems. SIAM J. Sci. Comput. 1993;14:1487–1503.
Web of Science ®Google Scholar
O’Leary DP. Near-optimal parameters for Tikhonov and other regularization methods. SIAM J. Sci. Comput. 2001;23:1161–1171.
Web of Science ®Google Scholar
Yagola A, Leonov A, Titarenko V. Data errors and an error estimation for ill-posed problems. Inverse Probl. Eng. 2002;10:117–129.
Web of Science ®Google Scholar
Vogel CR. Non-convergence of the l-curve regularization parameter selection method. Inverse Probl. 1996;12:535.
Web of Science ®Google Scholar
Paulsen KD, Jiang H. Spatially varying optical property reconstruction using a finite element diffusion equation approximation. Med. Phys. 1995;22:691–701.
PubMed Web of Science ®Google Scholar
Schweiger M, Arridge SR, Nissilä I. Gauss–Newton method for image reconstruction in diffuse optical tomography. Phys. Med. Biol. 2005;50:2365.
PubMed Web of Science ®Google Scholar
Niu H, Guo P, Ji L, Zhao Q, Jiang T. Improving image quality of diffuse optical tomography with a projection-error-based adaptive regularization method. Optics Exp. 2008;16:12423–12434.
PubMed Web of Science ®Google Scholar
Tarvainen T, Vauhkonen M, Arridge S. Gauss–Newton reconstruction method for optical tomography using the finite element solution of the radiative transfer equation. J. Quant. Spect. Rad. Trans. 2008;109:2767–2778.
Web of Science ®Google Scholar
Dehghani H, Srinivasan S, Pogue BW, Gibson A. Numerical modelling and image reconstruction in diffuse optical tomography. Philos. Trans. Roy. Soc. A: Math. Phys. Eng. Sci. 2009;367:3073–3093.
PubMed Web of Science ®Google Scholar
Dehghani H, Eames ME, Yalavarthy PK, Davis SC, Srinivasan S, Carpenter CM, Pogue BW, Paulsen KD. Near infrared optical tomography using nirfast: algorithm for numerical model and image reconstruction. Commun. Numer. Meth. Eng. 2009;25:711–732.
Web of Science ®Google Scholar
Björck A. Numerical methods for least squares problems. SIAM; 1996.
Google Scholar
Su J, Shan H, Liu H, Klibanov MV. Reconstruction method with data from a multiple-site continuous-wave source for three-dimensional optical tomography. J. Optical Soc. Am. A. 2006;23:2388–2395.
Google Scholar
Shan H, Klibanov MV, Liu H, Pantong N, Su J. Numerical implementation of the convexification algorithm for an optical diffusion tomograph. Inverse Probl. 2008;24:025006.
Web of Science ®Google Scholar
Shan H, Klibanov MV, Su J, Pantong N, Liu H. A globally accelerated numerical method for optical tomography with continuous wave source. J. Inverse Ill-posed Probl. 2008;16:763–790.
Web of Science ®Google Scholar
Beilina L, Klibanov MV. A globally convergent numerical method for a coefficient inverse problem. SIAM J. Sci. Comput. 2008;31:478–509.
Web of Science ®Google Scholar
Beilina L, Klibanov MV. Approximate global convergence and adaptivity for coefficient inverse problems. Springer Science & Business Media; 2012.
Google Scholar
Su J, Liu Y, Lin Z, Teng S, Rhoden A, Pantong N, Liu H. Reconstructions for continuous-wave diffuse optical tomography by a globally convergent method. J. Appl. Math. Phys. 2014;2:204–213.
Google Scholar
Hensel E, Hills R. Steady-state two-dimensional inverse heat conduction. Numer. Heat Trans. Part B Fund. 1989;15:227–240.
Web of Science ®Google Scholar
Taler J. Nonlinear steady-state inverse heat conduction problem with space-variable boundary conditions. J. Heat Trans. 1992;114:1048–1051.
Web of Science ®Google Scholar
Dulikravich G, Martin T. Inverse shape and boundary condition problems and optimization in heat conduction. Adv. Numer. Heat Trans. 1996;1:381–426.
Google Scholar
Jarny Y. 2011. Lecture 9: Inverse problems & regularized solutions. Eurotherm Spring School METTI 2011: Thermal measurements and inverse techniques. Available from: http://www.sft.asso.fr/document.php?pagendx=12299&project=sft;Roscoff.
Google Scholar
Tikhonov A, Leonov A, Yagola A. Nonlinear ill-posed problems. Vol. 1 and 2. London: Chapman and Hall; 1998.
Google Scholar
Schweiger M, Arridge S, Hiraoka M, Delpy D. The finite element method for the propagation of light in scattering media: boundary and source conditions. Med. Phys. 1995;22:1779–1792.
PubMed Web of Science ®Google Scholar
Lions JL, Faurre P. Cours d’analyse numérique [Lectures on numerical analysis]. Paris: École Polytechnique; 1982.
Google Scholar
Kaipio J. Statistical and computational inverse problems. Vol. 160. Springer; 2005.
Google Scholar
Hecht F. New development in freefem++. J. Numer. Math. 2012;20:251–266.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Quasi-optimal Tikhonov penalization and parameterization coarseness in space-dependent function estimation

Abstract

1. Introduction