Search in:

Inverse Problems in Science and Engineering Volume 26, 2018 - Issue 3

Submit an article Journal homepage

Free access

382

Views

CrossRef citations to date

Altmetric

Listen

Articles

Reconstruction of refractive indices from spectral measurements of monodisperse aerosols

Tobias KyrionRWTH Aachen University, Aachen, Germany.Correspondence[email protected]

https://orcid.org/0000-0002-4315-0805 View further author information

Pages 323-361 | Received 28 Oct 2016, Accepted 17 Dec 2016, Published online: 19 Jan 2017

Cite this article
https://doi.org/10.1080/17415977.2016.1275614
CrossMark

In this article

1. Set-up of the experiment
2. Mie theory
3. Derivatives of the truncated Mie efficiency series
4. Nonlinear regression using truncated series expansions
5. The reconstruction algorithm
6. Comparison of the Numerical Continuation Approach with Established Truncation Index Heuristics
7. Nonlinear Tikhonov regularization
8. Finding the smoothest coupled solutions
9. Further regularization of coupled solutions
10. Numerical results
11. Higher noise levels
12. Numerical study
13. Results for mixtures of H2O and CsI
14. Outlook
Acknowledgements
Additional information
Footnotes
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

For the investigation of two-component aerosols, one needs to know the refractive indices of the two aerosol components. One problem is that they depend on temperature and pressure, so one needs for their determination a robust measurement instrument such as the FASP device, which can cope with rigid environmental conditions. In this article, we show that the FASP device is capable of measuring the needed refractive indices, if monodisperse aerosols of the pure components are provided. We determine the particle radii of the monodisperse aerosols needed for this task and investigate how accurate the measurements have to be in order to retrieve refractive indices in a sufficient quality, such that they are suitable for investigations of two-component aerosols.

Keywords:

Nonlinear inverse problem
refractive index reconstruction
Mie theory
nonlinear Tikhonov regularization
nonlinear regression

AMS Subject Classifications:

15A29
34K29
34M50
45Q05
70F17

1. Set-up of the experiment

This paper provides an algorithm for the reconstruction of refractive indices from spectral measurements of monodisperse aerosols. The experiments we are conducting are similar to the experiments presented in [Citation1] with the difference that we are using air as surrounding medium for the aerosol particles and that temperature and pressure may approach $200^{\circ} C$ and 8 bar, respectively. For these rigid conditions, reliable databases for refractive indices do not exist up to now. These refractive index databases are needed for the measurement of particle size distributions of polydisperse aerosols using the FASP.

As outlined in [Citation2], the FASP measures light intensities $I_{long} (l)$ and $I_{short} (l)$ having passed a long and a short measurement path length $L_{long}$ and $L_{short}$ , respectively. The evaluations of the FASP measurements are based on the relation(1.1) $\begin{matrix} \int_{0}^{\infty} k (r, l) n (r) d r = e (l) with e (l) = - \frac{log (I_{long} (l)) - log (I_{short} (l))}{L_{long} - L_{short}}, \end{matrix}$ (1.1)

where $k (r, l) : = π r^{2} Q_{e x t} (m_{med} (l), m_{part} (l), r, l)$ is the so-called kernel function, l is the wavelength of the incident light, r is the radius of the spherical scattering particle and $m_{med} (l)$ and $m_{part} (l)$ are the refractive indices of the surrounding medium and the particle material depending on the wavelength l. The function $Q_{ext} (m_{med} (l), m_{part} (l), r, l)$ is the Mie extinction efficiency from [Citation3]. The function n(r) is the size distribution of the scattering particels. The right-hand side e(l) in (Equation1.1(1.1) $\begin{matrix} \int_{0}^{\infty} k (r, l) n (r) d r = e (l) with e (l) = - \frac{log (I_{long} (l)) - log (I_{short} (l))}{L_{long} - L_{short}}, \end{matrix}$ (1.1) ) is denoted as the spectral extinction.

Now if n(r) is the size distribution of a monodisperse aerosol, where all particles possess the same radius $r_{m}$ , it is given by $n (r) = n δ (r - r_{m})$ , where n is the total number of particles and $δ (r - r_{m})$ is a Dirac delta distribution truncated on the positive half-axis. Inserting this into (Equation1.1(1.1) $\begin{matrix} \int_{0}^{\infty} k (r, l) n (r) d r = e (l) with e (l) = - \frac{log (I_{long} (l)) - log (I_{short} (l))}{L_{long} - L_{short}}, \end{matrix}$ (1.1) ) gives(1.2) $\begin{matrix} n π r_{m}^{2} Q_{ext} (m_{med} (l), m_{part} (l), r_{m}, l) = e (l), \end{matrix}$ (1.2)

hence the Mie extinction efficiency is measured directly at the radius $r_{m}$ .

The Mie extinction efficiency is given as an infinite series, i.e. $\begin{matrix} Q_{ext} (m_{med} (l), m_{part} (l), r, l) = \sum_{n = 1}^{\infty} q_{n} (m_{med} (l), m_{part} (l), r, l) . \end{matrix}$

The computation of the coefficient functions $q_{n} (m_{med} (l), m_{part} (l), r, l)$ will be discussed in Section 2.

It is clear that in practical computations $Q_{ext} (m_{med} (l), m_{part} (l), r, l)$ can only be approximated by a truncated series, because only the computation of a finite number of the $q_{n} (m_{med} (l), m_{part} (l), r, l)$ ’s is practically feasible.

We now fix a wavelength l. The complex refractive index $m_{part} (l)$ for the wavelength l is reconstructed from FASP measurements of several monodisperse aerosols with particle radii $r_{1}$ , ..., $r_{N}$ . Let $q (r_{1}, l)$ , ..., $q (r_{N}, l)$ denote the measured spectral extinctions e(l) corresponding to the particle radii $r_{1}$ , ..., $r_{N}$ . We assume that they are contaminated by additive Gaussian noise, i.e. $q (r_{i}, l) = q_{true} (r_{i}, l) + δ_{i}$ with $δ_{i} \sim N (0, s_{i}^{2})$ for $i = 1, \dots, N$ . Furthermore, we assume that the standard deviations $s_{i}$ can be estimated from measurements sufficiently accurately, such that we can regard them as known. We have $\begin{matrix} q_{true} (r_{i}, l) = n_{i} π r_{i}^{2} \sum_{n = 1}^{\infty} q_{n} (m_{med} (l), m_{part} (l), r_{i}, l), \end{matrix}$

where $n_{i}$ is the number of particles having the same radius $r_{i}$ . Then, a reconstrution of $m_{part} (l)$ is obtained from the set of solutions M(l) of the nonlinear regression problem(1.3) $\begin{matrix} M (l) : = \underset{m \in C}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 {(\frac{s_{i}}{n_{i}})}^{2}} {(π r_{i}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (m_{med} (l), m, r_{i}, l) - \frac{q (r_{i}, l)}{n_{i}})}^{2} . \end{matrix}$ (1.3)

Note that M(l) contains in general more than one solution, especially when $q (r_{i}, l)$ is perturbed by measurement noise. We discuss nonlinear regression problems with truncated series expansions such as (Equation1.3(1.3) $\begin{matrix} M (l) : = \underset{m \in C}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 {(\frac{s_{i}}{n_{i}})}^{2}} {(π r_{i}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (m_{med} (l), m, r_{i}, l) - \frac{q (r_{i}, l)}{n_{i}})}^{2} . \end{matrix}$ (1.3) ) in Section 4.

For solving (Equation1.3(1.3) $\begin{matrix} M (l) : = \underset{m \in C}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 {(\frac{s_{i}}{n_{i}})}^{2}} {(π r_{i}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (m_{med} (l), m, r_{i}, l) - \frac{q (r_{i}, l)}{n_{i}})}^{2} . \end{matrix}$ (1.3) ), we use a global optimization strategy presented in Section 5 to generate reasonable candidates for start values for a local solver for a regularized version of (Equation1.3(1.3) $\begin{matrix} M (l) : = \underset{m \in C}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 {(\frac{s_{i}}{n_{i}})}^{2}} {(π r_{i}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (m_{med} (l), m, r_{i}, l) - \frac{q (r_{i}, l)}{n_{i}})}^{2} . \end{matrix}$ (1.3) ). Section 8 provides a selection method to find a unique start value out of the candidates. In order to apply a gradient-based local solver, we must know the derivatives of the Mie extinction efficiency series, which are discussed in Section 3.

2. Mie theory

We recapitulate Mie theory in an absorbing medium as presented in [Citation3]. Our first step is to introduce the complex-valued Riccati–Bessel functions $ξ_{n} : C \to C$ and $ψ_{n} : C \to C$ given by(2.1) $\begin{matrix} ξ_{n} (z) = \sqrt{\frac{π}{2}} \sqrt{z} J_{n + \frac{1}{2}} (z) \end{matrix}$ (2.1)

and(2.2) $\begin{matrix} ψ_{n} (z) = \sqrt{\frac{π}{2}} \sqrt{z} J_{n + \frac{1}{2}} (z) + i \sqrt{\frac{π}{2}} \sqrt{z} Y_{n + \frac{1}{2}} (z), \end{matrix}$ (2.2)

with the Bessel functions $J_{n + \frac{1}{2}} : C \to C$ and $Y_{n + \frac{1}{2}} : C \to C$ of order $n + \frac{1}{2}$ of first and second kinds. We define the size parameter $ρ = 2 π \frac{r}{l}$ . Then, we set $z_{med} : = ρ \cdot m_{med}$ and $z_{part} : = ρ \cdot m_{part}$ . Here and in the following, we omit the wavelength dependence of $m_{med}$ and $m_{part}$ for better readability. We introduce the notation $m_{med} = n_{med} + i k_{med}$ and $m_{part} = n_{part} + i k_{part}$ .

We introduce the so-called Mie coefficients:(2.3) $\begin{matrix} a_{n} & : = \frac{m_{part} \dot{ξ_{n}} (z_{med}) ξ_{n} (z_{part}) - m_{med} ξ_{n} (z_{med}) \dot{ξ_{n}} (z_{part})}{m_{part} \dot{ψ_{n}} (z_{med}) ξ_{n} (z_{part}) - m_{med} ψ_{n} (z_{med}) \dot{ξ_{n}} (z_{part})} \\ b_{n} & : = \frac{m_{part} ξ_{n} (z_{med}) \dot{ξ_{n}} (z_{part}) - m_{med} \dot{ξ_{n}} (z_{med}) ξ_{n} (z_{part})}{m_{part} ψ_{n} (z_{med}) \dot{ξ_{n}} (z_{part}) - m_{med} \dot{ψ_{n}} (z_{med}) ξ_{n} (z_{part})} \\ c_{n} & : = \frac{m_{part} ψ_{n} (z_{med}) \dot{ξ_{n}} (z_{med}) - m_{part} \dot{ψ_{n}} (z_{med}) ξ_{n} (z_{med})}{m_{part} ψ_{n} (z_{med}) \dot{ξ_{n}} (z_{part}) - m_{med} \dot{ψ_{n}} (z_{med}) ξ_{n} (z_{part})} \\ d_{n} & : = \frac{m_{part} \dot{ψ_{n}} (z_{med}) ξ_{n} (z_{med}) - m_{part} ψ_{n} (z_{med}) \dot{ξ_{n}} (z_{med})}{m_{part} \dot{ψ_{n}} (z_{med}) ξ_{n} (z_{part}) - m_{med} ψ_{n} (z_{med}) \dot{ξ_{n}} (z_{part})} \end{matrix}$ (2.3)

With the Mie coefficients, we can express the coefficient functions(2.4) $\begin{matrix} A_{n} (ρ, m_{med}, m_{part}) & : = \frac{l}{2 π m_{part}} ({|c_{n}|}^{2} ξ_{n} (z_{part}) \bar{\dot{ξ_{n}} (z_{part})} - {|d_{n}|}^{2} \dot{ξ_{n}} (z_{part}) \bar{ξ_{n} (z_{part})}) \end{matrix}$ (2.4)

and(2.5) $\begin{matrix} B_{n} (ρ, m_{med}, m_{part}) & : = \frac{l}{2 π m_{med}} ({|a_{n}|}^{2} \dot{ψ_{n}} (z_{med}) \bar{ψ_{n} (z_{med})} - {|b_{n}|}^{2} ψ_{n} (z_{med}) \bar{\dot{ψ_{n}} (z_{med})}), \end{matrix}$ (2.5)

which finally occur in the series expansion(2.6) $\begin{matrix} Q_{ext} (r, l, m_{med}, m_{part}) & = \frac{l}{2 c I (r, l)} \sum_{n = 1}^{\infty} (2 n + 1) I m (A_{n} (ρ, m_{med}, m_{part}) \\ + B_{n} (ρ, m_{med}, m_{part})) . \end{matrix}$ (2.6)

Here the quantity I(r, l) is the average incident intensity of light with wavelength l for a spherical particle with radius r and c denotes the speed of light in vacuum. The function I(r, l) is given by(2.7) $\begin{matrix} I (r, l) & = \frac{l^{2}}{8 π {(k_{med})}^{2}} \frac{n_{med}}{2 c} (1 + (4 π k_{med} \frac{r}{l} - 1) e^{4 π k_{med} \frac{r}{l}}), & f o r k_{med} \neq 0 \\ I (r, l) & = π r^{2} \frac{n_{med}}{2 c}, & f o r k_{med} = 0 . \end{matrix}$ (2.7)

Obviously we cannot evaluate (Equation2.6(2.6) $\begin{matrix} Q_{ext} (r, l, m_{med}, m_{part}) & = \frac{l}{2 c I (r, l)} \sum_{n = 1}^{\infty} (2 n + 1) I m (A_{n} (ρ, m_{med}, m_{part}) \\ + B_{n} (ρ, m_{med}, m_{part})) . \end{matrix}$ (2.6) ) exactly, because we cannot compute an infinite sum due to limited processing resources. Therefore, we have to truncate this series expansion. In [Citation4], a commonly used truncation index $N_{trunc}$ is presented, which is given by(2.8) $\begin{matrix} N_{trunc} & = ⌈ |M + 4.05 \cdot M^{\frac{1}{3}} + 2| ⌉, \\ w i t h M & = max ⌈ |ρ|, | ρ \cdot m_{med} |, | ρ \cdot m_{part} | ⌉ . \end{matrix}$ (2.8)

3. Derivatives of the truncated Mie efficiency series

The Bessel functions $J_{α} (z)$ and $Y_{α} (z)$ for an arbitrary weight $α$ fulfil the recurrence relations(3.1) $\begin{matrix} \frac{d}{d z} (z^{α} J_{α} (z)) = z^{α} J_{α - 1} (z) and \frac{d}{d z} (z^{α} Y_{α} (z)) = z^{α} Y_{α - 1} (z), \end{matrix}$ (3.1)

cf. [Citation5]. For the Bessel functions occurring in the Riccati–Bessel functions, $ξ_{n} (z)$ and $ψ_{n} (z)$ follow from this with the weight $α = n + \frac{1}{2}$ that(3.2) $\begin{matrix} \dot{ξ_{n}} (z) & = \sqrt{\frac{π}{2}} \sqrt{z} (J_{n - \frac{1}{2}} (z) - \frac{n}{z} J_{n + \frac{1}{2}} (z)) \end{matrix}$ (3.2) (3.3) $\begin{matrix} and \dot{ψ_{n}} (z) & = \sqrt{\frac{π}{2}} \sqrt{z} (J_{n - \frac{1}{2}} (z) - \frac{n}{z} J_{n + \frac{1}{2}} (z)) + \sqrt{\frac{π}{2}} \sqrt{z} (Y_{n - \frac{1}{2}} (z) - \frac{n}{z} Y_{n + \frac{1}{2}} (z)) i \end{matrix}$ (3.3)

for $z \neq 0$ .

We apply (Equation3.1(3.1) $\begin{matrix} \frac{d}{d z} (z^{α} J_{α} (z)) = z^{α} J_{α - 1} (z) and \frac{d}{d z} (z^{α} Y_{α} (z)) = z^{α} Y_{α - 1} (z), \end{matrix}$ (3.1) ) a second time to get $\ddot{ξ_{n}} (z)$ , which yields $\begin{matrix} \ddot{ξ_{n}} (z) = \sqrt{\frac{π}{2}} \frac{\sqrt{z}}{z^{2}} (n (n + 1) J_{n + \frac{1}{2}} (z) + (1 - 2 n) J_{n - \frac{1}{2}} (z) + z^{2} J_{n - \frac{3}{2}} (z)) . \end{matrix}$

For an arbitrary weight $α$ , we have the recurrence relation $\begin{matrix} J_{α - 1} (z) = \frac{2 α}{z} J_{α} (z) - J_{α + 1} (z), \end{matrix}$

see [Citation5], and we use it to eliminate the term $J_{n - \frac{3}{2}} (z)$ in the expression for $\ddot{ξ_{n}} (z)$ . Then also $J_{n - \frac{1}{2}} (z)$ cancels out, such that we obtain the representation(3.4) $\begin{matrix} \ddot{ξ_{n}} (z) = \sqrt{\frac{π}{2}} \frac{\sqrt{z}}{z^{2}} (n (n + 1) - z^{2}) J_{n + \frac{1}{2}} (z) \end{matrix}$ (3.4)

only involving $J_{n + \frac{1}{2}} (z)$ .

We recapitulate the Cauchy–Riemann equations in its complex form. For a holomorphic function $f : C \to C$ with $f (z) = f (x + i y) = u (x, y) + i v (x, y)$ holds(3.5) $\begin{matrix} \dot{f} (z) = \frac{d}{d z} f (z) = \frac{\partial}{\partial x} f (x + i y) = - i \frac{\partial}{\partial y} f (x + i y) . \end{matrix}$ (3.5)

From this follows(3.6) $\begin{matrix} u_{x} = R e (\dot{f} (z)), u_{y} = - I m (\dot{f} (z)), v_{x} = I m (\dot{f} (z)) and v_{y} = R e (\dot{f} (z)) . \end{matrix}$ (3.6)

Using the latter relations, we can deduce using $z_{part} = ρ (n_{part} + i k_{part})$ $\begin{matrix} \frac{\partial}{\partial n_{part}} R e (ξ_{n} (z_{part})) & = ρ R e (\dot{ξ_{n}} (z_{part})), & \frac{\partial}{\partial k_{part}} R e (ξ_{n} (z_{part})) & = - ρ I m (\dot{ξ_{n}} (z_{part})), \\ \frac{\partial}{\partial n_{part}} I m (ξ_{n} (z_{part})) & = ρ I m (\dot{ξ_{n}} (z_{part})), & \frac{\partial}{\partial k_{part}} I m (ξ_{n} (z_{part})) & = ρ R e (\dot{ξ_{n}} (z_{part})) \end{matrix}$

and analogously $\begin{matrix} \frac{\partial}{\partial n_{part}} R e (\dot{ξ_{n}} (z_{part})) & = ρ R e (\ddot{ξ_{n}} (z_{part})), & \frac{\partial}{\partial k_{part}} R e (\dot{ξ_{n}} (z_{part})) & = - ρ I m (\ddot{ξ_{n}} (z_{part})), \\ \frac{\partial}{\partial n_{part}} I m ({\dot{ξ}}_{n} (z_{part})) & = ρ I m (\ddot{ξ_{n}} (z_{part})), & \frac{\partial}{\partial k_{part}} I m ({\dot{ξ}}_{n} (z_{part})) & = ρ R e (\ddot{ξ_{n}} (z_{part})) . \end{matrix}$

The Bessel function values $\begin{matrix} J_{0 + \frac{1}{2}} (z_{med}), \dots, J_{N_{trunc} + \frac{1}{2}} (z_{med}), Y_{0 + \frac{1}{2}} (z_{med}), \dots, Y_{N_{trunc} + \frac{1}{2}} (z_{med}) \\ and & J_{0 + \frac{1}{2}} (z_{part}), \dots, J_{N_{trunc} + \frac{1}{2}} (z_{part}) . \end{matrix}$

already computed for a function evaluation of the truncated Mie extinction efficiency can be reused for their derivatives.

Now everything is prepared to differentiate the squared magnitude of the Mie coefficient $a_{n}$ with respect to $n_{part}$ and $k_{part}$ . It is sufficient only to discuss ${|a_{n}|}^{2}$ in the following because the differentiation of ${|b_{n}|}^{2}$ , ${|c_{n}|}^{2}$ and ${|d_{n}|}^{2}$ works analogously.

First we write the squared norm of the Mie coefficient $a_{n}$ as ${|a_{n}|}^{2} = a_{n} \bar{a_{n}}$ , which gives $\begin{matrix} \frac{\partial}{\partial n_{part}} {|a_{n}|}^{2} & = (\frac{\partial}{\partial n_{part}} a_{n}) \bar{a_{n}} + a_{n} (\frac{\partial}{\partial n_{part}} \bar{a_{n}}) \\ and \frac{\partial}{\partial k_{part}} {|a_{n}|}^{2} & = (\frac{\partial}{\partial k_{part}} a_{n}) \bar{a_{n}} + a_{n} (\frac{\partial}{\partial k_{part}} \bar{a_{n}}) . \end{matrix}$

We write $\begin{matrix} a_{n} = \frac{E}{D} with E & : = m_{part} \dot{ξ_{n}} (z_{med}) ξ_{n} (z_{part}) - m_{med} ξ_{n} (z_{med}) \dot{ξ_{n}} (z_{part}) \\ and D & : = m_{part} \dot{ψ_{n}} (z_{med}) ξ_{n} (z_{part}) - m_{med} ψ_{n} (z_{med}) \dot{ξ_{n}} (z_{part}), \end{matrix}$

which yields $\begin{matrix} \frac{d}{d m_{part}} a_{n} & = \frac{1}{D^{2}} ((\frac{d}{d m_{part}} E) D - E (\frac{d}{d m_{part}} D)) \\ with \frac{d}{d m_{part}} E & = \dot{ξ_{n}} (z_{med}) ξ_{n} (z_{part}) + m_{part} \dot{ξ_{n}} (z_{med}) ρ \dot{ξ_{n}} (z_{part}) \\ - m_{med} ξ_{n} (z_{med}) ρ \ddot{ξ_{n}} (z_{part}) \\ and \frac{d}{d m_{part}} D & = \dot{ψ_{n}} (z_{med}) ξ_{n} (z_{part}) + m_{part} \dot{ψ_{n}} (z_{med}) ρ \dot{ξ_{n}} (z_{part}) \\ - m_{med} ψ_{n} (z_{med}) ρ \ddot{ξ_{n}} (z_{part}) . \end{matrix}$

Furthermore follow from (Equation3.5(3.5) $\begin{matrix} \dot{f} (z) = \frac{d}{d z} f (z) = \frac{\partial}{\partial x} f (x + i y) = - i \frac{\partial}{\partial y} f (x + i y) . \end{matrix}$ (3.5) ), the relations $\begin{matrix} \frac{\partial}{\partial n_{part}} a_{n} & = \frac{d}{d m_{part}} a_{n} \\ and \frac{\partial}{\partial k_{part}} a_{n} & = (\frac{d}{d m_{part}} a_{n}) i . \end{matrix}$

Although $\bar{a_{n}}$ is not holomorphic with respect to $m_{part}$ , we can still compute the partial derivatives $\frac{\partial}{\partial n_{part}} \bar{a_{n}}$ and $\frac{\partial}{\partial k_{part}} \bar{a_{n}}$ . We obtain using (Equation3.6(3.6) $\begin{matrix} u_{x} = R e (\dot{f} (z)), u_{y} = - I m (\dot{f} (z)), v_{x} = I m (\dot{f} (z)) and v_{y} = R e (\dot{f} (z)) . \end{matrix}$ (3.6) ) the relations $\begin{matrix} \frac{\partial}{\partial n_{part}} \bar{a_{n}} & = \bar{\frac{\partial}{\partial n_{part}} a_{n}} \\ and \frac{\partial}{\partial k_{part}} \bar{a_{n}} & = \bar{\frac{\partial}{\partial k_{part}} a_{n}} . \end{matrix}$

This completes the computations of $\frac{\partial}{\partial n_{part}} {|a_{n}|}^{2}$ and $\frac{\partial}{\partial k_{part}} {|a_{n}|}^{2}$ .

For a holomorphic function $f (x + i y)$ , we can easily deduce from (Equation3.6(3.6) $\begin{matrix} u_{x} = R e (\dot{f} (z)), u_{y} = - I m (\dot{f} (z)), v_{x} = I m (\dot{f} (z)) and v_{y} = R e (\dot{f} (z)) . \end{matrix}$ (3.6) ) that $\begin{matrix} \frac{\partial}{\partial x} I m (f (x + i y)) = I m (\frac{\partial}{\partial x} f (x + i y)) and \frac{\partial}{\partial y} I m (f (x + i y)) = I m (\frac{\partial}{\partial y} f (x + i y)) . \end{matrix}$

This gives with respect to (Equation3.5(3.5) $\begin{matrix} \dot{f} (z) = \frac{d}{d z} f (z) = \frac{\partial}{\partial x} f (x + i y) = - i \frac{\partial}{\partial y} f (x + i y) . \end{matrix}$ (3.5) ) $\begin{matrix} \frac{\partial}{\partial n_{part}} I m (A_{n}) & = I m (\frac{\partial}{\partial n_{part}} A_{n}) \\ with \frac{\partial}{\partial n_{part}} A_{n} & = \frac{l}{2 π} (\frac{\partial}{\partial n_{part}} ({|c_{n}|}^{2}) U_{1} + {|c_{n}|}^{2} \frac{\partial}{\partial n_{part}} U_{1} \\ - \frac{\partial}{\partial n_{part}} ({|d_{n}|}^{2}) U_{2} - {|d_{n}|}^{2} \frac{\partial}{\partial n_{part}} U_{2}), \\ where U_{1} & = \frac{ξ_{n} (z_{part}) \bar{\dot{ξ_{n}} (z_{part})}}{m_{part}}, \\ \frac{\partial}{\partial n_{part}} U_{1} & = \frac{1}{m_{part}^{2}} ((ρ \dot{ξ_{n}} (z_{part}) \bar{\dot{ξ_{n}} (z_{part})} + ξ_{n} (z_{part}) ρ \bar{\ddot{ξ_{n}} (z_{part})}) m_{part} \\ - ξ_{n} (z_{part}) \bar{\dot{ξ_{n}} (z_{part})}), \\ and U_{2} & = \frac{{\dot{ξ}}_{n} (z_{part}) \bar{ξ_{n} (z_{part})}}{m_{part}}, \\ \frac{\partial}{\partial n_{part}} U_{2} & = \frac{1}{m_{part}^{2}} ((ρ \ddot{ξ_{n}} (z_{part}) \bar{ξ_{n} (z_{part})} + {\dot{ξ}}_{n} (z_{part}) ρ \bar{\dot{ξ_{n}} (z_{part})}) m_{part} \\ - {\dot{ξ}}_{n} (z_{part}) \bar{ξ_{n} (z_{part})}) . \end{matrix}$

Analogously we obtain $\begin{matrix} \frac{\partial}{\partial k_{part}} I m (A_{n}) & = I m (\frac{\partial}{\partial k_{part}} A_{n}) \\ with \frac{\partial}{\partial k_{part}} A_{n} & = \frac{l}{2 π} (\frac{\partial}{\partial k_{part}} ({|c_{n}|}^{2}) U_{1} + {|c_{n}|}^{2} \frac{\partial}{\partial k_{part}} U_{1} \\ - \frac{\partial}{\partial n_{part}} ({|d_{n}|}^{2}) U_{2} - {|d_{n}|}^{2} \frac{\partial}{\partial k_{part}} U_{2}), \\ where \frac{\partial}{\partial k_{part}} U_{1} & = \frac{1}{m_{part}^{2}} ((ρ (\dot{ξ_{n}} (z_{part})) i \bar{\dot{ξ_{n}} (z_{part})} + ξ_{n} (z_{part}) ρ \bar{(\ddot{ξ_{n}} (z_{part})) i}) m_{part} \\ - (ξ_{n} (z_{part}) \bar{\dot{ξ_{n}} (z_{part})}) i), \\ and \frac{\partial}{\partial k_{part}} U_{2} & = \frac{1}{m_{part}^{2}} ((ρ (\ddot{ξ_{n}} (z_{part})) i \bar{ξ_{n} (z_{part})} + {\dot{ξ}}_{n} (z_{part}) ρ \bar{(\dot{ξ_{n}} (z_{part})) i}) m_{part} \\ - ({\dot{ξ}}_{n} (z_{part}) \bar{ξ_{n} (z_{part})}) i) . \end{matrix}$

The derivatives of $I m (B_{n})$ are much easier to compute, since the dependence on $n_{part}$ and $k_{part}$ lies only in ${|a_{n}|}^{2}$ and ${|b_{n}|}^{2}$ here. So we can deduce $\begin{matrix} \frac{\partial}{\partial n_{part}} I m (B_{n}) & = I m (\frac{\partial}{\partial n_{part}} B_{n}) \\ with \frac{\partial}{\partial n_{part}} B_{n} & = \frac{l}{2 π} (\frac{\partial}{\partial n_{part}} ({|a_{n}|}^{2}) \frac{\dot{ψ_{n}} (z_{med}) \bar{ψ_{n} (z_{med})}}{m_{med}} \\ - \frac{\partial}{\partial n_{part}} ({|b_{n}|}^{2}) \frac{ψ_{n} (z_{med}) \bar{\dot{ψ_{n}} (z_{med})}}{m_{med}}) . \end{matrix}$

Analogously we get $\begin{matrix} \frac{\partial}{\partial k_{part}} I m (B_{n}) & = I m (\frac{\partial}{\partial k_{part}} B_{n}) \\ with \frac{\partial}{\partial k_{part}} B_{n} & = \frac{l}{2 π} (\frac{\partial}{\partial k_{part}} ({|a_{n}|}^{2}) \frac{\dot{ψ_{n}} (z_{med}) \bar{ψ_{n} (z_{med})}}{m_{med}} \\ - \frac{\partial}{\partial k_{part}} ({|b_{n}|}^{2}) \frac{ψ_{n} (z_{med}) \bar{\dot{ψ_{n}} (z_{med})}}{m_{med}}) . \end{matrix}$

4. Nonlinear regression using truncated series expansions

We wish to reconstruct the refractive indices of a particle material from spectral measurements by solving a nonlinear regression problem of the form(4.1) $\begin{matrix} X_{t, δ} : = \underset{x \in R^{D}}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 σ_{i}^{2}} {(\sum_{n = 1}^{t} a_{n}^{i} (x) - \sum_{n = 1}^{\infty} a_{n}^{i} (x_{true}) - δ_{i})}^{2}, \end{matrix}$ (4.1)

where $t \in N$ is a finite truncation index and $δ_{i} \sim N (0, s_{i}^{2})$ . Remember that N represents the number of particle radii $r_{i}$ of the different monodisperse aerosols we are investigating. We still assume that for each radius $r_{i}$ the standard deviations $s_{i}$ are determined well enough from a set of experiments, such that they can be regarded as known. We have to confine ourselves to a finite truncation index t because it is practically not feasible to compute all coefficient functions $a_{n}^{i} (x)$ for $i = 1, \dots, N$ . Throughout this paper, we assume that the feasible set $Ω$ is compact.

We define the functions $f_{t} : R^{D} \to R^{N}$ and $f : R^{D} \to R^{N}$ by $\begin{matrix} f_{t} (x) & : = {(\sum_{n = 1}^{t} a_{n}^{1} (x), \dots, \sum_{n = 1}^{t} a_{n}^{N} (x))}^{T} \\ and f (x) & : = {(\sum_{n = 1}^{\infty} a_{n}^{1} (x), \dots, \sum_{n = 1}^{\infty} a_{n}^{N} (x))}^{T} . \end{matrix}$

We set $e : = f (x_{true}) + δ$ with $δ : = {(δ_{1}, \dots, δ_{N})}^{T}$ . Then the observed probability density is given by $\begin{matrix} p_{observed} (e | x) : = {(2 π)}^{- \frac{N_{l}}{2}} | d e t (Σ_{σ}) |^{- \frac{1}{2}} exp (- \frac{1}{2} ‖ {Σ_{σ}}^{- \frac{1}{2}} (f_{t} (x) - e) ‖_{2}^{2}) \end{matrix}$

with the covariance matrix $Σ_{σ} : = d i a g (σ_{1}^{2}, \dots, σ_{N}^{2})$ . We know a priori that the vector $x$ specifying our model $f_{t} (x)$ lies within the set $Ω$ . This knowledge can be expressed with the prior probability density $\begin{matrix} p_{prior} (x) : = {(v o l (Ω))}^{- 1} I_{Ω} (x), \end{matrix}$

where $I_{Ω}$ is the indicator function of $Ω$ . Now $X_{t, δ}$ is the set of MAP-estimators of the posterior probability density, i.e.(4.2) $\begin{matrix} X_{t, δ} : = \underset{x}{a r g m a x} & p_{posterior} (x | e) \\ with p_{posterior} (x | e) \propto p_{observed} (e | x) p_{prior} (x) & \propto exp (- \frac{1}{2} ‖ {Σ_{σ}}^{- \frac{1}{2}} (f_{t} (x) - e) ‖_{2}^{2}) I_{Ω} (x) . \end{matrix}$ (4.2)

We carry out all the following investigations under the next assumption on the covariance matrix:

Assumption 4.1:

The covariance matrix $Σ_{σ}$ has the simple form $\begin{matrix} Σ_{σ} = δ^{2} \cdot d i a g (σ_{1}^{2}, \dots, σ_{N}^{2}) = : δ^{2} \cdot Σ, \end{matrix}$

where $δ \geq 0$ is an arbitrary but fixed noise level and $σ_{1}$ , ..., $σ_{N}$ are fixed.

To simplify notations, we introduce the two functions $f_{t} : R^{D} \to R^{N}$ and $g_{t} : R^{D} \to R^{N}$ depending on the truncation index t and defined by $\begin{matrix} {(f_{t} (x))}_{i} & : = \sum_{n = 1}^{⌊ t ⌋} a_{n}^{i} (x) + (t - ⌊ t ⌋) a_{⌊ t ⌋ + 1}^{i} (x) \\ and {(g_{t} (x))}_{i} & : = {(f (x))}_{i} - {(f_{t} (x))}_{i}, for i = 1, \dots, N . \end{matrix}$

In the following, we will investigate how an element $x_{t, δ}$ of the set $X_{t, δ}$ depends on the truncation index t. We change to a continuous truncation index here, i.e. we change from now on from (Equation4.1(4.1) $\begin{matrix} X_{t, δ} : = \underset{x \in R^{D}}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 σ_{i}^{2}} {(\sum_{n = 1}^{t} a_{n}^{i} (x) - \sum_{n = 1}^{\infty} a_{n}^{i} (x_{true}) - δ_{i})}^{2}, \end{matrix}$ (4.1) ) to the new regression problem(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3)

where the truncation index $t \geq 0$ is allowed to be non-integer.

As a preparation we prove the following technical lemma, which will form the basis of our continuity and convergence results.

Lemma 4.2:

Let the twice continuously differentiable function $F : R^{N} \to R$ have a strict local minimum $x_{0}$ inside a compact set $S \subset R^{N}$ . Let the function $h : R^{N} \times R \to R$ have the property ${lim}_{ε \to 0} h (x, ε) = 0$ for all $x \in S$ and let $h (x, ε)$ be twice continuously differentiable with respect to $x$ and continuous in $ε$ . Furthermore, we assume that the local minima $x_{ε}$ of $F_{ε} (x) : = F (x) + h (x, ε)$ are strict for any $ε > 0$ . Then there exists a sequence of local minima $x_{ε}$ of $F_{ε} (x)$ with ${lim}_{ε \to 0} x_{ε} = x_{0}$ .

The strategy of the proof is to construct for given $ε$ a neighbourhood of $x_{0}$ which must contain a local minimizer $x_{ε}$ of the perturbed function $F_{ε} (x)$ . By sending $ε$ to 0, this neighbourhood shrinks down to the local minimum $x_{0}$ itself, thus yielding the convergence of $x_{ε}$ to $x_{0}$ . To have this neighbourhood shrink down to $x_{0}$ , it is crucially important that $x_{0}$ must be a strict local minimum.

We define $d (ε) : = {sup}_{x \in S} | h (x, ε) |$ . From ${lim}_{ε \to 0} h (x, ε) = 0$ for all $x \in S$ follows ${lim}_{ε \to 0} d (ε) = 0$ . Let us now introduce the function $F^{-} (x) : = F (x) - d (ε)$ . Obviously, $x_{0}$ is also a local minimum of $F^{-} (x)$ , so for $ε$ sufficiently small there exists a neighbourhood $U_{2 d (ε)} (x_{0}) \subset S$ of $x_{0}$ with $\begin{matrix} F^{-} (x) \geq F^{-} (x_{0}) and F^{-} (x) - F^{-} (x_{0}) \leq 2 d (ε) for all x \in U_{2 d (ε)} (x_{0}) . \end{matrix}$

In particular, we have $\begin{matrix} \forall x \in \partial U_{2 d (ε)} (x_{0}) : F^{-} (x) = F^{-} (x_{0}) + 2 d (ε) = F (x_{0}) + d (ε) . \end{matrix}$

Let us assume that there exists an $x \in \partial U_{2 d (ε)} (x_{0})$ with $\begin{matrix} F_{ε} (x) < F (x_{0}) + d (ε) = F^{-} (x) = F (x) - d (ε) . \end{matrix}$

Then $F_{ε} (x) = F (x) + h (x, ε)$ implies $- d (ε) > h (x, ε)$ , hence $- h (x, ε) > d (ε) \geq - h (x, ε)$ by definition of $d (ε)$ , contradiction. Therefore we conclude(4.4) $\begin{matrix} \forall x \in \partial U_{2 d (ε)} (x_{0}) : F_{ε} (x) \geq F (x_{0}) + d (ε) . \end{matrix}$ (4.4)

Since $F_{ε} (x)$ is continuous and ${\bar{U}}_{2 d (ε)} (x_{0})$ is compact for $ε$ small enough, there exists an $x_{ε} \in {\bar{U}}_{2 d (ε)} (x_{0})$ with $\begin{matrix} F_{ε} (x_{ε}) = min_{x \in {\bar{U}}_{2 d (ε)} (x_{0})} F_{ε} (x) . \end{matrix}$

Let us assume $F_{ε} (x_{ε}) > F (x_{0}) + d (ε)$ . Then by definition of $x_{ε}$ , we get in particular $\begin{matrix} F (x_{0}) + h (x_{0}, ε) = F_{ε} (x_{0}) \geq F_{ε} (x_{ε}) > F (x_{0}) + d (ε), \end{matrix}$

i.e. $h (x_{0}, ε) > d (ε) \geq h (x_{0}, ε)$ , contradiction. It follows(4.5) $\begin{matrix} F_{ε} (x_{ε}) \leq F (x_{0}) + d (ε) and F_{ε} (x_{0}) \leq F (x_{0}) + d (ε), \end{matrix}$ (4.5)

where the latter follows with a proof by contradiction as well.

If it happens to hold that $F_{ε} (x_{ε}) = F (x_{0}) + d (ε)$ , then we also have $F_{ε} (x_{0}) = F (x_{0}) + d (ε)$ . Otherwise we have $F_{ε} (x_{ε}) < F (x_{0}) + d (ε)$ and then (Equation4.4(4.4) $\begin{matrix} \forall x \in \partial U_{2 d (ε)} (x_{0}) : F_{ε} (x) \geq F (x_{0}) + d (ε) . \end{matrix}$ (4.4) ) implies that $x_{ε}$ cannot lie on $\partial U_{2 d (ε)} (x_{0})$ , thus it must lie within the interior of $U_{2 d (ε)} (x_{0})$ . So in any case (Equation4.5(4.5) $\begin{matrix} F_{ε} (x_{ε}) \leq F (x_{0}) + d (ε) and F_{ε} (x_{0}) \leq F (x_{0}) + d (ε), \end{matrix}$ (4.5) ) gives that $U_{2 d (ε)} (x_{0})$ must contain a local minimizer $x_{ε}$ of $F_{ε} (x)$ .

Now ${lim}_{ε \to 0} d (ε) = 0$ gives ${lim}_{ε \to 0} x_{ε} = x_{0}$ . The existence of the last limit is guaranteed by the fact that $x_{0}$ is strict and the claim is proved.

Proposition 4.3:

Let all coefficient functions $a_{n}^{i} (x)$ be twice continuously differentiable and bounded on $Ω$ . We assume that each local minimum $x_{t, δ}$ of the right-hand side function $F_{t, δ} (x)$ in (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) is strict and lies in the interior of $Ω$ . Then each local minimum depends continuously on the truncation index t.

To prove the claim, one could be tempted to apply the implicit function theorem on the equation $d (t, x_{t, δ}) = 0$ with $d (s, x) : = \nabla f_{s} (x)$ . This would give that the local minima are parameterized by a function $m (s)$ with the property $m (t) = x_{t, δ}$ , where s is from an environment U(t) of t. The problem with this approach is that it requires continuous differentiability of $d (s, x)$ in the truncation parameter s. Thus the continuous truncation we are using would need more complicated methods such as spline interpolation of the partial sums, which would increase the overall computational effort.

Therefore, we use in the following a more direct approach to prove the claim. Let $ε > 0$ be arbitrary. First we consider an integer truncation index $t \in N$ , i.e. we have $t = ⌊ t ⌋$ . Now for $ε$ small enough, we get $⌊ t + ε ⌋ = t$ and $⌊ t - ε ⌋ = t - 1$ . This gives $\begin{matrix} {(f_{t + ε} (x))}_{i} & = {(f_{t} (x))}_{i} + ε a_{t + 1}^{i} (x) \\ and {(f_{t - ε} (x))}_{i} & = {(f_{t - 1} (x))}_{i} + (1 - ε) a_{t}^{i} (x) \\ = {(f_{t} (x))}_{i} - ε a_{t}^{i} (x) . \end{matrix}$

As next step we turn to an noninteger truncation index t. In this case, we can always select $ε$ small enough such that $⌊ t + ε ⌋ = ⌊ t ⌋$ and $⌊ t - ε ⌋ = ⌊ t ⌋$ , respectively hold. This yields $\begin{matrix} {(f_{t + ε} (x))}_{i} & = {(f_{t} (x))}_{i} + ε a_{⌊ t ⌋ + 1}^{i} (x) \\ and {(f_{t - ε} (x))}_{i} & = {(f_{t} (x))}_{i} - ε a_{⌊ t ⌋ + 1}^{i} (x) . \end{matrix}$

Now we introduce the function $\begin{matrix} a (x) : = \{\begin{matrix} (a_{t}^{1} (x), \dots, a_{t}^{N} (x))^{T}, & for t - ε, t \in N \\ (a_{⌊ t ⌋ + 1}^{1} (x), \dots, a_{⌊ t ⌋ + 1}^{N} (x))^{T}, & else . \end{matrix} \end{matrix}$

For $F_{t, δ} (x) = {‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖}_{2}^{2}$ this yields $\begin{matrix} F_{t + ε, δ} (x) & = F_{t, δ} (x) + 2 ε 〈 Σ^{- \frac{1}{2}} a (x), Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) 〉 + ε^{2} {‖ Σ^{- \frac{1}{2}} a (x) ‖}_{2}^{2} \\ and F_{t - ε, δ} (x) & = F_{t, δ} (x) - 2 ε 〈 Σ^{- \frac{1}{2}} a (x), Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) 〉 + ε^{2} {‖ Σ^{- \frac{1}{2}} a (x) ‖}_{2}^{2} . \end{matrix}$

Therefore, we obtain both for $F_{t + ε, δ} (x)$ and $F_{t - ε, δ} (x)$ a decomposition of the form $F_{t + ε, δ} (x) = F_{t, δ} (x) + h_{t, δ}^{ε} (x)$ and $F_{t - ε, δ} (x) = F_{t, δ} (x) + h_{t, δ}^{ε} (x)$ respectively, where the function $h_{t, δ}^{ε} (x)$ is appropriately selected according to above findings. We can readily check ${lim}_{ε \to 0} | h_{t, δ}^{ε} (x) | = 0$ for all $x \in Ω$ from the boundedness of the $a_{n}^{i} (x)$ ’s. Then the result follows from Lemma 4.2.

Corollary 4.4:

Let $t_{1}$ and $t_{2}$ be truncation indices with $t_{1} < t_{2}$ . Let $x_{t_{1}, δ}$ be a local minimizer of (Equation4.1(4.1) $\begin{matrix} X_{t, δ} : = \underset{x \in R^{D}}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 σ_{i}^{2}} {(\sum_{n = 1}^{t} a_{n}^{i} (x) - \sum_{n = 1}^{\infty} a_{n}^{i} (x_{true}) - δ_{i})}^{2}, \end{matrix}$ (4.1) ). Let $γ \in [0, 1]$ and define $t_{γ} : = t_{1} + γ (t_{2} - t_{1})$ . Then beginning at $γ = 0$ one can successively find local minimizers $x_{t_{γ}, δ}$ for the truncation index $t_{γ}$ using numerical continuation, see [Citation6]. Here for $γ_{1} < γ_{2}$ the minimizer $x_{t_{γ_{1}}, δ}$ is used as a start vector to compute the next minimizer $x_{t_{γ_{2}}, δ}$ . The next parameter $γ_{2}$ has to be sufficiently close to $γ_{1}$ , such that the start vector $x_{t_{γ_{1}}, δ}$ still lies within the domain of convergence for Newton’s method.

We use Corollary 4.4 to compute $x_{t, δ}$ for increasing truncation index t in a stable way. If we would keep t as integer and increase it in integer steps, we might leave the domain of convergence in the continuation method. Therefore, we increase them using a smaller step width.

In the following, we investigate how well the minimizers $x_{t, δ}$ of the noise-contaminated regression problem (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) with truncated series expansions approximate the minimizers $x_{\infty, 0}$ of the noise-free and untruncated problem(4.6) $\begin{matrix} x_{\infty, 0} : = \underset{x \in R^{D}}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 σ_{i}^{2}} {(\sum_{n = 1}^{\infty} a_{n}^{i} (x) - \sum_{n = 1}^{\infty} a_{n}^{i} (x_{true}))}^{2} s.t. x \in Ω . \end{matrix}$ (4.6)

Proposition 4.5:

Let the noise vector $δ$ fulfil ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} δ ‖}_{2}^{2} = 0$ and let the functions $f (x)$ and $f_{t_{δ}} (x)$ be bounded on $Ω$ . Assume ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x) ‖}_{2}^{2} = 0$ for all $x \in Ω$ . Then for any strict minimizer $x_{\infty, 0}$ of the right-hand side function of (Equation4.6(4.6) $\begin{matrix} x_{\infty, 0} : = \underset{x \in R^{D}}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 σ_{i}^{2}} {(\sum_{n = 1}^{\infty} a_{n}^{i} (x) - \sum_{n = 1}^{\infty} a_{n}^{i} (x_{true}))}^{2} s.t. x \in Ω . \end{matrix}$ (4.6) ) in the interior of $Ω$ exist minimizers $x_{t_{δ}, δ}$ of (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) with ${lim}_{δ \to 0} x_{t_{δ}, δ} = x_{\infty, 0}$ . Here we also assume the $x_{t_{δ}, δ}$ ’s to be strict for all $δ > 0$ .

With the notation introduced before, we can write $\begin{matrix} x_{\infty, 0} \in X_{\infty, 0} & : = \underset{x \in R^{D}}{a r g m i n} F_{\infty, 0} (x) s.t. x \in Ω \\ with F_{\infty, 0} (x) & : = ‖ Σ^{- \frac{1}{2}} (f (x) - f (x_{true})) ‖_{2}^{2} . \end{matrix}$

From the decomposition $f_{t_{δ}} (x) = f (x) - g_{t_{δ}} (x)$ we obtain $\begin{matrix} F_{t_{δ}, δ} (x) & = ‖ Σ^{- \frac{1}{2}} (f (x) - f (x_{true}) - δ) ‖_{2}^{2} \\ - 2 〈 Σ^{- \frac{1}{2}} g_{t_{δ}} (x), Σ^{- \frac{1}{2}} (f (x) - f (x_{true}) - δ) 〉 \\ + ‖ Σ^{- \frac{1}{2}} g_{t_{δ}} {(x) ‖}_{2}^{2} . \end{matrix}$

Then a further decomposition of the first term on the right-hand side yields $\begin{matrix} F_{t_{δ}, δ} (x) & = F_{\infty, 0} (x) - 2 〈 Σ^{- \frac{1}{2}} δ, Σ^{- \frac{1}{2}} (f (x) - f (x_{true})) 〉 + {‖ Σ^{- \frac{1}{2}} δ ‖}_{2}^{2} \\ - 2 〈 Σ^{- \frac{1}{2}} g_{t_{δ}} (x), Σ^{- \frac{1}{2}} (f (x) - f (x_{true}) - δ) 〉 + {‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x) ‖}_{2}^{2} \\ = : F_{\infty, 0} (x) + H_{t_{δ}, δ} (x) . \end{matrix}$

From the limit ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x) ‖}_{2}^{2} = 0$ , the limit ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} δ ‖}_{2}^{2} = 0$ and the boundedness of $f (x)$ follows ${lim}_{δ \to 0} | H_{t_{δ}, δ} (x) | = 0$ for arbitrary but fixed $x \in Ω$ .

Then the existence of the $x_{t_{δ}, δ}$ ’s follows from Lemma 4.2.

At last we study how the minimizers $x_{t_{δ}, δ}$ of (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) behave for $δ \to 0$ . We begin with a preparing corollary.

Corollary 4.6:

Let the assumptions of Proposition 4.5 hold. Then we have for any local minimizer $x_{t_{δ}, δ}$ of (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) approximating a local minimizer $x_{\infty, 0}$ of (Equation4.6(4.6) $\begin{matrix} x_{\infty, 0} : = \underset{x \in R^{D}}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 σ_{i}^{2}} {(\sum_{n = 1}^{\infty} a_{n}^{i} (x) - \sum_{n = 1}^{\infty} a_{n}^{i} (x_{true}))}^{2} s.t. x \in Ω . \end{matrix}$ (4.6) ) for $δ \to 0$ with $‖ Σ^{- \frac{1}{2}} (f (x_{\infty, 0}) - f (x_{true})) ‖_{2} = 0$ that $\begin{matrix} lim_{δ \to 0} {‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}) - f (x_{true}) - δ) ‖}_{2} = 0 . \end{matrix}$

The assumptions of Proposition 4.5 give $\begin{matrix} lim_{δ \to 0} ‖ Σ^{- \frac{1}{2}} {δ ‖}_{2}^{2} = 0 and lim_{δ \to 0} {‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}) ‖}_{2}^{2} = 0 . \end{matrix}$

We have by continuity of $f (x)$ that ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} (f (x_{t_{δ}, δ}) - f (x_{true})) ‖}_{2} = 0$ . Then $\begin{matrix} ‖ Σ^{- \frac{1}{2}} (f (x_{t_{δ}, δ}) - f (x_{true})) ‖_{2} + ‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}) ‖_{2} + {‖ Σ^{- \frac{1}{2}} δ ‖}_{2} \\ \geq ‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}) - f (x_{true}) - δ) ‖_{2} \end{matrix}$

gives the desired result.

Proposition 4.7:

Let the assumptions of Proposition 4.5 hold. Assume that the local minimizers $x_{\infty, 0}$ of (Equation4.6(4.6) $\begin{matrix} x_{\infty, 0} : = \underset{x \in R^{D}}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 σ_{i}^{2}} {(\sum_{n = 1}^{\infty} a_{n}^{i} (x) - \sum_{n = 1}^{\infty} a_{n}^{i} (x_{true}))}^{2} s.t. x \in Ω . \end{matrix}$ (4.6) ) with $‖ Σ^{- \frac{1}{2}} (f (x_{\infty, 0}) - f (x_{true})) ‖_{2} = 0$ form a discrete set $S_{\infty, 0}$ . Then the set $L_{\infty, 0}$ consisting of the limits ${lim}_{δ \to 0} x_{t_{δ}, δ}$ of local minimizers $x_{t_{δ}, δ}$ of (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) with ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}) - f (x_{true}) - δ) ‖}_{2} = 0$ coincides with $S_{\infty, 0}$ and there exists a noise level $δ_{max}$ such that all minimizers $x_{t_{δ}, δ}$ approximating $S_{\infty, 0}$ are isolated for all $δ \leq δ_{max}$ .

On the one hand from Proposition 4.5 we know that there exists a sequence $x_{t_{δ}, δ}$ of minimizers of (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) with ${lim}_{δ \to 0} x_{t_{δ}, δ} = x_{\infty, 0}$ . Then $‖ Σ^{- \frac{1}{2}} (f (x_{\infty, 0}) - f (x_{true})) ‖_{2} = 0$ and Corollary 4.6 give ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}) - f (x_{true}) - δ) ‖}_{2} = 0$ , which implies $S_{\infty, 0} \subseteq L_{\infty, 0}$ .

On the other hand holds for $x_{t_{δ}, δ}$ with ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}) - f (x_{true}) - δ) ‖}_{2} = 0$ that $\begin{matrix} ‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}) - f (x_{true}) - δ) ‖_{2} + ‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}) ‖_{2} + {‖ Σ^{- \frac{1}{2}} δ ‖}_{2} \\ \geq ‖ Σ^{- \frac{1}{2}} (f (x_{t_{δ}, δ}) - f (x_{true})) ‖_{2} \end{matrix}$

which implies ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} (f (x_{t_{δ}, δ}) - f (x_{true})) ‖}_{2} = 0$ . In particular, this means by continuity of $f (x)$ that the vector ${lim}_{δ \to 0} x_{t_{δ}, δ}$ must be a local minimizer of $‖ Σ^{- \frac{1}{2}} (f (x) - f (x_{true})) ‖_{2}$ . Thus we have also shown $L_{\infty, 0} \subseteq S_{\infty, 0}$ .

In the following, we number all elements of $S_{\infty, 0}$ with the index k, i.e. we write $x_{\infty, 0}^{k}$ for $k = 1, \dots, | S_{\infty, 0} |$ . Similarly we number all minimizers $x_{t_{δ}, δ}$ with ${lim}_{δ \to 0} {‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}) - f (x_{true}) - δ) ‖}_{2} = 0$ approximating the $x_{\infty, 0}^{k}$ ’s with $x_{t_{δ}, δ}^{k}$ , i.e. ${lim}_{δ \to 0} x_{t_{δ}, δ}^{k} = x_{\infty, 0}^{k}$ for $k = 1, \dots, | S_{\infty, 0} |$ . Define $\begin{matrix} D_{min} : = min_{i \neq j} {‖ x_{\infty, 0}^{i} - x_{\infty, 0}^{j} ‖}_{2} . \end{matrix}$

Since ${lim}_{δ \to 0} x_{t_{δ}, δ}^{k} = x_{\infty, 0}^{k}$ , we can find an error levels $δ_{max}^{k}$ such that $\begin{matrix} ‖ x_{t_{δ}, δ}^{k} - x_{\infty, 0}^{k} ‖_{2} < \frac{1}{2} D_{min} for k = 1, \dots, | S_{\infty, 0} |, \end{matrix}$

which holds for all $0 \leq δ \leq δ_{max}^{k}$ for each k. Then for all $0 \leq δ \leq δ_{max} : = {min}_{k} {δ_{max}^{k}}$ the $x_{t_{δ}, δ}^{k}$ ’s must have pairwise mutual distances greater than zero.

Now Proposition 4.7 gives that the number of local minima $x_{t_{δ}, δ}^{k}$ remains constant if the noise level $δ$ is small enough. It also yields that these local minima then form a set of separated continuous curves parametrized in $δ$ .

At last we wish to have an estimate of the convergence of the local minima $x_{t_{δ}, δ}^{k}$ of the truncated and noise contaminated problem to the local minima $x_{\infty, 0}^{k}$ of the noise-free and untruncated problem, which is useful for practical computations.

Proposition 4.8:

For the noise-level $δ$ small enough, we can bound for any local minimum $x_{t_{δ}, δ}^{k}$ the approximation error $‖ x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} ‖_{2}$ with a positively weighted linear combination of the residual $‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}^{k}) - f (x_{true}) - δ) ‖_{2}$ , the truncation error $‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}^{k}) ‖_{2}$ and the noise estimate $‖ Σ^{- \frac{1}{2}} {δ ‖}_{2}$ .

The first-order necessary conditions for a local minimum of $F_{t_{δ}, δ} (x) = F_{\infty, 0} (x) + H_{t_{δ}, δ} (x)$ at $x_{t_{δ}, δ}^{k}$ and a local minimum of $F_{\infty, 0} (x)$ at $x_{\infty, 0}^{k}$ yield in particular $\begin{matrix} 〈 \nabla F_{\infty, 0} (x_{t_{δ}, δ}^{k}) + \nabla H_{t_{δ}, δ} (x_{t_{δ}, δ}^{k}), x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} 〉 & \geq 0 \\ and 〈 \nabla F_{\infty, 0} (x_{\infty, 0}^{k}), x_{t_{δ}, δ}^{k} - x_{\infty, 0}^{k} 〉 & \geq 0, \end{matrix}$

Adding the last two inequalities yields(4.7) $\begin{matrix} 〈 \nabla H_{t_{δ}, δ} (x_{t_{δ}, δ}^{k}), x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} 〉 \geq 〈 \nabla F_{\infty, 0} (x_{\infty, 0}^{k}) - \nabla F_{\infty, 0} (x_{t_{δ}, δ}^{k}), x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} 〉 . \end{matrix}$ (4.7)

Since $\nabla F_{\infty, 0} (x)$ is totally differentiable at $x_{\infty, 0}^{k}$ , we obtain $\begin{matrix} \nabla F_{\infty, 0} (x_{t_{δ}, δ}^{k}) = \nabla F_{\infty, 0} (x_{\infty, 0}^{k}) + H e s s_{F_{\infty, 0}} (x_{\infty, 0}^{k}) (x_{t_{δ}, δ}^{k} - x_{\infty, 0}^{k}) + w_{\infty, 0} (x_{t_{δ}, δ}^{k}, x_{\infty, 0}^{k}), \end{matrix}$

where $w_{\infty, 0} (x, x_{\infty, 0}^{k})$ fulfils $\begin{matrix} ‖ w_{\infty, 0} (x, x_{\infty, 0}^{k}) ‖_{2} \leq {‖ x - x_{\infty, 0}^{k} ‖}_{2} ϵ_{\infty, 0} (x, x_{\infty, 0}^{k}) with lim_{x \to x_{\infty, 0}^{k}} ϵ_{\infty, 0} (x, x_{\infty, 0}^{k}) = 0 . \end{matrix}$

Since $H e s s_{F_{\infty, 0}} (x_{\infty, 0}^{k})$ is positive definite, the expression ${(〈 x, H e s s_{F_{\infty, 0}} (x_{\infty, 0}^{k}) x 〉)}^{\frac{1}{2}}$ gives a norm on $R^{D}$ . Because of the equivalence of all norms in $R^{D}$ , there exists a constant $C_{\infty, 0}^{k} > 0$ with $\begin{matrix} {(〈 x, H e s s_{F_{\infty, 0}} (x_{\infty, 0}^{k}) x 〉)}^{\frac{1}{2}} \geq C_{\infty, 0}^{k} {‖ x ‖}_{2} for all x \in R^{D} . \end{matrix}$

Since ${lim}_{δ \to 0} x_{t_{δ}, δ}^{k} = x_{\infty, 0}^{k}$ we can find a noise level $ρ_{max}^{k}$ such that $| ϵ_{\infty, 0} (x_{t_{δ}, δ}^{k}, x_{\infty, 0}^{k}) | \leq d_{\infty, 0}^{k}$ for all $δ \leq ρ_{max}^{k}$ , where $d_{\infty, 0}^{k}$ is a constant with $0 \leq d_{\infty, 0}^{k} < {(C_{\infty, 0}^{k})}^{2}$ . Then using $\begin{matrix} 〈 w_{\infty, 0} (x_{t_{δ}, δ}^{k}, x_{\infty, 0}^{k}), x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} 〉 \leq ϵ_{\infty, 0} (x_{t_{δ}, δ}^{k}, x_{\infty, 0}^{k}) {‖ x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} ‖}_{2}^{2} \end{matrix}$

and (Equation4.7(4.7) $\begin{matrix} 〈 \nabla H_{t_{δ}, δ} (x_{t_{δ}, δ}^{k}), x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} 〉 \geq 〈 \nabla F_{\infty, 0} (x_{\infty, 0}^{k}) - \nabla F_{\infty, 0} (x_{t_{δ}, δ}^{k}), x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} 〉 . \end{matrix}$ (4.7) ) we can estimate $\begin{matrix} ‖ \nabla H_{t_{δ}, δ} (x_{t_{δ}, δ}^{k}) ‖_{2} {‖ x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} ‖}_{2} \\ \geq 〈 \nabla F_{\infty, 0} (x_{\infty, 0}^{k}) - \nabla F_{\infty, 0} (x_{t_{δ}, δ}^{k}), x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} 〉 \\ = 〈 H e s s_{F_{\infty, 0}} (x_{\infty, 0}^{k}) (x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k}) - w_{\infty, 0} (x_{t_{δ}, δ}^{k}, x_{\infty, 0}^{k}), x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} 〉 \\ \geq ((C_{\infty, 0}^{k})^{2} - ϵ_{\infty, 0} (x_{t_{δ}, δ}^{k}, x_{\infty, 0}^{k})) {‖ x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} ‖}_{2}^{2}, \end{matrix}$ i.e. this gives(4.8) $\begin{matrix} ‖ x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} ‖_{2} \leq {((C_{\infty, 0}^{k})^{2} - d_{\infty, 0}^{k})}^{- 1} {‖ \nabla H_{t_{δ}, δ} (x_{t_{δ}, δ}^{k}) ‖}_{2} \end{matrix}$ (4.8)

for all $δ \leq ρ_{max}^{k}$ .

We have $\begin{matrix} \nabla H_{t_{δ}, δ} (x) = 2 ( & J a c_{g_{t_{δ}}}^{T} (x) Σ^{- 1} g_{t_{δ}} (x) - J a c_{g_{t_{δ}}}^{T} (x) Σ^{- 1} (f (x) - f (x_{true}) - δ) \\ - & J a c_{f}^{T} (x) Σ^{- 1} g_{t_{δ}} (x) - J a c_{f}^{T} (x) Σ^{- 1} δ), \end{matrix}$

i.e. we can find constants $K_{1}^{k}$ and $K_{2}^{k}$ such that $\begin{matrix} ‖ \nabla H_{t_{δ}, δ} (x_{t_{δ}, δ}^{k}) ‖_{2} & \leq 2 (K_{1}^{k} ‖ J a c_{g_{t_{δ}}}^{T} (x_{t_{δ}, δ}^{k}) Σ^{- \frac{1}{2}} ‖_{\infty} {‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}^{k}) ‖}_{2} \\ + K_{1}^{k} ‖ J a c_{g_{t_{δ}}}^{T} (x_{t_{δ}, δ}^{k}) Σ^{- \frac{1}{2}} ‖_{\infty} ({‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}^{k}) - f (x_{true}) - δ) ‖}_{2} \\ + ‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}^{k}) ‖_{2}) \\ + K_{2}^{k} ‖ J a c_{f}^{T} (x_{t_{δ}, δ}^{k}) Σ^{- \frac{1}{2}} ‖_{\infty} {‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}^{k}) ‖}_{2} \\ + K_{2}^{k} ‖ J a c_{f}^{T} (x_{t_{δ}, δ}^{k}) Σ^{- \frac{1}{2}} ‖_{\infty} {‖ Σ^{- \frac{1}{2}} δ ‖}_{2}), \end{matrix}$

which gives the result.

Corollary 4.9:

Let the truncation indices $t_{δ}$ depend on the vector of independent Gaussian random variables $δ$ with ${lim}_{δ \to 0} E ({‖ Σ^{- \frac{1}{2}} δ ‖}_{2}^{2}) = 0$ such that ${lim}_{δ \to 0} E ({‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x) ‖}_{2}^{2}) = 0$ holds for all arbitrary but fixed $x \in Ω$ . Then we have for all minimizers $x_{\infty, 0}^{k}$ of (Equation4.6(4.6) $\begin{matrix} x_{\infty, 0} : = \underset{x \in R^{D}}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 σ_{i}^{2}} {(\sum_{n = 1}^{\infty} a_{n}^{i} (x) - \sum_{n = 1}^{\infty} a_{n}^{i} (x_{true}))}^{2} s.t. x \in Ω . \end{matrix}$ (4.6) ) with $‖ Σ^{- \frac{1}{2}} (f (x_{\infty, 0}^{k}) - f (x_{true})) ‖_{2} = 0$ that for $δ$ sufficiently small there exist minimizers $x_{t_{δ}, δ}^{k}$ of (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) with $\begin{matrix} lim_{δ \to 0} E ({‖ x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} ‖}_{2}) = 0 . \end{matrix}$

Proposition 4.5 establishes the existence of the $x_{t_{δ}, δ}^{k}$ ’s. Corollary 4.6 gives $\begin{matrix} lim_{δ \to 0} E ({‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}^{k}) - f (x_{true}) - δ) ‖}_{2}) = 0 . \end{matrix}$

Proposition 4.7 gives that for $δ$ sufficiently small there exists a constant $K_{\infty, 0}^{k}$ with $\begin{matrix} E (‖ x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} ‖_{2}) \leq K_{\infty, 0}^{k} E ({‖ \nabla H_{t_{δ}, δ} (x_{t_{δ}, δ}^{k}) ‖}_{2}) . \end{matrix}$

Set $S_{1}^{k} : = {sup}_{x \in Ω} {‖ J a c_{g_{t_{δ}}}^{T} (x) Σ^{- \frac{1}{2}} ‖}_{\infty} < \infty$ and $S_{2}^{k} : = {sup}_{x \in Ω} {‖ J a c_{f}^{T} (x) Σ^{- \frac{1}{2}} ‖}_{\infty} < \infty$ . Then the estimate for $‖ \nabla H_{t_{δ}, δ} (x_{t_{δ}, δ}^{k}) ‖_{2}$ in the proof of Proposition 4.7 gives $\begin{matrix} E (‖ \nabla H_{t_{δ}, δ} (x_{t_{δ}, δ}^{k}) ‖_{2}) & \leq 2 (K_{1}^{k} S_{1}^{k} E ({‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}^{k}) ‖}_{2}) \\ + K_{1}^{k} S_{1}^{k} (E ({‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}^{k}) - f (x_{true}) - δ) ‖}_{2}) \\ + E (‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}^{k}) ‖_{2})) \\ + K_{2}^{k} S_{2}^{k} E ({‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}^{k}) ‖}_{2}) \\ + K_{2}^{k} S_{2}^{k} E ({‖ Σ^{- \frac{1}{2}} δ ‖}_{2})), \end{matrix}$ which proves the claim since Assumption 4.1 gives $E (‖ Σ^{- \frac{1}{2}} {δ ‖}_{2}) \leq \sqrt{N} δ$ .

The strategy for our retrieval algorithm is to start with an initial guess for the truncation index $t_{start}$ and try to find all local minima $x_{t_{start}, δ}^{k}$ . Then the truncation index is gradually increased and starting from $x_{t_{start}, δ}^{k}$ the continuation method is applied to find finally the local minima $x_{t_{δ}, δ}^{k}$ . Motivated by Propositions 4.8 and 4.9 only those local minima are considered to be possible approximations to our sought-after refractive index, where the residual $‖ Σ^{- \frac{1}{2}} (f_{t_{δ}} (x_{t_{δ}, δ}^{k}) - f (x_{true}) - δ) ‖_{2}$ and an estimate of the truncation error $‖ Σ^{- \frac{1}{2}} g_{t_{δ}} (x_{t_{δ}, δ}^{k}) ‖_{2}$ are both reasonably small. The latter serves also as a stopping criterion for the continuation method

The initial guess $t_{start}$ has to be selected with care. On the one hand if it is to small, the model is to inaccurate and the retrieval of the sought-after local minima cannot be guaranteed. On the other hand if it is too big, computational effort is wasted, since too many Mie coefficient functions with almost vanishing magnitudes and thus essentially not changing the local minima are computed.

Remark 4.10:

We did not explicitly show the assumption that the local minima of (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) are strict as demanded in Proposition 4.5. However we could always determine a set of strict local minima for any truncation index in our simulations.

5. The reconstruction algorithm

We now return to our regression problem (Equation1.3(1.3) $\begin{matrix} M (l) : = \underset{m \in C}{a r g m i n} \sum_{i = 1}^{N} \frac{1}{2 {(\frac{s_{i}}{n_{i}})}^{2}} {(π r_{i}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (m_{med} (l), m, r_{i}, l) - \frac{q (r_{i}, l)}{n_{i}})}^{2} . \end{matrix}$ (1.3) ). For $i = 1, \dots, N$ we see that the measured extinctions normalized by the number of particles $n_{i}$ with radius $r_{i}$ , i.e. the quantity $\frac{e_{i}}{n_{i}}$ , is Gaussian-distributed with mean $\frac{1}{n_{i}} q_{true} (r_{i}, l)$ and standard deviation $σ_{i} : = \frac{s_{i}}{n_{i}}$ . In the following we fix a wavelength l, i.e. we reconstruct the sought-after particle refractive index $m_{part} (l)$ wavelength by wavelength. In the following, the unit both for particle radii and light wavelengths is $μ$ m.

We make use of the function $q_{N_{t r}} : R^{2} \to R^{N}$ defined by(5.1) $\begin{matrix} q_{N_{t r}} (x) : = {(π r_{1}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (x, r_{1}, l), \dots, π r_{N}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (x, r_{N}, l))}^{T}, \end{matrix}$ (5.1)

where $R = {r_{1}, \dots, r_{N}}$ is the particle radius grid and $N_{t r}$ the truncation index to be used. We allow non-integer truncation indices $N_{t r}$ as well, where the non-integer truncation is done like in Proposition 4.3. Here the expression $q_{n} (x, r_{k}, l)$ is a short notation of $q_{n} (m_{med} (l), {(x)}_{1} + {(x)}_{2} i, r_{k}, l)$ from Section 1, where the sought-after refrative index $m_{part} (l)$ is identified with the vector $x$ here, i.e. $m_{part} (l) = {(x)}_{1} + {(x)}_{2} i$ . So its computation follows Section 2.

In the following, the refractive index search area is given by the rectangle $Ω : = [0, 20] \times [0, 40]$ , which means that we only consider refractive indices of particle materials whose real parts lie in the interval [0, 20] and its imaginary parts in the interval [0, 40]. This rather large search area makes the algorithm suitable for a wide range of aerosol materials.

In the first loop from lines 14–25, a search for local minima of the fit function $F (x)$ defined in line 16 for the truncation index $N_{t r} = 3$ is performed. The loop runs through all grid points ${(c_{i}, d_{j})}^{T}$ of the search grid defined in lines 5 and 6. If the Hessian of $F (x)$ at some grid point ${(c_{i}, d_{j})}^{T}$ is positive definite, this point might lie in the vicinity of a local minimum. The Hessian is computed exactly, where the second partial derivatives of the Mie extinction efficiency with respect to the real and imaginary part of the scattering material needed here are computed using the product rule approach from Section 3. So we use ${(c_{i}, d_{j})}^{T}$ as start point for a local solver in this case. In line 20, we only accept a new local minimum if it is sufficiently different from the local minima already found. Then it is stored in the container $S_{start}$ . This simple global search strategy can find all local minima if the search grid is fine enough.

Remark 5.1:

Of course other well-established global search heuristics can be applied here as well. In test runs, we compared genetic algorithms with our sequential search strategy on the two-dimensional refractive index search area, but their computational effort and reliability remained the same or were even worse. In [Citation7], the technique of simulated annealing was used to retrieve aerosol refractive indices, which could be a promising alternative here. In our study, the measurement noise was so high, that a unique global minimum of our fit function could not be determined. Instead our focus lied on effectively finding all local minima with small values of the fit function and we regarded them all as possible approximations to a thought-after refractive index.

The second loop from lines 29–49 uses the local minima found in the first loop as start points for the continuation method following Proposition 4.3 and Corollary 4.4. We found that a step width of 0.1 is for our problem a well-balanced choice between too big step widths rendering the continuation method unstable and too small step widths making it computationally inefiicient. With the stopping criterion $D_{rel} \leq T o l_{rel}$ of the while-loop, it is approximately checked if the magnitude of the remainder term is small enough. Finally in line 44, it is checked if the residual is small enough. In our implementation, we did another run of lines 44–48 with $τ = 5$ and $τ = 7$ , respectively, if none of the reconstructions had a squared residual smaller than $τ N_{r} δ^{2}$ for the previous $τ$ . This had to be done, because the parameter $τ$ has to be selected carefully in order to estimate the bound on $E (‖ x_{\infty, 0}^{k} - x_{t_{δ}, δ}^{k} ‖_{2})$ derived in the proof of Corollary 4.9 correctly.

6. Comparison of the Numerical Continuation Approach with Established Truncation Index Heuristics

As solution of the forward problem, we generated for a discrete set of wavelengths $l_{1}$ , ..., $l_{N_{l}}$ unperturbed spectral extinctions normalized with the number of particles of the monodisperse aerosol by computing $\begin{matrix} {(e_{true})}_{i, j} : = π r_{i}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (m_{med} (l_{j}), m_{part} (l_{j}), r_{i}, l_{j}), for i = 1, \dots, N, j = 1, \dots, N_{l} \end{matrix}$

with $m_{part} (l_{i})$ taken as the refractive indices of Ag, $H_{2}$ O and CsI. Here we used the truncation index(6.1) $\begin{matrix} ρ & = 2 π \frac{r}{l}, \\ M & = max {| ρ |, | ρ \cdot m_{part} (l) |, | ρ \cdot m_{med} (l) |}, \\ N_{t r} & : = ⌈ | M + 4.05 \cdot M^{\frac{1}{3}} + 2 | ⌉ \end{matrix}$ (6.1)

introduced in [Citation4].

For particle size distribution reconstructions as outlined in [Citation2], we need particle refractive indices for five optical windows, see [Citation8], so the wavelength grid of interest consists of five ranges. These ranges are given by 8 linearly spaced wavelengths from 0.6–0.8 $μ$ m, 8 from 1.1–1.3 $μ$ m, 8 from 1.6–1.8 $μ$ m, 16 from 2.1–2.5 $μ$ m and 8 from 3.1–3.3 $μ$ m, so we have in total $N_{l} = 48$ wavelengths.

For each of the 48 wavelengths, we generated noisy spectral extinctions $e$ by adding zero-mean Gaussian noise to $e_{true}$ , i.e. $\begin{matrix} {(e)}_{i, j} = {(e_{true})}_{i, j} + δ_{i, j} with δ_{i, j} \sim N (0, {(0.05 \cdot {(e_{true})}_{i, j})}^{2}), i = 1, \dots, N j = 1, \dots, N_{l} . \end{matrix}$

Here the standard deviations were taken to be $5 %$ of the original extinction values. We computed each mean ${(e_{real})}_{i, j}$ of the noisy spectral extinctions with a sample size of $N_{s} = 300$ .

6.1. Run Time Comparison

In the following, Algortihm Equation1(1.1) $\begin{matrix} \int_{0}^{\infty} k (r, l) n (r) d r = e (l) with e (l) = - \frac{log (I_{long} (l)) - log (I_{short} (l))}{L_{long} - L_{short}}, \end{matrix}$ (1.1) is referred to as method 1. On the same simulated spectral extinctions, we let Algorithm 1 run up to line 25, but with the difference that at each evaluation of $q_{N_{t r}} (x)$ we directly took the trunction index from (Equation6.1(6.1) $\begin{matrix} ρ & = 2 π \frac{r}{l}, \\ M & = max {| ρ |, | ρ \cdot m_{part} (l) |, | ρ \cdot m_{med} (l) |}, \\ N_{t r} & : = ⌈ | M + 4.05 \cdot M^{\frac{1}{3}} + 2 | ⌉ \end{matrix}$ (6.1) ). We denote this approach with method 2. We now display the average run times of method 1 and method 2 for 10 sweeps through all 48 wavelenghs.

6.1.1. Results for Ag

6.1.2. Results for CsI

6.1.3. Results for $H_{2}$ O

6.2. Maximal relative deviations

For the 10 simulation runs, we list the maximal relative deviations $\begin{matrix} 100 \cdot \frac{‖ {(n_{part}^{1} (l_{j}), k_{part}^{1} (l_{j}))}^{T} - {(n_{part}^{2} (l_{j}), k_{part}^{2} (l_{j}))}^{T} ‖_{2}}{‖ {(n_{part}^{1} (l_{i}), k_{part}^{1} (l_{j}))}^{T} ‖_{2}} \end{matrix}$

of the refractive index reconstructions ${(n_{part}^{2} (l_{j}), k_{part}^{2} (l_{j}))}^{T}$ from method 2 from ${(n_{part}^{1} (l_{j}), k_{part}^{1} (l_{j}))}^{T}$ of method 1 for $j = 1, \dots, 48$ . At each wavelength, multiple local minima can be detected by both methods. For the relative deviations, we always selected the local minima forming the smoothest reconstructions on each optical window in the sense of Section 8.

6.2.1. Results for Ag

6.2.2. Results for CsI

6.2.3. Results for $H_{2}$ O

6.3. Conclusion

For Ag, the average total run time over all 48 wavelengths for method 1 was $44.0167 %$ less than for method 2, for $H_{2}$ O $43.9808 %$ and for CsI $44.7322 %$ , i.e. method 1 is almost two times faster than method 2. The results are of the same quality, since their relative deviations are just small fractions of percentages.

The continuation method approach saves run time significantly with the same quality of the results compared to using the truncation index (Equation6.1(6.1) $\begin{matrix} ρ & = 2 π \frac{r}{l}, \\ M & = max {| ρ |, | ρ \cdot m_{part} (l) |, | ρ \cdot m_{med} (l) |}, \\ N_{t r} & : = ⌈ | M + 4.05 \cdot M^{\frac{1}{3}} + 2 | ⌉ \end{matrix}$ (6.1) ) all the time.

7. Nonlinear Tikhonov regularization

So far we have solved the regression problem (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ) without any regularization, thus the obtained refractive index reconstructions might still be too error-contaminated to be of practical use. A widely used regularization strategy for nonlinear regression problems is Tikhonov regularization, which yields the regularized regression problem(7.1) $\begin{matrix} x_{γ} : = \underset{x \in R^{D}}{a r g m i n} ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} + γ {‖ x - x^{*} ‖}_{2}^{2} s.t. x \in Ω \end{matrix}$ (7.1)

when we apply it on (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ), cf. [Citation9]. Here $γ$ is a regularization parameter and $x^{*}$ is an estimate of the sought-after true solution. In many cases, the unregularized problem has a whole set of minimizers, thus the vector $x^{*}$ works also as a selection criterion. Now if a reasonable $x^{*}$ is found, the regularization parameter $γ$ can be determined with the discrepancy principle, i.e. $γ$ is computed such that $\begin{matrix} ‖ Σ^{- \frac{1}{2}} (f_{t} (x_{γ}) - f (x_{true}) - δ) ‖_{2} = R (δ), \end{matrix}$

where $R (δ)$ is an estimate of the residual of the ‘true’ solution which depends on the noise level $δ$ . For this task, monotonicity in the residual of $x_{γ}$ is established in [Citation9].

The problem of finding a good estimate $x^{*}$ still remains. In [Citation10], an alternative implementable parameter choice strategy without the need of an $x^{*}$ is derived. Applied on our problem, it gives $\begin{matrix} γ 〈 Σ^{- \frac{1}{2}} (f_{t} (x_{γ}) - f (x_{true}) - δ), J_{γ}^{- 1} (Σ^{- \frac{1}{2}} (f_{t} (x_{γ}) - f (x_{true}) - δ) 〉 = R (δ), \\ with & J_{γ} : = γ I + Σ^{- \frac{1}{2}} J a c_{f_{t}} (x_{γ}) J a c_{f_{t}} {(x_{γ})}^{T} Σ^{- \frac{1}{2}} . \end{matrix}$

This method has the drawback that the matrix $J_{γ}$ needs to be inverted, which may lead to instabilities.

Nevertheless the quality of the regularized solutions still depends strongly on the start values for solving (Equation7.1(7.1) $\begin{matrix} x_{γ} : = \underset{x \in R^{D}}{a r g m i n} ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} + γ {‖ x - x^{*} ‖}_{2}^{2} s.t. x \in Ω \end{matrix}$ (7.1) ). We know about our sought-after refractive indices that they form smooth curves on each of the five optical windows. The complex refractive index curves of most materials can be described using the so-called Lorentz-oscillator-model, cf. [Citation11]. Here points with bigger curvatures only occur at so-called resonance frequencies corresponding to some isolated resonance wavelengths. Motivated by these facts, we derive in the following a method to find reasonable start values for Phillips–Twomey regularization out of the results of Algorithm 1, which will be outlined in Section 9.

8. Finding the smoothest coupled solutions

We have the problem of identifying the best approximation to the sought-after true particle material refractive index $x_{true}$ out of a set of multiple solutions obtained with Algorithm 1. This problem can be solved by coupling the solutions, which means that we combine solutions from neighbouring wavelengths l in each of the five optical windows in order to obtain a unique solution for every optical window. We know about the complex refractive index curves to be retrieved that they are smooth; hence, we expect their sum of the squared second finite differences both in the real and imaginary parts to be small.

Let $l_{1}$ , ..., $l_{s}$ denote the wavelengths of any of our five wavelength ranges. Let $N_{1}$ , ..., $N_{s}$ be the number of solutions found for all the wavelengths. We denote with $x_{j}^{i}$ the jth solution found for wavelength $l_{i}$ for $i = 1, \dots, s$ and $j = 1, \dots, N_{i}$ . Now we wish to find the smoothest combined solution from all possible combinations $x_{j_{1}}^{1}$ , ..., $x_{j_{s}}^{s}$ for $j_{i} = 1, \dots, N_{i}$ , hence we have a total number of $\prod_{i = 1}^{s} N_{i}$ combinations. Here we measure smoothness of a combination $x_{j_{1}}^{1}$ , ..., $x_{j_{s}}^{s}$ with the sum $\begin{matrix} S : = \sum_{i = 2}^{s - 1} ({((x_{j_{i - 1}}^{i - 1})_{1} - 2 (x_{j_{i}}^{i})_{1} + (x_{j_{i + 1}}^{i + 1})_{1})}^{2} + {((x_{j_{i - 1}}^{i - 1})_{2} - 2 (x_{j_{i}}^{i})_{2} + (x_{j_{i + 1}}^{i + 1})_{2})}^{2}) \end{matrix}$

of its second finite differences both in the real parts $(x_{j_{i}}^{i})_{1}$ and its imaginary parts $(x_{j_{i}}^{i})_{2}$ , which means that we regard a combination the smoother the smaller its sum S is.

We encounter the problem that the total number of possible combinations $\prod_{i = 1}^{s} N_{i}$ might get too big to iterate through all combinations in the search for the smoothest one in acceptable time. Therefore we propose a greedy algorithm, which uses each second finite difference as start point to find a smooth combination.

The main loop spanning over the lines 5–52 iterates through all positions $z = 2, \dots, s - 1$ of a middle point for a second finite difference both in the real and imaginary part. At each position z, the inner loops beginning in lines 6–8 iterate through all possible second finite differences which can be formed out of the vectors $x_{c 1}^{z - 1}$ , $x_{c 2}^{z}$ and $x_{c 3}^{z + 1}$ for $c 1 = 1, \dots, N_{z - 1}$ , $c 2 = 1, \dots, N_{z}$ and $c 3 = 1, \dots, N_{z + 1}$ , i.e. they loop through all of their middle points and left and right neighbours at position z. In lines 9–12, the variable $S_{cur}$ is initialized with the sum of the squared second finite differences in the real and imaginary parts of the current vectors $x_{c 1}^{z - 1}$ , $x_{c 2}^{z}$ and $x_{c 3}^{z + 1}$ and the positions $z - 1$ , z and $z + 1$ of the array Comb are filled with the current vectors. For $z \geq 3$ , the loop in lines 13–28 successively fills the positions $k = z - 2, z - 3, \dots, 1$ of the array Comb. At each new position k, the minimal sum of the two squared second finite differences in the real and imaginary part $D_{min}$ is determined in lines 19–25, where the middle and right points are fixed and taken as the leftmost two vectors from the array Comb and the right point runs through all $x_{j}^{k}$ for $j = 1, \dots, N_{k}$ . After the vector $x_{min}$ giving out of all of the $x_{j}^{k}$ ’s giving the minimal sum $D_{min}$ is found, the $D_{min}$ is added to $S_{cur}$ and $x_{min}$ is stored in kth entry Comb(k). In a similar way, the loop in lines 29–44 succesively fills the positions $k = z + 2, z + 3, \dots, s$ for $z \leq s - 2$ . This time the left and middle points are fixed and taken as the rightmost two points of the array Comb, whereas the left point iterates through all $x_{j}^{k}$ for $j = 1, \dots, N_{k}$ . Again the vector $x_{min}$ is that one of the $x_{j}^{k}$ ’s giving the minimal sum $D_{min}$ and it is stored in Comb(k). As well the sum $D_{min}$ is added to $S_{cur}$ .

In the above procedure, every triple of neighbouring vectors from the results of Algorithm 1 is considered to possibly lie on the sought-after smoothest combination with the smallest sum of all squared second differences. The three vectors are used as start points to find a smooth combination with a greedy strategy, where only a vector is added to the current combination, if it gives the smallest sum $D_{min}$ at the left or right end of the growing set with vectors already added, until the first and last position are reached.

Finally, from all of the combinations constructed this way the smoothest one with the smallest sum $S_{min}$ out of all $S_{cur}$ ’s is selected in lines 45–48 to be the final output SmoothestCombination.

Define $N_{total} : = \sum_{j = 1}^{s} N_{j}$ . Then the total number of operations needed for Algorithm 2 can be estimated by $O (N_{total} \sum_{j = 2}^{s - 1} N_{j - 1} N_{j} N_{j + 1})$ which is considerably less than the $O (\prod_{j = 1}^{s} N_{j})$ operations needed by the naive method of iterating through all possible combinations.

9. Further regularization of coupled solutions

Not only for determining the smoothest refractive index curve reconstructions formed from the results of Algorithm 1 the coupled view on the solutions is beneficial – it also leads to further improvement of the results by Twomey-regularization. Let us investigate the coupled approach in a probability theoretical setting. Here we reuse the notations introduced in Section 4, i.e. we let $x^{1}$ , ..., $x^{s}$ denote a set of solution for any of the five optical windows. Then the joint posterior probability density of $x^{1}$ , ..., $x^{s}$ is given by(9.1) $\begin{matrix} p (x^{1}, \dots, x^{s} | e^{1}, \dots, e^{s}) & = \prod_{j = 1}^{s} p (x^{j} | e^{j}) \propto exp (- \frac{1}{2} \sum_{j = 1}^{s} {‖ Σ_{j}^{- \frac{1}{2}} (f_{t}^{j} (x^{j}) - e^{j}) ‖}_{2}^{2}) \\ \times \prod_{j = 1}^{s} I_{Ω} (x^{j}), \end{matrix}$ (9.1) where $e^{j}$ is the data vector for the jth wavelength $l_{j}$ having N entries with N being the size of the radius grid. Moreover, $Σ_{j}$ is the scaled covariance matrix for $l_{j}$ and $f_{t}^{j} (x)$ is the applied model depending on $l_{j}$ . Note that we initially have differing truncation indices $t_{1}$ , ..., $t_{s}$ . Since the coefficient functions of the truncated model function $f_{t_{j}} (x)$ are decaying fast for each $t_{j}$ , it is convenient to change to the same truncation index $t : = max {t_{1}, \dots, t_{s}}$ for all wavelengths $l_{1}$ , ..., $l_{s}$ . The errors introduced by doing so are negliglible. It is easy to show that maximizing the joint density (Equation9.1(9.1) $\begin{matrix} p (x^{1}, \dots, x^{s} | e^{1}, \dots, e^{s}) & = \prod_{j = 1}^{s} p (x^{j} | e^{j}) \propto exp (- \frac{1}{2} \sum_{j = 1}^{s} {‖ Σ_{j}^{- \frac{1}{2}} (f_{t}^{j} (x^{j}) - e^{j}) ‖}_{2}^{2}) \\ \times \prod_{j = 1}^{s} I_{Ω} (x^{j}), \end{matrix}$ (9.1) ) is equivalent to maximize all single densities $p (x^{j} | e^{j})$ independently, i.e. a joint MAP-estimator $\begin{matrix} x_{o p t}^{1}, . . ., x_{o p t}^{s} \in X_{o p t} : = \underset{x^{1}, . . ., x^{s}}{a r g m a x} p (x^{1}, . . ., x^{s} | e^{1}, . . ., e^{s}) \end{matrix}$

consists of the single MAP-estimators $\begin{matrix} x_{o p t}^{j} \in X_{o p t}^{j} : = \underset{x}{a r g m a x} p (x | e^{j}) \end{matrix}$

for $j = 1, \dots, s$ . This means the the results of Algorithm 1 can be used to construct MAP-estimators for the joint posterior probabilty density.

This behaviour changes when we replace the joint prior probabilty density $\begin{matrix} p_{prior} (x^{1}, \dots, x^{s}) = {(v o l (Ω))}^{- s} \prod_{j = 1}^{s} I_{Ω} (x^{j}) \end{matrix}$

with $\begin{matrix} p_{prior} (x^{1}, \dots, x^{s}) \propto exp (- \frac{1}{2} γ S (x^{1}, \dots, x^{s})) \prod_{j = 1}^{s} I_{Ω} (x^{j}), \end{matrix}$

where $\begin{matrix} S (x^{1}, \dots, x^{s}) & : = \sum_{i = 2}^{s - 1} (((x^{i - 1})_{1} - 2 (x^{i})_{1} + (x^{i + 1})_{1})^{2} + ((x^{i - 1})_{2} - 2 (x^{i})_{2} \\ + (x^{i + 1})_{2})^{2}) + ρ \sum_{i = 1}^{s} ((x^{i})_{1}^{2} + (x^{i})_{2}^{2}), \end{matrix}$

where $γ$ is a regularization parameter and $ρ$ is a parameter specifying the amount Tikhonov regularization.

In the new prior distribution, we use a combination of Tikhonov and Phillips–Twomey regularization both in the real and imaginary parts. Here we apply a small amount of Tikhonov-regularization by setting $ρ = 10^{- 8}$ , such that the resulting regularization operator gets regular. This means that the regularized regression problem (Equation9.2(9.2) $\begin{matrix} x_{o p t}^{1}, . . ., x_{o p t}^{s} \in X_{o p t} : = \underset{x^{1}, . . ., x^{s}}{a r g m a x} p (x^{1}, . . ., x^{s} | e^{1}, . . ., e^{s}) \end{matrix}$ (9.2) ) can be transformed into standard Tikhonov form and that the monotonicity results from [Citation9] are still valid. Each second finite difference is clearly a function of three neighbouring points; therefore, a decoupled computation of the joint MAP-estimator(9.2) $\begin{matrix} x_{o p t}^{1}, . . ., x_{o p t}^{s} \in X_{o p t} : = \underset{x^{1}, . . ., x^{s}}{a r g m a x} p (x^{1}, . . ., x^{s} | e^{1}, . . ., e^{s}) \end{matrix}$ (9.2)

with $\begin{matrix} p (x^{1}, \dots, x^{s} | e^{1}, \dots, e^{s}) & \propto exp (- \frac{1}{2} \sum_{j = 1}^{s} {‖ Σ_{j}^{- \frac{1}{2}} (f_{t}^{j} (x^{j}) - e^{j}) ‖}_{2}^{2} - \frac{1}{2} γ S (x^{1}, \dots, x^{s})) \\ \times \prod_{j = 1}^{s} I_{Ω} (x^{j}) \end{matrix}$

for each wavelength seperately is not possible anymore after changing to the new prior density. However the result vectors $x^{1}$ , ..., $x^{s}$ from Algorithm 2 form a good start vector to solve the nonlinear regression problem (Equation9.2(9.2) $\begin{matrix} x_{o p t}^{1}, . . ., x_{o p t}^{s} \in X_{o p t} : = \underset{x^{1}, . . ., x^{s}}{a r g m a x} p (x^{1}, . . ., x^{s} | e^{1}, . . ., e^{s}) \end{matrix}$ (9.2) ).

We selected the regularization parameter $γ$ using the discrepancy principle, i.e. we compute $γ$ such that the regularized solution $\begin{matrix} x_{γ}^{1}, . . ., x_{γ}^{s} \in X_{γ} : & = \underset{x^{1}, . . ., x^{s}}{a r g m i n} \sum_{j = 1}^{s} {‖ Σ_{j}^{- \frac{1}{2}} (f_{t}^{j} (x^{j}) - e^{j}) ‖}_{2}^{2} + γ S (x^{1}, . . ., x^{s}) \\ s.t. x^{j} \in Ω, j = 1, . . ., s \end{matrix}$

fulfils a relation of the form $\begin{matrix} \sum_{j = 1}^{s} {‖ Σ_{j}^{- \frac{1}{2}} (f_{t}^{j} (x_{γ}^{j}) - e^{j}) ‖}_{2}^{2} = R (δ), \end{matrix}$

where $R (δ)$ is a proposed residual value depending on the noise level $δ$ . In [Citation2], several different residual values are proposed for a fixed model discretization and a set of regularization parameters is obtained from those using the discrepancy principle. The pairings of model discretizations and regularization parameters obtained this way are compared by their Bayesian posterior probabilities. For these probabilities, a set of integrals of the different model posterior densities is needed to be computed, which can be done with Monte Carlo integration methods. Due to the highly nonlinear behaviour of our model $f_{t} (x)$ such integration methods are not available here. Therefore, we simplified the posterior exploration in such a way that only one residual value is proposed.

Since each observed probability density $p (e^{j} | x^{j})$ for $j = 1, \dots, s$ is Gaussian, the joint observed density $p (e^{1}, \dots, e^{1} | x^{1}, \dots, x^{s}) = \prod_{j = 1}^{s} p (e^{j} | x^{j})$ is Gaussian as well. We have $x^{j} \in R^{2}$ , thus the sum of residuals $\sum_{j = 1}^{s} {‖ Σ_{j}^{- \frac{1}{2}} (f_{t}^{j} (x^{j}) - e^{j}) ‖}_{2}^{2}$ running through all wavelengths in the optical window is $χ^{2} (2 s)$ -distributed. This yields $\begin{matrix} E (\sum_{j = 1}^{s} {‖ Σ_{j}^{- \frac{1}{2}} (f_{t}^{j} (x^{j}) - e^{j}) ‖}_{2}^{2}) = 2 s . \end{matrix}$

Now a widely proposed residual value for the discrepancy principle is $τ \cdot 2 s$ , where $τ = 1.1$ is the so-called Morozov safety factor. This choice is prone to under- or overregularization since the residual value corresponding to the ‘true’ solution might be much smaller or bigger than $2 τ s$ . Therefore, we proposed a residual value which depends more dynamically on the observed behoviour of the residual.

Let $x_{0}^{1}$ , ..., $x_{0}^{s}$ denote the unregularized solutions, i.e. the results of Algorithm 2. Then their squared residual is given by $R_{0} : = \sum_{j = 1}^{s} {‖ Σ_{j}^{- \frac{1}{2}} (f_{t}^{j} (x_{0}^{j}) - e^{j}) ‖}_{2}^{2}$ . We first proposed $\begin{matrix} R (δ) = max \{2 τ_{1} s, τ R_{0}\}, \end{matrix}$

where we selected $τ_{1} = 1.1$ . This means that the residual of the regularized solution is beginning at $R_{0}$ at least increased by the factor $τ_{1}$ , which avoids underregularization. If it then happens that $\frac{R (δ)}{R_{0}} > θ$ with $θ = 1.5$ the proposed residual is most likely too big and overregularization occurs. In this case, we corrected $R (δ)$ by setting $\begin{matrix} R (δ) = max \{2 τ_{2} s, θ R_{0}\} \end{matrix}$

with $τ_{2} = 0.9$ .

10. Numerical results

To see how reliable our proposed reconstruction algortihm is, we performed for each of the scatterer materials Ag, $H_{2}$ O a numerical study with 100 sweeps through all 48 wavelenghts of the five optical windows with the same settings as in Section 6. We found out that the radii $r_{1} : = 0.1 μ$ m, $r_{2} : = 0.2 μ$ m and $r_{3} : = 0.3 μ$ m contain the most information about the refractive indices. This was found by keeping our 48 wavelengths fixed and comparing the quality of inversion results under varying aerosol particle radii. Bigger radii did not improve the results in our simulations and refractive index reconstructions only using bigger radii even turned out to be too unstable. A more thorough treatment of this problem can be found in [Citation7], where a covariance eigenvalue analysis is used. Although not directly comparable with our study of uncoated particles, the coated radii $0.0975 μ$ m, $0.2305 μ$ m and $0.11 μ$ m carrying the most information content found in this study are roughly comparable to our radii.

We computed original spectral extinctions $\begin{matrix} {(e_{true})}_{i, j} : = π r_{i}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (m_{med} (l_{i}), m_{part} (l_{i}), r_{j}, l_{i}), i = 1, \dots, 48, j = 1, \dots, 3 \end{matrix}$

for Ag, $H_{2}$ O and CsI and added zero-mean Gaussian noise to it in order to obtain the simulated noisy spectral extinctions $\begin{matrix} {(e)}_{i, j} = {(e_{true})}_{i, j} + δ_{i, j} with δ_{i, j} \sim N (0, {(0.05 \cdot {(e_{true})}_{i, j})}^{2}), i = 1, \dots, 48, j = 1, \dots, 3 . \end{matrix}$

The standard deviations were taken to be $5 %$ of the true spectral extinctions. Real experiments using 500 wavelengths were contaminated by Gaussian noise with $30 %$ of the true spectral extinctions as standard deviations. We expect that switching to 48 wavelengths and thus increasing the time resolution of the measurements will lower the standard deviations to a small percentage. We used a sample size of $N_{s} = 300$ to compute each mean ${(e_{real})}_{i, j}$ of noisy spectral extinctions.

In the following, the results are presented separately for each of the three materials. The uppermost plot presents the relative errors of the unregularized solutions obtained with Algorithm 2 from the original scatterer refractive indices. The next plot displays the run times of Algorithm 1, which returned all local minima of (Equation4.3(4.3) $\begin{matrix} X_{t, δ} & : = \underset{x \in R^{D}}{a r g m i n} F_{t, δ} (x) s.t. x \in Ω, \\ with F_{t, δ} (x) & : = ‖ Σ^{- \frac{1}{2}} (f_{t} (x) - f (x_{true}) - δ) ‖_{2}^{2} \end{matrix}$ (4.3) ). These candidate solutions served as input for Algorithm 2. Then the relative errors of the regularized solutions are presented. Finally, the relative errors of the average of the regularized solutions are shown.

10.1. Results for Ag

10.1.1. Results of Algorithms Equation1(1.1) $\begin{matrix} \int_{0}^{\infty} k (r, l) n (r) d r = e (l) with e (l) = - \frac{log (I_{long} (l)) - log (I_{short} (l))}{L_{long} - L_{short}}, \end{matrix}$ (1.1) and Equation2(1.2) $\begin{matrix} n π r_{m}^{2} Q_{ext} (m_{med} (l), m_{part} (l), r_{m}, l) = e (l), \end{matrix}$ (1.2)

10.1.2. Relative errors of the regularized solutions

10.1.3. Relative errors of the average of the regularized solutions

10.2. Results for CsI

10.2.1. Results of Algorithms Equation1(1.1) $\begin{matrix} \int_{0}^{\infty} k (r, l) n (r) d r = e (l) with e (l) = - \frac{log (I_{long} (l)) - log (I_{short} (l))}{L_{long} - L_{short}}, \end{matrix}$ (1.1) and Equation2(1.2) $\begin{matrix} n π r_{m}^{2} Q_{ext} (m_{med} (l), m_{part} (l), r_{m}, l) = e (l), \end{matrix}$ (1.2)

10.2.2. Relative errors of the regularized solutions

10.2.3. Relative errors of the average of the regularized solutions

10.3. Results for $H_{2}$ O

10.3.1. Results of Algorithms Equation1(1.1) $\begin{matrix} \int_{0}^{\infty} k (r, l) n (r) d r = e (l) with e (l) = - \frac{log (I_{long} (l)) - log (I_{short} (l))}{L_{long} - L_{short}}, \end{matrix}$ (1.1) and Equation2(1.2) $\begin{matrix} n π r_{m}^{2} Q_{ext} (m_{med} (l), m_{part} (l), r_{m}, l) = e (l), \end{matrix}$ (1.2)

10.3.2. Relative errors of the regularized solutions

10.3.3. Relative errors of the average of the regularized solutions

10.4. Conclusion

The severest relative errors can be observed for Ag. For the initial unregularized solutions, they lie between 1 and $5 %$ on average and can go up to ca. $53 %$ in the extreme cases as one can see in the leftmost subplot for the first optical window. The run times of Algorithm 1 lie between 30 and 50 s in the average case and can rise up to 200 s in the extreme cases. A typical sweep through all 48 wavelengths needed ca. 30 minutes in total and this value was very much the same for all three materials. For Ag, the regularization procedure effectively reduced the relative errors such that they are in the range between 0.5 and $2.2 %$ on average and are below $10 %$ in the extreme cases. Finally, one can see in the last plot that the relative errors of the average of all 100 regularized solutions are all below $0.4 %$ .

For CsI, the relative errors of the unregularized solutions are already quite small and lie between 0.03 and $0.065 %$ on average and rise only up to $0.3 %$ in the extreme cases. The run times of Algorithm 1 are typically in the range from 25 to 55 s and are always below 95 s. The regularization of the solutions brought only a small improvement of the results here such that the relative errors did not change much. They are still in the same range from 0.03 to $0.065 %$ on average but only reach up to ca. $0.2 %$ now. The relative errors of the average of the 100 regularized solutions are between 0.01 and $0.055 %$ .

Also for $H_{2}$ O, the relative errors of the unregularized solutions are comparably small and below $0.35 %$ on average and still below $1.3 %$ in the extreme cases. Especially, the rightmost subplot for the last optical window shows the biggest relative errors, whereas for all the other optical windows the relative errors are below $0.03 %$ on average and below $0.15 %$ in the extreme cases. A similar behaviour can be observed for the run times of Algorithm 1. For the first four optical windows, they are between 20 and 45 s on average and below 100 s in the extreme cases, whereas for the last optical window they are between 30 and 170 s on average and can even rise up to 350 s. For $H_{2}$ O, the regularization procedure improves the relative errors only slightly for the first four optical windows and even increases them for the last optical window such that they can rise up to ca. $0.06 %$ on average and $1.4 %$ in the extreme cases. The relative errors of the average of the 100 regularized solutions are virtually zero for the first four optical windows and below $0.55 %$ for the last optical window.

11. Higher noise levels

To see how our proposed reconstruction algortihm behaves for higher noise levels, we performed for each of the scatterer materials Ag, $H_{2}$ O and CsI two numerical studies with 10 sweeps through all 48 wavelenghts of the five optical windows with the same settings as in Section 6. We computed original spectral extinctions $\begin{matrix} {(e_{true})}_{i, j} : = π r_{i}^{2} \sum_{n = 1}^{N_{t r}} q_{n} (m_{med} (l_{i}), m_{part} (l_{i}), r_{j}, l_{i}), i = 1, \dots, 48, j = 1, \dots, 3 \end{matrix}$

for all wavelengths and added zero-mean Gaussian noise to it in order to obtain the simulated noisy spectral extinctions $\begin{matrix} {(e)}_{i, j} = {(e_{true})}_{i, j} + δ_{i, j} with δ_{i, j} \sim N (0, {(0.15 \cdot {(e_{true})}_{i, j})}^{2}), i = 1, \dots, 48, j = 1, \dots, 3 . \end{matrix}$

for the first study and $\begin{matrix} {(e)}_{i, j} = {(e_{true})}_{i, j} + δ_{i, j} with δ_{i, j} \sim N (0, {(0.3 \cdot {(e_{true})}_{i, j})}^{2}), i = 1, \dots, 48, j = 1, \dots, 3 . \end{matrix}$

for the second. The standard deviations were taken to be $15 %$ and $30 %$ , respectively, of the true spectral extinctions. We used a sample size of $N_{s} = 300$ to compute each means ${(e_{real})}_{i, j}$ of noisy spectral extinctions.

For brevity, we only present the relative errors of the average of the 10 regularized solutions.

11.1. Results for Ag

11.1.1. Standard deviation of $15 %$

11.1.2. Standard deviation of $30 %$

11.2. Results for CsI

11.2.1. Standard deviation of $15 %$

11.2.2. Standard deviation of $30 %$

11.3. Results for $H_{2}$ O

11.3.1. Standard deviation of $15 %$

11.3.2. Standard deviation of $30 %$

11.4. Conclusion

Whereas the relative errors for CsI and $H_{2}$ O are still below $1 %$ , they can rise up to ca. $53 %$ for Ag. Therefore, the reconstructed refractive indices for Ag under this noise level are most likely not of practical use. This shows that the FASP measurements of monodisperse aerosols must be sufficiently accurate in order to retrieve the scatterer refractive indices from them.

12. Numerical study

We performed four numerical studies for two-component aerosols with log-normal, RRSB and Hedrih model size distributions as outlined in [Citation2]. The aerosol particles were assumed to be homogeneously internally mixed, such that only one effective refractive index was retrieved. One component of the simulated aerosols was $H_{2}$ O with volume fractions of 0, 11, 22, 33, 44, 56, 67, 78, 89 and $100 %$ . In the first two studies, we simulated mixtures of $H_{2}$ O and CsI, where we used the original aerosol component refractive indices for the first study. For the second study, we used the average of the 100 regularized solutions from Section 10. We did the same for the third and fourth studies, but here we simulated mixtures of $H_{2}$ O and Ag. In the third study, we utilized the original aerosol component refractive indices and for the fourth the average of the 100 regularized solutions from Section 10.

We applied the same reconstruction methods described in [Citation2] under the same settings, i.e. for each reconstruction we generated 300 artificial noisy measurements for all 48 wavelengths, where the measurement error was simulated as additive zero-mean Gaussian noise. For each wavelength, the standard deviations were taken as $5 %$ of the solutions of the forward problem. In [Citation2], three different regularization methods, namely Tikhonov, minimal first differences and Twomey regularization, were compared and their results turned out to be very similar. Therefore, we only used Tikhonov regularization in the following. The results for the first study were directly adopted from [Citation2].

For every inversion, we computed the $L^{2}$ -error of the obtained reconstruction relative to the original size distribution and measured the total run time needed for the inversion. The computations were performed on a notebook with a 2.27 GHz CPU and 3.87 GB accessible primary memory.

13. Results for mixtures of $H_{2}$ O and CsI

13.1. Noise-free refractive indices

Table

Display Table

Table

Display Table

13.2. Noisy refractive indices

Table

Display Table

Table

Display Table

13.3. Results for mixtures of $H_{2}$ O and Ag

13.3.1. Noise-free refractive indices

Table

Display Table

Table

Display Table

13.3.2. Noisy refractive indices

Table

Display Table

Table

Display Table

13.4. Conclusion

The resuts of the first and second study only differ by ca. $4 %$ at most and behave very similarly. The same is for the third and fourth studies. These numerical results indicate that 100 FASP measurement sweeps consisting of 300 single measurements with an accuracy as in Section 10 are sufficient to determine aerosol refractive indices in such a quality, that they are suitable for particle size distribution reconstructions for two-component homogeneously internally mixed aerosols using the FASP. The particle radii of the three monodisperse aerosols generated for the refractive indices retrieval need to be $0.1 μ$ m, $0.2 μ$ m and $0.3 μ$ m, respectively.

14. Outlook

It is of interest to investigate if the methods derived in this study can be extended to the case of core-plus-shell aerosols.

Acknowledgements

I thank Graham Alldredge, Ph.D. for proofreading the manuscript, useful recommendations and fruitful discussions on this topic.

Additional information

Funding

This work is sponsored by the German Federal Ministry of Education and Research (BMBF) [contract number 02NUK022A0]. Responsibility for the content of this report lies with the authors.

Notes

No potential conflict of interest was reported by the author.

References

Riziq AA, Erlick C, Dinar E, et al. Optical properties of absorbing and non-absorbing aerosols retrieved by cavity ring down (CRD) spectroscopy. Atmos Chem Phys. 2007;7:1523–1536.
Web of Science ®Google Scholar
Alldredge G, Kyrion T. Robust inversion methods for aerosol spectroscopy. Inverse Probl. Sci. Eng. 2016.
PubMed Web of Science ®Google Scholar
Fu Q, Sun W. Mie theory for light scattering by a spherical particle in an absorbing medium. Appl. Opt. 2001;40(9):1354–1361.
PubMed Web of Science ®Google Scholar
Wiscombe W. Improved Mie scattering algorithms. Appl Opt. 1980;19(9):1505–1509.
PubMed Web of Science ®Google Scholar
Abromovitz M, Stegun IA. Handbook of mathematical functions with formulas, graphs, and mathematical tables. Washington (DC): Dover; 1972.
Google Scholar
Deuflhard P, Hohmann A. Numerical mathematics I: an algorithmically oriented introduction. Berlin: de Gruyter; 2002.
Google Scholar
Erlick C, Haspel M, Rudich Y. Simultaneous retrieval of the complex refractive indices of the core and shell of coated aerosol particles from extinction measurements using simulated annealing. Appl Opt. 2011;50(22):4393–4402.
PubMed Web of Science ®Google Scholar
Goody RM, Yung YL. Atmospheric radiation. Theoretical basis. 2nd ed. New York (NY): Oxford University Press; 1989.
Google Scholar
Engl HW, Kunisch K, Neubauer A. Convergence rates for Tikhonov regularisation of non-linear ill-posed problems. Inverse Prob. 1989;5:523–540.
Web of Science ®Google Scholar
Scherzer O, Engl HW, Kunisch K. Optimal a posteriori parameter choice for Tikhonov regularization for solving nonlinear Ill-posed problems. SIAM J Numer Anal. 1993;6:1796–1838.
Google Scholar
Quinten M. Optical properties of nanoparticle systems. Weinheim: WILEY-VCH Verlag; 2010.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Reconstruction of refractive indices from spectral measurements of monodisperse aerosols

Abstract

1. Set-up of the experiment

2. Mie theory

3. Derivatives of the truncated Mie efficiency series

4. Nonlinear regression using truncated series expansions

5. The reconstruction algorithm

6. Comparison of the Numerical Continuation Approach with Established Truncation Index Heuristics

6.1. Run Time Comparison

6.1.1. Results for Ag

6.1.2. Results for CsI

6.1.3. Results for H2O

6.2. Maximal relative deviations

6.2.1. Results for Ag

6.2.2. Results for CsI

6.2.3. Results for H2O

6.3. Conclusion

7. Nonlinear Tikhonov regularization

8. Finding the smoothest coupled solutions

9. Further regularization of coupled solutions

10. Numerical results

10.1. Results for Ag

10.1.1. Results of Algorithms Equation1(1.1) ∫0∞k(r,l)n(r)dr=e(l)withe(l)=-log(Ilong(l))-log(Ishort(l))Llong-Lshort,(1.1) and Equation2(1.2) nπrm2Qext(mmed(l),mpart(l),rm,l)=e(l),(1.2)

10.1.2. Relative errors of the regularized solutions

10.1.3. Relative errors of the average of the regularized solutions

10.2. Results for CsI

10.2.1. Results of Algorithms Equation1(1.1) ∫0∞k(r,l)n(r)dr=e(l)withe(l)=-log(Ilong(l))-log(Ishort(l))Llong-Lshort,(1.1) and Equation2(1.2) nπrm2Qext(mmed(l),mpart(l),rm,l)=e(l),(1.2)

10.2.2. Relative errors of the regularized solutions

10.2.3. Relative errors of the average of the regularized solutions

10.3. Results for H2O

10.3.1. Results of Algorithms Equation1(1.1) ∫0∞k(r,l)n(r)dr=e(l)withe(l)=-log(Ilong(l))-log(Ishort(l))Llong-Lshort,(1.1) and Equation2(1.2) nπrm2Qext(mmed(l),mpart(l),rm,l)=e(l),(1.2)

10.3.2. Relative errors of the regularized solutions

10.3.3. Relative errors of the average of the regularized solutions

10.4. Conclusion

11. Higher noise levels

11.1. Results for Ag

11.1.1. Standard deviation of 15%

11.1.2. Standard deviation of 30%

11.2. Results for CsI

11.2.1. Standard deviation of 15%

11.2.2. Standard deviation of 30%

11.3. Results for H2O

11.3.1. Standard deviation of 15%

11.3.2. Standard deviation of 30%

11.4. Conclusion

12. Numerical study

13. Results for mixtures of H2O and CsI

13.1. Noise-free refractive indices

13.2. Noisy refractive indices

13.3. Results for mixtures of H2O and Ag

13.3.1. Noise-free refractive indices

13.3.2. Noisy refractive indices

13.4. Conclusion

14. Outlook

Acknowledgements

Additional information

Funding

Notes

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

6.1.3. Results for $H_{2}$ O

6.2.3. Results for $H_{2}$ O

10.3. Results for $H_{2}$ O

11.1.1. Standard deviation of $15 %$

11.1.2. Standard deviation of $30 %$

11.2.1. Standard deviation of $15 %$

11.2.2. Standard deviation of $30 %$

11.3. Results for $H_{2}$ O

11.3.1. Standard deviation of $15 %$

11.3.2. Standard deviation of $30 %$

13. Results for mixtures of $H_{2}$ O and CsI

13.3. Results for mixtures of $H_{2}$ O and Ag