Full article: Solving a Cauchy problem for the heat equation using cubic smoothing splines

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

The Cauchy problem for the heat equation is a model of situation where one seeks to compute the temperature, or heat-flux, at the surface of a body by using interior measurements. The problem is well-known to be ill-posed, in the sense that measurement errors can be magnified and destroy the solution, and thus regularization is needed. In previous work it has been found that a method based on approximating the time derivative by a Fourier series works well [Berntsson F. A spectral method for solving the sideways heat equation. Inverse Probl. 1999;15:891–906; Eldén L, Berntsson F, Regińska T. Wavelet and Fourier methods for solving the sideways heat equation. SIAM J Sci Comput. 2000;21(6):2187–2205]. However, in our situation it is not resonable to assume that the temperature is periodic which means that additional techniques are needed to reduce the errors introduced by implicitly making the assumption that the solution is periodic in time. Thus, as an alternative approach, we instead approximate the time derivative by using a cubic smoothing spline. This means avoiding a periodicity assumption which leads to slightly smaller errors at the end points of the measurement interval. The spline method is also shown to satisfy similar stability estimates as the Fourier series method. Numerical simulations shows that both methods work well, and provide comparable accuracy, and also that the spline method gives slightly better results at the ends of the measurement interval.

2010 Mathematics Subject Classifications:

1. Introduction

In many industrial applications one wishes to determine the temperature, or heat-flux, on the surface of a body. Often it is the case that the surface itself inaccessible for measurements [Citation1–7]. In such cases one can instead measure the temperature at a location in the interior of the body and compute the surface temperature by solving an ill-posed boundary value problem for the heat equation.

In a one-dimensional setting a mathematical model of the above situation is the following: Determine the temperature $u (x, t)$ , satisfying the heat equation (1) $(k u_{x})_{x} = ρ c_{p} u_{t}, 0 < x < a, t \geq 0,$ (1) where k is the thermal conductivity, ρ is the density, and $c_{p}$ is the specific heat capacity of the material, with the Cauchy data $[u, u_{x}] = [g, h]$ is given along the line x = a. In addition we speficy the initial data $u (x, 0) = 0$ , for $0 \leq x \leq a$ .

Of course, since g and h are measured, there exists measurement errors, and we would actually have functions $g_{m}, h_{m} \in L^{2}$ , for which (2) $‖ g_{m} - g ‖_{L^{2}} \leq ϵ, and ‖ h_{m} - h ‖_{L^{2}} \leq ϵ,$ (2) where $ϵ > 0$ represents a bound on the measurement error.

Although in this paper we mostly discuss the heat equation in its simplest form, $κ u_{x x} = u_{t}$ , our interest is in numerical methods that can be used for more general problems, e.g. non-linear equations (3) $(κ (u) u_{x})_{x} = α (u) u_{t},$ (3) which occur in applications since the thermal properties of most materials are dependent on the temperature. For such problems one cannot use methods based on reformulating the problem as a linear operator equation [Citation4,Citation8,Citation9]. Instead, we solve the problem, essentially, as an initial value problem in the space variable [Citation10–12]. This approach works very well if the time-derivative is approximated by a bounded, discrete operator [Citation2].

In this paper, we study two different methods for approximating the time derivative and their numerical implementation. First, the time derivative is approximated by a matrix representing differentiation of a trigonometric interpolant. This is a method that has proved to work very well in practice, see [Citation7,Citation13–15], but that has the drawback that by using the Fourier transform we implicitly assume that the solution is periodic in the time variable. This is generally not the case and thus the method is affected by additional errors at the ends of the time interval. Implicit assumptions of periodicity also occurs in other popular methods, such as those based on Meyer wavelets [Citation16,Citation17].

In order to avoid problems with periodicity we instead use a cubic splines to construct a discrete approximation of the time derivative. In previous work cubic splines have been shown to work well for this purpose, see [Citation18]. In our work we use cubic smoothing splines [Citation19], which includes a parameter λ that controls the smoothness of the result, and construct a matrix approximation of the time derivative. We also derive stability estimates that suggest that the spline method has similar regularizing properties as the Fourier series approach. In addition numerical experiments shows that the method works well.

2. The Cauchy problem for the heat equation

In this work we are concerned with the following non-standard boundary value problem for the heat-equation: Find the temperature $u (x, t)$ , for $0 \leq x \leq a$ , such that (4) ${\begin{cases} (κ u_{x})_{x} = u_{t}, & 0 \leq x \leq a, t \geq 0, \\ u (x, 0) = 0, & 0 \leq x \leq a, \\ u (a, t) = g (t), & t \geq 0, \\ u_{x} (a, t) = h (t) & t \geq 0, \end{cases}$ (4) where the function κ represents the material properties, and $g (t)$ , $h (t)$ are the data.

The problem (Equation4(4) ${\begin{cases} (κ u_{x})_{x} = u_{t}, & 0 \leq x \leq a, t \geq 0, \\ u (x, 0) = 0, & 0 \leq x \leq a, \\ u (a, t) = g (t), & t \geq 0, \\ u_{x} (a, t) = h (t) & t \geq 0, \end{cases}$ (4) ) is ill–posed in the sense that the solution, if it exists, does not depend continuously on the data. In the case of constant material properties, i.e. κ is a constant, this can be seen by solving the problem in the Fourier domain. In order to simplify the analysis we define all functions to be zero for t<0. Let $\hat{u} (x, ξ) = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} u (x, t) e^{- i ξ t} d t,$ be the Fourier transform of the solution. The heat equation takes the form (5) $κ {\hat{u}}_{x x} (x, ξ) = i ξ \hat{u} (x, ξ), 0 < x < a, ξ \in R,$ (5) and it can be verified that the solution is given by (6) $\hat{u} (x, ξ) = \cosh (σ (ξ) (a - x)) \hat{g} (ξ) - \frac{\sinh (σ (ξ) (a - x))}{σ (ξ)} \hat{h} (ξ),$ (6) where $σ (ξ) = \sqrt{i ξ / κ}$ , and $\sqrt{i ξ}$ denotes the principal value of the square root. Since the real part of $\sqrt{i ξ}$ is positive, and the solution $\hat{u} (x, ξ)$ , is assumed to be in $L^{2}$ , we see that both the data functions $[\hat{g}, \hat{h}]$ , must decay rapidly as $| ξ | \to \infty$ . Also, small perturbations in $\hat{g}$ and $\hat{h}$ , for high frequencies, may blow-up and drastically change the solution. This behavior is typical for ill-posed problems [Citation20].

2.1. Stabilization by discretizing the time variable

In this section we stabilize the heat conduction problem by discretizing the time-variable, see [Citation2,Citation13]. By replacing the time derivative $\partial_{t}$ by a bounded operator, e.g. a matrix, we effectively regularize the problem. For this purpose we rewrite (Equation4(4) ${\begin{cases} (κ u_{x})_{x} = u_{t}, & 0 \leq x \leq a, t \geq 0, \\ u (x, 0) = 0, & 0 \leq x \leq a, \\ u (a, t) = g (t), & t \geq 0, \\ u_{x} (a, t) = h (t) & t \geq 0, \end{cases}$ (4) ) as an initial-value problem (7) ${(\begin{matrix} u \\ κ u_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ \frac{\partial}{\partial t} & 0 \end{array}) (\begin{matrix} u \\ a u_{x} \end{matrix}), 0 \leq x \leq a,$ (7) with initial-boundary values (8) $u (a, t) = g_{m} (t), u_{x} (a, t) = h_{m} (t), for 0 \leq t \leq b,$ (8) and (9) $u (x, 0) = 0, for 0 \leq x \leq a .$ (9) We discretize (Equation7(7) ${(\begin{matrix} u \\ κ u_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ \frac{\partial}{\partial t} & 0 \end{array}) (\begin{matrix} u \\ a u_{x} \end{matrix}), 0 \leq x \leq a,$ (7) ), on a uniform grid $Δ = {t_{j}}_{j = 1}^{n}$ , with $0 = t_{1} < \dots < t_{n} = b$ . For simplicity we also introduce a sampling operator such that $g_{Δ} = (g (t))_{Δ} = (g (t_{1}), \dots, g (t_{n}))^{T},$ i.e. the function is sampled on the time grid. By introducing semi-discrete representations of the solution and its derivative, i.e. $U (x) = (u (x, \cdot))_{Δ} and U_{x} (x) = (u_{x} (x, \cdot))_{Δ},$ we obtain the initial value problem, (10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) with initial data $U (a) = (g_{m})_{Δ}$ and $U_{x} (a) = (h_{m})_{Δ}$ .

In the initial value problem (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) the matrix D represents a discretization of the time derivative. In [Citation21] it was observed that since, for any matrix, $‖ D ‖_{2}$ is bounded the system of ODEs (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) is well-posed, and the solution depends in a stable way on the used data. By controlling the accuracy of the approximation $D \approx \partial_{t}$ the degree of stability can be adjusted to be suitable for a problem with a specific noise level. For solving the problem numerically, we use a standard ODE solver, and usually it is sufficient to use an explicit method, e.g. a Runge–Kutta code.

In the rest of this paper, we will discuss concrete implementations where the matrix D in (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) is approximated by a Fourier method and also a method where the derivative is computed using a smoothing spline. In both cases we will derive stability estimates and also investigate the properties of the methods using numerical examples.

2.2. Differentiation using the trigonometric interpolant

The unique trigonometric polynomial interpolating a function $g (t)$ , on the uniform grid ${t_{j}}_{j = 1}^{n}$ is [Citation22], (11) $g (t) = \frac{1}{\sqrt{2 π}} \sum_{j = - \frac{n}{2}}^{\frac{n}{2} - 1} {\hat{g}}_{j} e^{i ξ_{j} t}, ξ_{j} = 2 π j / b,$ (11) where the sequence ${{\hat{g}}_{j}}_{j = - \frac{n}{2}}^{\frac{n}{2} - 1}$ are the discrete Fourier coefficients, with the assumption that n is even. The discrete Fourier coefficients are computed by taking the FFT of the vector $g_{Δ}$ .

It is known that very accurate approximations of the derivative, of periodic functions, can be computed by differentiation of the trigonometric polynomial. This leads us to the following definition:

Definition 2.1

The matrix $D_{ξ_{c}} : R^{n} \to R^{n}$ is defined as $D_{ξ_{c}} = F^{H} Λ_{ξ_{c}} F$ , where F is the Fourier matrix, and $Λ_{ξ_{c}}$ is the diagonal matrix (12) $(Λ_{ξ_{c}})_{j, j} = {\begin{cases} i ξ_{j}, & | ξ_{j} | < ξ_{c}, \\ 0, & | ξ_{j} | \geq ξ_{c}, \end{cases}$ (12) and $ξ_{c}$ is the cut-off frequency.

By keeping only frequencies below the cut-off, i.e. $| ξ_{j} | \leq ξ_{c}$ , we effectively remove the high frequency content from the solution (Equation6(6) $\hat{u} (x, ξ) = \cosh (σ (ξ) (a - x)) \hat{g} (ξ) - \frac{\sinh (σ (ξ) (a - x))}{σ (ξ)} \hat{h} (ξ),$ (6) ) and thus stability is restored. The stability of the problem (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ), will be explored as a sequence of lemmas.

Lemma 2.2.

The matrix $D_{ξ_{c}}$ is bounded and $‖ D_{ξ_{c}} ‖_{2} \leq ξ_{c}$ .

Proof.

The Fourier matrix F is orthogonal and therefore $‖ Λ_{ξ_{c}} ‖_{2} = ‖ D_{ξ_{c}} ‖_{2}$ . Also, since $Λ_{ξ_{c}}$ is diagonal the norm is bounded by its largest diagonal element.

Next, we prove that the problem (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) is well-posed if the approximation $\partial_{t} \approx D = D_{ξ_{c}}$ is used. In the case of constant coefficients, i.e. constant material propertiens κ, we have the following result;

Lemma 2.3.

Let $D = D_{ξ_{c}}$ , where $ξ_{c} > 0$ is the cut-off frequency, and let $U_{1}$ and $U_{2}$ be two different solutions to (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ), corresponding to Cauchy data $[(g_{1})_{Δ}, h_{Δ}]$ and $[(g_{2})_{Δ}, h_{Δ}]$ , respectively. Then (13) $‖ U_{1} (x) - U_{2} (x) ‖_{2} \leq \cosh (\sqrt{σ} (a - x)) ‖ (g_{1} - g_{2})_{Δ} ‖_{2}, σ = \frac{ξ_{c}}{2 κ} .$ (13)

Proof.

We use the Discrete Fourier Transform and let $V = F (U_{1} - U_{2})$ . Then (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) simplifies to (14) $κ V_{x x} = Λ_{ξ_{c}} V, 0 < x < a,$ (14) with boundary conditions $V (a) = F (g_{1} - g_{2})_{Δ}$ and $V_{x} (a) = F (h - h)_{Δ} = 0$ . Since $Λ_{ξ_{c}}$ is a diagonal matrix, the problem can be solved one frequency $ξ_{j}$ at a time, and (15) $V (x, ξ_{j}) = \cosh (\sqrt{i ξ_{j} / κ} (a - x)) V (a, ξ_{j}), if | ξ_{j} | \leq ξ_{c},$ (15) and $V (x, ξ_{j}) = 0$ othervise. Thus $\begin{aligned} | V (x, ξ_{j}) |^{2} & \leq {| \cosh (\sqrt{i ξ_{j} / κ} (a - x)) |}^{2} {| V (a, ξ_{j}) |}^{2} \\ \leq {(\cosh (\sqrt{(ξ_{c} / 2 κ)} (a - x)))}^{2} {| V (a, ξ_{j}) |}^{2} . \end{aligned}$ Since F is orthogonal $‖ V ‖_{2} = ‖ U_{1} - U_{2} ‖_{2}$ and $‖ V (a, ξ_{j}) ‖_{2} = ‖ (g_{1} - g_{2})_{Δ}) ‖_{2}$ and thus the result follows by summation over all the frequencies $ξ_{j}$ .

It is easy to see that $| \cosh \sqrt{i ξ} | \leq \cosh (\sqrt{ξ_{c} / 2})$ , for $| ξ | \leq ξ_{c}$ , as was used above. In order to treat the second component of (Equation6(6) $\hat{u} (x, ξ) = \cosh (σ (ξ) (a - x)) \hat{g} (ξ) - \frac{\sinh (σ (ξ) (a - x))}{σ (ξ)} \hat{h} (ξ),$ (6) ) we need a similar property for $\sinh (\sqrt{i ξ}) / \sqrt{i ξ}$ . This is demonstrated by the following two lemmas.

Lemma 2.4.

It holds that $| \sinh z |^{2} = \sinh^{2} (R e {z}) + \sin^{2} (I m {z})$ .

Proof.

Let $z = γ + i β$ . Then a direct calculation shows that (16) $| e^{z} - e^{- z} |^{2} = {(e^{γ} - e^{- γ})}^{2} \cos^{2} β + {(e^{γ} + e^{- γ})}^{2} \sin^{2} β .$ (16) Expanding the squares, and using the trigonometric identity $\cos^{2} β + \sin^{2} β = 1$ , yields the desired result.

Lemma 2.5.

The function $g (ξ) = ξ^{- 2} (\sinh^{2} ξ + \sin^{2} ξ)$ , $ξ > 0$ , is monotonically increasing.

Proof.

Let $g_{1} (ξ) = \sinh^{2} ξ + \sin^{2} ξ$ and $g_{2} (ξ) = ξ^{2}$ . According to L'Hôspital's monotone rule, see [Citation23], the function $g (ξ) = g_{1} (ξ) / g_{2} (ξ)$ is monotonically increasing if the same is true for $h (ξ) = g_{1}^{'} (ξ) / g_{2}^{'} (ξ)$ . The results follows by applying the monotone rule twice since a direct calculation shows that $g_{1}^{''} (ξ) = 2 \cosh (2 ξ) + 2 \cos (2 ξ) > 0$ , for $ξ > 0$ .

Finally since $\sqrt{i} = (1 \pm i) / \sqrt{2}$ the two previous lemmas can be combined into the following result.

Corollary 2.6.

The function $| \sinh \sqrt{i ξ} | / | \sqrt{i ξ} |$ is monotonically increasing.

Lemma 2.7.

Let $D = D_{ξ_{c}}$ , where $ξ_{c} > 0$ is the cut-off frequency, and let $U_{1}$ and $U_{2}$ be two different solutions to (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ), corresponding to Cauchy data $[g_{Δ}, (h_{1})_{Δ}]$ and $[g_{Δ}, (h_{2})_{Δ}]$ . Then (17) $‖ U_{1} (x) - U_{2} (x) ‖_{2} \leq \frac{\sinh (\sqrt{ξ_{c} / 2 κ} (a - x)) + 1}{\sqrt{ξ_{c} / 2 κ}} ‖ (h_{1} - h_{2})_{Δ} ‖_{2} .$ (17)

Proof.

The proof is similar to that of Lemma 2.3. Let $V = F (U_{1} - U_{2})$ and note that $V (a) = 0$ and $V_{x} (a) = F (h_{1} - h_{2})_{Δ}$ . It can be verified that the solution, for one frequency $ξ_{j}$ , is (18) $V (x, ξ_{j}) = - \frac{\sinh (\sqrt{(i ξ_{j} / κ)} (a - x))}{\sqrt{i ξ_{j} / κ}} V (a, ξ_{j}), for | ξ_{j} | < ξ_{c},$ (18) and $V (x, ξ_{j}) = 0$ otherwise. Taking absolute values and using the monotononicity of Collorary 2.6 we obtain (19) $| V (x, ξ_{j}) |^{2} \leq {| \frac{\sinh (\sqrt{i ξ_{c} / κ} (a - x))}{\sqrt{i ξ_{c} / κ}} |}^{2} | V (a, ξ_{j}) |^{2} .$ (19) Using Lemma 2.4 leads to (20) $| V (x, ξ_{j}) |^{2} \leq \frac{\sinh^{2} (\sqrt{ξ_{c} / 2 κ} (a - x)) + \sin^{2} (\sqrt{ξ_{c} / 2 κ} (a - x))}{(\sqrt{ξ_{c} / 2 κ})^{2}} | V (a, ξ_{j}) |^{2} .$ (20) Finally, the result follows by using the orhogonality of F, the fact that $\sin^{2} α \leq 1$ , and by summation over all the frequencies $ξ_{j}$ .

Remark 2.8

Note that when deriving the bounds (Equation13(13) $‖ U_{1} (x) - U_{2} (x) ‖_{2} \leq \cosh (\sqrt{σ} (a - x)) ‖ (g_{1} - g_{2})_{Δ} ‖_{2}, σ = \frac{ξ_{c}}{2 κ} .$ (13) ) and (Equation17(17) $‖ U_{1} (x) - U_{2} (x) ‖_{2} \leq \frac{\sinh (\sqrt{ξ_{c} / 2 κ} (a - x)) + 1}{\sqrt{ξ_{c} / 2 κ}} ‖ (h_{1} - h_{2})_{Δ} ‖_{2} .$ (17) ) we used that the eigenvalues of $D_{ξ_{c}}$ , i.e. the frequencies $i ξ_{j}$ , are located on the imaginary axis. Thus we get a slightly better bound compared to just using $‖ D_{ξ_{c}} ‖_{2} \leq ξ_{c}$ .

2.3. Periodization using splines

When using the FFT algorithm we implicitly assume that the vector $f_{Δ}$ represents a periodic function. This is not realistic in our application. In order to avoid wrap-around effects, see [Citation2], we extend the data $f_{Δ}$ , from the interval $[0, 1]$ to $[1, 2]$ . This is done by first computing two first degree polynomials that approximate the first, and last, six points of the vector $f_{Δ}$ in the least squares sense. The two first degree polynomials are then sampled on the original grid creating a total of 12 interpolation points. By moving the six interpolation points located near $t = 0$ to after $t = 2$ we obtain suitable points for creating a cubic spline, defined on the interval $[1, 2]$ , that can be used to extend the vector $f_{Δ}$ to a periodic vector of double length in such a way that the transitions at t = 0 and t = 1 are as smooth as possible.

The procedure is illustrated in Figure where a noisy vector $f_{Δ}$ , representing a function $f (t)$ defined on $[0, 1]$ , has been smoothly extended to double length. By fitting first degree polynomials to six data points, near t = 0 and t = 1, we obtain the interpolation points needed to create the cubic spline defined on the interval $[1, 2]$ . We also display the exact derivative $f^{'} (t)$ and the approximation $D_{ξ_{c}} f_{Δ}$ , where the FFT of the extended data vector is used to approximate the derivative. In this particular experiment the noise level was $ε = 0.5 \cdot 10^{- 1}$ and the cut-off frequency $ξ_{c} = 35$ was used. We manage to filter out most of the noise from the derivative and a sufficient number of terms in (Equation11(11) $g (t) = \frac{1}{\sqrt{2 π}} \sum_{j = - \frac{n}{2}}^{\frac{n}{2} - 1} {\hat{g}}_{j} e^{i ξ_{j} t}, ξ_{j} = 2 π j / b,$ (11) ) is included so that the computed derivative is resonably accurate. Note that Gibbs phenomena near the end points of the interval $[0, 1]$ are avoided and the derivative is reasonably accurate in the whole interval.

Figure 1. We illustrate the periodization process by displaying both a noisy data vector $f_{Δ}$ (left graph) representing a non-periodic function $f (t)$ defined on $[0, 1]$ . The cubic spline, defined on $[1, 2]$ and that matches the slopes of $f (t)$ at t = 0 and t = 1 is also illustrated (left graph). The combined vector represents a periodic function defined on $[0, 2]$ . We also show the exact derivative $f^{'} (t)$ (right graph) and the approximate derivative $D_{ξ_{c}} f_{Δ}$ for $ξ_{c} = 35$ (right graph). The approximation is reasonably good both for t = 0 and t = 1.

2.4. Implementation of the Fourier method

The numerical implementation of the Fourier series-based approach consists of implementing the procedure for computing derivatives described in Section 2.2, together with the periodization presented in Section 2.3, i.e. writing a procedure for computing the product of $D_{ξ_{c}}$ and a vector. Once such a procedure is available we can solve the initial value problem (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) using a Runge–Kutta method (ode45 in Matlab) with automatic step size control.

In order to create test problems, with a known solution, we select a function $f (t) = u (0, t)$ , and set $u (a, t) = 0$ , for $0 \leq t \leq b$ , and used the initial data $u (x, 0) = 0$ , for 0<x<a. We then discretized the heat equation using the the Crank-Nicholson implicit scheme and computed the corresponding temperature gradient $h (t) = u_{x} (a, t)$ . For our experiments a grid of size n = 200 in space and m = 500 in time was used. By adding normally distributed noise to the computed Cauchy data $[g_{Δ}, h_{Δ}]$ we obtain the noisy data vectors $[(g_{m})_{Δ}, (h_{m})_{Δ}]$ , from which we attempt to reconstruct an approximation of the exact surface temperature $f (t)$ by solving the Cauchy problem (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ).

The performance of the Fourier method is illustrated by selecting a function $f (t)$ and computing the corresponding Cauchy data $[g_{Δ}, h_{Δ}]$ , as described above. Recall that $g (t) = 0$ is used here. The exact surface temperature $f (t)$ , and also the thermal gradient $h (t)$ , are illustrated in Figure . Note that the function $f (t)$ is not periodic in time and therefore we might expect errors due to wrap-around effects.

Figure 2. We display the functions $f (t)$ and $h (t)$ in the top-left graph. In the top-right graph we show the error $‖ f_{m, ξ_{c}} - f ‖_{2}$ , as a function of $ξ_{c}$ , for the noise level $ϵ = 10^{- 2}$ . Note that there is an optimal value for $ξ_{c}$ . In addition we display the solution $f_{m, ξ_{c}} (t)$ , for $ξ_{c} = 6$ (middle left), which is close to the optimum, and for $ξ_{c} = 18$ (middle right). Also the exact solution $f (t)$ is displayed. In the bottom-left graph we show the optimal $ξ_{c}$ as a function of the noise level ϵ and in the bottom-right graph we show the corresponding error, for the optimal $ξ_{c}$ , as a function of ϵ. Note that for a larger noise level ϵ, we need a smaller value of $ξ_{c}$ , and obtain a larger error in the computed surface temperature $f_{m, ξ_{c}} (t)$ .

First, we set the noise level to $ϵ = 10^{- 2}$ and compute an approximate surface temperature $f_{m, ξ_{c}}$ , for a range of frequencies $1 < ξ_{c} < 20$ , and compute the error $‖ (f_{m, ξ_{c}} - f)_{Δ} ‖_{2}$ . The total error can be divided into two parts. First we have the truncation error $R_{T} = ‖ (f - f_{ξ_{c}})_{Δ} ‖_{2}$ , due to the fact that not all frequencies are included in the numerical solution. Second we have the propagated data error $R_{X F} = ‖ (f_{ξ_{c}} - f_{m, ξ_{c}})_{Δ} ‖_{2}$ . The stability results, see Lemmas 2.3 and 2.7, says that $R_{X F}$ is increasing as a function of $ξ_{c}$ , while the truncation error $R_{T}$ is decreasing as a function of $ξ_{c}$ . Thus the error curve has a clear minimum and an optimal $ξ_{c}$ representing the appropriate trade-off between $R_{X F}$ and $R_{T}$ . For this case we also plot two solutions $f_{m, ξ_{c}} (t)$ , for $ξ_{c} = 6$ and $ξ_{c} = 18$ . We see that in the first case the level of regularization is just right but in the latter case there is too much noise magnification.

Next we pick noise levels in the range $10^{- 5} \leq ϵ \leq 10^{- 1}$ . For each level of noise we find the optimal value of $ξ_{c}$ , which gives the smallest error. We see that a smaller noise ϵ means that we can use a large cut-off frequency $ξ_{c}$ . We also compute the errors, for the optimal value of $ξ_{c}$ , and as we expect a larger noise in the used data, leads to a larger error in the computed approximations $f_{m, ξ_{c}} (t)$ .

3. Regularization by using smoothing splines

In this section we discuss the regularization of the problem (Equation4(4) ${\begin{cases} (κ u_{x})_{x} = u_{t}, & 0 \leq x \leq a, t \geq 0, \\ u (x, 0) = 0, & 0 \leq x \leq a, \\ u (a, t) = g (t), & t \geq 0, \\ u_{x} (a, t) = h (t) & t \geq 0, \end{cases}$ (4) ) using cubic smoothing splines. As previously we will implement a differentiation matrix $D_{λ}$ using cubic splines and solve the discretized initial value problem (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) numerically. The goal is to reach similar accuracy and stability properties as reported for the Fourier-based derivative $D_{ξ_{c}}$ , while avoiding the practical difficulties that occur when the data vectors do not represent periodic functions.

In the following subsections we introduce a differentiation matrix $D_{λ}$ and also give a bound for its norm. Also we show that the matrix $D_{λ}$ is skew-symmetric. Thus stability results, similar to Lemmas 2.3 and 2.7, can be derived.

3.1. Differentiation using smoothing splines

In this section we present known facts about splines and introduce the notation needed in our work. For simplicity we restrict ourselves to considering the interval $0 \leq t \leq 1$ . Thus we have a uniform grid Δ, such that $0 = t_{1} < \dots < t_{n} = 1$ , with grid parameter h. We work in an $L^{2}$ setting and introduce the Sobolev space (21) $W^{2} [0, 1] = {u : u, u^{'} abs . cont . on [0, 1] and u^{''} \in L^{2} [0, 1]},$ (21) together with the standard norms and semi-norms defined, see [Citation24].

Let $y = (y_{1}, y_{2}, \dots, y_{n})^{T}$ be a vector of data values on the grid. The smoothing cubic spline, that approximates $y_{i}$ on the grid, is defined as follows:

Definition 3.1

Let $y \in R^{n}$ be a vector. The cubic smoothing spline $s_{λ} [y]$ is obtained by solving (22) $min_{u \in W^{2} [0, 1]} h ‖ y - u_{Δ} ‖_{2}^{2} + λ | u |_{2}^{2},$ (22) where $λ > 0$ is the regularization parameter and $| u |_{2}$ denotes a semi-norm.

It is well known that the solution to (Equation22(22) $min_{u \in W^{2} [0, 1]} h ‖ y - u_{Δ} ‖_{2}^{2} + λ | u |_{2}^{2},$ (22) ) is a natural cubic spline. It is also known that the corresponding operator, mapping $y \in R^{n}$ onto $s_{λ} [y] \in W^{2} ([0, 1])$ , is linear, see [Citation24,Citation25]. Thus we can introduce a matrix as follows:

Definition 3.2

The matrix $S_{λ} \in R^{n \times n}$ is defined by (23) $S_{λ} y = (s_{λ} [y])_{Δ} \in R^{n}, for all y \in R^{n} .$ (23)

The matrix $S_{λ}$ has many interesting properties. For instance $‖ S_{λ} ‖_{2} \leq 1$ . Also, the problem (Equation22(22) $min_{u \in W^{2} [0, 1]} h ‖ y - u_{Δ} ‖_{2}^{2} + λ | u |_{2}^{2},$ (22) ) is symmetric in the sense that $s_{λ} [y] (t) = s_{λ} [\bar{y}] (1 - t)$ , where $\bar{y} = (y_{n}, y_{n - 1}, \dots, y_{1})^{T}$ is the vector taken in reverse order. A consequence is the following result:

Lemma 3.3.

The matrix $S_{λ} \in R^{n \times n}$ is symmetric.

Proof.

Let $y \in R^{n}$ . By introducing a basis, e.g. the B-splines ${β_{i} (t)}_{i = 0}^{n + 1}$ , for the space of cubic splines, and writing the minimizer of (Equation22(22) $min_{u \in W^{2} [0, 1]} h ‖ y - u_{Δ} ‖_{2}^{2} + λ | u |_{2}^{2},$ (22) ) as $u (t) = c_{0} β_{0} (t) + \dots + c_{n + 1} β_{n + 1} (t)$ , we can derive an expression for $S_{λ}$ as follows: First $S_{λ} y = u_{Δ} = A c$ , where $(A)_{i j} = β_{j} (t_{i})$ . Second, the seminorm $| u |_{2}$ can be written in the form $| u |_{2}^{2} = c^{T} B c$ , where $(B)_{i j} = (β_{i}, β_{j})_{2}$ . Thus the normal equations are $(h A^{T} A + λ B) c = h A^{T} y$ . Thus $S_{λ} = A (h A^{T} A + λ B)^{- 1} A^{T}$ is symmetric. For the case $λ = 0$ , i.e. interpolating natural cubic splines, this is a known result [Citation25].

Now we are ready to introduce the differentiation matrix $D_{λ}$ . More precisely, we make the following definition:

Definition 3.4

The matrix $D_{λ} \in R^{n \times n}$ is defined by (24) $D_{λ} y = (s_{λ}^{'} [y] (t))_{Δ}, for all y \in R^{n} .$ (24)

The product $D_{λ} y$ is computed by finding the smoothing spline $s_{λ} [y] (t)$ and sampling its derivative on the grid.

Lemma 3.5.

The matrix $D_{λ}$ is skew-symmetric.

Proof.

The matrix $D_{λ}$ can be written in the form $D_{λ} = D W A^{T}$ , where the matrix $W = h (h A^{T} A + λ B)^{- 1}$ is symmetric, $(A)_{i j} = β_{j} (t_{i})$ , and $(D)_{i j} = β_{j}^{'} (t_{i})$ , see the proof of Lemma 3.3. A direct calculation shows that (25) $(D_{λ})_{i, j} = \sum_{l = 0}^{n + 1} \sum_{k = 0}^{n + 1} β_{l}^{'} (t_{i}) w_{l k} β_{k} (t_{j}) .$ (25) Since the B-spline basis function $β_{k} (t)$ is even, with respect to $t = t_{k}$ , and the derivative $β_{l}^{'} (t)$ is odd with respect to $t = t_{l}$ , we see that for the product $β_{l}^{'} (t_{i}) β_{k} (t_{j}) = - β_{l}^{'} (t_{j}) β_{k} (t_{i})$ . Thus we can rearrange the indices to show that $(D_{λ})_{i j} = - (D_{λ})_{j i}$ .

A consequence of the fact that $D_{λ}$ is skew-symmetric is that its eigenvalues are purely imaginary. The fact is used to derive stability estimates when (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) is solved using the smoothing spline method.

3.2. A bound for the matrix $D_{λ}$

In this section we derive a bound for $‖ D_{λ} ‖_{2}$ . Most of the theory presented in this section represent simplifications, with simpler proofs, of more general results found in [Citation24], where an error estimate $| u - s_{λ} [u_{Δ}] |_{1}$ was derived. For our work we instead need a bound for $‖ D_{λ} ‖_{2}$ which can be obtained using similar techniques. We need a series of Lemmas.

Lemma 3.6.

Let $u \in C^{(1)} ([0, 1])$ , $Δ = {t_{i}}_{i = 1}^{n}$ be a uniform grid, and $h = t_{2} - t_{1}$ . Then, if h is chosen so that $h | u |_{1} \leq 0.1 | u |_{0}$ , it holds that (26) $c_{0} | u |_{0} \leq \sqrt{h} ‖ u_{Δ} ‖_{2} \leq c_{1} | u |_{0},$ (26) where $c_{0}$ and $c_{1}$ are constants.

Proof.

We do the proof of one inequality as the other is obtained in a similar way. Let $ν_{i} = (t_{i - 1} + t_{i}) / 2$ , with $ν_{1} = 0$ and $ν_{n + 1} = 1$ . For $t, t_{i} \in [ν_{i}, ν_{i + 1}]$ we have $\begin{aligned} u^{2} (t_{i}) & = u^{2} (t) + \int_{t}^{t_{i}} (u^{2} (τ))^{'} d τ = u^{2} (t) + \int_{t}^{t_{i}} 2 u (τ) u^{'} (τ) d τ \\ \leq u^{2} (t) + 2 {(\int_{ν_{i}}^{ν_{i + 1}} u^{2} (τ) d τ)}^{1 / 2} {(\int_{ν_{i}}^{ν_{i + 1}} (u^{'} (τ))^{2} d τ)}^{1 / 2} \end{aligned}$ Now we can integrate over $[ν_{i}, ν_{i + 1}]$ , sum over all the intervals, and then use the discrete Cauchy-Schwarz inequality, to obtain (27) $h ‖ u_{Δ} ‖_{2}^{2} \leq | u |_{0}^{2} + 2 h | u |_{0} | u |_{1} .$ (27) The result follows by using $h | u |_{1} \leq 0.1 | u |_{0}$ . Also note that $c_{1} = \sqrt{1.2}$ .

The above Lemma can be used to estimate the discrete norm $‖ u_{Δ} ‖_{2}$ in terms of the continuous norm $| u |_{0}$ , and also the other way around.

In order to obtain the desired bound for $‖ D_{λ} ‖_{2}$ we need a result that follows directly from the fact that the cubic smoothing spline is the minimizer of (Equation22(22) $min_{u \in W^{2} [0, 1]} h ‖ y - u_{Δ} ‖_{2}^{2} + λ | u |_{2}^{2},$ (22) ).

Lemma 3.7.

Let $y \in R^{n}$ be a vector and $u \in W^{2} ([0, 1])$ be corresponding smoothing spline. Then (28) $h (y - u_{Δ})^{T} u_{Δ} = λ | u |_{2}^{2} .$ (28)

Proof.

A norm, and scalar product, for $(y, v) \in R^{n} \oplus L^{2} ([0, 1])$ is given by (29) $‖ (y, v) ‖^{2} = h ‖ y ‖_{2}^{2} + λ | v |_{0}^{2} .$ (29) The problem defining the cubic smoothing spline, see Lemma 3.1, can then be written as: Find $u \in W^{2} ([0, 1])$ that minimize $‖ (y, 0) - (u_{Δ}, u^{''}) ‖$ . The solution of the least squares problem is characterized by the residual $(y - u_{Δ}, - u^{''})$ being orthogonal to $(v_{Δ}, v^{''})$ , for all $v \in W^{2} ([0, 1])$ . This means that (30) $h (y - u_{Δ})^{T} v_{Δ} + λ (- u^{''}, v^{''})_{0} = 0.$ (30) Insert v = u and the result follows.

Proposition 3.8.

The matrix $D_{λ}$ , defined by (3.4), satisfies (31) $‖ D_{λ} ‖ \leq c_{3} λ^{- 1 / 4},$ (31) where $c_{3}$ is a constant.

Proof.

From the definition of $D_{λ}$ , and using Lemma 3.6, we obtain (32) $‖ D_{λ} y ‖_{2}^{2} = ‖ (u^{'})_{Δ} ‖_{2}^{2} \leq \frac{c_{1}^{2}}{h} | u^{'} |_{0}^{2} = \frac{c_{1}^{2}}{h} | u |_{1}^{2},$ (32) where $u (t) = s_{λ} [y] (t)$ is the smoothing cubic spline corresponding to the vector $y \in R^{n}$ . Next we use the inequality (33) $τ^{2} | u |_{1}^{2} \leq c_{2} (| u |_{0}^{2} + τ^{4} | u |_{2}^{2}),$ (33) where $c_{2}$ is a constant, which is valid for $0 < τ < 1$ , and all $u \in W^{2} ([0, 1])$ , see [Citation24, Lemma 3.9] or, originally, [Citation26]. We obtain (34) $‖ D_{λ} y ‖_{2}^{2} \leq \frac{c_{1}^{2} c_{2}}{h τ^{2}} (| u |_{0}^{2} + τ^{4} | u |_{2}^{2}) \leq \frac{c_{1}^{2} c_{2}}{h τ^{2}} (\frac{h}{c_{0}^{2}} ‖ u_{Δ} ‖_{2}^{2} + τ^{4} | u |_{2}^{2}),$ (34) where we again used Lemma 3.6. For the final step we write $y = y - u_{Δ} + u_{Δ}$ and expand $‖ y ‖_{2}^{2}$ into (35) $‖ y ‖_{2}^{2} = (y - u_{Δ} + u_{Δ})^{T} (y - u_{Δ} + u_{Δ}) = ‖ y - u_{Δ} ‖_{2}^{2} + 2 (y - u_{Δ})^{T} u_{Δ} + ‖ u_{Δ} ‖_{2}^{2} .$ (35) By inserting Lemma 3.7 into the above expression we obtain (36) $‖ y ‖_{2}^{2} = ‖ y - u_{Δ} ‖_{2}^{2} + \frac{2 λ}{h} | u |_{2}^{2} + ‖ u_{Δ} ‖_{2}^{2} .$ (36) From (Equation36(36) $‖ y ‖_{2}^{2} = ‖ y - u_{Δ} ‖_{2}^{2} + \frac{2 λ}{h} | u |_{2}^{2} + ‖ u_{Δ} ‖_{2}^{2} .$ (36) ) we obtain the two estimates (37) $| u |_{2}^{2} \leq \frac{h}{2 λ} ‖ y ‖_{2}^{2}, and ‖ u_{Δ} ‖_{2}^{2} \leq ‖ y ‖_{2}^{2} .$ (37) Inserting these two inequalities into (Equation34(34) $‖ D_{λ} y ‖_{2}^{2} \leq \frac{c_{1}^{2} c_{2}}{h τ^{2}} (| u |_{0}^{2} + τ^{4} | u |_{2}^{2}) \leq \frac{c_{1}^{2} c_{2}}{h τ^{2}} (\frac{h}{c_{0}^{2}} ‖ u_{Δ} ‖_{2}^{2} + τ^{4} | u |_{2}^{2}),$ (34) ) yields (38) $‖ D_{λ} y ‖_{2}^{2} \leq \frac{c_{1}^{2} c_{2}}{h τ^{2}} (\frac{h}{c_{0}^{2}} + \frac{τ^{4} h}{2 λ}) ‖ y ‖_{2}^{2} .$ (38) The result follows by inserting $τ = λ^{1 / 4}$ into (Equation38(38) $‖ D_{λ} y ‖_{2}^{2} \leq \frac{c_{1}^{2} c_{2}}{h τ^{2}} (\frac{h}{c_{0}^{2}} + \frac{τ^{4} h}{2 λ}) ‖ y ‖_{2}^{2} .$ (38) ).

3.3. Stability analysis for cubic splines

In this section we establish stability results, for the initial value problem (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ), when the approximation $\partial_{t} \approx D = D_{λ}$ is used. The proofs are based on the idea that since $D_{λ}$ is skew-symmetric we have an eigenvalue decomposition (39) $D_{λ} = X Λ_{λ} X^{H}, (Λ_{λ})_{j, j} = i λ_{j},$ (39) where the eigenvector matrix X is unitary, the $λ_{j}$ are real, and are bounded by $| λ_{j} | \leq c_{3} λ^{- 1 / 4} / b$ . The factor b is due to the fact that the theory in the previous section was developed under the assumption that the time interval was 0<t<1.

Lemma 3.9.

Let $D = D_{λ}$ , where $λ > 0$ is the regularization parameter, and let $U_{1}$ and $U_{2}$ be two different solutions to (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ), corresponding to Cauchy data $[(g_{1})_{Δ}, h_{Δ}]$ and $[(g_{2})_{Δ}, h_{Δ}]$ , respectively. Then (40) $‖ U_{1} (x) - U_{2} (x) ‖_{2} \leq \cosh (\sqrt{σ} (a - x)) ‖ (g_{1} - g_{2})_{Δ} ‖_{2}, σ = \frac{c_{3} λ^{- 1 / 4}}{2 b κ} .$ (40)

Proof.

Let X be as defined in (Equation39(39) $D_{λ} = X Λ_{λ} X^{H}, (Λ_{λ})_{j, j} = i λ_{j},$ (39) ) and introduce a new variable $V = X^{H} (U_{1} - U_{2})$ . The system (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) simplifies to (41) $κ V_{x x} = Λ_{λ} V, 0 < x < a,$ (41) with boundary conditions $V (a) = X^{H} (g_{1} - g_{2})_{Δ}$ and $V_{x} (a) = X^{H} (h - h)_{Δ} = 0$ . This is exactly the same situation as in the proof of Lemma 2.3 and the result follows by repeating the same steps.

Similarily we can repeat the steps for the proof of Lemma 2.7 to obtain the following result:

Lemma 3.10.

Let $D = D_{ξ_{c}}$ , where $λ > 0$ is the regularization parameter, and let $U_{1}$ and $U_{2}$ be two different solutions to (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ), corresponding to Cauchy data $[g_{Δ}, (h_{1})_{Δ}]$ and $[g_{Δ}, (h_{2})_{Δ}]$ . Then (42) $‖ U_{1} (x) - U_{2} (x) ‖_{2} \leq \frac{\sinh (\sqrt{σ} (a - x)) + 1}{\sqrt{σ}} ‖ (h_{1} - h_{2})_{Δ} ‖_{2}, σ = \frac{c_{3} λ^{- 1 / 4}}{2 b κ} .$ (42)

The above results means that the problem (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) is well-posed when the approximation $\partial_{t} \approx D_{λ}$ is used. The regularization parameter λ fills the same role as the cut-off frequency $ξ_{c}$ for the Fourier method. For this method a small value for λ leads to a more accurate derivative and thus less stability for the inverse problem. If $ξ_{c} \approx λ^{- 1 / 4}$ then both methods satisfy similar stability estimates.

4. Simulated numerical examples

In this section we present numerical examples intended to illustrate the properties of the smoothing spline method. The experiments are constructed essentially the same way as in Section 2.4. There are many codes available for solving the minimization problem (Equation22(22) $min_{u \in W^{2} [0, 1]} h ‖ y - u_{Δ} ‖_{2}^{2} + λ | u |_{2}^{2},$ (22) ) and for our work we use the Matlab function csaps; which can be used to find the cubic smoothing spline $s_{λ} [y] (t)$ for a given vector y, a stepsize h, and regularization parameter λ .Footnote¹

Test 1 As a first experiment we solve the same problem, as was used in Section 2.4. Thus we first set the noise level to be $ϵ = 10^{- 2}$ and solve the inverse problem for a wide range of regularization parameters λ. In Figure we present the results. We note that, as previously, there is an optimal regularization parameter λ, which represents the appropriate trade-off between accuracy and stability. We also display the computed solution for $λ = 2 \cdot 10^{- 8}$ which is close to the optimal value. The accuracy of the numerical solution is comparable to that computed by the Fourier method.

Figure 3. In the top-left graph we show the error $‖ f_{m, λ} - f ‖_{2}$ , as a function of λ, for the noise level $ϵ = 10^{- 2}$ . Note that there is an optimal value for λ. In the top-right graph we display the surface temperature for $λ = 2 \cdot 10^{- 8}$ which is close to the optimal value. In the bottom-left graph we show the optimal λ as a function of the noise level ϵ and in the bottom-right graph we show the corresponding error, for the optimal λ, as a function of ϵ. Note that for a larger noise level ϵ, we need a larger value of λ, and obtain a larger error in the computed surface temperature $f_{m, λ} (t)$ .

Next we illustrate the regularization properties of the method by letting the noise level vary in the range $10^{- 5} \leq ϵ \leq 10^{- 1}$ . For each different noise level we find the optimal λ and compute the corresponding error. smallest error. We see that a smaller noise ϵ means that we can use a smaller λ, and thus compute the derivatives more accurately. We also see that a larger noise in the data leads to a larger error in the computed surface temperatures $f_{m, λ} (t)$ . The results are comparable to those reported in Section 2.4.

Test 2 In order to further investigate the properties of the two numerical methods we select a variable coefficient (43) $κ (u) = 1 + \sin (π u)^{0.8} + 4.2 \exp (- 27.3 (a / 3 - u)^{2}) .$ (43) This choice of $κ (u)$ gives a slightly less ill-posed problem compared to our previous test. Note that since the problem now is non-linear iteration is needed to solve the problem and create numerical test data. The numerical code for the inverse problem (Equation10(10) ${(\begin{matrix} u \\ κ U_{x} \end{matrix})}_{x} = (\begin{array}{cc} 0 & κ^{- 1} I \\ D & 0 \end{array}) (\begin{matrix} U \\ κ U_{x} \end{matrix}), 0 \leq x \leq a,$ (10) ) remains the same. For this experiment we chose a smoothed out stepfunction $f (t)$ as the exact surface temperature and the time interval is 0<t<b, where b = 5. The size of the time grid is again n = 400. Our objective is to try and verify that the Fourier transform method leads to a larger error, at the beginning of the time interval due to the implicit periodicity assumption. This motivates our choice of $f (t)$ as a step function. The idea is that Gibbs phenomena near the discontinuity causes a larger error in the second half of the interval $[0, b]$ , and that this should also cause errors near t = 0 for the Fourier method. We use a smoothed out step function, and not a true discontinuity, to make the test a bit easier for the Fourier method.

First we compute the errors $‖ f - f_{m, λ} ‖_{2}$ and $‖ f - f_{m, ξ_{c}} ‖_{2}$ for a range of regularization parameters. For this test we use the noise level $ϵ = 10^{- 2}$ . The results are displayed in Figure . We see that for both methods too little regularization leads to a large error due to instability. However, also for both methods, the truncation error is fairly large due to Gibbs phenomena regardless of the choice of the regularization parameter. There is no clear optimal choice for the level of regularization. We also display two approximate solutions for $λ = 10^{- 8}$ and $ξ_{c} = 7$ , respectively. Both solutions are about as accurate however the spline method clearly leads to a smaller error initially. This is due to a lack of wrap-around effects.

Figure 4. We present tests where the exact solution $f (t)$ is a smoothed step function. The top graphs show the error $‖ f - f_{m, λ} ‖_{2}$ (left) for the spline method and the error $‖ f - f_{m, ξ_{c}} ‖_{2}$ (right) for the Fourier method. The middle graphs display the numerical solutions $f_{m, λ} (t)$ (left) obtained using the spline method and $λ = 10^{- 8}$ and the solution $f_{m, ξ_{c}} (t)$ computed using the Fourier method and $ξ_{c} = 7$ . The lower graphs show the errors $‖ f - f_{m_{k}, λ} ‖_{2}$ x markers) and $‖ f - f_{m_{k}, ξ_{c}} ‖_{2}$ o markers) for different random noise sequences $ϵ_{k}$ . In the left graph the variance of the noise is $ϵ = 10^{- 2}$ , $λ = 10^{- 8}$ and $ξ_{c} = 7$ . In the right graph instead $ϵ = 10^{- 3}$ , $λ = 2 \cdot 10^{- 9}$ and $ξ_{c} = 8.3$ . In both cases 100 different sets of random noise were generated.

The noise level $ϵ = 10^{- 2}$ represents the variance of the normally distributed random noise added to the data $[g_{Δ}, h_{Δ}]$ at individual grid points. However an observation is that different random sequences can give quite different errors in the computed solutions. Thus we perform 100 different experiments, with the same parameters b = 5, $ϵ = 10^{- 2}$ and $κ (x)$ as specified above. For each experiment we compute the error using the Euclidea norm $‖ \cdot ‖_{2, I}$ , where only grid points $t_{i}$ inside the interval $I = [0, 1]$ counts towards the sum. This means that we only look at the errors at the beginning of the time interval.

To select the appropriate levels of regularization is difficult since the methods behave differently. In this case we did the following: We use the exact solution $f (t) = 0$ , and thus $g_{Δ} = h_{Δ} = 0$ . This means we only have noise. We picked $λ = 10^{- 8}$ and adjusted $ξ_{c}$ until both methods give the same mean error, over 100 tests. In this case that happens for $ξ_{c} \approx 7$ . Thus the parameters are chosen so that random noise is magnified equally by both methods. The results are shown in Figure . We see that for the step function $f (t)$ the error, measured using $‖ \cdot ‖_{2, [0, 1]}$ , is on average significantly larger for the Fourier method which shows that the method suffers from problems with wrap-around effects.

We also tried a lower noise level $ϵ = 10^{- 3}$ . In this case the appropriate regularization parameters are $λ = 2 \cdot 10^{- 9}$ and $ξ_{c} = 8.3$ . Again when we run 100 tests, with different noise sequences, we see that the spline method has a smaller error initially, due to avoidance of wrap-around effects.

5. Concluding remarks

In this paper we have developed a regularization method for solving the inverse heat conduction problem by using smoothing cubic splines. Previously, the same problem has been solved using Fourier transforms [Citation2,Citation7,Citation13]. The Fourier method is working well but makes the implicit assumption that the data vectors represents periodic functions. This is not true for the problem under consideration and this can potentially lead to very large errors in practice. We present one of the standard techniques, called periodization, that is used to deal with non-periodic data vectors and thus avoid wrap-around effects. In our work we show that, while the periodization works reasonably well, there are increased errors in the numerical solution due to the periodicity assumptions needed for the Fourier method. To avoid the need for a periodicity assumption is why we introduce the smoothing spline method as an alternative. Our experiments show that we do indeed avoid an increased error due to wrap-around effects.

The inverse heat conduction problem is ill-posed in the sense that small errors in the data can cause large errors in the numerical solution. Thus regularization is needed. We demonstrate that the Fourier method is a regularization of the problem by providing stability estimates. The stability theory is developed for the discrete problem that is solved numerically. Though a complication is that the periodization procedure is not included in the stability analysis.

For the smoothing spline method we introduce a matrix $D_{λ}$ that represents differentiation of the smoothing spline obtained from a vector y consisting of function values on the grid. Our goal is to develop similar stability estimates as for the Fourier method. Thus we derive a bound on the norm $‖ D_{λ} ‖_{2}$ and also show that the matrix is skew-symmetric. Our theory is based on a previous paper [Citation24] but since our work concerns only a special case we can obtain simpler proofs. Since no periodicity assumption is needed, as for the Fourier method, one advantage is that the stability theory corresponds more directly to the discrete problem that is actually solved by our codes. From our experiments we also deduce that the smoothing splines method can effectively be used as a regularization method for solving the Cauchy problem.

In this paper we only present a stability analysis that bounds the errors in the solution caused by random errors in the data. In [Citation24] an error estimate $‖ u^{'} - s_{λ}^{'} [u_{Δ}] ‖_{L^{2}}$ is derived. In future work we intend to make use of a similar error estimate for our matrix approximation $D_{λ}$ and also limit the truncation error for our method. We also intend to apply our method to similar problems and compare with the work of other authors, e.g. [Citation16,Citation18]. We are also looking to apply the method to industrial problems with measured data.

Acknowledgments

The work of Mary Nanfuka is supported by the SIDA bilateral programme (2015–2020) with Makerere University; Project 316: Capacity building in Mathematics. The authors also thank prof. Matti Heiliö for valuable discussions during the work.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

1 The code used for our experiments is made available upon request by the corresponding author

References

Beck JV, Blackwell B, Clair SR. Inverse heat conduction. Ill-posed problems. New York: Wiley; 1985.
Google Scholar
Eldén L, Berntsson F, Regińska T. Wavelet and Fourier methods for solving the sideways heat equation. SIAM J Sci Comput. 2000;21(6):2187–2205.
Web of Science ®Google Scholar
Gardarein J-L, Gaspar J, Corre Y, et al. Inverse heat conduction problem using thermocouple deconvolution application to the heat flux estimation in a tokamak. Inverse Probl Sci Eng. 2013;21(5):854–864.
Web of Science ®Google Scholar
Jahedi M, Berntsson F, Wren J, et al. Transient inverse heat conduction problem of quenching a hollow cylinder by one row of water jets. Int J Heat Mass Transf. 2018;117(Supplement C):748–756.
Google Scholar
Li D, Wells MA. Effect of subsurface thermocouple installation on the discrepancy of the measured thermal history and predicted surface heat flux during a quench operation. Metallurg Mater Trans. 2005;36B:343–354.
Google Scholar
Taler J, Weglowski B, Pilarczyk M. Monitoring of thermal stresses in pressure components using inverse heat conduction methods. Int J Numer Meth Heat Fluid Flow. 2017;27(3):740–756.
Web of Science ®Google Scholar
Wikström P, Blasiak W, Berntsson F. Estimation of the transient surface temperature, heat flux and effective heat transfer coefficient of a slab in an industrial reheating furnace by using an inverse method. Steel Res Int. 2007;78(1):63–70.
Web of Science ®Google Scholar
Berntsson F. An inverse heat conduction problem and improving shielded thermocouple accuracy. Numer Heat Transfer, Part A: Appl. 2012;61(10):754–763.
Web of Science ®Google Scholar
Xiong X-T, Fu C-L. A spectral regularization method for solving surface heat flux on a general sideways parabolic. Appl Math Comput. 2008;197(1):358–365.
Web of Science ®Google Scholar
Carasso AS. Slowly divergent space marching schemes in the inverse heat conduction problem. Numer Heat Transfer, Part B. 1993;23:111–126.
Web of Science ®Google Scholar
Guo L, Murio DA, Roth C. A mollified space marching finite differences algorithm for the inverse heat conduction problem with slab symmetry. Computers Math Appl. 1990;19:75–89.
Web of Science ®Google Scholar
Mejía CE, Murio DA. Numerical solution of generalized IHCP by discrete mollification. Computers Math Appl. 1996;32:33–50.
Web of Science ®Google Scholar
Berntsson F. A spectral method for solving the sideways heat equation. Inverse Probl. 1999;15:891–906.
Web of Science ®Google Scholar
Blevins LG, Pitts WM. Modeling of bare and a spirated thermocouples in compartment fires. Fire Saf J. 1999;33:239–259.
Web of Science ®Google Scholar
Kemp SE, Annaheim S, Rossi RM, et al. Test method for characterising the thermal protective performance of fabrics exposed to flammable liquid fires. Fire Mater. 2016;41:750–767.
Web of Science ®Google Scholar
Karimi M, Rezaee A. Regularization of the Cauchy problem for the Helmholtz equation by using Meyer wavelet. J Comput Appl Math. 2017;320:76–95.
Web of Science ®Google Scholar
Regińska T. Sideways heat equation and wavelets. J Comput Appl Math. 1995;63:209–214.
Web of Science ®Google Scholar
Foadian S, Pourgholi R, Hashem Tabasi S. Cubic b-spline method for the solution of an inverse parabolic system. Appl Anal. 2018;97(3):438–465.
Web of Science ®Google Scholar
Reinsch CH. Smoothing by spline functions. Numer Math. 1971;16(5):451–454.
Web of Science ®Google Scholar
Isakov V. Inverse problems for partial differential equations. New York: Springer; 1998.
Google Scholar
Eldén L. Solving the sideways heat equation by a ‘method of lines’. J Heat Transfer, Trans ASME. 1997;119:406–412.
Web of Science ®Google Scholar
Gustafsson B, Kreiss H-O, Oliger J. Time dependent problems and difference methods. New York: Wiley Interscience; 1995.
Google Scholar
Anderson GD, Vamanamurthy MK, Vuorinen MK. Conformal invariants, inequalities, and quasiconformal maps. New York: John Wiley & Sons, Inc.; 1997.
Google Scholar
Ragozin DL. Error bounds for derivative estimates based on spline smoothing of exact or noisy data. J Approx Theor. 1983;37(4):335–355.
Web of Science ®Google Scholar
Schoenberg IJ. Spline interpolation and the higher derivatives. Proc Nat Acad Sci USA. 1964;51:24–28.
PubMed Web of Science ®Google Scholar
Agmon S. Lectures on elliptic boundary value problems. Princeton (NJ): D. Van Nostrand Co.; 1965.
Google Scholar

Solving a Cauchy problem for the heat equation using cubic smoothing splines

ABSTRACT

1. Introduction

2. The Cauchy problem for the heat equation

2.1. Stabilization by discretizing the time variable