1,088

Views

CrossRef citations to date

Altmetric

Research Article

A high order PDE-constrained optimization for the image denoising problem

Lekbir Afraitesa LMA FST Béni-Mellal, Université Sultan Moulay Slimane, Beni-Mellal, Morocco

Aissam Hadrib Faculté polydisciplinaire, Université Ibn Zohr, Ouarzazate, Morocco

Amine Laghriba LMA FST Béni-Mellal, Université Sultan Moulay Slimane, Beni-Mellal, MoroccoCorrespondence[email protected]

Mourad Nachaouia LMA FST Béni-Mellal, Université Sultan Moulay Slimane, Beni-Mellal, Morocco

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In the present work, we investigate the inverse problem of identifying simultaneously the denoised image and the weighting parameter that controls the balance between two diffusion operators for an evolutionary partial differential equation (PDE). The problem is formulated as a non-smooth PDE-constrained optimization model. This PDE is constructed by second- and fourth-order diffusive tensors that combines the benefits from the diffusion model of Perona–Malik in the homogeneous regions, the Weickert model near sharp edges and the fourth-order term in reducing staircasing. The existence and uniqueness of solutions for the proposed PDE-constrained optimization system are provided in a suitable Sobolev space. Also, an optimization problem for the determination of the weighting parameter is introduced based on the Primal–Dual algorithm. Finally, simulation results show that the obtained parameter usually coincides with the better choice related to the best restoration quality of the image.

Keywords:

2010 Mathematics Subject Classifications:

1. Introduction

Inverse problems are extensively studied in many areas and applications [Citation1]. Among them, the image processing and especially denoising task. The inverse problem discussed in this paper can be viewed as a variational denoising of images with total variation regularization [Citation2–4]. It is well known that the difficult choices of the weighting parameters generalize several combined first- and second-order TV-type regularization methods [Citation5,Citation6]. A major difficulty encountered in this kind of problem is how to choose the appropriate weighting parameter. In fact, there exists a vast amount of mathematical approaches on how to identify this parameter, see for example [Citation7,Citation8] and the references therein. Many of the above papers deal with the iterative and statistical approaches, which are always based on statistical estimation of the noise distribution [Citation9,Citation10]. Some successful methods are used with the total variation denoising problem, where the authors incorporated the statistical choice of a spatially variable into the standard model [Citation11,Citation12]. Other choice of the weighting parameter is based on the geometry of the image and also a specified threshold. In [Citation13], the authors propose a new algorithm which determine the weighting parameter λ based on the maximum of the Signal to Noise Ratio (SNR) of the denoised image. Similar work is also proposed in [Citation14] with satisfactory result. For a more detailed overview of methods for selecting the weighting parameter λ in general inverse problems, see [Citation15] and references therein. In a recent work, a new choice of the weighting parameter is proposed based on the learning of the noise model through a nonsmooth PDE-constrained optimization model [Citation16]. This approach is adopted for several noise distributions and has shown promising results for determining an optimal weighting parameter in the fidelity term and may be used for other variational image regularization. This weighting parameter is computed based on a training set of original and noisy images, while no information is required concerning the type or strength of the noise. In the last decade, the combined first- and high order regularizations have given an interesting success in several image processing tasks, see [Citation6,Citation17–19]. These regularizations have been introduced to reduce the staircasing effect. However, the selection of the weighting parameter λ that controls the balance between the first- and high order terms is always delicate. Indeed, an automatic selection of this parameter is desired. Recently, the bilevel techniques have had great success in this direction. The bilevel optimization is well known and understood in machine learning. This technique is considered as a semi-supervised learning method because of the ability of adapting itself to a given training set and valuable solutions. For instance, in [Citation20], the bilevel optimization was proposed for finite dimensional Markov field models. Optimal inversion and exploratory acquisition setup are studied in terms of optimal model design [Citation21]. Lately, the image processing has known the introduction of parameter learning for functional variational regularization models by several works [Citation16,Citation22–26]. One of the very interesting contributions is [Citation27] where a spatially varying regularization parameter in total variation image denoising is computed. Before that, a weighted total variation model is also investigated with bilevel optimization strategy [Citation28,Citation29]. More recently and using the same principle, a bilevel minimization framework for an automated selection of the regularization parameter in the total generalized variation (TGV) case is introduced with efficient results. However, for our knowledge, the bilevel techniques are not yet studied before for the PDE denoising models.

To avoid the problem of the blurring effect in the homogeneous regions and also to keep safe the coherence-enhancing property of the image, we propose a new PDE with two diffusion operators. This PDE takes in consideration the advantage of Perona–Malik diffusion behaviour in flat regions [Citation30], the efficiency of Weickert filter effect [Citation31–33] near sharp edges and also the ability of the fourth-order operator in avoiding the staircasing effect. Thus, taking into account these proprieties, the proposed PDE enhances the local information in images much better. However, the construction of this PDE using a convex combination between two operators requires the introduction of a weighting parameter $λ = λ (x)$ . This parameter affects the contrast of the restored image as well as the smoothness degree. In general, the choice of this parameter is not easy and it is always based on the image properties, the noise type and its nature. In this paper, we propose a PDE-constrained optimization model that compute the weighting parameter λ as well as the denoised image X without any knowledge of the image features and the noise type. In fact, we propose the study of the following optimal problem (where the attached to data term is considered to be the $L^{1}$ norm since it is robust against impulse noise, see [Citation2] for more details) (1) $min_{λ \in U_{a d}} J (λ),$ (1) where $J (λ) = \int_{Ω} | X (T, x) - Y (x) | d x + β T V (λ)$ and X is a solution of the following problem: (2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) where Y is the filtered image of the noisy one $X_{0}$ using a bilateral filter [Citation34], ν is the exterior normal and $X (T, x)$ is the restored image at the time T. With $T V (λ) = \int_{Ω} | \nabla λ |,$ and β is the regularization parameter. D is an anisotropic diffusion tensor and $J_{ρ}$ is the Structure tensor defined by (3) $J_{ρ}^{σ} (\nabla X) = K_{ρ} * (\nabla X_{σ} \otimes \nabla X_{σ}) .$ (3) where $X_{σ}$ is constructed using a convolution of X with a Gaussian kernel. While $K_{ρ}$ and $K_{σ}$ represent two Gaussian convolution kernels such as $K_{τ} (x) = \frac{1}{2 π τ^{2}} \exp (- \frac{| x |^{2}}{2 τ^{2}})$ . The function D is calculated using the tensor $J_{ρ}$ eigenvalues and the eigenvectors as follows: (4) $\begin{aligned} D : & = φ_{+} (υ_{+}, υ_{-}) θ_{+} θ_{+}^{⊺} + φ_{-} (υ_{+}, υ_{-}) θ_{-} θ_{-}^{⊺} \\ = (θ_{+} θ_{-}) (\begin{matrix} φ_{+} (υ_{+}, υ_{-}) & 0 \\ 0 & φ_{-} (υ_{+}, υ_{-}) \end{matrix}) (θ_{+} θ_{-})^{T}, \end{aligned}$ (4) where $υ_{+ / -}$ and $θ_{+ / -}$ are respectively the eigenvalues and the eigenvectors of the tensor structure $J_{ρ}$ , the eigenvalues $υ_{+ / -}$ are calculated as (5) $υ_{+ / -} = \frac{1}{2} (t r a c e (J_{ρ}^{σ}) \pm \sqrt{t r a c e^{2} (J_{ρ}^{σ}) - 4 det (J_{ρ}^{σ})}) .$ (5) The functions $φ_{+}$ and $φ_{-}$ represent the isotropic or anisotropic diffusion on image regions. Recently, an efficient choice of these functions is introduced in [Citation35], where the authors consider the behaviour of the Weickert model according to the two directions $θ_{+}$ and $θ_{-}$ . The proposed coefficients take into account the diffusion in the vicinity of the contours and corners where the eigenvalues $υ_{+}$ and $υ_{-}$ are very high. This choice is proposed as follows: (6) $\{\begin{cases} φ_{+} (υ_{+}, υ_{-}) = \exp (- \frac{υ_{+}}{k_{1}}), \\ φ_{-} (υ_{+}, υ_{-}) = \exp (- \frac{υ_{-}}{k_{2}}) (1 - \exp (- \frac{υ_{+}}{k_{1}})), \end{cases}$ (6) where $k_{1}$ and $k_{2}$ are two thresholds defining the diffusion with respect to the directions $θ_{+}$ and $θ_{-}$ , respectively.

We recall that λ is the weighting parameter to be estimated in (Equation1(1) $min_{λ \in U_{a d}} J (λ),$ (1) ) and $U_{a d}$ is the set of admissible functions defined by $U_{a d} = {λ \in L^{1} (Ω), 0 < \underline{λ} \leq λ \leq \bar{λ} < 1 a . e . x \in Ω a n d T V (λ) \leq C_{0}},$ with $\underline{λ}, \bar{λ}$ and $C_{0}$ are positive constants.

Before resolving the control problem (Equation1(1) $min_{λ \in U_{a d}} J (λ),$ (1) ), we have to check the existence and uniqueness of a solution of the state problem (Equation2(2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) ). In order to establish the existence of the solution X to the problem (Equation2(2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) ), we introduce the following set of hypotheses:

H1	– The tensor $D \in C^{\infty} (R^{2 \times 2}, R^{2 \times 2})$ is coercive and positive-definite matrix.
H2	– The initial condition verifies $X_{0} \in L^{\infty} (Ω)$ .

The use of the proposed PDE-constrained with the right weighting parameter identification can cope with the famous total generalized variation denoising method [Citation36,Citation37] in the image reconstruction quality. Such a problem is considered in the framework of optimization which is challenging and generally difficult to solve, since:

The weighting parameter λ depends on the nature of the data, the noise level and type.
The nonlinear dependence of the observed noised image with respect to the noise.
The large computational cost due to the determination of the denoised image and the weighting parameter in the same time in the presence of numerous operators of first and high orders.

Our contribution is then to take into consideration these difficulties and provide a constraint denoising model that increases the quality of the restored image and estimates the weighting parameter λ. Moreover, inspired by the success of the Primal–Dual algorithm in accelerating the resolution of convex optimization problems [Citation38–41], we propose a new Primal–Dual algorithm adopted for PDE-constrained problems.

The rest of this paper is organized as follows: in Section 2, the variational formulation and estimations are given. Then, the existence and uniqueness results of the state problem and the optimal one are given in Sections 3 and 4 respectively. After, Section 5 presents the suggested Primal–Dual algorithm. Finally, in Section 6, we illustrated the benefits of the proposed approach by using comparative experiments with some state-of-art denoising methods.

2. Variational formulation and a priori estimates

In this section, we give the variational formulation of the problem (Equation2(2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) ) and a priori estimates of the solution X. Indeed, the variational formulation of the problem (Equation2(2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) ) is stated as follows: (7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) For more details about the space $H_{0}^{2} (Ω)$ , see Appendix 2.

In order to show the existence of a solution to the problem (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ), we use Schauder fixed point Theorem [Citation42]. For this reason, we need to show a priori estimates of the solution to the problem (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ) in $L^{2} (0, T; H_{0}^{2} (Ω))$ .

Lemma 2.1

Assume that assumptions $(H_{1}) - (H_{2})$ are satisfied, then there exists three constants $C_{i} > 0$ for i = 1, 2, 3, such that the weak solution of the problem (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ) satisfies the following a priori estimations: (8) $\begin{aligned} ∥ X ∥_{L^{\infty} (0, T; L^{2} (Ω))} \leq C_{1}, \end{aligned}$ (8) (9) $\begin{aligned} ∥ X ∥_{L^{2} (0, T; H_{0}^{2} (Ω))} \leq C_{2}, \end{aligned}$ (9) (10) $\begin{aligned} {∥\frac{\partial X}{\partial t}∥}_{L^{2} (0, T; H^{- 2} (Ω))} \leq C_{3} . \end{aligned}$ (10)

Proof.

Consider $τ \in (0, T]$ , then for $t \in (0, τ)$ we take $ϕ = X (t)$ in the variational formulation (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ), we get: $\frac{1}{2} \frac{d}{d t} ∥ X (t) ∥_{L^{2} (Ω)}^{2} d t + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) | \nabla^{2} X |^{2} d x + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) | \nabla X |^{2} d x = 0,$ we integrate with respect to t over $(0, τ)$ (11) $\begin{aligned} \frac{1}{2} ∥ X (τ) ∥_{L^{2} (Ω)}^{2} + \int_{0}^{τ} \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) | \nabla X^{2} |^{2} d x d t \\ + \int_{0}^{τ} \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) | \nabla X |^{2} d x d t = \frac{1}{2} ∥ X (0) ∥_{L^{2} (Ω)}^{2} . \end{aligned}$ (11) Since $λ \in U_{a d},$ we find $\begin{aligned} \frac{1}{2} ∥ X (τ) ∥_{L^{2} (Ω)}^{2} + \underline{λ} \int_{0}^{τ} \int_{Ω} D (J_{ρ}^{σ} (\nabla X)) | \nabla^{2} X |^{2} d x d t \\ + (1 - \bar{λ}) \int_{0}^{τ} \int_{Ω} D (J_{ρ}^{σ} (\nabla X)) | \nabla X |^{2} d x d t \leq \frac{1}{2} ∥ X (0) ∥_{L^{2} (Ω)}^{2} . \end{aligned}$ On the other hand, we have $D (J_{ρ}^{σ})$ is coercive, where the coercivity constant is α, then $\begin{aligned} \frac{1}{2} ∥ X (τ) ∥_{L^{2} (Ω)}^{2} + α \underline{λ} \int_{0}^{τ} \int_{Ω} | \nabla^{2} X |^{2} d x d t + α (1 - \bar{λ}) \int_{0}^{τ} \int_{Ω} | \nabla X |^{2} d x d t \\ \leq \frac{1}{2} ∥ X (0) ∥_{L^{2} (Ω)}^{2}, \end{aligned}$ which implies that $∥ X (τ) ∥_{L^{2} (Ω)}^{2} \leq C_{1}, \forall τ \in [0, T],$ where the constant $C_{1} = ∥ X (0) ∥_{L^{2} (Ω)}^{2}$ . Consequently, we have (12) $∥ X ∥_{L^{\infty} (0, T; L^{2} (Ω))}^{2} \leq C_{1}, ∥ X ∥_{L^{2} (0, T; L^{2} (Ω))}^{2} \leq T ∥ X ∥_{L^{\infty} (0, T; L^{2} (Ω))}^{2} .$ (12) We have also (13) $min (\frac{1}{2 T}, α \underline{λ}, (1 - \bar{λ}) α) ∥ X ∥_{L^{2} (0, T; H^{2} (Ω))}^{2} \leq C_{1},$ (13) Since we have $∥ X ∥_{L^{2} (0, T; H_{0}^{2} (Ω))}^{2} \leq ∥ X ∥_{L^{2} (0, T; H^{2} (Ω))}^{2},$ then $∥ X ∥_{L^{2} (0, T; H_{0}^{2} (Ω))}^{2} \leq C_{2},$ with $C_{2} = \frac{\frac{1}{2} C_{1}}{min (\frac{1}{2 T}, α \underline{λ}, (1 - \bar{λ}) α)} .$ Let's prove now the estimation of $\frac{\partial X}{\partial t}$ . From the variational formulation (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ), we have (14) $\begin{aligned} |{〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)}| = | \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x |, \forall ϕ \in L^{2} (0, T; H_{0}^{2} (Ω)) . \end{aligned}$ (14) Using the Hölder inequality and the fact that $D (J_{ρ}^{σ} (\nabla X))$ is bounded in $L^{\infty} (0, T, C^{\infty} (Ω))$ , we find (15) $\begin{aligned} |{〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)}| \leq \bar{λ} C ∥ \nabla^{2} X ∥_{L^{2} (Ω)} ∥ \nabla^{2} ϕ ∥_{L^{2} (Ω)} + (1 - \underline{λ}) C ∥ \nabla X ∥_{L^{2} (Ω)} ∥ \nabla ϕ ∥_{L^{2} (Ω)}, \\ \forall ϕ \in L^{2} (0, T; H_{0}^{2} (Ω)) . \end{aligned}$ (15) or $∥ \nabla^{2} ϕ ∥_{L^{2} (Ω)} \leq ∥ ϕ ∥_{H_{0}^{2} (Ω)}$ and $∥ \nabla ϕ ∥_{L^{2} (Ω)} \leq ∥ ϕ ∥_{H_{0}^{2} (Ω)}$ , we obtain (16) $sup_{ϕ \in H_{0}^{2} (Ω)} \frac{| 〈 \frac{\partial X}{\partial t}, ϕ 〉_{H^{- 2} (Ω), H_{0}^{2} (Ω)} |}{∥ ϕ ∥_{H_{0}^{2} (Ω)}} \leq \bar{λ} C ∥ \nabla^{2} X ∥_{L^{2} (Ω)} + (1 - \underline{λ}) C ∥ \nabla X ∥_{L^{2} (Ω)} .$ (16) Using norm equivalence and Poincaré inequality we have $∥ \nabla^{2} X ∥_{L^{2} (Ω)} \leq ∥ X ∥_{H^{2} (Ω)} \leq C ∥ X ∥_{H_{0}^{2} (Ω)}$ (see Appendix 2), $∥ \nabla X ∥_{L^{2} (Ω)} \leq C ∥ X ∥_{H_{0}^{2} (Ω)}$ and by integrating on $t \in (0, T]$ , we obtain (17) ${∥\frac{\partial X}{\partial t}∥}_{L^{2} (0, T, H^{- 2} (Ω))} \leq C \sqrt{C_{2}} = C_{3} .$ (17) This achieves the proof.

3. Existence and uniqueness of the state problem

In this section, based on the above estimates, we will show the existence and uniqueness of the solution for the problem (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ) for a fixed $λ \in U_{a d} .$ The main difficulties encountered in this study come from the strong nonlinearity of the conductivity part of Equation (Equation2(2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) ). To overcome these difficulties, we use the Schauder fixed point [Citation42]. The idea of the proof is inspired from the paper [Citation43]. In order to prove the existence of a solution to the problem (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ) we will prove some lemmas that will be useful for the theorem of the existence. Let's now define the following operator G by (18) $\begin{aligned} G : V & ⟶ & V \\ \bar{X} & ⟼ & X, \end{aligned}$ (18) where X is the unique solution of (19) $\{\begin{cases} w i t h X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω), \end{cases}$ (19) and (20) $\begin{aligned} V & = {U \in L^{2} (0, T; H_{0}^{1} (Ω)) \cap L^{\infty} (0, T; L^{2} (Ω)) / ∥ U ∥_{L^{\infty} (0, T; L^{2} (Ω))} \leq C_{1}, \\ ∥ U ∥_{L^{2} (0, T; H_{0}^{2} (Ω))} \leq C_{2} a n d {∥\frac{\partial U}{\partial t}∥}_{L^{2} (0, T, H^{- 2} (Ω))} \leq C_{3}} . \end{aligned}$ (20) The operator G is well defined, thanks to two things: first, the existence of the solution for the problem (Equation19(19) $\{\begin{cases} w i t h X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω), \end{cases}$ (19) ) by the equivalent of Lax–Milgram theorem for parabolic equation which called Lions theorem. This theorem is stated in the comments part of the Chapter 10 of [Citation44] (also it is stated in pages 509−510 of the book [Citation45]) and it is mentioned in the part of parabolic equation from the book [Citation46]. Second, using the same techniques as in the proof of Lemma 2.1, it's easy to prove that for all $\bar{X} \in V$ , we have $X \in V$ .

To prove that G admits a unique fixed point X, which is the solution of (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ). We have to show that G is continuous and compact. The following lemma show the continuity of the operator G.

Lemma 3.1

The operator G is continuous in V.

Proof.

In order to prove the continuity of the map G, let us consider the sequence ${\bar{X}}_{n}$ in $L^{2} (0, T, H_{0}^{1} (Ω)) \cap L^{\infty} (0, T; L^{2} (Ω))$ , such that (21) ${\bar{X}}_{n} \underset{n \to \infty}{⟶} \bar{X} i n L^{2} (0, T, H_{0}^{1} (Ω)) \cap L^{\infty} (0, T; L^{2} (Ω)),$ (21) we have to prove that (22) $X_{n} = G ({\bar{X}}_{n}) \underset{n \to \infty}{⟶} X = G (\bar{X}) i n L^{2} (0, T, H_{0}^{1} (Ω)) \cap L^{\infty} (0, T; L^{2} (Ω)) .$ (22) Let $X_{n}$ (respectively X) be the unique solution of ${\bar{X}}_{n}$ (respectively $\bar{X}$ ) for the formulation Equation19(19) $\{\begin{cases} w i t h X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω), \end{cases}$ (19) , which means (23) $\begin{aligned} \{\begin{cases} {〈\frac{\partial X_{n}}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla {\bar{X}}_{n})) \nabla^{2} X_{n} \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla {\bar{X}}_{n})) \nabla X_{n} \nabla ϕ d x = 0, \forall ϕ \in L^{2} (0, T; H_{0}^{2} (Ω)), \end{cases} \end{aligned}$ (23) (24) $\begin{aligned} \{\begin{cases} {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla X \nabla ϕ d x = 0, \forall ϕ \in L^{2} (0, T; H_{0}^{2} (Ω)) . \end{cases} \end{aligned}$ (24) Using Equations (Equation23(23) $\begin{aligned} \{\begin{cases} {〈\frac{\partial X_{n}}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla {\bar{X}}_{n})) \nabla^{2} X_{n} \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla {\bar{X}}_{n})) \nabla X_{n} \nabla ϕ d x = 0, \forall ϕ \in L^{2} (0, T; H_{0}^{2} (Ω)), \end{cases} \end{aligned}$ (23) ) and (Equation24(24) $\begin{aligned} \{\begin{cases} {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla X \nabla ϕ d x = 0, \forall ϕ \in L^{2} (0, T; H_{0}^{2} (Ω)) . \end{cases} \end{aligned}$ (24) ), and by taking $ϕ = X_{n} - X$ , we find $\{\begin{cases} {〈\frac{\partial (X_{n} - X)}{\partial t}, X_{n} - X〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} \\ + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla^{2} (X_{n} - X) \nabla^{2} (X_{n} - X) d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla (X_{n} - X) \nabla (X_{n} - X) d x \\ = \int_{Ω} λ (D (J_{ρ}^{σ} (\nabla \bar{X})) - D (J_{ρ}^{σ} (\nabla \bar{X_{n}}))) \nabla^{2} X_{n} \nabla^{2} (X_{n} - X) d x \\ + \int_{Ω} (1 - λ) (D (J_{ρ}^{σ} (\nabla \bar{X})) - D (J_{ρ}^{σ} (\nabla \bar{X_{n}}))) \nabla X_{n} \nabla (X_{n} - X) d x . \end{cases}$ Integrating now on $t \in (0, τ], \forall τ \in (0, T]$ and using the fact that the operator $D (J_{ρ}^{σ})$ is smooth enough, we have (25) $∥ D (J_{ρ}^{σ} (\nabla {\bar{X}}_{n})) - D (J_{ρ}^{σ} (\nabla \bar{X})) ∥_{L^{\infty} (0, τ; L^{\infty} (Ω))} \leq C ∥ {\bar{X}}_{n} - \bar{X} ∥_{L^{\infty} (0, τ; L^{2} (Ω))} .$ (25) Using the fact that D is coercive, and using the Hölder inequality, we obtain (26) $\begin{aligned} \frac{1}{2} ∥ X_{n} (τ) - X (τ) ∥_{L^{2} (Ω)}^{2} + \underline{λ} α ∥ \nabla^{2} (X_{n} - X) ∥_{L^{2} (0, τ; L^{2} (Ω))}^{2} \end{aligned}$ (26) (27) $\begin{aligned} + (1 - \bar{λ}) α ∥ \nabla (X_{n} - X) ∥_{L^{2} (0, τ; L^{2} (Ω))}^{2} \\ \leq \bar{λ} C ∥ {\bar{X}}_{n} - \bar{X} ∥_{L^{\infty} (0, τ; L^{2} (Ω))} ∥ \nabla^{2} X_{n} ∥_{L^{2} (0, T; L^{2} (Ω))} ∥ \nabla^{2} (X_{n} - X) ∥_{L^{2} (0, τ; L^{2} (Ω))} \\ + (1 - \underline{λ}) ∥ {\bar{X}}_{n} - \bar{X} ∥_{L^{\infty} (0, τ; L^{2} (Ω))} ∥ \nabla X_{n} ∥_{L^{2} (0, T; L^{2} (Ω))} ∥ \nabla (X_{n} - X) ∥_{L^{2} (0, τ; L^{2} (Ω))} . \end{aligned}$ (27) Using the Young inequality for the right terms, we find (28) $\begin{aligned} \frac{1}{2} ∥ X_{n} (τ) - X (τ) ∥_{L^{2} (Ω)}^{2} + (\underline{λ} α - ϵ_{1} \frac{C \bar{λ}}{2}) ∥ \nabla^{2} (X_{n} - X) ∥_{L^{2} (0, τ; L^{2} (Ω))}^{2} \\ + ((1 - \bar{λ}) α - ϵ_{1} \frac{C (1 - \underline{λ})}{2}) ∥ \nabla (X_{n} - X) ∥_{L^{2} (0, τ; L^{2} (Ω))}^{2} \\ \leq \frac{\bar{λ} C}{2 ϵ_{1}} ∥ {\bar{X}}_{n} - \bar{X} ∥_{L^{\infty} (0, τ; L^{2} (Ω))}^{2} ∥ \nabla^{2} X_{n} ∥_{L^{2} (0, T; L^{2} (Ω))}^{2} \\ + \frac{(1 - \underline{λ}) C}{2 ϵ_{1}} ∥ {\bar{X}}_{n} - \bar{X} ∥_{L^{\infty} (0, τ; L^{2} (Ω))}^{2} ∥ \nabla X_{n} ∥_{L^{2} (0, T; L^{2} (Ω))}^{2}, \end{aligned}$ (28) and using the Poincaré inequality: $\{\begin{cases} ∥ \nabla^{2} X_{n} ∥_{L^{2} (0, T; L^{2} (Ω))}^{2} \leq C ∥ X_{n} ∥_{L^{2} (0, T; H_{0}^{2} (Ω))}^{2}, \\ ∥ \nabla X_{n} ∥_{L^{2} (0, T; L^{2} (Ω))}^{2} \leq C ∥ X_{n} ∥_{L^{2} (0, T; H_{0}^{2} (Ω))}^{2}, \end{cases}$ we obtain (29) $\begin{aligned} \frac{1}{2} ∥ X_{n} (τ) - X (τ) ∥_{L^{2} (Ω)}^{2} + (\underline{λ} α - ϵ_{1} \frac{C \bar{λ}}{2}) ∥ \nabla^{2} (X_{n} - X) ∥_{L^{2} (0, τ; L^{2} (Ω))}^{2} \end{aligned}$ (29) (30) $\begin{aligned} + ((1 - \bar{λ}) α - ϵ_{1} \frac{C (1 - \underline{λ})}{2}) ∥ \nabla (X_{n} - X) ∥_{L^{2} (0, τ; L^{2} (Ω))}^{2} \\ \leq (\frac{\bar{λ} C}{2 ϵ_{1}} + \frac{(1 - \underline{λ}) C}{2 ϵ_{1}}) ∥ {\bar{X}}_{n} - \bar{X} ∥_{L^{\infty} (0, τ; L^{2} (Ω))}^{2} ∥ X_{n} ∥_{L^{2} (0, T; H_{0}^{2} (Ω))}^{2} . \end{aligned}$ (30) For $ϵ_{1} < min (\frac{2 (1 - \bar{λ}) α}{C (1 - \underline{λ})}, \frac{2 \underline{λ} α}{C \bar{λ}})$ chosen and since we have (31) ${\bar{X}}_{n} ⟶ \bar{X} i n L^{\infty} (0, T; L^{2} (Ω)),$ (31) we find that (32) $X_{n} ⟶ X i n L^{2} (0, T, H_{0}^{2} (Ω)) \cap L^{\infty} (0, T, L^{2} (Ω)) .$ (32) By the equivalence between the norms $∥ . ∥_{H_{0}^{2} (Ω)}$ and $∥ . ∥_{H^{2} (Ω)}$ (see Appendix 2), we find that the operator G is continuous.

Let's show now that G is compact.

Lemma 3.2

The operator G is compact in V.

Proof.

Now, let us show that G is compact. For this, let $({\bar{X}}_{n})_{n}$ be a bounded sequence in $L^{2} (0, T; H_{0}^{1} (Ω)) \cap L^{\infty} (0, T; L^{2} (Ω))$ , and let $X_{n} = G ({\bar{X}}_{n})$ be the unique solution of (Equation19(19) $\{\begin{cases} w i t h X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla \bar{X})) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω), \end{cases}$ (19) ) to ${\bar{X}}_{n} .$ Indeed, by taking $ϕ = X_{n}$ and integrating over t and using the same proof as in Lemma 2.1, we have (33) $\begin{aligned} \frac{1}{2} ∥ X_{n} (τ) ∥_{L^{2} (Ω)}^{2} + α \underline{λ} ∥ \nabla^{2} X_{n} ∥_{L^{2} (0, τ; L^{2} (Ω))}^{2} + (1 - \bar{λ}) α ∥ \nabla X_{n} ∥_{L^{2} (0, τ; L^{2} (Ω))}^{2} \\ \leq \frac{1}{2} C_{1}, \forall τ \in [0, T] . \end{aligned}$ (33) We have also (34) ${∥\frac{\partial X_{n}}{\partial t}∥}_{L^{2} (0, τ; H^{- 2} (Ω))}^{2} \leq C_{3} .$ (34) Now, thanks to the compact embedding of $H_{0}^{2} (Ω)$ in $H_{0}^{1} (Ω)$ and the continuous embedding of $H_{0}^{1} (Ω)$ in $H^{- 2} (Ω)$ . From Aubin–Lions Lemma [Citation45–47], we have $\{u \in L^{2} (0, T; H_{0}^{2} (Ω)) / \frac{\partial u}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω))\}$ is compactly embedded in $L^{2} (0, T; H_{0}^{1} (Ω)),$ which means we can extract a subsequence denoted again $(X_{n})_{n}$ that converges in $L^{2} (0, T; H_{0}^{1} (Ω))$ . This implies that the operator G is compact.

The following theorem shows the existence of weak solution for the problem (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) )

Theorem 3.1

The problem (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ) admits a unique solution in $L^{2} (0, T; H_{0}^{1} (Ω))$ .

Proof.

Existence We have that the operator G is well defined. From Lemmas 3.1 and 3.2, G is continuous and compact in V, which implies the existence of the fixed point by using the Schauder fixed point [Citation42].

Uniqueness To prove the uniqueness of the solution, we assume that $X_{1}$ and $X_{2}$ are two distinct solutions of the problem (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ). By subtracting the weak formulations of the solutions $X_{1}$ and $X_{2}$ , we obtain (35) $\begin{aligned} {〈\frac{\partial X_{1}}{\partial t} - \frac{\partial X_{2}}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} \\ + \int_{Ω} λ (D (J_{ρ}^{σ} (\nabla X_{1})) \nabla^{2} X_{1} - D (J_{ρ}^{σ} (\nabla X_{2})) \nabla^{2} X_{2}) \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) (D (J_{ρ}^{σ} (\nabla X_{1})) \nabla X_{1} - D (J_{ρ}^{σ} (\nabla X_{2})) \nabla X_{2}) \nabla ϕ d x = 0, \end{aligned}$ (35) taking $ϕ = X_{1} - X_{2}$ , we get (36) $\begin{aligned} {〈\frac{\partial X_{1}}{\partial t} - \frac{\partial X_{2}}{\partial t}, X_{1} - X_{2}〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} \\ + \int_{Ω} λ (D (J_{ρ}^{σ} (\nabla X_{1})) \nabla^{2} X_{1} - D (J_{ρ}^{σ} (\nabla X_{2})) \nabla^{2} X_{2}) \nabla^{2} (X_{1} - X_{2}) d x \\ + \int_{Ω} (1 - λ) (D (J_{ρ}^{σ} (\nabla X_{1})) \nabla X_{1} - D (J_{ρ}^{σ} (\nabla X_{2})) \nabla X_{2}) \nabla (X_{1} - X_{2}) d x = 0. \end{aligned}$ (36) Then, we have $\begin{aligned} {〈\frac{\partial X_{1}}{\partial t} - \frac{\partial X_{2}}{\partial t}, X_{1} - X_{2}〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} \\ + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X_{1})) \nabla^{2} (X_{1} - X_{2}) \nabla^{2} (X_{1} - X_{2}) d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X_{1})) \nabla (X_{1} - X_{2}) \nabla (X_{1} - X_{2}) d x \\ = \int_{Ω} λ (D (J_{ρ}^{σ} (\nabla X_{1})) - D (J_{ρ}^{σ} (\nabla X_{2}))) \nabla^{2} X_{2} \nabla^{2} (X_{2} - X_{1}) \\ + \int_{Ω} (1 - λ) (D (J_{ρ}^{σ} (\nabla X_{1})) - D (J_{ρ}^{σ} (\nabla X_{2}))) \nabla X_{2} \nabla (X_{2} - X_{1}) . \end{aligned}$ By using the fact that D is bounded and by the Hölder inequality, we find $\begin{aligned} \frac{d}{2 d t} ∥ X_{1} (t) - X_{2} (t) ∥_{L^{2} (Ω)}^{2} + α \underline{λ} ∥ \nabla^{2} (X_{1} - X_{2}) ∥_{L^{2} (Ω)}^{2} + (1 - \bar{λ}) α ∥ \nabla (X_{1} - X_{2}) ∥_{L^{2} (Ω)}^{2} \\ \leq \bar{λ} ∥ D (J_{ρ}^{σ} (\nabla X_{1} (t))) - D (J_{ρ}^{σ} (\nabla X_{2} (t))) ∥_{L^{\infty} (Ω)} ∥ \nabla^{2} X_{2} (t) ∥_{L^{2} (Ω)} ∥ \nabla^{2} (X_{1} (t) \\ - X_{2} (t)) ∥_{L^{2} (Ω)} \\ + (1 - \underline{λ}) ∥ D (J_{ρ}^{σ} (\nabla X_{1} (t))) - D (J_{ρ}^{σ} (\nabla X_{2} (t))) ∥_{L^{\infty} (Ω)} ∥ \nabla X_{2} (t) ∥_{L^{2} (Ω)} ∥ \nabla (X_{1} (t) \\ - X_{2} (t)) ∥_{L^{2} (Ω)} . \end{aligned}$ Since the operator $D (J_{ρ})$ is smooth enough, we have (37) $∥ D (J_{ρ}^{σ} (\nabla X_{1} (t))) - D (J_{ρ}^{σ} (\nabla X_{2} (t))) ∥_{L^{\infty} (Ω)} \leq C ∥ X_{1} (t) - X_{2} (t) ∥_{L^{2} (Ω)}, \forall t \in [0, T] .$ (37) Then (38) $\begin{aligned} \frac{d}{2 d t} ∥ X_{1} (t) - X_{2} (t) ∥_{L^{2} (Ω)}^{2} + \underline{λ} α ∥ \nabla^{2} (X_{1} - X_{2}) ∥_{L^{2} (Ω)}^{2} + (1 - \bar{λ}) α ∥ \nabla (X_{1} - X_{2}) ∥_{L^{2} (Ω)}^{2} \\ \leq \bar{λ} C ∥ X_{1} (t) - X_{2} (t) ∥_{L^{2} (Ω)} ∥ \nabla^{2} X_{2} (t) ∥_{L^{2} (Ω)} ∥ \nabla^{2} (X_{1} (t) - X_{2} (t)) ∥_{L^{2} (Ω)} \\ + (1 - \underline{λ}) C ∥ X_{1} (t) - X_{2} (t) ∥_{L^{2} (Ω)} ∥ \nabla X_{2} (t) ∥_{L^{2} (Ω)} ∥ \nabla (X_{1} (t) - X_{2} (t)) ∥_{L^{2} (Ω)} . \end{aligned}$ (38) Using the Young inequality, we find (39) $\begin{aligned} \frac{d}{2 d t} ∥ X_{1} (t) - X_{2} (t) ∥_{L^{2} (Ω)}^{2} + (\underline{λ} α - ϵ_{1} \frac{C \bar{λ}}{2}) ∥ \nabla^{2} (X_{1} - X_{2}) ∥_{L^{2} (Ω)}^{2} \\ + ((1 - \bar{λ}) α - ϵ_{1} \frac{C (1 - \underline{λ})}{2}) ∥ \nabla (X_{1} - X_{2}) ∥_{L^{2} (Ω)}^{2} \\ \leq C ∥ X_{1} (t) - X_{2} (t) ∥_{L^{2} (Ω)}^{2} (\bar{λ} \frac{1}{2 ϵ_{1}} ∥ \nabla^{2} X_{2} (t) ∥_{L^{2} (Ω)}^{2} + (1 - \underline{λ}) \frac{1}{2 ϵ_{1}} ∥ \nabla X_{2} (t) ∥_{L^{2} (Ω)}^{2}) \\ \leq \frac{C}{2 ϵ_{1}} ∥ X_{1} (t) - X_{2} (t) ∥_{L^{2} (Ω)} ∥ X_{2} (t) ∥_{H_{0}^{2} (Ω)}^{2} . \end{aligned}$ (39) By taking $ϵ_{1} < min (\frac{2 (1 - \bar{λ}) α}{C (1 - \underline{λ})}, \frac{2 \underline{λ} α}{C \bar{λ}})$ and by using the Grönwall inequality, we conclude that (40) $∥ X_{1} - X_{2} ∥_{L^{\infty} (0, T; L^{2} (Ω))}^{2} \leq 0.$ (40) Finally, we get the uniqueness of the solution.

4. Existence of solution for the optimization problem

We prove now the existence of the solution to the optimal control problem (Equation1(1) $min_{λ \in U_{a d}} J (λ),$ (1) ) we need the following compactness lemmas.

Lemma 4.1

$U_{a d}$ is compact for the topology defined by the strong convergence in $L^{1} (Ω)$ .

Proof.

Let's $(λ_{k})_{k}$ be a sequence in $U_{a d}$ . Then we have (41) $\underline{λ} \leq λ_{k} \leq \bar{λ} a . e . Ω$ (41) and (42) $T V (λ_{k}) \leq C_{0} .$ (42) Our aim is to show that we can extract a subsequence denoted again $(λ_{k})_{k}$ such that $λ_{k} ⟶ λ^{*} i n L^{1} (Ω)$ and $λ^{*} \in U_{a d} .$ From the estimations (Equation41(41) $\underline{λ} \leq λ_{k} \leq \bar{λ} a . e . Ω$ (41) ) and (Equation42(42) $T V (λ_{k}) \leq C_{0} .$ (42) ) we have (43) $∥ λ_{k} ∥_{B V (Ω)} \leq C_{0} + \bar{λ} m e s (Ω) .$ (43) Using the compact embedding of $B V (Ω)$ in $L^{1} (Ω)$ , we can extract a subsequence denoted again $(λ_{k})_{k}$ such that $λ_{k} ⟶ λ^{*} i n L^{1} (Ω) with 0 < \underline{λ} \leq λ^{*} \leq \bar{λ} < 1 a . e . x \in Ω$ From estimation (Equation42(42) $T V (λ_{k}) \leq C_{0} .$ (42) ) we find $\nabla λ_{k} ⇀^{*} \nabla λ^{*} i n M_{b} (Ω),$ $M_{b} (Ω)$ is the space of bounded measures, using the Banach–Alaoglu–Bourbaki theorem in [Citation44], we find $T V (λ^{*}) \leq C_{0} .$ Which conclude the compactness of $U_{a d}$ .

Now we will prove the continuity of the map $λ ⟶ X (λ)$ . Which means that if $(λ_{k})_{k}$ converges to $λ^{*}$ in $U_{a d}$ , then $X_{k} \equiv X (λ_{k})$ the solution of (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ) converges to $X^{*} \equiv X (λ^{*})$ solution of (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ).

Proposition 4.1

Let $(λ_{k})_{k}$ be a sequence from $U_{a d}$ that converges in $L^{1} (Ω)$ to $λ^{*} \in U_{a d}$ and $X_{k} \equiv X (λ_{k})$ solution of (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ) related to $λ_{k},$ $\forall k \in N$ , we have

(1)	The sequence $(X_{k})_{k}$ converges weakly in $L^{2} (0, T; H_{0}^{2} (Ω))$ to $X^{} \equiv X (λ^{})$ solution of (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ) related to $λ^{*} .$
(2)	The cost functional J verifies $J (λ^{*}) \leq \underset{k ⟶ + \infty}{lim inf} J (λ_{k}) .$

Proof.

Let's $(λ_{k})_{k}$ be a sequence of $U_{a d}$ and $X_{k} \equiv X_{k} (λ_{k})$ solution of (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ) related to $λ_{k},$ $\forall k \in N$ , which means (44) $\begin{aligned} {〈\frac{\partial X_{k}}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ_{k} D (J_{ρ}^{σ} (\nabla X_{k})) \nabla^{2} X_{k} \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ_{k}) D (J_{ρ}^{σ} (\nabla X_{k})) \nabla X_{k} \nabla ϕ d x = 0, \end{aligned}$ (44) $\forall ϕ \in H_{0}^{2} (Ω) .$ From Lemma 2.1, we have: there exists $C_{i} > 0$ for $i \in {1, 2, 3}$ independent of k, such that (45) $\begin{aligned} ∥ X_{k} ∥_{L^{\infty} (0, T; L^{2} (Ω))} \leq C_{1}, \end{aligned}$ (45) (46) $\begin{aligned} ∥ X_{k} ∥_{L^{2} (0, T; H_{0}^{2} (Ω))} \leq C_{2}, \end{aligned}$ (46) (47) $\begin{aligned} {∥\frac{\partial X_{k}}{\partial t}∥}_{L^{2} (0, T; H^{- 2} (Ω))} \leq C_{3} . \end{aligned}$ (47) Since $(λ_{k})_{k}$ is a sequence of $U_{a d}$ and using the results of compactness of $U_{a d}$ obtained in Lemma 4.1, we can extract a subsequence denoted again $(λ_{k})_{k}$ and $λ^{*},$ such that (48) $λ_{k} \underset{k \to \infty}{⟶} λ^{*} i n L^{1} (Ω) .$ (48) Also, from the boundedness of $(X_{k})_{k}$ in $L^{2} (0, T; H_{0}^{2} (Ω))$ and $\frac{\partial X_{k}}{\partial t}$ in $L^{2} (0, T; H^{- 2} (Ω)),$ we can extract a subsequence denoted again $(X_{k})_{k}$ (49) $X_{k} ⇀ X^{*} i n L^{2} (0, T; H_{0}^{2} (Ω))$ (49) and (50) $\frac{\partial X_{k}}{\partial t} ⇀ \frac{\partial X^{*}}{\partial t} i n L^{2} (0, T; H^{- 2} (Ω)) .$ (50) By using Aubin–Lions Lemma [Citation45–47], we have (51) $X_{k} ⟶ X^{*} i n L^{2} (0, T; H^{1} (Ω))$ (51) and (52) $X_{k} ⇀ X^{*} i n C (0, T; H^{1} (Ω)) .$ (52) Thanks to the compact embedding of $H_{0}^{1} (Ω)$ in $L^{2} (Ω)$ and the continuous embedding of $L^{2} (Ω)$ in $H^{- 2} (Ω) .$ Since, we have $(X_{k})_{k}$ in $L^{\infty} (0, T; H_{0}^{1} (Ω))$ and $\frac{\partial X_{k}}{\partial t}$ in $L^{2} (0, T; H^{- 2} (Ω)),$ using Aubin–Simon lemma [Citation48] we obtain (53) $X_{k} ⟶ X^{*} i n C ([0, T]; L^{2} (Ω)) .$ (53) Let's prove now that $X^{*}$ is the solution of (Equation7(7) $\{\begin{cases} F i n d X \in L^{2} (0, T; H_{0}^{2} (Ω)) a n d \frac{\partial X}{\partial t} \in L^{2} (0, T; H^{- 2} (Ω)) s u c h t h a t \\ {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (7) ). Which means, it suffices to prove that the following convergences hold: (54) $\begin{aligned} \int_{0}^{T} {〈\frac{\partial X_{k}}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} d t \underset{k \to \infty}{⟶} \int_{0}^{T} {〈\frac{\partial X^{*}}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} d t, \end{aligned}$ (54) (55) $\begin{aligned} \int_{0}^{T} \int_{Ω} λ_{k} D (J_{ρ}^{σ} (\nabla X_{k})) \nabla^{2} X_{k} \nabla^{2} ϕ d x d t \underset{k \to \infty}{⟶} \int_{0}^{T} \int_{Ω} λ^{*} D (J_{ρ}^{σ} (\nabla X^{*})) \nabla^{2} X^{*} \nabla^{2} ϕ d x d t \end{aligned}$ (55) and (56) $\begin{aligned} \int_{0}^{T} \int_{Ω} (1 - λ_{k}) D (J_{ρ}^{σ} (\nabla X_{k})) \nabla X_{k} \nabla ϕ d x d t \underset{k \to \infty}{⟶} \int_{0}^{T} \int_{Ω} (1 - λ^{*}) D (J_{ρ}^{σ} (\nabla X^{*})) \\ \nabla X^{*} \nabla ϕ d x d t . \end{aligned}$ (56) For the convergence (Equation54(54) $\begin{aligned} \int_{0}^{T} {〈\frac{\partial X_{k}}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} d t \underset{k \to \infty}{⟶} \int_{0}^{T} {〈\frac{\partial X^{*}}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} d t, \end{aligned}$ (54) ), it is obtained directly from convergence (Equation50(50) $\frac{\partial X_{k}}{\partial t} ⇀ \frac{\partial X^{*}}{\partial t} i n L^{2} (0, T; H^{- 2} (Ω)) .$ (50) ). For the convergences (Equation55(55) $\begin{aligned} \int_{0}^{T} \int_{Ω} λ_{k} D (J_{ρ}^{σ} (\nabla X_{k})) \nabla^{2} X_{k} \nabla^{2} ϕ d x d t \underset{k \to \infty}{⟶} \int_{0}^{T} \int_{Ω} λ^{*} D (J_{ρ}^{σ} (\nabla X^{*})) \nabla^{2} X^{*} \nabla^{2} ϕ d x d t \end{aligned}$ (55) ) and (Equation56(56) $\begin{aligned} \int_{0}^{T} \int_{Ω} (1 - λ_{k}) D (J_{ρ}^{σ} (\nabla X_{k})) \nabla X_{k} \nabla ϕ d x d t \underset{k \to \infty}{⟶} \int_{0}^{T} \int_{Ω} (1 - λ^{*}) D (J_{ρ}^{σ} (\nabla X^{*})) \\ \nabla X^{*} \nabla ϕ d x d t . \end{aligned}$ (56) ), it suffices to show only one of them since they are similar. For this reason, we will show only the convergence (Equation55(55) $\begin{aligned} \int_{0}^{T} \int_{Ω} λ_{k} D (J_{ρ}^{σ} (\nabla X_{k})) \nabla^{2} X_{k} \nabla^{2} ϕ d x d t \underset{k \to \infty}{⟶} \int_{0}^{T} \int_{Ω} λ^{*} D (J_{ρ}^{σ} (\nabla X^{*})) \nabla^{2} X^{*} \nabla^{2} ϕ d x d t \end{aligned}$ (55) ). Let's consider $I = \int_{0}^{T} \int_{Ω} λ_{k} D (J_{ρ}^{σ} (\nabla X_{k})) \nabla^{2} X_{k} \nabla^{2} ϕ d x d t - \int_{0}^{T} \int_{Ω} λ^{*} D (J_{ρ}^{σ} (\nabla X^{*})) \nabla^{2} X^{*} \nabla^{2} ϕ d x d t,$ using the following decomposition $I = I_{1} + I_{2} + I_{3},$ where $\begin{aligned} I_{1} & = \int_{0}^{T} \int_{Ω} (λ_{k} - λ^{*}) D (J_{ρ}^{σ} (\nabla X_{k})) \nabla^{2} X_{k} \nabla^{2} ϕ d x d t, \\ I_{2} & = \int_{0}^{T} \int_{Ω} λ^{*} (D (J_{ρ}^{σ} (\nabla X_{k})) - D (J_{ρ}^{σ} (\nabla X^{*}))) \nabla^{2} X_{k} \nabla^{2} ϕ d x d t \end{aligned}$ and $I_{3} = \int_{0}^{T} \int_{Ω} λ^{*} D (J_{ρ}^{σ} (\nabla X^{*})) \nabla^{2} (X_{k} - X^{*}) \nabla^{2} ϕ d x d t .$ Let's show that $I_{1} \underset{k \to \infty}{⟶} 0$ .
We have $| I_{1} | \leq \int_{0}^{T} \int_{Ω} | λ_{k} - λ^{*} | | D (J_{ρ}^{σ} (\nabla X_{k})) | | \nabla^{2} X_{k} | \nabla^{2} ϕ | d x d t,$ using Hölder inequality and the proprieties of $D (J_{ρ}^{σ} (\nabla X_{k})),$ we find $\begin{aligned} | I_{1} |^{2} & \leq ∥ \nabla^{2} X_{k} ∥_{L^{2} (0, T; L^{2} (Ω))}^{2} ∥ D (J_{ρ}^{σ} (\nabla X_{k})) ∥_{L^{\infty} (0, T; L^{\infty} (Ω))}^{2} \\ \int_{0}^{T} \int_{Ω} (λ_{k} - λ^{*})^{2} | \nabla^{2} ϕ |^{2} d x d t . \end{aligned}$ Since we have $∥ D (J_{ρ}^{σ} (\nabla X_{k})) ∥_{L^{\infty} (0, T; L^{\infty} (Ω))}^{2} \leq C ∥ X_{k} ∥_{L^{\infty} (0, T; L^{2} (Ω))}^{2}$ and using the estimations (Equation45(45) $\begin{aligned} ∥ X_{k} ∥_{L^{\infty} (0, T; L^{2} (Ω))} \leq C_{1}, \end{aligned}$ (45) ) and (Equation46(46) $\begin{aligned} ∥ X_{k} ∥_{L^{2} (0, T; H_{0}^{2} (Ω))} \leq C_{2}, \end{aligned}$ (46) ) we obtain $| I_{1} |^{2} \leq C \int_{0}^{T} \int_{Ω} (λ_{k} - λ^{*})^{2} | \nabla^{2} ϕ |^{2} d x d t .$ On the other hand, we know that (57) $λ_{k} \underset{k \to \infty}{⟶} λ^{*} i n L^{1} (Ω),$ (57) then $(λ_{k} - λ^{*})^{2} | \nabla^{2} ϕ |^{2} \to 0, a.e in Ω \forall ϕ \in L^{2} (0, T; H^{2} (Ω)),$ and $λ_{k}, λ^{*}$ are bounded $(λ_{k} - λ^{*})^{2} | \nabla^{2} ϕ |^{2} \leq 4 {\bar{λ}}^{2} | \nabla^{2} ϕ |^{2} \in L^{1} (0, T; L^{1} (Ω)) .$ By applying the dominated convergence theorem of Lebesgue, we have $I_{1} \underset{k \to \infty}{⟶} 0.$ Now we will show that $I_{2} \underset{k \to \infty}{⟶} 0$ . Using the Hölder inequality and the fact that $λ^{*}$ is bounded, we obtain $\begin{aligned} | I_{2} | \leq \bar{λ} ∥ \nabla^{2} X_{k} ∥_{L^{2} (0, T; L^{2} (Ω))} ∥ D (J_{ρ}^{σ} (\nabla X_{k})) \\ - D (J_{ρ}^{σ} (\nabla X^{*})) ∥_{L^{\infty} (0, T; L^{\infty} (Ω))} ∥ \nabla^{2} ϕ ∥_{L^{2} (0, T; L^{2} (Ω))} . \end{aligned}$ Since the operator $D (J_{ρ})$ is smooth enough, we have (58) $∥ D (J_{ρ}^{σ} (\nabla X_{k})) - D (J_{ρ}^{σ} (\nabla X^{*})) ∥_{L^{\infty} (0, T; L^{\infty} (Ω))} \leq C ∥ X_{k} - X^{*} ∥_{L^{\infty} (0, T; L^{2} (Ω))} .$ (58) Using the estimation (Equation45(45) $\begin{aligned} ∥ X_{k} ∥_{L^{\infty} (0, T; L^{2} (Ω))} \leq C_{1}, \end{aligned}$ (45) ) then $| I_{2} | \leq C ∥ X_{k} - X^{*} ∥_{L^{\infty} (0, T; L^{2} (Ω))} ∥ \nabla^{2} ϕ ∥_{L^{2} (0, T; L^{2} (Ω))} .$ By the convergence (Equation53(53) $X_{k} ⟶ X^{*} i n C ([0, T]; L^{2} (Ω)) .$ (53) ) we obtain $I_{2} \underset{k \to \infty}{⟶} 0.$ We show now that $I_{3} \underset{k \to \infty}{⟶} 0$ . We have $λ^{*}$ is bounded in $L^{\infty} (Ω)$ and the following inequality (59) $∥ D (J_{ρ}^{σ} (\nabla X^{*})) ∥_{L^{\infty} (0, T; L^{\infty} (Ω))} \leq C ∥ X^{*} ∥_{L^{\infty} (0, T; L^{2} (Ω))} \leq C .$ (59) In other hand, we have $X_{k} ⇀ X^{*} i n L^{2} (0, T; H_{0}^{2} (Ω)),$ then $I_{3} \underset{k \to \infty}{⟶} 0,$ which means $I \underset{k \to \infty}{⟶} 0.$
Let's show that the cost functional J verifies $J (λ^{*}) \leq \underset{k ⟶ + \infty}{lim inf} J (λ_{k}) .$ Indeed, we have (60) $X_{k} ⟶ X^{*} i n C ([0, T]; L^{2} (Ω)),$ (60) then (61) $\int_{Ω} | X_{k} (T, x) - Y (x) | d x ⟶ \int_{Ω} | X^{*} (T, x) - Y (x) | d x .$ (61) Indeed, we have (62) $\begin{aligned} |\int_{Ω} | X_{k} (T, x) - Y (x) | d x - \int_{Ω} | X^{*} (T, x) - Y (x) | d x| \\ \leq \int_{Ω} | X_{k} (T, x) - X^{*} (T, x) | d x \\ \leq m e s (Ω)^{\frac{1}{2}} ∥ X_{k} (T,) - X^{*} (T,) ∥_{L^{2} (Ω)} \\ \leq C sup_{t \in [0, T]} (∥ X_{k} (t,) - X^{*} (t,) ∥_{L^{2} (Ω)}) \\ \leq C ∥ X_{k} - X^{*} ∥_{C ([0, T], L^{2} (Ω))} . \end{aligned}$ (62) Using the convergence (Equation60(60) $X_{k} ⟶ X^{*} i n C ([0, T]; L^{2} (Ω)),$ (60) ) we obtain $\int_{Ω} | X^{*} (T, x) - Y (x) | d x \leq \underset{k ⟶ + \infty}{lim inf} \int_{Ω} | X_{k} (T, x) - Y (x) | d x .$ Also, we have $T V (λ_{k}) \leq C_{0},$ then we can extract a subsequence denoted again $(λ_{k}),$ such that $\nabla λ_{k} ⇀^{*} \nabla λ^{*} i n M_{b} (Ω),$ which means $T V (λ^{*}) \leq \underset{k ⟶ + \infty}{lim inf} T V (λ_{k}) .$ Finally, we obtain (63) $\begin{aligned} J (λ^{*}) & = \int_{Ω} | X^{*} (T, x) - Y (x) | d x + T V (λ^{*}) \\ \leq \underset{k ⟶ + \infty}{lim inf} \int_{Ω} | X_{k} (T, x) - Y (x) | d x + \underset{k ⟶ + \infty}{lim inf} T V (λ_{k}) \\ \leq \underset{k ⟶ + \infty}{lim inf} (\int_{Ω} | X_{k} (T, x) - Y (x) | d x + T V (λ_{k})) \\ \leq \underset{k ⟶ + \infty}{lim inf} J (λ_{k}) . \end{aligned}$ (63)

The following theorem shows the existence of optimal solution to the problem (Equation1(1) $min_{λ \in U_{a d}} J (λ),$ (1) ).

Theorem 4.1

The problem (Equation1(1) $min_{λ \in U_{a d}} J (λ),$ (1) ) admits at least a solution in $U_{a d} .$

Proof.

Let's $(λ_{k})_{k}$ be a minimizing sequence of J in $U_{a d}$ such that $lim_{k \to \infty} J (λ_{k}, X_{k}) = inf_{λ \in U_{a d}} J (λ, X (λ)) .$ Using Lemma 4.1, which means that $U_{a d}$ is compact, then we can extract a subsequence denoted again $(λ_{k})_{k}$ that converge to $λ^{*}$ in $U_{a d}$ . In other hand, from Proposition 4.1, we have J is lower semicontinuous, then: $J (λ^{*}, X^{*}) \leq \underset{k \to \infty}{lim inf} J (λ_{k}, X_{k}) \leq lim_{k \to \infty} J (λ_{k}, X_{k}) = inf_{λ \in U_{a d}} J (λ, X (λ)),$ we have also $inf_{λ \in U_{a d}} J (λ, X (λ)) \leq J (λ^{*}, X^{*}) .$ Then $lim_{k \to \infty} J (λ_{k}, X_{k}) = inf_{λ \in U_{a d}} J (λ, X (λ)) = J (λ^{*}, X^{*}) .$ This concludes that the problem (Equation1(1) $min_{λ \in U_{a d}} J (λ),$ (1) ) admits a solution in $U_{a d} .$

5. The proposed algorithm and numerical approximation

In this section, we give the proposed algorithm based on the Primal–Dual algorithm and the numerical approximation of the state and adjoint problems.

5.1. The proposed algorithm

Recently, an extension of the A. Chambolle and T. Pock Primal–Dual algorithm (proposed in [Citation38]) to nonsmooth optimization problems, which involves complex nonlinear operators coming from PDE-constrained optimization problems [Citation49]. Inspired by this work, we propose an efficient Primal–Dual algorithm, well adapted to the evolutionary PDE-constrained problem, where two operators are involved (which to our knowledge has not already been treated). Indeed, we transform the studied PDE into an optimization problem of the following form: (64) $\hat{λ} = \arg min_{λ} {F (K (λ)) + G (λ)},$ (64) where $F$ and $G$ are proper, convex and lower semicontinuous functional and K is a nonlinear operator involving the solution of the PDE in (Equation2(2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) ). First, we construct the operator $S$ that maps λ to the solution X of the PDE in (Equation23(23) $\begin{aligned} \{\begin{cases} {〈\frac{\partial X_{n}}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla {\bar{X}}_{n})) \nabla^{2} X_{n} \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla {\bar{X}}_{n})) \nabla X_{n} \nabla ϕ d x = 0, \forall ϕ \in L^{2} (0, T; H_{0}^{2} (Ω)), \end{cases} \end{aligned}$ (23) ) at final time T, which is defined by $\begin{aligned} S : U_{a d} & ⟶ L^{2} (Ω) \\ λ & ⟼ X (T, x), \end{aligned}$ where X is the weak solution of (65) $\{\begin{cases} {〈\frac{\partial X}{\partial t}, ϕ〉}_{H^{- 2} (Ω), H_{0}^{2} (Ω)} + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ d x = 0, \forall ϕ \in H_{0}^{2} (Ω) . \end{cases}$ (65) Let's transform the problem (Equation1(1) $min_{λ \in U_{a d}} J (λ),$ (1) ) to the form (Equation64(64) $\hat{λ} = \arg min_{λ} {F (K (λ)) + G (λ)},$ (64) ). The problem (Equation1(1) $min_{λ \in U_{a d}} J (λ),$ (1) ) is given as follows: (66) $\begin{aligned} \hat{λ} & = \arg min_{λ \in U_{a d}} {| | S (λ) - Y | |_{L^{1} (Ω)} + β ∥ \nabla λ ∥_{L^{1} (Ω)}}, \\ = \arg min_{λ \in L^{1} (Ω)} {| | S (λ) - Y | |_{L^{1} (Ω)} + β ∥ \nabla λ ∥_{L^{1} (Ω)} + i_{U_{a d}} (λ)}, \\ = \arg min_{λ \in L^{1} (Ω)} {F (K (λ)) + G (λ)}, \end{aligned}$ (66) where the functions $F$ and $G$ are given as follows: $\begin{aligned} F : (L^{1} (Ω))^{2} & ⟶ R \cup {\infty} \\ g = (\begin{matrix} g_{1} \\ g_{2} \end{matrix}) & ⟼ ∥ g ∥_{L^{1} (Ω)} = ∥ g_{1} ∥_{L^{1} (Ω)} + ∥ g_{2} ∥_{L^{1} (Ω)}, \end{aligned}$ and $\begin{aligned} G : U_{a d} & ⟶ R \cup {\infty} \\ λ & ⟼ i_{U_{a d}} (λ) = \{\begin{cases} 0, & λ \in U_{a d}, \\ + \infty, & λ \notin U_{a d} . \end{cases} \end{aligned}$ We define also the operators $K_{1}$ , $K_{2}$ such as $K_{1} (λ) = S (λ) - Y$ and $K_{2} (λ) = β \nabla λ$ . The operator K is then defined by (67) $K = (\begin{matrix} K_{1} \\ K_{2} \end{matrix}),$ (67) where $K_{1}$ is nonlinear operator and $K_{2}$ is linear, with these notations, we have (68) $F (K (λ)) = ∥ K_{1} (λ) ∥_{L^{1} (Ω)} + ∥ K_{2} (λ) ∥_{L^{1} (Ω)} .$ (68) Now, we can apply the iterations of the Primal–Dual Algorithm [Citation38] to our problem (Equation66(66) $\begin{aligned} \hat{λ} & = \arg min_{λ \in U_{a d}} {| | S (λ) - Y | |_{L^{1} (Ω)} + β ∥ \nabla λ ∥_{L^{1} (Ω)}}, \\ = \arg min_{λ \in L^{1} (Ω)} {| | S (λ) - Y | |_{L^{1} (Ω)} + β ∥ \nabla λ ∥_{L^{1} (Ω)} + i_{U_{a d}} (λ)}, \\ = \arg min_{λ \in L^{1} (Ω)} {F (K (λ)) + G (λ)}, \end{aligned}$ (66) ), we obtain the following iterative algorithm: (69) $\{\begin{cases} λ^{n + 1} = (I + η \partial G)^{- 1} (λ^{n} - η K^{' *} (λ^{n}) ϑ^{n}), \\ {\hat{λ}}^{n + 1} = λ^{n + 1} + θ (λ^{n + 1} - λ^{n}), \\ ϑ^{n + 1} = (I + δ \partial F^{*})^{- 1} (ϑ^{n} + δ K ({\hat{λ}}^{n + 1})), \end{cases}$ (69) where the $F^{*} : (L^{\infty} (Ω))^{2} ⟶ R \cup {\infty}$ denotes the Legendre-Fenchel conjugate of the functional $F$ is given by (70) $F^{*} (ϑ) = \{\begin{cases} 0 & ϑ_{1} \in O \\ + \infty & ϑ_{1} \notin O \end{cases} + \{\begin{cases} 0 & ϑ_{2} \in O \\ + \infty & ϑ_{2} \notin O, \end{cases}$ (70) where $O = {Z : ∥ Z ∥_{L^{\infty} (Ω)} \leq 1}$ .

In order to give the Primal–Dual algorithm, we have to expression the proximal operator function $(I + δ \partial F^{*})^{- 1}$ and $(I + τ \partial G)^{- 1}$ (for more details see [Citation49]). For this, we have (71) $ϑ = (I + δ \partial F^{*})^{- 1} (\hat{ϑ}) = p r o j (\hat{ϑ}) = (\begin{matrix} p r o j_{O} ({\hat{ϑ}}_{1}) \\ p r o j_{O} ({\hat{ϑ}}_{2}) \end{matrix}),$ (71) where $p r o j_{O} ({\hat{ϑ}}_{i}) = \frac{{\hat{ϑ}}_{i}}{max (∥ {\hat{ϑ}}_{i} ∥_{L^{\infty} (Ω)}, 1)}, for i = 1, 2,$ and (72) $λ = (I + η \partial G)^{- 1} (\hat{λ}) = p r o j_{U_{a d}} (\hat{λ}) .$ (72) We need also to define the operator $K^{' *}$ , which is the adjoint derivative of the operators K defined by (73) $K^{' *} (λ) = (\begin{matrix} K_{1}^{' *}, K_{2}^{*} \end{matrix}) = (\begin{matrix} S^{'} (λ)^{*}, - β d i v \end{matrix}),$ (73) where $S^{'} (λ)^{*} : L^{2} (Ω) ⟶ (U_{a d})^{*}$ denotes the adjoint of the Gâteaux derivative of the operator $S,$ it is given by the following expression: (74) $S^{'} (λ)^{*} W = - \int_{0}^{T} D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P d t + \int_{0}^{T} D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P d t,$ (74) where $W = ϑ_{1}$ in Algorithm 1 and P is the solution of the following adjoint problem: (75) $\{\begin{cases} - \frac{\partial P}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} P) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla P) \\ + λ D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P + (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P = 0 i n] 0, T [\times Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X)) \nabla P, ν〉 = 0 o n] 0, T [\times \partial Ω, \\ P (t, x) = 0, o n (0, T) \times \partial Ω, \\ P (T, x) = W (x), i n Ω, \end{cases}$ (75) where $D^{*} (J_{ρ}^{σ} (\nabla X)) = D^{'} (J_{ρ}^{σ} (\nabla X)) (J_{ρ}^{σ} (\nabla X))^{'}$ is the derivative of $D (J_{ρ}^{σ} (\nabla X))$ with respect to state solution X.

For the proof of the Gâteaux derivative of the operator $S$ and the calculation of $S^{'} (λ)^{*}$ , see Appendix 1.

After introducing all the operators needed, the proposed algorithm is given as follows:

For more details about the convergence of Algorithm 1, see [Citation49].

5.2. Numerical approximation

This part provides all the ingredients needed to implement the Primal–Dual algorithm adopted to the problem (Equation64(64) $\hat{λ} = \arg min_{λ} {F (K (λ)) + G (λ)},$ (64) ). We discretize first the time interval $[0, T]$ into a series of M subdomains, namely $[0, T] = \cup_{m = 1}^{j = M} [t_{m - 1}, t_{m}], w i t h Δ t = t_{m} - t_{m - 1} = \frac{T}{M + 1} .$ Using a semi-discretization with respect to time for the problem (Equation2(2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) ) and the problem (Equation75(75) $\{\begin{cases} - \frac{\partial P}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} P) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla P) \\ + λ D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P + (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P = 0 i n] 0, T [\times Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X)) \nabla P, ν〉 = 0 o n] 0, T [\times \partial Ω, \\ P (t, x) = 0, o n (0, T) \times \partial Ω, \\ P (T, x) = W (x), i n Ω, \end{cases}$ (75) ), we have $\forall m = 0, \dots, M - 1$ (76) $\{\begin{cases} \frac{X^{m + 1} - X^{m}}{d t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X^{m})) \nabla^{2} X^{m}) + \nabla . (1 - λ) D (J_{ρ}^{σ} (\nabla X^{m})) \nabla X^{m}) = 0. i n Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X^{m})) \nabla X^{m}, ν〉 = 0 o n \partial Ω, \\ X^{m} (x) = 0, o n \partial Ω, \\ X^{0} (x) = X_{0} (x), i n Ω . \end{cases}$ (76) and $\forall m = M, \dots, 1$ (77) $\{\begin{cases} - \frac{P^{m} - P^{m - 1}}{d t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X^{m})) \nabla^{2} P^{m}) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X^{m})) \nabla P^{m}) \\ + λ D^{*} (J_{ρ}^{σ} (\nabla X^{m})) \nabla^{2} X^{m} \nabla^{2} P^{m} + (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X^{m})) \nabla X^{m} \nabla P^{m} = 0 i n Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X^{m})) \nabla P^{m}, ν〉 = 0 o n \partial Ω, \\ P^{m} (x) = 0, o n \partial Ω, \\ P^{M} (x) = W (x), i n Ω . \end{cases}$ (77) The discretization of the gradient operator $\nabla X = ((\nabla X)^{1}, (\nabla X)^{2})$ knowing that $X_{i, j}, i, j = 1, \dots (M_{1}, M_{2})$ is the discrete image, and $R^{M_{1} \times M_{2}}$ the set of discrete images. (78) $\begin{aligned} (\nabla X)_{i, j}^{1} = \{\begin{cases} X_{i + 1, j} - X_{i, j} & if i < M_{1} \\ 0 & if i = M_{1} \end{cases}, \\ (\nabla X)_{i, j}^{2} = \{\begin{cases} X_{i, j + 1} - X_{i, j} & if j < M_{2} \\ 0 & if j = M_{2} \end{cases}, \end{aligned}$ (78) we also need to define the discretization of the adjoint operator of the gradient ” div ” : $(R^{M_{1} \times M_{2}})^{2} \to R^{M_{1} \times M_{2}}$ ”, satisfying the following relation: (79) $- d i v Y \cdot X = Y \cdot \nabla X, \forall X \in R^{M_{1} \times M_{2}}, Y \in (R^{M_{1} \times M_{2}})^{2} .$ (79) With: $(d i v (Y^{1}, Y^{2}))_{i, j} = (d i v (Y^{1}, Y^{2}))_{i, j}^{1} + (d i v (Y^{1}, Y^{2}))_{i, j}^{2}$ and (80) $\begin{aligned} (d i v (Y^{1}, Y^{2}))_{i, j}^{1} = \{\begin{cases} Y_{i, j}^{1} - Y_{i - 1, j}^{1} & i f 1 < i < M_{1} \\ Y_{i, j}^{1} & i f i = 1 \\ 0 & i f i = M_{1} \end{cases}, \\ ({div} (Y^{1}, Y^{2}))_{i, j}^{2} = \{\begin{cases} Y_{i, j}^{2} - Y_{i, j - 1}^{2} & if 1 < j < M_{2} \\ Y_{i, j}^{2} & if j = 1 \\ - Y_{i, j - 1}^{2} & if j = M_{2} \end{cases} . \end{aligned}$ (80) We define also the second-order differential operator $\nabla^{2} = (\begin{matrix} \nabla_{x x} & \nabla_{x y} \\ \nabla_{x y} & \nabla_{y y} \end{matrix})$ , where $\nabla_{x x}$ , $\nabla_{y y}$ , and $\nabla_{x y}$ are given in discrete form such as (81) $\nabla_{x x} X_{i, j} = \{\begin{cases} X_{i, M_{2}} - 2 X_{i, 1} + X_{i, 2} & i f 1 \leq i \leq M_{1}, j = 1, \\ X_{i, j - 1} - 2 X_{i, j} + X_{i, j + 1} & i f 1 \leq i \leq M_{1}, 1 < j < M_{2}, \\ X_{i, M_{2} - 1} - 2 X_{i, M_{2}} + X_{i, 1} & i f 1 \leq i \leq M_{1}, j = M_{2}, \end{cases}$ (81) and (82) $\nabla_{y y} X_{i, j} = \{\begin{cases} X_{M_{1}, j} - 2 X_{1, j} + X_{2, j} & i f i = 1, 1 \leq j \leq M_{2}, \\ X_{i - 1, j} - 2 X_{i, j} + X_{i + 1, j} & i f 1 < i < M_{1}, 1 \leq j \leq M_{2}, \\ X_{M_{1} - 1, j} - 2 X_{M_{1}, j} + X_{1, i} & i f i = M_{1}, 1 \leq j \leq M_{2} . \end{cases}$ (82) On the other hand, we have (83) $\nabla_{x y} X_{i, j} = \{\begin{cases} X_{i, j} - X_{i + 1, j} - X_{i, j + 1} + X_{i + 1, j + 1} & i f 1 \leq i < M_{1}, 1 \leq j < M_{2}, \\ X_{i, M_{2}} - X_{i + 1, M_{2}} - X_{i, 1} + X_{i + 1, 1} & i f 1 \leq i < M_{1}, j = M_{2}, \\ X_{M_{1}, j} - X_{1, j} - X_{M_{1}, j + 1} + X_{1, j + 1} & i f i = M_{1}, 1 \leq j < M_{2}, \\ X_{M_{1}, M_{2}} - X_{1, M_{2}} - X_{M_{1}, 1} + X_{1, 1} & i f i = M_{1}, j = M_{2} . \end{cases}$ (83) For the approximation of $D^{*} (J_{ρ}^{σ} (\nabla X)) = D^{'} (J_{ρ}^{σ} (\nabla X)) (J_{ρ}^{σ} (\nabla X))^{'},$ we use the following approximation: $(J_{ρ}^{σ} (\nabla X_{i, j}))^{'} = \frac{J_{ρ}^{σ} (\nabla X_{i + 1, j}) - J_{ρ}^{σ} (\nabla X_{i, j})}{X_{i + 1, j} - X_{i, j}}$ or $(J_{ρ}^{σ} (\nabla X_{i, j}))^{'} = \frac{J_{ρ}^{σ} (\nabla X_{i, j + 1}) - J_{ρ}^{σ} (\nabla X_{i, j})}{X_{i, j + 1} - X_{i, j}},$ for $i, j = 1, \dots (M_{1} - 1, M_{2} - 1) .$

6. Numerical results

In this section, we present numerous experiment results to see the contribution of the proposed model. In fact, the results are divided into two categories: the first one is devoted to the results of the case where λ is a scalar and the second deals with the case of λ is spatially varying. For a fair comparison, our method and the compared ones are implemented using Matlab 2013a on the platform: 3 GHz dual-core central processing unit (CPU) and 8 Gbytes RAM. The stopping criterion is the relative residual error becoming less than $10^{- 4}$ : (84) $E r r = \frac{∥ {\hat{λ}}^{n + 1} - {\hat{λ}}^{n} ∥_{2}}{∥ {\hat{λ}}^{n + 1} ∥_{2}} \leq 10^{- 4} .$ (84) In the following numerical experiments, we fix the values of $\underline{λ}$ and $\bar{λ}$ (the lower and upper bounds of the parameter λ in $U_{a d}$ ) for the proposed algorithm such as $\underline{λ} = 10^{- 3}$ and $\bar{λ} = 0.8$ .

6.1. The case of scalar λ

In this part, we are interested in the case where the parameter λ is constant. We focus on image denoising task with the aim to reduce the staircasing effect and intensity loss for different noise type. We compare our model to the classical Rudin, Osher and Fatemi (ROF) model [Citation3] solved by the Bregman iteration [Citation50], the nonlocal means (NLM) model [Citation51,Citation52], the combined TV and TV $^{2}$ regularization (TV+TV $^{2}$ ) [Citation19,Citation53] solved by Bregman iteration and the TGV model [Citation36,Citation37] solved by the Primal–Dual algorithm. Note that all parameters for the compared models were optimized in order to meet the higher Peak signal-to-noise ratio (PSNR) values.

We start with the first test, where the image of Pirate is considered. The image size is $510 \times 510$ pixels. We construct a corrupted version of the original image by adding a with Gaussian noise of zero mean and standard deviation $σ = 0.3$ . For this example, the chosen parameters ( $α_{0}, α_{1}$ ) for the TV+TV $^{2}$ and TGV methods optimized the PSNR value as we can see in Figure . Concerning the NLM model we chose the size of the neighbourhood $9 \times 9$ and the threshold that determines isotropic/anisotropic neighbourhood selection $λ = 22$ , while we take $α = 0.1$ for the ROF model. These parameters are chosen using the same procedure for all the remaining tests.

Figure 1. Plot of the PSNR values of the restored image using the TGV and TV+TV $^{2}$ approach with respect to the parameters $α_{0}$ and $α_{1}$ . We can see that the highest achieved PSNR for the TGV $^{2}$ approach is related to the values $α_{0} = 0.06$ and $α_{0} = 0.04$ . For the TV+TV $^{2}$ model, the best PSNR corresponds to the values $α_{0} = 0.011$ and $α_{0} = 0.05$ . Note that the original image is the Pirate one corrupted by Gaussian noise of variance 0.3.

In Figure , we show the respective denoised results by different methods. Visually, our method can achieve a reconstruction which is close to one obtained by the TGV method while the other models are outperformed. In the second denoising test, we take the Cameraman image which is a challenging textural image. We consider also the Gaussian noise but with a higher standard deviation $σ = 0.5$ . In order to see the robustness of the proposed method, we present in Figure the restored image with different approaches including the proposed one. Once again, we can observe that visually the proposed method can achieve a better reconstructed result compared to other methods. To see the evolution of the obtained weighting parameter λ, we plot in Figure the values of λ with respect to the iterations for the two above examples. It is then observed that the obtained parameter for the Pirate image is $λ = 0.39$ , while for the Cameraman image the parameter is $λ = 0.68$ .

Figure 2. The obtained denoised image compared to other classical approaches for the (Pirate image), where the noise is considered to be Gaussian of $σ = 0.3$ : (a) Noisy image, (b) ROF model [Citation3], (c) NLM model [Citation51], (d) TV+TV $^{2}$ [Citation19] (e) TGV model [Citation36] and (f) Our model.

Figure 3. The obtained denoised image compared to other classical approaches for the (Cameraman image), where the noise is considered to be Gaussian of $σ = 0.5$ : (a) Noisy image, (b) ROF model [Citation3], (c) NLM model [Citation51], (d) TV+TV $^{2}$ [Citation19], (e) TGV model [Citation36] and (f) our model.

Figure 4. The computation of the parameter λ with respect to the iteration for the two images Pirate and Cameraman: (a) The Pirate image and (b) The Cameraman image.

For the third and fourth experiments, we change the nature of noise which is considered as an impulse one. For that, we change the fidelity term for the compared methods which is now considered as the L $^{1}$ norm since it success in handling the impulse noise (see [Citation2,Citation54,Citation55]). First, we consider the Cameraman image corrupted by an impulse noise with parameter 0.3. Figure shows the obtained denoised image using different approaches with L $^{1}$ norm, we can see once again that the proposed method can efficiently reduce the impulse noise without destroying image features, even though the TGV method reduces the noise better than our approach. For the fourth test, we increase the impulse noise level which is taken with parameter 0.5. Figure illustrates the restored image using different regularization methods, we can observe that the proposed approach is visually very sharp compared to the TGV result. Also, we plot in Figure the evolution of the parameter λ with respect to the iterations for the third and fourth examples. It is then observed that the obtained parameter for the Cameraman image is $λ = 0.81$ , while for the Penguin image the parameter is $λ = 0.90$ . Furthermore, to better perform the proposed denoising method over the other methods, a quantitative evaluation is used. In fact, we provide two metrics: PSNR [Citation56] and the mean structure similarity (SSIM) [Citation57]. The PSNR measures signal strength relative to noise in the image while the SSIM metric gives an indication on the quality of the image related to the known characteristics of human visual system. Tables and illustrate the PSNR and SSIM values related to the above simulated tests. We can notice that the proposed model outperforms the compared methods. Moreover, to better compare the quality of reconstructions, we present two tables that indicate the normalized $l^{2}$ and $l^{1}$ -distances between true image and the restored ones. Tables and illustrate the normalized $l^{2}$ and $l^{1}$ -distances values for the four above simulated tests, respectively. Once again, the proposed approach is always with the lower values, which confirm its robustness.

Figure 5. The obtained denoised image compared to other classical approaches for the Cameraman image, where the noise is considered to be impulse one of parameter 0.3: (a) Noisy image, (b) ROF [Citation3], (c) NLM [Citation51], (d) L $^{1}$ -TV+TV $^{2}$ [Citation19], (e) L $^{1}$ -TGV [Citation36] and (f) Our model.

Figure 6. The obtained denoised image compared to other classical approaches for the (Penguin image), where the noise is considered to be impulse one of parameter 0.5: (a) Noisy image, (b) ROF model [Citation3], (c) NLM model [Citation51], (d) L $^{1}$ -TV+TV $^{2}$ [Citation19], (e) L $^{1}$ -TGV model [Citation36] and (f) Our model.

Figure 7. The computation of the parameter λ with respect to the iteration for the two images Camerman and Penquin: (a) The Pirate image and (b) The Penguin image.

6.2. The space variant case

We now present extensive results to perform the weighted proposed PDE, where the parameter λ is space variant. In fact we are particularly interested to present the results in the space $U_{a d}^{2} = {λ \in L^{2} (Ω), 0 < \underline{λ} \leq λ \leq \bar{λ} < 1 a . e . x \in Ω a n d ∥ \nabla λ ∥_{2} \leq C_{0}},$ which is the most used space in the literature (especially $λ \in H^{1} (Ω)$ space), see [Citation27–29,Citation58–60], and also the proposed set of admissible filtering weights $U_{a d}^{1}$ defined by $U_{a d}^{1} = {λ \in L^{1} (Ω), 0 < \underline{λ} \leq λ \leq \bar{λ} < 1 a . e . x \in Ω a n d T V (λ) \leq C_{0}} .$ We start by a first experiment where we consider the Lena image which is contaminated by a Gaussian noise with variance $σ = 0.4$ . Then, we use the proposed Algorithm 1 to compute the clean image X and the approximate weighted parameter λ using two different choices of the space of admissible weights λ. The restored image and the spatially variant λ (projected in the image grid: $λ (X)$ ) using the two sets are displayed in Figure .We can observe that the restored image using $U_{a d}^{1}$ is little sharp compared to the one restored in $U_{a d}^{2}$ , especially near edges. In particular, the approximate λ for the two cases is higher in homogeneous regions and texture, and weaker near edges where the variation is stronger. The next set of experiments is considered to see the behaviour of the proposed weighted PDE with respect to different noise levels using the two admissible spaces. For that, we consider four images which essentially contain combination of large piecewise constant parts and smooth areas. The restored images using the two sets $U_{a d}^{1}$ and $U_{a d}^{2}$ are depicted in Figures and , respectively. The noisy images are obtained by adding Gaussian noise with variances $σ = 0.1, 0.3, 0.4,$ and 0.5, respectively in the same order from the top to the bottom of Figures and .From the surface representation of λ, we can detect that λ is smooth enough for $U_{a d}^{2}$ and piecewise constant for the case of $U_{a d}^{1}$ . For the both cases the shapes of λ are related to the one of the obtained image X, even if for $U_{a d}^{1}$ the shape of λ is almost the same as the picture. In addition, the same remark as for the previous test remains valid since the values of λ are higher near homogeneous areas.

Figure 8. The restored image and the parameter λ using the two admissible sets of the Lena image: (a) Noisy, (b) X using $U_{a d}^{2}$ , (c) λ using $U_{a d}^{2}$ , (d) X using $U_{a d}^{1}$ and (e) λ using $U_{a d}^{1}$ .

Figure 9. The restored image X and the corresponding weighted parameter λ using the admissible set $U_{a d}^{2}$ . The Baboon image is contaminated by Gaussian noise with $σ = 0.1$ , Penguin image is contaminated by $σ = 0.3$ , Zebra image is contaminated by $σ = 0.4$ , Tiger image is contaminated by $σ = 0.5$ . The used parameters for these tests are: $(k_{1}, k_{2}) = (35, 89)$ , $σ = 1.3$ and $ρ = 2.7$ : (a) Noisy, (b) Obtained X, (c) λ and (d) Image of λ.

Figure 10. The restored image X and the corresponding weighted parameter λ using the admissible set $U_{a d}^{1}$ . the Baboon image is contaminated by Gaussian noise with $σ = 0.1$ , Penguin image is contaminated by $σ = 0.3$ , Zebra image is contaminated by $σ = 0.4$ , Tiger image is contaminated by $σ = 0.5$ . The used parameters for these tests are: $(k_{1}, k_{2}) = (35, 80)$ , $σ = 1.7$ and $ρ = 2.3$ : (a) Noisy, (b) Obtained X, (c) Obtained λ and (d) Image of λ.

The following four tests concern the comparison between the proposed weighted PDE and the spatially dependent regularization parameters for TV image denoising (SDRTV) [Citation27]. For a fair comparison, we consider the same conditions for our approach as for SDRTV one (where $λ \in H^{1} (Ω)$ ). For that, we set $λ \in U_{a d}^{2}$ for our method and we consider two comparative results. In the two experiments, we treat the denoising problem with brain scan images. The first set consists of image of $256 \times 256$ pixels and Gaussian noise with zero mean and variance $σ = 0.3$ . The second one consists of image of $512 \times 512$ pixels and Gaussian noise with zero mean and variance $σ = 0.4$ . The used parameters for SDRTV method are $γ = 40, 1013$ , $μ = 10^{- 12}, β = 10^{- 10}$ and $h = 0.1$ . The computed semismooth Newton method, on the whole domain, converges when kmax = 40 iterations. Note that the compared results are done without domain decomposition. For our method, the used parameters are $(k_{1}, k_{2}) = (55, 40)$ , $σ = 1.8$ and $ρ = 2.1$ . The corresponding results are shown in Figures and , respectively. By a visual comparison, we can see that the proposed PDE yields significant improvement compared to the SDRTV method. In addition, from the surface representation of λ for the two methods, we can see that λ is continuous and its shape is related to the one of the original image (especially in homogeneous areas).

Figure 11. Comparison between the obtained clean image X and the SDRTV method with the respective computation of the spatially dependent parameter λ of the Head image: (a) Original, (b) Noisy, (c) λ initial, (d) SDRTV, (e) Obtained λ, (f) Image of λ, (g) Proposed PDE, (h) Obtained λ and (i) Image of λ.

Figure 12. Comparison between the obtained clean image and the SDRTV method and the respective computation of the spatially dependent parameter λ of the Brain image: (a) Original, (b) Noisy, (c) λ initial, (d) SDRTV, (e) Obtained λ, (f) Image of λ, (g) SDRTV, (h) Obtained λ and (i) Image of λ.

We now consider the proposed admissible set such as $λ \in U_{a d}^{1}$ for our method and we propose two experiment results where we compare our PDE with the SDRTV approach ( $λ \in H^{1} (Ω)$ ). We keep the same parameters as for the previous tests. The noise is considered to be Gaussian one with variances $σ = 0.3$ and $σ = 0.5$ , respectively. The recovered images are depicted in Figures and , respectively. Once again, our method outperforms the SDRTV by a visual comparison. In addition the approximate λ tends to capture more information from the image, which can be interpreted by the less regularity of the proposed set of admissible λ.

Figure 13. Comparison between the obtained clean image X and the SDRTV method with the respective computation of the spatially dependent parameter λ of the Dolphins image: (a) Original, (b) Noisy, (c) λ initial, (d) SDRTV, (e) Obtained λ, (f) Image of λ, (g) Obtained X, (h) Obtained λ and (i) Image of λ.

Figure 14. Comparison between the obtained clean image X and the SDRTV method with the respective computation of the spatially-dependent parameter λ of the Plane image: (a) Original, (b) Noisy, (c) λ initial, (d) SDRTV, (e) Obtained λ, (f) Image of λ, (g) Obtained X, (h) Obtained λ and (i) Image of λ.

We end this part by a final example where we give a comparison between the scalar approximate λ, the weighted spatial approximate λ in the respective spaces

U_{a d}^{1}

and

U_{a d}^{2}

. We consider then the image of Fishes, which is a textured challenging example. Then, we add a Gaussian noise with high variance

σ = 0.5

. The restored image using different λ approximations is presented in Figure . In view of the subplots in Figure , the obtained spatially variant regularization parameter shares similar patterns to the ones obtained in the previous tests. Particularly, the restored image X using the admissible set

U_{a d}^{1}

is visually even better, especially near sharp edges. In addition, the spatially variant result leads to a reduction of the staircasing effect better than the scalar result.

6.3. Robustness of the proposed PDE

In this part, we use four simulated images to perform the choice of the proposed PDE-constrained with the space variant choice of λ. To show the efficiency of the proposed PDE in recovering essential image features, we use some comparisons to other competitive denoising PDEs. In fact, the obtained results are compared to the Fourth-order partial differential equations for image enhancement (FOPDE) [Citation61], Adaptive fourth-order partial differential equation filter for image denoising (AEFD) [Citation62] and the adaptive fourth-order partial differential equation for image denoising (AFOD) [Citation63]. The four tests are done using different levels of Gaussian noise, with $σ = 30, 40, 50$ and 60, respectively. In Figure , we present the restored image of the four noisy images using the compared enhancement PDEs and our. We can see obviously the robustness of the proposed PDE in avoiding different artefacts compared with the other ones. The used parameters are tuned with respect to the best obtained PSNR. For example the used parameters for the Tiger image are: $λ = 0.09$ , $Δ t = 0.07$ , $σ = 0.5$ , $K = 0.001$ and 50 iterations for the AEFD approach. $μ = 0.09$ , $Δ t = 0.05$ , $k_{1} = 0.05$ , $k_{2} = 0.05$ , $γ = 10^{- 6}$ and 100 iterations for the AFOD approach. $β = 10^{- 4}$ , $Δ t = 0.1$ , $σ_{1} = 0.001$ , $σ_{2} = 0.8$ , $α_{1} = 0.01$ , $α_{2} = 44$ and 120 iterations for the FOPDE method. Note that the used parameters of our approach are: $(k_{1}, k_{2}) = (33, 42)$ , $σ = 1.54$ , $ρ = 3.1$ , $β = 1$ and 50 iterations number.

Figure 15. Comparison between the obtained clean image X and the respective computation of the spatially dependent parameter λ for the scalar case and for it two set of admissible values $U_{a d}^{1}$ and $U_{a d}^{2}$ for the Fishes image: (a) Original, (b) Noisy, (c) scalar λ, (d) weighted $λ \in U_{a d}^{2}$ and (e) weighted $λ \in U_{a d}^{1}$ .

Figure 16. The obtained denoised image compared to other PDE approaches with respect to both quality measures PSNR and SSIM. First row: noisy images. Second row: FOPDE. Third row: AEFD. Fourth row: AFOD. Fifth row: Our approach. (a) PSNR = 19.17, SSIM = 0.327. (b) PSNR = 15.40, SSIM = 0.377. (c) PSNR = 15.12, SSIM = 0.310. (d) PSNR = 14.08, SSIM = 0.286. (e) PSNR = 23.79, SSIM = 0.539. (f) PSNR = 22.61, SSIM = 0.588. (g) PSNR = 20.66, SSIM = 0.548. (h) PSNR = 20.41, SSIM = 0.422, (i) PSNR = 26.19, SSIM = 0.719, (j) PSNR = 23.95, SSIM = 0.650, (k) PSNR = 23.62, SSIM = 0.647. (l) PSNR = 28.48, SSIM = 0.747. (m) PSNR = 26.94, SSIM = 0.746, (n) PSNR = 25.62, SSIM = 0.741, (o) PSNR = 22.81, SSIM = 0.627, (p) PSNR = 24.96, SSIM = 0.597, (q) PSNR = 28.14, SSIM = 0.826, (r) PSNR = 27.65, SSIM = 0.821, (s) PSNR = 26.03, SSIM = 0.765 and (t) PSNR = 31.91, SSIM = 0.872.

7. Conclusion

In this paper, we have introduced a fourth-order PDE-constrained optimization model to treat the denoising task. This PDE is elaborated using a convex combination between two operators controlled by a weighting parameter, which is the solution of the optimal problem. This PDE takes the advantage of the Perona–Malik equation with much regularity and the efficiency of a nonlinear filter to restore tiny and sharp edges. The existence and uniqueness of the proposed PDE-constrained is proved using the Schauder fixed point theorem and the well-posedness of the optimization model is established using the control theory in a suitable space. The results part demonstrates, visually and quantitatively the performance and the contribution of this new PDE over the other denoising approaches. In summary, the proposed approach gives promising analytic strategy for determining an optimal solution for the denoised image and also the parameter λ. As further developments of this method, it may be generalized for other operators and different regularization functional. Another very important generalization of the proposed model is to move from a simple denoising model to more general inverse problems involving linear and nonlinear operators.

Acknowledgments

We are very grateful to the anonymous referees for the remarkable corrections and useful suggestions that have much improved this paper.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

Kirsch A. An introduction to the mathematical theory of inverse problems. Vol. 120. Germany: Springer Science & Business Media; 2011.
Google Scholar
Chan TF, Esedoglu S. Aspects of total variation regularized l 1 function approximation. SIAM J Appl Math. 2005;65(5):1817–1837.
Google Scholar
Rudin LI, Osher S, Fatemi E. Nonlinear total variation based noise removal algorithms. Phys D Nonlinear Phenom. 1992;60(1-4):259–268.
Google Scholar
Hintermüller M, Holler M, Papafitsoros K. A function space framework for structural total variation regularization with applications in inverse problems. Inverse Probl. 2018;34(6):064002.
Google Scholar
Liu J, Ni G, Yan S. Alternating method based on framelet l 0-norm and tv regularization for image restoration. Inverse Probl Sci Eng. 2019;27(6):790–807.
Google Scholar
Laghrib A, Ezzaki M, El Rhabi M, et al. Simultaneous deconvolution and denoising using a second order variational approach applied to image super resolution. Comput Vis Image Underst. 2018;168:50–63.
Google Scholar
Chantas G, Galatsanos NP, Molina R, et al. Variational bayesian image restoration with a product of spatially weighted total variation image priors. IEEE Trans Image Process. 2010;19(2):351–362.
Google Scholar
Rodríguez P, Wohlberg B. Efficient minimization method for a generalized total variation functional. IEEE Trans Image Process. 2009;18(2):322–332.
Google Scholar
Frick K, Marnitz P, Munk A. Statistical multiresolution estimation for variational imaging: with an application in poisson-biophotonics. J Math Imaging Vis. 2013;46(3):370–387.
Google Scholar
Dong Y, Hintermüller M, Rincon-Camacho MM. Automated regularization parameter selection in multi-scale total variation models for image restoration. J Math Imaging Vis. 2011;40(1):82–104.
Google Scholar
Bertalmío M, Caselles V, Rougé B, et al. Tv based image restoration with local constraints. J Sci Comput. 2003;19(1–3):95–122.
Google Scholar
Almansa A, Ballester C, Caselles V, et al. A tv based restoration model with local constraints. J Sci Comput. 2008;34(3):209–236.
Google Scholar
Strong DM, Aujol J-F, Chan TF. Scale recognition, regularization parameter selection, and Meyer's g norm in total variation regularization. Multiscale Model Simulat. 2006;5(1):273–303.
Google Scholar
Gilboa G, Sochen NA, Zeevi YY. Estimation of optimal pde-based denoising in the snr sense. IEEE Trans Image Proces. 2006;15(8):2269–2280.
Google Scholar
Vogel CR. Computational methods for inverse problems. Vol. 23. US: SIAM; 2002.
Google Scholar
De los Reyes JC, Schönlieb C-B. Image denoising: learning the noise model via nonsmooth pde-constrained optimization. Inverse Problems Imaging. 2013;7(4):1183–1214.
Google Scholar
Chan T, Marquina A, Mulet P. High-order total variation-based image restoration. SIAM J Sci Comput. 2000;22(2):503–516.
Google Scholar
Osher S, Solé A, Vese L. Image decomposition and restoration using total variation minimization and the h. Multiscale Model Simul. 2003;1(3):349–370.
Google Scholar
Papafitsoros K, Schönlieb C-B. A combined first and second order variational approach for image reconstruction. J Math Imaging Vis. 2014;48(2):308–338.
Google Scholar
Chen Y, Ranftl R, Pock T. Insights into analysis operator learning: from patch-based sparse models to higher order mrfs. IEEE Trans Image Process. 2014;23(3):1060–1072.
Google Scholar
Haber E, Tenorio L. Learning regularization functionals–a supervised training approach. Inverse Probl. 2003;19(3):611.
Google Scholar
De los Reyes JC, Schönlieb C-B, Valkonen T. The structure of optimal parameters for image restoration problems. J Math Anal Appl. 2016;434(1):464–500.
Google Scholar
De los Reyes JC, Schönlieb C-B, Valkonen T. Bilevel parameter learning for higher-order total variation regularisation models. J Math Imaging Vis. 2017;57(1):1–25.
Google Scholar
Chung C, De los Reyes JC, Schönlieb C-B. Learning optimal spatially-dependent regularization parameters in total variation image restoration. arXiv preprint arXiv:1603.09155.
Google Scholar
Kunisch K, Pock T. A bilevel optimization approach for parameter learning in variational models. SIAM J Imaging Sci. 2013;6(2):938–983.
Google Scholar
Calatroni L, Cao C, De Los Reyes JC, et al. Bilevel approaches for learning of variational imaging models. Variat Methods Imaging Geom Control. 2017;18(252):2.
Google Scholar
Van Chung C, De los Reyes J, Schönlieb C. Learning optimal spatially-dependent regularization parameters in total variation image denoising. Inverse Probl. 2017;33(7):074005.
Google Scholar
Hintermüller M, Rautenberg CN. Optimal selection of the regularization function in a weighted total variation model. part i: modelling and theory. J Math Imaging Vis. 2017;59(3):498–514.
Google Scholar
Hintermüller M, Rautenberg CN, Wu T, et al. Optimal selection of the regularization function in a weighted total variation model. part ii: algorithm, its analysis and numerical tests. J Math Imaging Vision. 2017;59(3):515–533.
Google Scholar
Perona P, Shiota T, Malik J. Anisotropic diffusion. In: Geometry-driven diffusion in computer vision. Springer; 1994. p. 73–92.
Google Scholar
Weickert J. Coherence-enhancing diffusion filtering. Journal of Computer Vision¡/DIFdel¿Int J Comput Vis. 1999;31(2-3):111–127.
Google Scholar
Weickert J, Scharr H. A scheme for coherence-enhancing diffusion filtering with optimized rotation invariance. J Vis Commun Image Represent. 2002;13(1-2):103–118.
Google Scholar
Burgeth B, Didas S, Weickert J. A general structure tensor concept and coherence-enhancing diffusion filtering for matrix fields. In: Visualization and processing of tensor fields. Springer; 2009. p. 305–323.
Google Scholar
Elad M. On the origin of the bilateral filter and ways to improve it. Image Processing¡/DIFdel¿IEEE Trans Image Process. 2002;11(10):1141–1151.
Google Scholar
El Mourabit I, El Rhabi M, Hakim A, et al. A new denoising model for multi-frame super-resolution image reconstruction. Signal Processing. 2017;132:51–65.
Google Scholar
Bredies K, Kunisch K, Pock T. Total generalized variation. SIAM J Imaging Sci. 2010;3(3):492–526.
Google Scholar
Valkonen T, Bredies K, Knoll F. Total generalized variation in diffusion tensor imaging. SIAM J Imaging Sci. 2013;6(1):487–525.
Google Scholar
Chambolle A, Pock T. A first-order primal-dual algorithm for convex problems with applications to imaging. J Math Imaging Vis. 2011;40(1):120–145.
Google Scholar
Zhang X, Burger M, Osher S. A unified primal-dual algorithm framework based on bregman iteration. J Sci Comput. 2011;46(1):20–46.
Google Scholar
Laghrib A, Hakim A, Raghay S. An iterative image super-resolution approach based on bregman distance. Signal Proces Image Commun. 2017;58:24–34.
Google Scholar
Afraites L, Hadri A, Laghrib A. A denoising model adapted for impulse and gaussian noises using a constrained-pde. Inverse Probl. 2020;36(2):025006.
Google Scholar
Gilbarg D, Trudinger NS. Elliptic partial differential equations of second order. Springer; 2015.
Google Scholar
Catté F, Lions P-L, Morel J-M, et al. Image selective smoothing and edge detection by nonlinear diffusion. SIAM J Numer Anal. 1992;29(1):182–193.
Google Scholar
Brezis H. Analyse fonctionnelle. Masson: Paris; 1983.
Google Scholar
Dautray R, Lions J. Mathematical analysis and numerical methods for science and technology: evolution problems I. US: Springer; 1992.
Google Scholar
Zeidler E. Nonlinear functional analysis and its applications: III: variational methods and optimization. US: Springer Science & Business Media; 2013.
Google Scholar
Aubin JP. Un théorème de compacité. Acad Sci Paris. 1963;256:5042–5044.
Google Scholar
Simon J. Compact sets in the space lp(0,t;b). Ann. Mat. Pura Appl. 1987;146:65–96.
Google Scholar
Clason C, Valkonen T. Primal-dual extragradient methods for nonlinear nonsmooth pde-constrained optimization. SIAM J Optim. 2017;27(3):1314–1339.
Google Scholar
Wu C, Tai X-C. Augmented lagrangian method, dual methods, and split Bregman iteration for rof, vectorial tv, and high order models. SIAM J Imaging Sci. 2010;3(3):300–339.
Google Scholar
Buades A, Coll B, Morel J-M. Non-local means denoising. Image Process On Line. 2011;1:208–212.
Google Scholar
Maleki A, Narayan M, Baraniuk RG. Anisotropic nonlocal means denoising. Appl Comput Harmon Anal. 2013;35(3):452–482.
Google Scholar
Papafitsoros K, Schoenlieb CB, Sengul B. Combined first and second order total variation inpainting using split Bregman. Image Process On Line. 2013;3:112–136.
Google Scholar
Duval V, Aujol J-F, Gousseau Y. The tvl1 model: a geometric point of view. Multiscale Model Simul. 2009;8(1):154–189.
Google Scholar
Nikolova M. A variational approach to remove outliers and impulse noise. J Math Imaging Vis. 2004;20(1-2):99–120.
Google Scholar
De Boer JF, Cense B, Park BH, et al. Improved signal-to-noise ratio in spectral-domain compared with time-domain optical coherence tomography. Opt Lett. 2003;28(21):2067–2069.
Google Scholar
Wang Z, Simoncelli EP, Bovik AC. Multiscale structural similarity for image quality assessment. In: The thirty-seventh asilomar conference on signals, systems & computers. Vol. 2. IEEE; 2003, p. 1398–1402.
Google Scholar
Bredies K, Dong Y, Hintermüller M. Spatially dependent regularization parameter selection in total generalized variation models for image restoration. Int J Comput Math. 2013;90(1):109–123.
Google Scholar
Hintermüller M, Papafitsoros K. Generating structured nonsmooth priors and associated primal-dual methods. In: Handbook of numerical analysis. Vol. 20. Elsevier; 2019. p. 437–502.
Google Scholar
Hintermüller M, Papafitsoros K, Rautenberg CN, et al. Dualization and automatic distributed parameter selection of total generalized variation via bilevel optimization. arXiv preprint arXiv:2002.05614.
Google Scholar
Yi D, Lee S. Fourth-order partial differential equations for image enhancement. Appl Math Comput. 2006;175(1):430–440.
Google Scholar
Liu X, Huang L, Guo Z. Adaptive fourth-order partial differential equation filter for image denoising. Appl Math Lett. 2011;24(8):1282–1288.
Google Scholar
Zhang X, Ye W. An adaptive fourth-order partial differential equation for image denoising. Comput Math Appl. 2017;74(10):2529–2545.
Google Scholar
Casas E, Fernandez L. Distributed control of systems governed by a general class of quasilinear elliptic equations. J Diff Equ.
Google Scholar
Ciarlet PG. The finite element method for elliptic problems. Vol. 40. US: Siam; 2002.
Google Scholar
Eymard R, Herbin R, Linke A, et al. Convergence of a finite volume scheme for the biharmonic problem.
Google Scholar

Appendices

Appendix 1

The next propositions show the differentiability of the solution operator

S

. This result leads us to get an expression of the operator

S^{'} (λ)^{*}

using the proper adjoint state.

Proposition A.1

Let $S : U_{a d} \mapsto L^{2} (Ω)$ be the solution of Equation (Equation2(2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) ) at t = T associated to each parameter λ. The operator $S$ is Gâteaux differentiable and its derivative at λ, in direction h, is given by the unique solution $X^{1}$ of the following linearized equation: (A1) $\{\begin{cases} \frac{\partial X^{1}}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X^{1}) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1}) + \nabla^{2} . (λ D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla^{2} X) \\ - \nabla . ((1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla X) = - \nabla^{2} . (h D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) \\ - \nabla . (h D (J_{ρ}^{σ} (\nabla X)) \nabla X) i n] 0, T [\times Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1}〉 = 0 o n] 0, T [\times \partial Ω, \\ X^{1} (t, x) = 0, o n (0, T) \times \partial Ω, \\ X^{1} (0, x) = 0, i n Ω, \end{cases}$ (A1) where $D^{*} (J_{ρ}^{σ} (\nabla X)) = D^{'} (J_{ρ}^{σ} (\nabla X)) (J_{ρ}^{σ} (\nabla X))^{'}$ is the derivative of $D (J_{ρ}^{σ} (\nabla X))$ with respect to state X.

Proof.

We decompose the proof in tree steps:

Step 1 Let $X_{ϵ}$ and X be the unique solutions to (Equation2(2) $\{\begin{cases} \frac{\partial X}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X) = 0 i n] 0, T [\times Ω, \\ 〈 D (J_{ρ}^{σ} (\nabla X)) \nabla X, ν 〉 = 0 o n] 0, T [\times \partial Ω, \\ X (t, x) = 0, o n (0, T) \times \partial Ω, \\ X (0, x) = X_{0} (x), i n Ω, \end{cases}$ (2) ) corresponding to $λ + ϵ h$ and λ, respectively for $h \in U_{a d} \subset L^{\infty} (Ω) \cap B V (Ω)$ and small ϵ. Let's show the priori estimations of the difference of the two solutions $X_{ϵ}$ and X. Taking the difference between the equations of $X_{ϵ}$ and X, we have: (A2) $\begin{aligned} 〈\frac{\partial X_{ϵ}}{\partial t} - \frac{\partial X}{\partial t}, ϕ〉 + \int_{Ω} λ (D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla^{2} X_{ϵ} - D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) (D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla X_{ϵ} - D (J_{ρ}^{σ} (\nabla X)) \nabla X) \nabla ϕ d x \\ = - ϵ \int_{Ω} h D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} X_{ϵ} \nabla^{2} ϕ - \nabla X_{ϵ} \nabla ϕ) d x, \end{aligned}$ (A2) we can rewrite (EquationA2(A2) $\begin{aligned} 〈\frac{\partial X_{ϵ}}{\partial t} - \frac{\partial X}{\partial t}, ϕ〉 + \int_{Ω} λ (D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla^{2} X_{ϵ} - D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) (D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla X_{ϵ} - D (J_{ρ}^{σ} (\nabla X)) \nabla X) \nabla ϕ d x \\ = - ϵ \int_{Ω} h D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} X_{ϵ} \nabla^{2} ϕ - \nabla X_{ϵ} \nabla ϕ) d x, \end{aligned}$ (A2) ) as follows: (A3) $\begin{aligned} 〈\frac{\partial (X_{ϵ} - X)}{\partial t}, ϕ〉 + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} (X_{ϵ} - X)) \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla (X_{ϵ} - X)) \nabla ϕ d x \\ + \int_{Ω} λ (D (J_{ρ}^{σ} (\nabla X_{ϵ})) - D (J_{ρ}^{σ} (\nabla X))) \nabla^{2} X \nabla^{2} ϕ \\ + \int_{Ω} (1 - λ) (D (J_{ρ}^{σ} (\nabla X_{ϵ})) - D (J_{ρ}^{σ} (\nabla X))) \nabla X \nabla ϕ d x \\ = - ϵ \int_{Ω} h D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} X_{ϵ} \nabla^{2} ϕ - \nabla X_{ϵ} \nabla ϕ) d x . \end{aligned}$ (A3) (A4) $\begin{aligned} ∥ \nabla^{2} (X_{ϵ} - X) ∥_{L^{2} (0, T; L^{2} (Ω))} \leq C ϵ ∥ h ∥_{L^{\infty} (Ω)}, \end{aligned}$ (A4) (A5) $\begin{aligned} ∥ \nabla (X_{ϵ} - X) ∥_{L^{2} (0, T; L^{2} (Ω))} \leq C ϵ ∥ h ∥_{L^{\infty} (Ω)}, \end{aligned}$ (A5) (A6) $\begin{aligned} ∥ X_{ϵ} - X ∥_{L^{\infty} (0, T; L_{0}^{2} (Ω))} \leq C ϵ ∥ h ∥_{L^{\infty} (Ω)}, \end{aligned}$ (A6) and (A7) $∥ \frac{\partial (X_{ϵ} - X)}{\partial t} ∥_{L^{2} (0, T; H^{- 2} (Ω))} \leq C ϵ ∥ h ∥_{L^{\infty} (Ω)} .$ (A7) Step 2 In the second step, we consider the sequence ${X_{ϵ}^{1}}_{ϵ > 0}$ , with $X_{ϵ}^{1} := \frac{X_{ϵ} - X}{ϵ}$ and we prove the weak convergence of $X_{ϵ}^{1} := \frac{X_{ϵ} - X}{ϵ}$ to $X^{1}$ solution of Equation (EquationA1(A1) $\{\begin{cases} \frac{\partial X^{1}}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X^{1}) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1}) + \nabla^{2} . (λ D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla^{2} X) \\ - \nabla . ((1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla X) = - \nabla^{2} . (h D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) \\ - \nabla . (h D (J_{ρ}^{σ} (\nabla X)) \nabla X) i n] 0, T [\times Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1}〉 = 0 o n] 0, T [\times \partial Ω, \\ X^{1} (t, x) = 0, o n (0, T) \times \partial Ω, \\ X^{1} (0, x) = 0, i n Ω, \end{cases}$ (A1) ).

We have the corresponding Equation (EquationA3(A3) $\begin{aligned} 〈\frac{\partial (X_{ϵ} - X)}{\partial t}, ϕ〉 + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} (X_{ϵ} - X)) \nabla^{2} ϕ d x \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla (X_{ϵ} - X)) \nabla ϕ d x \\ + \int_{Ω} λ (D (J_{ρ}^{σ} (\nabla X_{ϵ})) - D (J_{ρ}^{σ} (\nabla X))) \nabla^{2} X \nabla^{2} ϕ \\ + \int_{Ω} (1 - λ) (D (J_{ρ}^{σ} (\nabla X_{ϵ})) - D (J_{ρ}^{σ} (\nabla X))) \nabla X \nabla ϕ d x \\ = - ϵ \int_{Ω} h D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} X_{ϵ} \nabla^{2} ϕ - \nabla X_{ϵ} \nabla ϕ) d x . \end{aligned}$ (A3) ) at $X_{ϵ}^{1}$ is given as follows: (A8) $\begin{aligned} 〈\frac{\partial X_{ϵ}^{1}}{\partial t}, ϕ〉 + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla^{2} X_{ϵ}^{1} \nabla^{2} ϕ d x + \int_{Ω} λ \frac{D (J_{ρ}^{σ} (\nabla X_{ϵ})) - D (J_{ρ}^{σ} (\nabla X))}{ϵ} \nabla^{2} X \nabla^{2} ϕ \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla X_{ϵ}^{1} \nabla ϕ d x + \int_{Ω} (1 - λ) \frac{D (J_{ρ}^{σ} (\nabla X_{ϵ})) - D (J_{ρ}^{σ} (\nabla X))}{ϵ} \nabla X \nabla ϕ d x \\ = - \int_{Ω} h D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} X_{ϵ} \nabla^{2} ϕ - \nabla X_{ϵ} \nabla ϕ) d x . \end{aligned}$ (A8) Using the mean value theorem in integral form we get that: (A9) $\begin{aligned} 〈\frac{\partial X_{ϵ}^{1}}{\partial t}, ϕ〉 + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla^{2} X_{ϵ}^{1} \nabla^{2} ϕ d x + \int_{Ω} λ D^{*} (J_{ρ}^{σ} (\nabla W_{ϵ})) X_{ϵ}^{1} \nabla^{2} X \nabla^{2} ϕ \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla X_{ϵ}^{1} \nabla ϕ d x + \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla W_{ϵ})) X_{ϵ}^{1} \nabla X \nabla ϕ d x \\ = - \int_{Ω} h D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} X_{ϵ} \nabla^{2} ϕ - \nabla X_{ϵ} \nabla ϕ) d x . \end{aligned}$ (A9) where $W_{ϵ} = X + ρ_{ϵ} (X_{ϵ} - X)$ with $0 \leq ρ_{ϵ} \leq 1$ .

Using the estimations (EquationA4(A4) $\begin{aligned} ∥ \nabla^{2} (X_{ϵ} - X) ∥_{L^{2} (0, T; L^{2} (Ω))} \leq C ϵ ∥ h ∥_{L^{\infty} (Ω)}, \end{aligned}$ (A4) ) and (EquationA7(A7) $∥ \frac{\partial (X_{ϵ} - X)}{\partial t} ∥_{L^{2} (0, T; H^{- 2} (Ω))} \leq C ϵ ∥ h ∥_{L^{\infty} (Ω)} .$ (A7) ) Aubin–Lions and Aubin–Simon lemmas [Citation45–48], we can extract a subsequence denoted again $(X_{ϵ}^{1})$ such that $X_{ϵ}^{1} \to X^{1}$ strongly in $L^{2} (0, T; H_{0}^{1} (Ω))$ and $X_{ϵ}^{1} \to X^{1}$ strongly in $L^{\infty} (0, T; L^{2} (Ω))$ as $ϵ \to 0.$ Using the same techniques of convergence we can extract a subsequence denoted again $(X_{ϵ})$ such that $X_{ϵ} \to X$ strongly in $L^{2} (0, T; H_{0}^{1} (Ω))$ and $X_{ϵ} \to X$ strongly in $L^{\infty} (0, T; L^{2} (Ω))$ , and thanks to the smoothness of D, and form the definition we have $D^{*}$ is continuous. We can pass to the limit in (EquationA9(A9) $\begin{aligned} 〈\frac{\partial X_{ϵ}^{1}}{\partial t}, ϕ〉 + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla^{2} X_{ϵ}^{1} \nabla^{2} ϕ d x + \int_{Ω} λ D^{*} (J_{ρ}^{σ} (\nabla W_{ϵ})) X_{ϵ}^{1} \nabla^{2} X \nabla^{2} ϕ \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla X_{ϵ}^{1} \nabla ϕ d x + \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla W_{ϵ})) X_{ϵ}^{1} \nabla X \nabla ϕ d x \\ = - \int_{Ω} h D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} X_{ϵ} \nabla^{2} ϕ - \nabla X_{ϵ} \nabla ϕ) d x . \end{aligned}$ (A9) ) to obtain the following formula: (A10) $\begin{aligned} 〈\frac{\partial X^{1}}{\partial t}, ϕ〉 + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X^{1} \nabla^{2} ϕ d x + \int_{Ω} λ D^{*} (J_{ρ}^{σ} (\nabla X) X^{1} \nabla^{2} X \nabla^{2} ϕ \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1} \nabla ϕ d x + \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X) X^{1} \nabla X \nabla ϕ d x \\ = - \int_{Ω} h D (J_{ρ}^{σ} (\nabla X)) (\nabla^{2} X \nabla^{2} ϕ - \nabla X \nabla ϕ) d x . \end{aligned}$ (A10) Consequently, $X^{1}$ corresponds to the solution of the linearised equation (EquationA1(A1) $\{\begin{cases} \frac{\partial X^{1}}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X^{1}) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1}) + \nabla^{2} . (λ D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla^{2} X) \\ - \nabla . ((1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla X) = - \nabla^{2} . (h D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) \\ - \nabla . (h D (J_{ρ}^{σ} (\nabla X)) \nabla X) i n] 0, T [\times Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1}〉 = 0 o n] 0, T [\times \partial Ω, \\ X^{1} (t, x) = 0, o n (0, T) \times \partial Ω, \\ X^{1} (0, x) = 0, i n Ω, \end{cases}$ (A1) ) with (A11) $\begin{aligned} ∥ \nabla^{2} X^{1} ∥_{L^{2} (0, T; L^{2} (Ω))} \leq C ∥ h ∥_{L^{\infty} (Ω)}, \end{aligned}$ (A11) (A12) $\begin{aligned} ∥ \nabla X^{1} ∥_{L^{2} (0, T; L^{2} (Ω))} \leq C ∥ h ∥_{L^{\infty} (Ω)}, \end{aligned}$ (A12) (A13) $\begin{aligned} ∥ X^{1} ∥_{L^{\infty} (0, T; L_{0}^{2} (Ω))} \leq C ∥ h ∥_{L^{\infty} (Ω)}, \end{aligned}$ (A13) and (A14) $∥ \frac{\partial X^{1}}{\partial t} ∥_{L^{2} (0, T; H^{- 2} (Ω))} \leq C ∥ h ∥_{L^{\infty} (Ω)} .$ (A14) Step 3 Now, we will prove that the sequence $X_{ϵ}^{1}$ converges strongly in $L^{2} (0, T; H_{0}^{2} (Ω))$ to $X^{1}$ . As a consequence of the previous step, it suffices to prove that: $\nabla^{2} X_{ϵ}^{1} \to \nabla^{2} X^{1} i n (L^{2} (Ω))^{2}$ Let consider two Matrix: $M_{ϵ} = (λ + ϵ h) D (J_{ρ}^{σ} (\nabla X_{ϵ})) a n d M = D (J_{ρ}^{σ} (\nabla X))$ $M_{ϵ}$ and M are symmetric and positive definite. Using Cholesky decomposition we obtain lower triangular matrices $L_{ϵ}$ and L such that $M_{ϵ} = L_{ϵ} L_{ϵ}^{T} a n d M = L L^{T} .$ Applying the same convergence as in step 2 in the weak formulation (EquationA9(A9) $\begin{aligned} 〈\frac{\partial X_{ϵ}^{1}}{\partial t}, ϕ〉 + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla^{2} X_{ϵ}^{1} \nabla^{2} ϕ d x + \int_{Ω} λ D^{*} (J_{ρ}^{σ} (\nabla W_{ϵ})) X_{ϵ}^{1} \nabla^{2} X \nabla^{2} ϕ \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X_{ϵ})) \nabla X_{ϵ}^{1} \nabla ϕ d x + \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla W_{ϵ})) X_{ϵ}^{1} \nabla X \nabla ϕ d x \\ = - \int_{Ω} h D (J_{ρ}^{σ} (\nabla X_{ϵ})) (\nabla^{2} X_{ϵ} \nabla^{2} ϕ - \nabla X_{ϵ} \nabla ϕ) d x . \end{aligned}$ (A9) ) with the function test is $X_{ϵ}^{1}$ and integrating by parts we obtain (A15) $\begin{aligned} lim_{ϵ \to 0} \int_{0}^{T} \int_{Ω} M_{ϵ} \nabla^{2} X_{ϵ}^{1} \nabla^{2} X_{ϵ}^{1} d x d t = - \frac{1}{2} ∥ X^{1} (., T) ∥_{L^{2} (Ω)}^{2} \\ - \int_{0}^{T} \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla^{2} X \nabla^{2} X^{1} d x d t \\ - \int_{0}^{T} \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1} \nabla X^{1} d x d t - \int_{0}^{T} \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla X \nabla X^{1} d x d t \\ + \int_{0}^{T} \int_{Ω} h D (J_{ρ}^{σ} (\nabla X)) (\nabla^{2} X \nabla^{2} X^{1} - \nabla X \nabla X^{1}) d x d t = \int_{0}^{T} \int_{Ω} M \nabla^{2} X^{1} \nabla^{2} X^{1} d x d t \\ \int_{0}^{T} \int_{Ω} M_{ϵ} \nabla^{2} X_{ϵ}^{1} \nabla^{2} X_{ϵ}^{1} d x d t \to \int_{0}^{T} \int_{Ω} M \nabla^{2} X^{1} \nabla^{2} X^{1} d x d t a s ϵ \to 0, \end{aligned}$ (A15) Using the above equation and (EquationA10(A10) $\begin{aligned} 〈\frac{\partial X^{1}}{\partial t}, ϕ〉 + \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X^{1} \nabla^{2} ϕ d x + \int_{Ω} λ D^{*} (J_{ρ}^{σ} (\nabla X) X^{1} \nabla^{2} X \nabla^{2} ϕ \\ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1} \nabla ϕ d x + \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X) X^{1} \nabla X \nabla ϕ d x \\ = - \int_{Ω} h D (J_{ρ}^{σ} (\nabla X)) (\nabla^{2} X \nabla^{2} ϕ - \nabla X \nabla ϕ) d x . \end{aligned}$ (A10) ), we prove by the same argumentation as in ([Citation64], Theorem 3.1, Step 3) that which implies that $\nabla^{2} X_{ϵ}^{1} \to \nabla^{2} X^{1} i n L^{2} (0, T; L^{2} (Ω)) .$

We can find by using the same techniques that $\frac{\partial X_{ϵ}^{1}}{\partial t} \to \frac{\partial X^{1}}{\partial t} i n L^{2} (0, T; H^{- 2} (Ω)) .$ This complete the proof.

Based on the differentiability properties of the solution operator, we give in next proposition the expression of adjoint derivative operator $S^{'} (λ)^{*} (.)$ .

Proposition A.2

We have: (A16) $S^{'} (λ)^{*} W = - \int_{}^{T} D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P d t + \int_{0}^{T} D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P d t,$ (A16) where P is the solution of the following adjoint problem: (A17) $\{\begin{cases} - \frac{\partial P}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} P) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla P) \\ + λ D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P + (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P = 0 i n] 0, T [\times Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X)) \nabla P, ν〉 = 0 o n] 0, T [\times \partial Ω, \\ P (t, x) = 0, o n, (0, T) \times \partial Ω, \\ P (T, x) = W (x), i n Ω, \end{cases}$ (A17)

Proof.

To prove the result (Equation74(74) $S^{'} (λ)^{*} W = - \int_{0}^{T} D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P d t + \int_{0}^{T} D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P d t,$ (74) ), we take $ϕ \in L^{2} (0, T; H_{0}^{2} (Ω))$ as function test in the variational form of system (EquationA1(A1) $\{\begin{cases} \frac{\partial X^{1}}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X^{1}) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1}) + \nabla^{2} . (λ D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla^{2} X) \\ - \nabla . ((1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla X) = - \nabla^{2} . (h D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X) \\ - \nabla . (h D (J_{ρ}^{σ} (\nabla X)) \nabla X) i n] 0, T [\times Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1}〉 = 0 o n] 0, T [\times \partial Ω, \\ X^{1} (t, x) = 0, o n (0, T) \times \partial Ω, \\ X^{1} (0, x) = 0, i n Ω, \end{cases}$ (A1) ) and (Equation75(75) $\{\begin{cases} - \frac{\partial P}{\partial t} + \nabla^{2} . (λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} P) - \nabla . ((1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla P) \\ + λ D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P + (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P = 0 i n] 0, T [\times Ω, \\ 〈D (J_{ρ}^{σ} (\nabla X)) \nabla P, ν〉 = 0 o n] 0, T [\times \partial Ω, \\ P (t, x) = 0, o n (0, T) \times \partial Ω, \\ P (T, x) = W (x), i n Ω, \end{cases}$ (75) ), which give: (A18) $\begin{aligned} \int_{0}^{T} 〈\frac{\partial X^{1}}{\partial t}, ϕ〉 + \int_{0}^{T} \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X^{1} \nabla^{2} ϕ + \int_{0}^{T} \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1} \nabla ϕ \\ + \int_{0}^{T} \int_{Ω} λ D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla^{2} X \nabla^{2} ϕ + \int_{0}^{T} \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla X \nabla ϕ \\ = - \int_{0}^{T} \int_{Ω} h D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla ϕ + \int_{0}^{T} \int_{Ω} h D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ \end{aligned}$ (A18) and (A19) $\begin{aligned} \int_{0}^{T} 〈- \frac{\partial P}{\partial t}, ϕ〉 + \int_{0}^{T} \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} P \nabla^{2} ϕ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla P \nabla ϕ \\ + \int_{0}^{T} \int_{Ω} λ D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P ϕ + \int_{0}^{T} \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P ϕ = 0 \end{aligned}$ (A19) Taking $ϕ = P$ in Equation (EquationA18(A18) $\begin{aligned} \int_{0}^{T} 〈\frac{\partial X^{1}}{\partial t}, ϕ〉 + \int_{0}^{T} \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X^{1} \nabla^{2} ϕ + \int_{0}^{T} \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla X^{1} \nabla ϕ \\ + \int_{0}^{T} \int_{Ω} λ D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla^{2} X \nabla^{2} ϕ + \int_{0}^{T} \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) X^{1} \nabla X \nabla ϕ \\ = - \int_{0}^{T} \int_{Ω} h D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla ϕ + \int_{0}^{T} \int_{Ω} h D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla ϕ \end{aligned}$ (A18) ) and $ϕ = X^{1}$ in Equation (EquationA19(A19) $\begin{aligned} \int_{0}^{T} 〈- \frac{\partial P}{\partial t}, ϕ〉 + \int_{0}^{T} \int_{Ω} λ D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} P \nabla^{2} ϕ + \int_{Ω} (1 - λ) D (J_{ρ}^{σ} (\nabla X)) \nabla P \nabla ϕ \\ + \int_{0}^{T} \int_{Ω} λ D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P ϕ + \int_{0}^{T} \int_{Ω} (1 - λ) D^{*} (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P ϕ = 0 \end{aligned}$ (A19) ) and use the formula $\int_{0}^{T} 〈\frac{\partial X^{1}}{\partial t}, P〉 = \int_{0}^{T} 〈- \frac{\partial P}{\partial t}, X^{1}〉 + \int_{Ω} X^{1} (x, T) P (x, T) d x,$ it immediately shows that $\int_{Ω} X^{1} (x, T) P (x, T) d x = - \int_{0}^{T} \int_{Ω} h D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P d t + \int_{0}^{T} \int_{Ω} h D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P d x d t$ To finish this the proof, we use the proposed notation $X^{1} (T) = K^{'} (λ) h = S^{'} (λ) h$ and $P (T) = W$ then, we have $〈S^{'} (λ) h, W〉 = - 〈\int_{0}^{T} (D (J_{ρ}^{σ} (\nabla X)) \nabla^{2} X \nabla^{2} P - D (J_{ρ}^{σ} (\nabla X)) \nabla X \nabla P) d t, h〉 .$ which completes the proof.

Appendix 2

We introduce the functional space $H_{0}^{2} (Ω),$ which is defined as the closure of $C_{c}^{\infty} (Ω)$ in $H^{2} (Ω) .$ Thanks to the Lipschitz regularity of the boundary, which leads to the definition $H_{0}^{2} (Ω) = \{u \in H_{0}^{1} (Ω) \cap H^{2} (Ω) such that \nabla u \cdot n = 0 a.e. on \partial Ω\} .$ The space $H_{0}^{2} (Ω)$ equipped by the norm $∥ Δ u ∥_{L^{2} (Ω)},$ which is equivalent to the $∥ u ∥_{H^{2} (Ω)}$ for every $u \in H_{0}^{2} (Ω)$ (see [Citation65,Citation66]). Indeed, the Poincaré inequality $\forall u \in H_{0}^{1} (Ω), ∥ u ∥_{L^{2} (Ω)} \leq C ∥ \nabla u ∥_{L^{2} (Ω)^{2}}$ and $\forall u \in H_{0}^{2} (Ω), - \int_{Ω} u Δ u d x = \int_{Ω} \nabla u \nabla u d x,$ leads to $\forall u \in H_{0}^{2} (Ω), ∥ \nabla u ∥_{L^{2} (Ω)^{2}} \leq C ∥ Δ u ∥_{L^{2} (Ω)} .$ Besides, the following equality which is an immediate consequence of two integrations by parts $\forall ϕ \in C_{c}^{\infty} (Ω), \int_{Ω} (Δ ϕ)^{2} d x = \sum_{i = 1}^{2} \sum_{j = 1}^{2} \int_{Ω} \frac{\partial^{2} ϕ}{\partial^{2} x_{i}} \frac{\partial^{2} ϕ}{\partial^{2} x_{j}} d x = \sum_{i = 1}^{2} \sum_{j = 1}^{2} \int_{Ω} (\frac{\partial^{2} ϕ}{\partial x_{i} \partial x_{j}})^{2} d x .$ Now we give the definition of some useful operators. We have ${d i v}^{2} (A) = \frac{\partial^{2} A_{12}}{\partial x_{1} \partial x_{2}} + \frac{\partial^{2} A_{21}}{\partial x_{2} \partial x_{1}} + \sum_{i = 1}^{2} \frac{\partial^{2} A_{i i}}{\partial^{2} x_{i}}, \forall A \in C_{c}^{\infty} (Ω, R^{2 \times 2})$ and $\nabla^{2} f = (\begin{array}{cc} \frac{\partial^{2} f}{\partial^{2} x_{1}} & \frac{\partial^{2} f}{\partial x_{1} \partial x_{2}} \\ \frac{\partial^{2} f}{\partial x_{2} \partial x_{1}} & \frac{\partial^{2} f}{\partial^{2} x_{2}} \end{array}), \forall f \in C_{c}^{\infty} (Ω) .$

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

A high order PDE-constrained optimization for the image denoising problem

Abstract

1. Introduction

2. Variational formulation and a priori estimates

3. Existence and uniqueness of the state problem

4. Existence of solution for the optimization problem