273
Views
6
CrossRef citations to date
0
Altmetric
Original Articles

A homotopy-projection method for the parameter estimation problems

, &
Pages 141-157 | Received 01 Dec 2005, Accepted 23 Oct 2006, Published online: 17 Mar 2008

Abstract

A homotopy-projection method is proposed for the parameter estimation problems of partial differential equations. To overcome the local convergence of traditional iterative methods, the homotopy method and Tikhonov regularization combine to form a new and widely convergent method. Additionally to that, projection method is introduced to fulfill the constrains described by a closed and convex set. As a practical application, the parameter estimation problem of an elliptic equation is solved. Numerical results show the method's effectiveness.

1. Introduction

The problem of determining the parameter a(x) is reduced to solving the nonlinear ill-posed operator equation (1) where with domain . Here, is not necessarily a maximal domain of definition, but rather a set of interest whose definition might include certain features of the solution one is interested in. Throughout this article we will assume that is closed and convex.

We restrict our attention to Hilbert spaces and with inner products and norms , respectively; they can always be identified from the context in which they appear.

Since is closed and convex, we assume in the following that for all there exists with for all directions where for some t>0. This together with the Lipschitz condition on the Fréchet derivative of F guarantees that where L is the Lipschitz parameter. This means F has to be Fréchet-differentiable on the subset .

Moreover, we assume that we have noisy data with (2) and that for exact data y, a solution of (1.1) exists in .

Usually the problem of equation (1.1) is difficult due to its ill-posed property, and therefore, has to be regularized (see Citation1).

Tikhonov regularization is certainly the most well-known regularization technique Citation2. It's convergence in application to nonlinear ill-posed problem have been studied in Citation3–6. An alternative to use is iterative methods. Iterative methods like regularized Gauss–Newton method Citation7, steepest descent method Citation8, Landweber's method Citation9, and other modifications (see e.g. Citation10–14) are attractive for numerical solution of nonlinear ill-posed operator equations, since these methods can be implemented in a straightforward way.

Indeed, such iterative methods have good convergence properties under some restrictions, and therefore, they have been used widely to deal with inverse problems. However, all these methods are locally convergent; the convergence depends highly on good initial estimates. In most practical applications it is difficult to find good initial values. Therefore, other widely convergent (even global convergent) approaches are needed to find solutions to inverse problems.

The homotopy method is a powerful tool for solving nonlinear problems due to its widely convergent property Citation15. It is an interesting research problem to apply the homotopy method to solve inverse problems for differential equations. Keller and Perozzi's paper Citation16 may be the first work to use the homotopy method to solve the inverse problems. They combined a homotopy path-following formulism with the singularity theory to track multiple solutions of a geophysical inverse problem Citation17; Han et al. developed a widely convergent homotopy method for determining the distributed parameters of an elliptical equation Citation18; Everett transformed a 2–D discrete inverse problem of Helmholtz equation to a corresponding polynomial system and solved it by using the homotopy method and got all the roots of the system Citation19; Jegen applied the classic Euler–Newton numerical continuation scheme to inverse problems Citation20; Bao and Liu suggested a homotopy-regularization method for inverse scattering problems with multi-experimental limited aperture data Citation21; Han designed a homotopy method for the inversion of a two-dimensional acoustic wave equation Citation22.

In this article, we consider a new idea. Firstly, the homotopy method is introduced and combined with Tikhonov regularization to give a widely convergent method. Secondly, we can expand the parameter a(x) and F(a) in some finite element spaces, then the problem of estimating a(x) is reduced to solving a finite dimensional vector formed by the coefficients of the finite expansion and the constraints described by a closed and convex set are guaranteed by projection. We call this method homotopy-projection method. Finally, as a practical application, a parameter estimation problem of elliptical equation is solved. We describe the computational process in detail for this numerical example.

The remainder of the article is organized as follows. Section 2 deals with the principle of homotopy-projection method, and constructs a practical method. In section 3, a model of an elliptical equation is described and some numerical simulations are made. Finally, in section 4, some conclusions are given.

2. The homotopy-projection method

2.1. Homotopy-regularization method

Since the problem (1.1) is ill-posed, regularization method must be employed. In Tikhonov regularization, the problem (1.1) is replaced by the minimization problem (3) where α is the regularization parameter, a0 is an initial guess for a solution of (1.1).

The iteratively regularized Gauss–Newton method (see Citation7) (4) may be used to solve (2.1). To maintain stability, a growth restriction for some r>1 on the regularization parameters is imposed, which, e.g., allows for a geometrically decaying sequence for some t∈(0,1) and α0 is a small constant. It is a locally convergent scheme, and will converge very fast when the initial guess a0 falls within the convergence domain. However it is usually very difficult to provide a good initial guess. Therefore, we consider to introduce the homotopy method.

2.1.1. Homotopy theory

We briefly introduce the principle of homotopy theory for solving nonlinear operator equations like (1.1). For more details we refer to the tutorial given by Watson Citation23. A homotopy function H(a,λ) is constructed by adding a scalar homotopy parameter λ∈[0,1] and a simple function G(a) to the “target function” F(a)−y, such that The “start system” of nonlinear equations G(a)=0 can be chosen arbitrarily, and at least has one known solution a0, which can be chosen as the initial guess for the whole computation. We wish that there may exists a curve a=a(λ), λ ∈[0,1], which satisfies the homotopy equation (5) Then it is obvious that a(0)=a0, and , which is a solution of the operator equation (1.1). Therefore, a(λ) is a curve, connected a0 with . If we use some numerical method to trace this curve from a known point a0, then we can finally reach the solution .

Basically, there are two ways to trace this curve. The first one is based on the assumption that H(a,λ) is differentiable with respect to a and λ. Differentiating each side of (2.3) on λ, we have (6) As a satisfies the initial condition a(0)=a0, the question to determine a(λ) is turned out to be an initial value problem of ordinary differential equation (2.4). By some numerical integration formulae, one can get an approximation of . The second one is to divide the interval [0,1] into , then use some iterative methods to solve the operator equation sequentially (7) Since the solution am−1 of the (m−1)th equation is obtained (note that the solution of the 0-th equation a0 is already known), am−1 can be used as the initial approximation of the m-th equation of (2.5). When λm−λm−1 is sufficiently small, am−1 would be expected as a good approximation to am, then by using a locally convergent iterative method we can achieve its convergent result. In this article, we use this method as our path tracing method.

Actually, there are other methods to trace the curve connected a0 with , when it can not be expressed as the function of λ. But it is out of our theme.

2.1.2. A homotopy method for inversion

Now, we consider the problem (1.1). If we know fairly a prior estimation a0 of , we can use the iteratively-Gauss–Newton method (2.2) to solve (1.1). However, this may be not the practical situation. Usually we can not provide a good estimation a0 for (1.1), and the Levenberg–Marquardt method may be used in trying for lots of initial estimation a0 to give a reasonable approximation of . This is a major problem when carrying out inversions.

In order to solve (2.1), we consider the fixed point homotopy equation (8) where λ ∈[0,1] is the homotopy parameter, α(λ) is the regularized parameter which continuously depends on λ (e.g., and this time, ), a0 is an arbitrary point. Obviously, when λ=0, ; when λ=1, it becomes . Therefore, H(a,λ) may be regarded as a homotopy map between and , and . Clearly, the minimizer of (2.6) satisfies (9)

We propose the minimizer a(λ) is a continuous function for λ∈ [0,1], which is the basic for the following arithmetic. In order to ensure this property, we have to incorporate some smoothness conditions as the majority of research did for nonlinear ill-posed problems. Throughout this article we will assume that the following conditions hold (see Citation25):

a.

F is twice Fréchet-differentiable with continuous second derivative.

b.

The first derivative is Lipschitz-continuous, (10)

c.

There exists ω∈ Y with (11)

Remark We do not intend to palliate the fact that source conditions (2.9) depending on the problem under consideration, might imposes severe restriction on the exact solution and it is difficult to verify. However, in the numerical experiments we can see that this condition is not necessarily satisfied.

In order to prove the continuous theorem, we need the following Lemma 2.1. The proof is similar to the proof of in Citation1. Here, for completeness, we still prove it.

LEMMA 2.1

Suppose the assumptions (a), (b), (c) are satisfied. Then (12) and (13) hold.

Proof

Since a(λ) is the minimizer of the functional H(a,λ), we obtain Therefore Set . From Taylor's Theorem, we know namely and Therefore, and thus (14) and the estimates (2.10) and (2.11) follow immediately by using (2.9). Lemma 2.1 is proved.▪

Now we prove the continuous dependence of a\,(λ) on λ.

THEOREM 1

Let the assumption of lemma 1 hold. And that a sequencem} with is given. If (15) holds, then and the error estimate (16) with (17) (18) yield.

Proof

For the following we set with Taylor's Theorem, we see Moreover, let us define then and Defining we know Since aδ(λ) is a minimizer of the functional H(a,λ), therefore, (19) From (2.17), we know we get Since we know (20) Now by using the estimates (2.10) and (2.11), and setting then we have (21) and (22) By setting we get (23) (24) Inserting (2.19–2.22) in (2.18) yields By setting (25) we find Due to . With (2.13) we obtain we have It is obvious such that The continuity is proved.▪

Due to the continuity of the map λ→ a(λ), we can divide the interval [0, 1] into . For , we use some iterative method to solve (2.7) sequently. As the solution a0 of the 0-th equation of (2.7) is known, it can be used as the initial approximation of the next equation. Assume that the approximate solution aδ,m of the m-th equation has already been found. For computing the solution aδ,m+1 of the (m+1) th equation, we consider a successive linearization method. Suppose aδ,m is known, for computing , we replace F(a) by the linear function , and substitute F(a) by in (2.7) to obtain (26) The solution of (2.24) can be regarded as the next approximation to aδ,m+1: (27) From (2.25), the approximation aM of the M-th equation can be solved, which can be taken as the initial estimate to the method (2.2). Iterate repeatedly by using (2.2) until the regularized approximation of is obtained. Therefore, (2.25) and (2.2) constitutes a stable and widely convergent method for the problem (1.1).

If we choose λm=m/M by an isometric division, mT≡0, and α is constant. Then (2.25) becomes a more simple formula: (28)

2.2. Projected iterative methods

In this article, we consider the situation that closeness of some iterate aδ,m to the exact solution is not sufficient for well-definedness of F(aδ,m), or that we search for a solution satisfying additional constrains. In this situation we eventually have to proj ect our iterates into the domain of F in order to carry out the next step or, more generally, stay in the feasible set. For this purpose, we use the (possibly nonlinear) metric projector onto . Then the regularized Gauss–Newton methods (2.2) arrive at (29) with the regularizing operator The homotopy method (2.25) arrive at (30) with the regularizing operator Note that if lies within , then the projection can be omitted and the respective step is equivalent to (2.2) or (2.26).

Kaltenbacher and Neubauer had proved the feasibility and validity of this projection method in Citation26. Here we briefly introduce this theory.

In regularization theory, α>0 is a small regularization parameter, and the regularizing operator Rα satisfies (31) (32) (33) for some positive constants c1, c2, and cs. Here the superscript † denotes the generalized inverse. With the assumption (34) and the uniformly bounded property of F, as long as the stopping index is chosen such that with a sufficiently small constant ζ, the iteration (2.27) is convergent (cf. [Citation26, Theorem 4]). When we consider the iteration (2.28), the regularizing operator Rα turns to Rα(λ), whose parameter α(λ) continuously depends on λ. However, (2.29) is still satisfied under the condition λ→1 (namely α→ 0) and the fulfillment of (2.30) and (2.31) is obviously obtainable. As a result, with the corresponding assumption like (2.32), the iteration (2.28) is feasible.

When this projection method is applied to regularization methods, where the infinite-dimensional linear operator equation is projected to a finite-dimensional subspace of the data space , we assume that (35) and is solved in a best approximate sense, so that (36) where Qm and Pm are the orthogonal projectors onto and , respectively. An advantage of regularization by discretization is that it enables the use of multi-grid method as optimal preconditions for the fast convergence of iterative solutions of the linear system in each Newton step.

Note that and that as m→∞. Hence, (2.29) and (2.31) hold. Moreover, it holds that (37)

Let us assume that the space have the property that (38) for all , where . Such properties are for instance fulfilled for finite element spaces and Sobolev spaces . The parameter hm usually plays the role of a mesh size of the discretization, with hm→ 0 as m→∞. If K has the smoothing property that (39) then we obtain an estimate (40) On the other hand, inverse inequalities yield an estimate for in terms of the mesh size, i.e., (41) for some and all linear operators K satisfying (42)

With the correspondence (43) for the regularization parameter, (2.35), (2.38), and (2.40) imply that (2.32) holds, where we additionally restrict the operators K to those satisfying (2.40). As a result, as long as (2.40) holds for all operators with . If the additional condition (44) holds, (2.39) and (2.41) imply that Rα defined by (2.34) satisfies (2.30).

As pointed out in Citation26, (2.36), (2.37), (2.39), (2.40), and (2.42) are natural conditions in the context of parameter identification in PDEs and discretization by finite elements of splines.

3. A parameter estimation problem of an elliptical equation

Let us consider the following parameter estimation problem of an elliptical equation, namely the problem of identifying the space dependent diffusivity a=a(x) in (45) from measurements of u in the domain . A solution to this problem exists if with a.e. and if . For this purpose, usually the domain of the forward operator is chosen as the set of functions in the Hilbert space with ϵ>0. Then one can not only verify Lipschitz continuity of the derivative (cf. (2.8) but also more restrictive assumptions on the nonlinearity of the forward operator F. We project onto the set for some constants . This means, we look for approximate solutions in L but consider their convergence to the exact solution in L2.

This example has been described in Citation26, here for completeness we still enumerate some properties of the model.

We will now verify all assumptions on the forward operator F made in the previous section for the one-dimensional case Ω=[0,1]: one can show that F(a) is given by and that F is differentiable in the sense of the last section, namely for all and such that for some t>0. Obviously and is exist. If fL1[0,1], then F satisfies (2.8): for it holds that The adjoint of F is given by where u(a) is the solution of (3.1) and the dot denotes derivative with respect to s. The source condition (2.9) is equivalent to the conditions and ω is given by

To numerically test the approach proposed here, we applied it to the test problem: (46) Note that and a0 satisfy the source conditions with ω=99.9f.

The solution of the forward problem (3.1) was based on linear finite elements on a uniform grid with mesh size h=1/10. The same finite element space was used to approximate a, i.e., where un solves the linear equation The Fréchet-derivative is given by where dun solves the linear equation un is as above, and An(h) is similarly defined as An(a). The diffusivity is approximated by

(2.27) in this setting then reads as follows: (47) and (2.28) arrives at (48)

In order to test the feasibility and to study the general characteristics of the new method, we carry out the numerical simulation procedure for two cases. In the process of computation, we take as error level.

At first we take a0=100 for exact data (δ=0) with . Both iterations (2.27), (2.28) and (2.2) are convergent (). Without projection, we do not get convergence via the regularized Gauss–Newton method (2.2), while the homotopy method still provides good results.

Figure 1. a0=100. The numerical results for the iterations (2.25) (left) and (2.28) and (2.2) (right) without noise. Note: Solid line denotes numerical results am and dashed line denotes exact solution a*.

Figure 1. a0=100. The numerical results for the iterations (2.25) (left) and (2.28) and (2.2) (right) without noise. Note: Solid line denotes numerical results am and dashed line denotes exact solution a*.

Then, with , we choose a0=0.01 so that the source condition is no longer satisfied. The numerical results show that only iteration (2.28) and (2.2) can converge and 5% noise level can be added (). The numerical results show that our method is indeed a widely convergent method and can improve stability of classical methods.

Figure 2. a0=0.01. The numerical results for the iterations (2.28) and (2.2) without noise (left) and 5% noise added (right). Note: Solid line denotes numerical results am and dashed line denotes exact solution a*.

Figure 2. a0=0.01. The numerical results for the iterations (2.28) and (2.2) without noise (left) and 5% noise added (right). Note: Solid line denotes numerical results am and dashed line denotes exact solution a*.

4. Conclusions

For the general parameter estimation problems, we have designed the homotopy-projection method. First, we introduce the homotopy method to overcome the defects of local convergence of conventional iterative methods. Secondly, the discretization by finite elements is not only to make the problem of estimating a function a(x) reduce solving a n-dimensional vector an but also an does not take part in the discretization. This two properties will reduce the cost of computation. Finally, constrains defined by closed and convex sets can be handled by projection. As a practical application, the method has been used successfully to solve the parameter estimation problems of an elliptical equation. It turns out to be stable and widely convergent. We have done other numerical experiments on an inverse problem of wave equation, and on auto-convolution equation, they all show our method's effectiveness (cf. Citation22).

Acknowledgement

This work is supported by the National Science Foundation of China under grant No.40544016, and the Multidiscipline Scientific Research Foundation of Harbin Institute of Technology under No. HIT. MD 2003.19.

References

  • Engl, HW, Hanke, M, and Neubauer, A, 1996. Regularization of Inverse problems. Dordrecht: Kluwer; 1996.
  • Tikhonov, AN, and Arsenin, VY, 1977. Solution of ill-posed problems. New York: John Wiley and Sons; 1977. pp. 20–40.
  • Engl, HW, Hanke, M, and Neubauer, A, 1989. Convergence rates for Tikhonov regularization of nonlinear ill-posed problem, Inverse Problems 5 (1989), pp. 523–540.
  • Hofmann, B, and Scherzer, O, 1994. Factors influnencing the ill-posedness problems, Inverse Problems 10 (1994), pp. 1277–1297.
  • Neubauer, O, 1989. Tikhonov regularization for nonlinear ill-posed problems: optimal convergence rates and finite-dimensional approximation, Inverse Problems 5 (1989), pp. 541–557.
  • Seidman, TI, and Vogel, CR, 1989. Well posedness and convergence of some regularization methods for nonlinear ill-posed problems, Inverse Problems 10 (1989), pp. 227–238.
  • Bakushinskii, AB, 1992. The problem of the convergence of the iteratively regularized Gauss-Newton method, Computational Mathematics and Mathematical Physics 32 (1992), pp. 1353–1359.
  • Scherzer, O, 1996. A convergence analysis of a method of steepest descent and a two-step algorithm for nonlinear ill-posed problems, Numerical Functional Analysis Optimization 17 (1996), pp. 197–214.
  • Hanke, M, Neubauer, A, and Scherzer, O, 1995. A convergence analysis of the Landweber iteration for nonlinear ill-posed problems, Numerical Mathematic 72 (1995), pp. 21–27.
  • Fletcher, R, 1971. A modified Marquardt subroutine for nonlinear least squares: Rep. AERE-R 6799, Atomic energy research establishment. UK: Harwell; 1971. pp. 271–307.
  • Kaltrnbacher, B, 1997. Some Newton-type methods for the regularization of nonlinear ill-posed problems, Inverse Problems 13 (1997), pp. 729–753.
  • Ramlau, R, 1999. A modified Landweber method for inverse problems, Numerical Functional Analysis Optimization 20 (1999), pp. 79–98.
  • Scherzer, O, 1998. A modified Landweber iteration for solving parameter estimation problems, Applied Mathematic Optimisation 38 (1998), pp. 45–68.
  • Song, H, and Liu, JQ, 1990. Newton regualrization method and its application, Mathematics Numerica Sinica 3 (1990), pp. 225–231.
  • Allgower, E, and Georg, K, 1990. Numerical continuation methods. Berlin: Springer-Verlag; 1990.
  • Keller, HB, and Perozzi, DJ, 1983. Fast seismic ray tracing, SIAM Journal on Applied Mathematics 43 (1983), pp. 981–992.
  • Vasco, DW, 1994. Singularity and branching: a path-following formalism for geophysical inverse problems, Geophysical Journal International 119 (1994), pp. 809–830.
  • Han, B, Shao, JQ, and Guo, BQ, 1994. A widely convergent method for determing the distributed parameters of an elliptal equation, Applied Mathematics and Computation 60 (1994), pp. 139–146.
  • Everett, ME, 1996. Homotopy, polynomial equations, gross boundary data,and small Helmhaltz systems, Journal of Computation Physics 124 (1996), pp. 431–441.
  • Jegen, MD, Everett, EM, and Schultz, A, 2001. Using homotopy to invert geophysical data, Geophysical Journal International 66 (2001), pp. 1749–1760.
  • Bao, G, and Liu, J, 2003. Numerical solution of inverse scattering problems with multiexperimental limited aperture data, SIAM Journal on Scientific Computing 25 (2003), pp. 1102–1117.
  • Han, B, Fu, HS, and Li, Z, 2005. A homotopy method for the inversion of a two-dimensional acoustic wave equation, Inverse Problems in Science and Engineering 13 (2005), pp. 411–431.
  • Watson, LT, 1989. Globally convergent homotopy methods, Applied Mathematics and Computation 31 (1989), pp. 369–396.
  • Hanke, M, 1997. A regularization Levenberg-Marquardt scheme, with application to inverse groundwater filtrations problems, Inverse Problems 13 (1997), pp. 79–95.
  • Ramlau, R, 2003. TIGRA Can iterative algorithm for regularizing nonlinear ill Cposed problems, Inverse Problems 19 (2) (2003), pp. 433–467.
  • Kaltenbacher, B, and Neubauer, A, 2006. Convergence of projected iterative regularization methods for nonlinear problems with smooth solutions., Inverse Problems 22 (3) (2006), pp. 1105–1119.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.