Search in:

Inverse Problems in Science and Engineering Volume 28, 2020 - Issue 8

Submit an article Journal homepage

Free access

513

Views

CrossRef citations to date

Altmetric

Listen

Articles

A generalized Newton iteration for computing the solution of the inverse Henderson problem

Fabrice DelbaryInstitut für Mathematik, Johannes Gutenberg – Universität Mainz, Mainz, GermanyView further author information

Martin HankeInstitut für Mathematik, Johannes Gutenberg – Universität Mainz, Mainz, GermanyCorrespondence[email protected]
View further author information

Dmitry IvanizkiInstitut für Mathematik, Johannes Gutenberg – Universität Mainz, Mainz, GermanyView further author information

Pages 1166-1190 | Received 26 Mar 2019, Accepted 28 Nov 2019, Published online: 16 Jan 2020

Cite this article
https://doi.org/10.1080/17415977.2019.1710504
CrossMark

In this article

ABSTRACT
1. Introduction
2. Mathematical setting of the problem
3. Generalized Newton schemes for the inverse Henderson problem
4. Well-posedness of the IHNC and HNCN schemes
5. Numerical discretization
6. Extensions of the method
7. Numerical results
8. Conclusion
Acknowledgements
Disclosure statement
Additional information
Footnotes
References
Appendixes

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

We develop a generalized Newton scheme called IHNC (inverse hypernetted-chain iteration) for the construction of effective pair potentials for systems of interacting point-like particles. The construction is realized in such a way that the distribution of the particles matches a given radial distribution function. The IHNC iteration uses the hypernetted-chain integral equation for an approximate evaluation of the inverse of the Jacobian of the forward operator.

In contrast to the full Newton method realized in the Inverse Monte Carlo (IMC) scheme, the IHNC algorithm requires only a single molecular dynamics computation of the radial distribution function per iteration step and no further expensive cross-correlations. Numerical experiments are shown to demonstrate that the method is as efficient as the IMC scheme, and that it easily allows to incorporate thermodynamical constraints.

Keywords:

Coarse-graining
radial distribution function
effective potential
Iterative Boltzmann Inversion
Inverse Monte Carlo

AMS subject classifications:

65Z05
82B21

1. Introduction

A common problem in material science is the quantification of interactions between a set of given particles. For example, in computer simulations of complex materials, where all sorts of numerical multiscale techniques are inevitable tools to treat relevant timescales and/or spatial resolutions (cf., e.g. Potestio, Peter, and Kremer [Citation1]), larger atomistic structures are often replaced by artificial particles, so-called beads, and the simulation of these beads requires the knowledge of effective interactions between them and other molecules or atoms.

In the simplest case, one may assume that the beads are point-like particles, whose interactions are governed by a potential $u = u (r)$ , which only depends on the distance r>0 of each interacting pair of particles and vanishes in the limit $r \to \infty$ . According to Henderson [Citation2] such a pair potential $u = u (r)$ is uniquely determined by the so-called radial distribution function $g = g (r)$ , which measures the number of particle pairs with a given distance in a homogeneous fluid in thermal equilibrium. The inverse Henderson problem of computing the pair potential from the given radial distribution is therefore exactly what needs to be solved in order to settle the aforementioned problem in physical chemistry.

One of the difficulties with this problem is the fact that the associated map (1) $G : u \mapsto g,$ (1) which takes the pair potential onto the corresponding radial distribution function (for specified values of density and temperature of the fluid) is not given in closed terms, but has to be evaluated numerically, using expensive molecular dynamics or Monte-Carlo simulations. It goes without saying that the inverse map $G^{- 1}$ is not known, either. Methods for solving the inverse Henderson problem therefore can be distinguished in two classes: one class uses closed-form approximations of G or $G^{- 1}$ , respectively, most notably the hypernetted-chain or the Percus–Yevick approximations, cf., e.g. Ben–Naim [Citation3] or Hansen and McDonald [Citation4]; the other class uses iterative schemes which start from a certain educated guess $u_{k}$ of u, simulate the corresponding radial distribution function $g_{k} = G (u_{k})$ and use this information to determine an improved approximation $u_{k + 1}$ by some sophisticated update rule, proceeding in this manner until convergence. Most prominent representatives of the latter class are the Iterative Boltzmann Inversion (IBI) or Inverse Monte Carlo (IMC); cf., e.g. Mirzoev and Lyubartsev [Citation5], Rühle et al. [Citation6] or Tóth [Citation7].

In this paper, we suggest a new method of the second class, i.e. an iterative method, which combines the advantages of the two aforementioned schemes, namely the simplicity and robustness of IBI, and the rapid convergence of IMC for an appropriate initial guess. Our method is a generalized Newton iteration – as opposed to IMC, which corresponds to the much more expensive full Newton scheme for inverting (Equation1(1) $G : u \mapsto g,$ (1) ) – and we use the hypernetted-chain approximation to compute a simplified derivative of G. We show by numerical examples for simulated and measured radial distribution data that the method outperforms IBI and requires about the same number of iterations as does IMC, even when the density and the temperature of the fluid are near a phase transition. We also demonstrate how to include thermodynamical constraints like a known value for the pressure of the system into our scheme. In this work, we only treat the case of a homogeneous fluid of single particles; we plan to show in a forthcoming paper how to extend the method to binary mixtures.

The outline of this paper is as follows. In the following section, we briefly summarize the necessary ingredients from statistical mechanics which are fundamental for this work. Then, in Section 3, we derive the approximation of the inverse of the Jacobian of G which will be used for our generalized Newton scheme. Section 4 presents the mathematical core of this paper and is concerned with the well-posedness of different variants of our algorithm. Readers who are only interested in the algorithms and in implementation details can skip this part without any loss. In the subsequent two sections, we then discuss the numerical implementation and further extensions of these schemes; in particular, we show in Section 6 how to incorporate pressure constraints. Finally, numerical results for some benchmark systems are presented in Section 7. In the appendix, we include a proof for an extension of the classical Wiener lemma (cf., e.g. Jörgens [Citation8]) to some weighted $L^{\infty}$ space, which is needed for our mathematical analysis.

2. Mathematical setting of the problem

Consider an ensemble of identical classical point-like particles in thermodynamical equilibrium, where the interaction of the particles is given in terms of a pair potential $u : R^{+} \to R$ of Lennard–Jones type, i.e. there exist a core radius $r_{0} > 0$ and a parameter $α > 3$ such that (2) $\begin{aligned} u (r) & \geq a r^{- α}, r \leq r_{0}, \\ | u (r) | & \leq b r^{- α}, r \geq r_{0}, \end{aligned}$ (2) for suitable constants a, b>0. We assume that the number of particles and the size of the spatial domain under consideration are so big that one can treat this ensemble in the thermodynamical limit, i.e. as if it fills the full space $R^{3}$ . For our mathematical analysis, we further assume that the counting density $ρ_{0} > 0$ of the ensemble is sufficiently small and the temperature T>0 is sufficiently large, so that the system is in its so-called gas phase, cf., e.g. Ruelle [Citation9, p. 84].

The radial distribution function $g : R^{+} \to R^{+}$ , referred to in the introduction, measures the number of particle pairs at distance r>0, normalized in such a way that $g (r) \to 1$ as $r \to \infty$ ; see [Citation4] for the precise definition of this function. Then, as shown in [Citation10], the map G of (Equation1(1) $G : u \mapsto g,$ (1) ), which takes u onto g, is well-defined and differentiable in a certain neighbourhood of u with respect to the Banach space $V$ of perturbations v of u, for which the corresponding norm (3) $‖ v ‖_{V} = max {‖ v / u ‖_{(0, r_{0}]}, ‖ ϱ v ‖_{[r_{0}, \infty)}}$ (3) is sufficiently small; here, (4) $ϱ (r) = (1 + r^{2})^{α / 2}, r \geq 0,$ (4) is a weight function associated with the parameter α of (Equation2(2) $\begin{aligned} u (r) & \geq a r^{- α}, r \leq r_{0}, \\ | u (r) | & \leq b r^{- α}, r \geq r_{0}, \end{aligned}$ (2) ), and for a real interval $I \subset R$ the expression $‖ \cdot ‖_{I}$ refers to the supremum norm of real functions defined on this respective interval.

In [Citation11], it has been shown that the so-called pair correlation function h = g−1 for a Lennard–Jones type pair potential given by (Equation2(2) $\begin{aligned} u (r) & \geq a r^{- α}, r \leq r_{0}, \\ | u (r) | & \leq b r^{- α}, r \geq r_{0}, \end{aligned}$ (2) ) belongs to the Banach space $L_{ϱ}^{\infty}$ of functions $f \in L^{\infty}$ with finite norm (5) $‖ f ‖_{L_{ϱ}^{\infty}} = ‖ ϱ f ‖_{(0, \infty)},$ (5) where ϱ is defined in (Equation4(4) $ϱ (r) = (1 + r^{2})^{α / 2}, r \geq 0,$ (4) ). Since $α > 3$ , the radially symmetric extension of any $f \in L_{ϱ}^{\infty}$ to the full space $R^{3}$ is absolutely integrable and has a well-defined (three-dimensional) continuous Fourier transform. This is important, because although u, g, and h are defined as functions of a positive argument r>0, they can be viewed as representations of radial functions of a three-dimensional spatial variable in full space. In particular, the Fourier transform of the corresponding extension of h – which is again radially symmetric and can therefore be represented by a function $\hat{h} : [0, \infty) \to R$ by some slight abuse of the standard notation – is used to define the structure factor (6) $S (ω) = 1 + ρ_{0} \hat{h} (ω), ω \geq 0,$ (6) which is known to be continuous and nonnegative.

Going one step further, if $f_{1}, f_{2} \in L_{ϱ}^{\infty}$ , then the three-dimensional convolution integral of their radially symmetric extensions to $R^{3}$ is again a radial function and – as has also been shown in [Citation11] – its representation (as a function defined in $R^{+}$ ) again belongs to $L_{ϱ}^{\infty}$ ; we adopt the notation $f_{1} * f_{2}$ for the resulting convolution product, which turns $L_{ϱ}^{\infty}$ into a (commutative) Banach algebra.

Proposition 2.1

Let u be a Lennard–Jones type pair potential (Equation2(2) $\begin{aligned} u (r) & \geq a r^{- α}, r \leq r_{0}, \\ | u (r) | & \leq b r^{- α}, r \geq r_{0}, \end{aligned}$ (2) ) with parameter $α > 3$ , and let the counting density $ρ_{0}$ of the ensemble be sufficiently small. Using the pair correlation function h of this ensemble and the above definition of the convolution product in $L_{ϱ}^{\infty}$ , define (7) $A : L_{ϱ}^{\infty} \to L_{ϱ}^{\infty}, A : f \mapsto ρ_{0} h * f .$ (7) Then I + A is invertible in $L (L_{ϱ}^{\infty})$ , if the structure factor (Equation6(6) $S (ω) = 1 + ρ_{0} \hat{h} (ω), ω \geq 0,$ (6) ) is strictly positive.

The proof of this result follows from a weighted version of the Wiener lemma, stated and proved in the appendix, cf. Lemma A.1.

Under the assumptions of Proposition 2.1, it follows in particular that the so-called Ornstein–Zernike relation (8) $c + ρ_{0} h * c = h$ (8) has a unique solution $c \in L_{ϱ}^{\infty}$ , known as the direct correlation function, cf. [Citation4]. Then, with $k_{B}$ the Boltzmann constant and $β = \frac{1}{k_{B} T}$ the inverse temperature, the hypernetted-chain approximation mentioned in the introduction states that (9) $g \approx e^{- β u + h - c} .$ (9) Historically (Equation9(9) $g \approx e^{- β u + h - c} .$ (9) ) has been used to approximate g without lengthy molecular dynamics simulations, but by solving a (nonlinear) integral equation instead. On the other hand, (Equation9(9) $g \approx e^{- β u + h - c} .$ (9) ) can be solved for u to provide an explicit approximation $u_{HNC}$ of the true pair potential, namely (10) $u_{HNC} = U (g) = - \frac{1}{β} \log g + \frac{1}{β} (h - c),$ (10) which only depends on quantities that are readily available from the given radial distribution function.

We close this section by formally differentiating U of (Equation10(10) $u_{HNC} = U (g) = - \frac{1}{β} \log g + \frac{1}{β} (h - c),$ (10) ) to determine the impact of small perturbations $g^{'}$ of g on $u_{HNC}$ , namely (11) $U^{'} (g) g^{'} = - \frac{1}{β} \frac{g^{'}}{g} + \frac{1}{β} (g^{'} - c^{'}),$ (11) where $c^{'}$ is the derivative of c with respect to g or h, respectively: Using (Equation8(8) $c + ρ_{0} h * c = h$ (8) ) and the fact that $(L_{ϱ}^{\infty}, *)$ is a Banach algebra, we conclude that (12) $c^{'} + ρ_{0} h * c^{'} + ρ_{0} g^{'} * c = g^{'} .$ (12) Convolving this equation with $ρ_{0} h$ , adding the result to (Equation12(12) $c^{'} + ρ_{0} h * c^{'} + ρ_{0} g^{'} * c = g^{'} .$ (12) ) again, and using the associativity and commutativity of the convolution product, we obtain $c^{'} + 2 ρ_{0} h * c^{'} + ρ_{0}^{2} h * h * c^{'} + ρ_{0} (c + ρ_{0} h * c) * g^{'} = g^{'} + ρ_{0} h * g^{'},$ and inserting (Equation8(8) $c + ρ_{0} h * c = h$ (8) ), this yields $c^{'} + 2 ρ_{0} h * c^{'} + ρ_{0}^{2} h * h * c^{'} = g^{'} .$ With the operator A of Proposition 2.1, the latter can be rewritten as $(I + A)^{2} c^{'} = g^{'},$ showing that $c^{'} \in L_{ϱ}^{\infty}$ is well-defined when the structure factor is positive. Inserting this identity into (Equation11(11) $U^{'} (g) g^{'} = - \frac{1}{β} \frac{g^{'}}{g} + \frac{1}{β} (g^{'} - c^{'}),$ (11) ), we eventually obtain (13) $U^{'} (g) g^{'} = - \frac{1}{β} \frac{g^{'}}{g} + \frac{1}{β} φ,$ (13) where (14) $φ = (I + A)^{- 2} (2 I + A) A g^{'} .$ (14)

3. Generalized Newton schemes for the inverse Henderson problem

We now present iterative algorithms for an approximate solution of the inverse Henderson problem, i.e. for determining a pair potential $\tilde{u}$ , for which the associated radial distribution function $G (\tilde{u})$ is close to the given data g for specified values of $ρ_{0}$ and β.

One of the most successful methods of this kind is the Iterative Boltzmann Inversion (IBI) (15) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g}, g_{k} = G (u_{k}),$ (15) $k = 0, 1, 2, \dots$ , originally suggested by Schommers [Citation12]. In (Equation15(15) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g}, g_{k} = G (u_{k}),$ (15) ), as in Section 2, $g, g_{k}, u$ and $u_{k}$ are functions of $r \in (0, \infty)$ , and we continue to omit the independent variable r as long as there is no danger of confusion. Note that each iteration of IBI requires an expensive evaluation of the forward operator G. IBI is widely used, because it has been found to be fairly robust. Soper, who redeveloped this scheme in [Citation13], gave some heuristic arguments to support this observation. However, a rigorous convergence analysis is still lacking; see [Citation11] for some preliminary results in this direction.

A certain shortcoming of IBI is that it may require quite a few iterations to determine a sufficiently accurate potential. In [Citation14], Lyubartsev and Laaksonen therefore proposed the Newton method (16) $u_{k + 1} = u_{k} + G^{'} (u_{k})^{- 1} (g - g_{k}), g_{k} = G (u_{k}),$ (16) $k = 0, 1, 2, \dots$ , as an alternative. In this scheme, now called Inverse Monte Carlo (IMC),Footnote¹ the numerical evaluation of the Fréchet derivative of G can be implemented by using higher order statistics of the ensemble corresponding to some integrated 3- and 4-particle distribution functions. As it requires longer forward simulations to achieve sufficiently accurate statistics of these higher order distribution functions, each IMC iteration is much more expensive than one step of IBI. Another shortcoming of IMC is the need to start the iteration with a fairly accurate initial guess. It is therefore sometimes recommended to first run a number of IBI steps before switching to IMC, cf., e.g. Mirzoev and Lyubartsev [Citation5] or Murtola et al. [Citation15].

Here we consider a generalized Newton scheme, where $G^{'} (u_{k})^{- 1}$ in (Equation16(16) $u_{k + 1} = u_{k} + G^{'} (u_{k})^{- 1} (g - g_{k}), g_{k} = G (u_{k}),$ (16) ) is replaced by some approximation. Note, for example, that the low-density approximation $G (u) \approx G_{LDL} (u) = e^{- β u},$ which is correct of order $O (ρ_{0})$ as $ρ_{0} \to 0$ , suggests to replace $G^{'} (u_{k})^{- 1} g^{'} \approx {G_{LDL}^{'} (u)}^{- 1} g^{'} = - \frac{1}{β} e^{β u} g^{'} \approx - \frac{1}{β} \frac{g^{'}}{g},$ cf. [Citation16]. When using this approximation in (Equation16(16) $u_{k + 1} = u_{k} + G^{'} (u_{k})^{- 1} (g - g_{k}), g_{k} = G (u_{k}),$ (16) ) we arrive at the iterative scheme (17) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g}, k = 0, 1, 2, \dots .$ (17) Note that this is reminiscent of the IBI scheme (Equation15(15) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g}, g_{k} = G (u_{k}),$ (15) ), because $\log \frac{g_{k}}{g} = \log (1 + \frac{g_{k} - g}{g}) \approx \frac{g_{k} - g}{g}$ for $g_{k}$ close to g. In fact, in numerical experiments that we have made, we did not observe a significant difference between the performance of the two iterative schemes (Equation15(15) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g}, g_{k} = G (u_{k}),$ (15) ) and (Equation17(17) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g}, k = 0, 1, 2, \dots .$ (17) ).

We therefore propose a more sophisticated approximation of G, namely one that is based on the hypernetted-chain approximation (Equation9(9) $g \approx e^{- β u + h - c} .$ (9) ), which is correct of order $O (ρ_{0}^{2})$ as $ρ_{0} \to 0$ , cf. [Citation4], to obtain a useful compromise between IBI and IMC. To be specific, with U of (Equation10(10) $u_{HNC} = U (g) = - \frac{1}{β} \log g + \frac{1}{β} (h - c),$ (10) ) we approximate (18) $G^{'} (u_{k})^{- 1} g^{'} \approx U^{'} (g) g^{'} = - \frac{1}{β} \frac{g^{'}}{g} + \frac{1}{β} φ,$ (18) cf. (Equation13(13) $U^{'} (g) g^{'} = - \frac{1}{β} \frac{g^{'}}{g} + \frac{1}{β} φ,$ (13) ), where g is the measured radial distribution function and ϕ is given by (Equation14(14) $φ = (I + A)^{- 2} (2 I + A) A g^{'} .$ (14) ) with A of (Equation7(7) $A : L_{ϱ}^{\infty} \to L_{ϱ}^{\infty}, A : f \mapsto ρ_{0} h * f .$ (7) ). Inserting (Equation18(18) $G^{'} (u_{k})^{- 1} g^{'} \approx U^{'} (g) g^{'} = - \frac{1}{β} \frac{g^{'}}{g} + \frac{1}{β} φ,$ (18) ) into (Equation16(16) $u_{k + 1} = u_{k} + G^{'} (u_{k})^{- 1} (g - g_{k}), g_{k} = G (u_{k}),$ (16) ) we thus obtain the iteration (19a) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (19a) with (19b) $φ_{k} = (I + A)^{- 2} (2 I + A) A (g - g_{k}) .$ (19b) We call (Equation19(19a) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (19a) ) the hypernetted-chain Newton iteration (HNCN). Take note that this approach does not involve a computation of the hypernetted-chain approximation $u_{HNC}$ of (Equation10(10) $u_{HNC} = U (g) = - \frac{1}{β} \log g + \frac{1}{β} (h - c),$ (10) ) itself; the hypernetted-chain approximation is only used formally to determine an approximate Newton inverse. Accordingly, when the iteration (Equation19(19a) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (19a) ) converges, i.e. when $u_{k} \to u$ and $g_{k} \to g$ as $k \to \infty$ , then the limit u is the true solution of the Henderson problem for the given data.

Note that HNCN coincides with (Equation17(17) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g}, k = 0, 1, 2, \dots .$ (17) ) up to an additive correction term. The similarity between (Equation17(17) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g}, k = 0, 1, 2, \dots .$ (17) ) and IBI therefore suggests to consider also the alternative IBI-type scheme (20) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (20) with $φ_{k}$ of (Equation19b(19b) $φ_{k} = (I + A)^{- 2} (2 I + A) A (g - g_{k}) .$ (19b) ), which we call the inverse hypernetted-chain iteration (IHNC).

We finally mention that IHNC and HNCN differ from the so-called LWR scheme developed by Levesque, Weis, and Reatto [Citation17] and rediscovered recently by Heinen [Citation18]: in our notation the LWR scheme proceeds by computing $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g} + \frac{1}{β} (g - g_{k} - c + c_{k}), k = 0, 1, 2, \dots,$ where c is the direct correlation function (Equation8(8) $c + ρ_{0} h * c = h$ (8) ), and $c_{k}$ is defined accordingly via $c_{k} + ρ_{0} h_{k} * c_{k} = h_{k}$ with $h_{k} = g_{k} - 1$ . It is straightforward to verify that the LWR scheme can also be rewritten as $u_{k + 1} = u_{k} + U (g) - U (g_{k})$ with U of (Equation10(10) $u_{HNC} = U (g) = - \frac{1}{β} \log g + \frac{1}{β} (h - c),$ (10) ), hence the LWR update of the potential can be seen as the secant approximation of $U^{'} (g) (g - g_{k})$ used by the HNCN scheme. While this may appear on first sight to be a minor difference only between the two schemes, the tangent approximation turns out to be crucial to allow for subsequent extensions of the HNCN scheme described in Section 6.

4. Well-posedness of the IHNC and HNCN schemes

We are now going to analyse the two new iterative schemes (Equation19(19a) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (19a) ) and (Equation20(20) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (20) ) similar to the analysis of IBI in [Citation11]. For this we work in the topology of the Banach space $V$ defined in (Equation3(3) $‖ v ‖_{V} = max {‖ v / u ‖_{(0, r_{0}]}, ‖ ϱ v ‖_{[r_{0}, \infty)}}$ (3) ).

Proposition 4.1

Let u be a Lennard–Jones type pair potential and $ρ_{0}$ be sufficiently small. Moreover, assume that the structure factor (Equation6(6) $S (ω) = 1 + ρ_{0} \hat{h} (ω), ω \geq 0,$ (6) ) is a strictly positive function. Then the IHNC iteration (Equation20(20) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (20) ) is well-posed in the following sense: If $‖ u_{0} - u ‖_{V}$ is sufficiently small, then $u_{1}$ is again a Lennard–Jones type pair potential, and there holds $‖ u_{1} - u ‖_{V} \leq C ‖ u_{0} - u ‖_{V}$ for some C>0, depending on u, $ρ_{0}$ , and the inverse temperature β.

Proof.

In the analysis of IBI in [Citation11] it has been shown that (21) $‖ \log (g_{0} / g) ‖_{V} \leq C ‖ u_{0} - u ‖_{V}$ (21) for some constant C>0, cf. [Citation11, (6.3)]. Furthermore, since $L_{ϱ}^{\infty}$ is continuously embedded in $V$ because of (Equation2(2) $\begin{aligned} u (r) & \geq a r^{- α}, r \leq r_{0}, \\ | u (r) | & \leq b r^{- α}, r \geq r_{0}, \end{aligned}$ (2) ), and since A and $(I + A)^{- 1}$ belong to $L (L_{ϱ}^{\infty})$ by virtue of Proposition 2.1, it follows from (Equation19b(19b) $φ_{k} = (I + A)^{- 2} (2 I + A) A (g - g_{k}) .$ (19b) ) that $‖ φ_{0} ‖_{V} \leq C ‖ φ_{0} ‖_{L_{ϱ}^{\infty}} \leq C ‖ g_{0} - g ‖_{L_{ϱ}^{\infty}} \leq C ‖ u_{0} - u ‖_{V}$ for some (other) constants C>0 that may be different in each of the individual terms; here, the last inequality is borrowed from [Citation11, Theorem 5.3]. Together with (Equation20(20) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (20) ) and (Equation21(21) $‖ \log (g_{0} / g) ‖_{V} \leq C ‖ u_{0} - u ‖_{V}$ (21) ) we thus obtain the assertion.

Concerning HNCN we have a similar result which is stated next, but this one requires $u_{0}$ to be close to u in the stronger norm of $L_{ϱ}^{\infty}$ .

Theorem 4.2

Under the assumptions of Proposition 4.1, the HNCN iteration (Equation19(19a) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (19a) ) is conditionally well-posed in the following sense: If $‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}$ is sufficiently small, then $u_{1}$ is again a Lennard–Jones type pair potential, and there holds $‖ u_{1} - u ‖_{L_{ϱ}^{\infty}} \leq C ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}$ for some C>0, depending on u, $ρ_{0}$ , and the inverse temperature β.

Proof.

According to (Equation19(19a) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (19a) ), there holds $u_{1} - u = u_{0} - u + \frac{1}{β} \frac{g_{0} - g}{g} + \frac{1}{β} φ_{0},$ where $‖ φ_{0} ‖_{L_{ϱ}^{\infty}} \leq C ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}$ for some constant C>0 by virtue of (Equation21(21) $‖ \log (g_{0} / g) ‖_{V} \leq C ‖ u_{0} - u ‖_{V}$ (21) ), because $L_{ϱ}^{\infty}$ is continuously embedded in $V$ . It therefore remains to prove that (22) $‖ \frac{g_{0} - g}{g} ‖_{L_{ϱ}^{\infty}} \leq C ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}$ (22) for some (other) suitable C>0.

Consider first a fixed radius $r \geq r_{0}$ . We rewrite $g (r) = y (r) e^{- β u (r)}$ in terms of the cavity distribution function y, compare [Citation4], which is known to be bounded away from zero for small enough density $ρ_{0}$ according to Proposition 3.1 in [Citation11]. It follows that g is bounded away from zero for $r \geq r_{0}$ , and hence, there exist positive constants C>0 such that (23) $\begin{aligned} ϱ (r) | \frac{g_{0} (r) - g (r)}{g (r)} | & \leq C ϱ (r) | g_{0} (r) - g (r) | \leq C ‖ g_{0} - g ‖_{L_{ϱ}^{\infty}} \\ \leq C ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}, r \geq r_{0}; \end{aligned}$ (23) compare (Equation21(21) $‖ \log (g_{0} / g) ‖_{V} \leq C ‖ u_{0} - u ‖_{V}$ (21) ) again for the final estimate.

For a fixed radius r with $0 < r \leq r_{0}$ , on the other hand, we use the cavity distribution functions $y_{0}$ and y corresponding to $u_{0}$ and u, respectively, and rewrite $\frac{g_{0} (r) - g (r)}{g (r)} = \frac{e^{β u (r)} (g_{0} (r) - g (r))}{y (r)} .$ Since y is bounded away from zero we deduce from the mean value theorem that $\begin{aligned} ϱ (r) | \frac{g_{0} (r) - g (r)}{g (r)} | & \leq C e^{β u (r)} | g_{0} (r) - g (r) | \\ \leq C (| y_{0} (r) - y (r) | + g_{0} (r) | e^{β u (r)} - e^{β u_{0} (r)} |) \\ = C (| y_{0} (r) - y (r) | + β g_{0} (r) e^{β \tilde{u}} | u (r) - u_{0} (r) |) \end{aligned}$ for some C>0 independent of r and some $\tilde{u}$ between $u_{0} (r)$ and $u (r)$ . Note that the latter implies that $\tilde{u} \leq u_{0} (r) + | u_{0} (r) - u (r) | \leq u_{0} (r) + ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}} .$ Since the cavity distribution function in $L^{\infty} (R^{+})$ depends locally Lipschitz continuously on the pair potential in $L_{ϱ}^{\infty}$ (see Proposition 3.1 in [Citation11]) it follows that $\begin{aligned} ϱ (r) | \frac{g_{0} (r) - g (r)}{g (r)} | & \leq C ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}} (1 + β g_{0} (r) e^{β \tilde{u}}) \\ \leq C ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}} (1 + β y_{0} (r) e^{β ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}}) \\ \leq C ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}, 0 < r \leq r_{0}, \end{aligned}$ for some suitable constants C>0, provided that $‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}$ is sufficiently small. This being independent of $r \in (0, r_{0}]$ , we have thus achieved to establish (Equation23(23) $\begin{aligned} ϱ (r) | \frac{g_{0} (r) - g (r)}{g (r)} | & \leq C ϱ (r) | g_{0} (r) - g (r) | \leq C ‖ g_{0} - g ‖_{L_{ϱ}^{\infty}} \\ \leq C ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}, r \geq r_{0}; \end{aligned}$ (23) ) also for $0 < r \leq r_{0}$ , and hence the proof of (Equation22(22) $‖ \frac{g_{0} - g}{g} ‖_{L_{ϱ}^{\infty}} \leq C ‖ u_{0} - u ‖_{L_{ϱ}^{\infty}}$ (22) ) is done.

Theorem 4.2 indicates that the HNCN iteration requires a better initial approximation of the true potential within the core region $0 < r \leq r_{0}$ than IHNC. Nevertheless, as shown in [Citation11], if the data g are exact, then the potential of mean force, (24) $u_{0} = - \frac{1}{β} \log g,$ (24) which is often taken as initial guess in practice, does satisfy $u_{0} - u \in L_{ϱ}^{\infty}$ , which means that the assumptions of Theorem 4.2 are not too far-fetched. We have used the potential of mean force in all our experiments (with the simulated, i.e. noisy radial distribution function g as input), and both HNCN and IHNC performed well with this choice, see Section 7.

5. Numerical discretization

Compared with IBI the only additional difficulty in a numerical implementation of HNCN and IHNC consists in computing $φ_{k}$ of (Equation19b(19b) $φ_{k} = (I + A)^{- 2} (2 I + A) A (g - g_{k}) .$ (19b) ). To simplify notation let us denote by (25) $T = (I + A)^{- 2} (2 I + A) A$ (25) the operator occurring in (Equation19b(19b) $φ_{k} = (I + A)^{- 2} (2 I + A) A (g - g_{k}) .$ (19b) ). Recall that A corresponds to a three-dimensional convolution integral with $ρ_{0}$ times the radially symmetric extension of the pair correlation function h = g−1 as convolution kernel, cf. (Equation7(7) $A : L_{ϱ}^{\infty} \to L_{ϱ}^{\infty}, A : f \mapsto ρ_{0} h * f .$ (7) ). The natural framework for discretizing A and T is therefore the Fourier space, using the representation (26a) $\hat{f} (ω) = \frac{2}{ω} \int_{0}^{\infty} r f (r) \sin (2 π r ω) d r$ (26a) for the three-dimensional Fourier transform of the radially symmetric extension of $f \in L_{ϱ}^{\infty}$ , where $ω > 0$ is the absolute value of the three-dimensional frequency. Likewise, we can compute f from $\hat{f}$ by using the formula (26b) $f (r) = \frac{2}{r} \int_{0}^{\infty} ω \hat{f} (ω) \sin (2 π r ω) d ω .$ (26b) To implement $φ = T f$ for $f \in L_{ϱ}^{\infty}$ we therefore need to determine $\hat{f}$ and the corresponding representation $\hat{h}$ for h, form (27) $\hat{φ} = \frac{2 + ρ_{0} \hat{h}}{(1 + ρ_{0} \hat{h})^{2}} ρ_{0} \hat{h} \hat{f},$ (27) and transform back using (Equation26b(26b) $f (r) = \frac{2}{r} \int_{0}^{\infty} ω \hat{f} (ω) \sin (2 π r ω) d ω .$ (26b) ) to obtain ϕ.

In order to achieve reasonable accuracy of the low frequencies of the Fourier transform of h, the simulation box and the particle count need to be sufficiently large. Generally this implies that the radial distribution function is being sampled on a larger radial interval than is used for tabulating the pair potential. To be specific, we will assume that the radial distribution function g is given on a grid (28) $Δ = {r_{j} = j Δ r : j = 1, \dots, m}$ (28) with m grid points and spacing $Δ r > 0$ , and that $h (r)$ is negligible for $r > r_{m}$ . On the other hand, the potentials $u_{k}$ are being tabulated on the subgrid (29) $Δ^{'} = {r_{i} = i Δ r : i = 1, \dots, n} \subset Δ$ (29) with $n \leq m$ grid points and the understanding that $u_{k} (r) = 0$ for $r \geq r_{n}$ . Note that from a theoretical point of view n>m would not make much sense, while n = m would be just fine. However, to reduce computational costs of the forward simulation, a choice of n<m is very reasonable and natural; a good value of the associated cut-off parameter $r_{n}$ , however, is largely a matter of experience.

For a generic function $f \in L_{ϱ}^{\infty}$ which is vanishing for $r > r_{m}$ and which has been sampled on Δ the integral (Equation26a(26a) $\hat{f} (ω) = \frac{2}{ω} \int_{0}^{\infty} r f (r) \sin (2 π r ω) d r$ (26a) ) can be discretized with the trapezoidal quadrature rule. Introducing the odd extension $ψ (r) = {\begin{cases} r f (r), & r \geq 0, \\ r f (- r), & r < 0, \end{cases}$ of $r \mapsto r f (r)$ to the whole real line (and to the extended grid with nonpositive grid points $r_{j}$ with $j \leq 0$ ), and taking into account that $ψ (r_{j}) = 0$ for $| j | > m$ , the quadrature approximation of (Equation26a(26a) $\hat{f} (ω) = \frac{2}{ω} \int_{0}^{\infty} r f (r) \sin (2 π r ω) d r$ (26a) ) can be written as (30) $\hat{f} (ω) \approx \frac{1}{i ω} (Δ r \sum_{j = - m}^{m + 1} ψ (r_{j}) e^{- 2 π i ω r_{j}}) .$ (30) This approximation is in good agreement with the true values of the Fourier transform of f as long as $0 \leq ω \leq ω_{*} := \frac{1}{2 Δ r},$ provided that f is negligible for $r > r_{m}$ and $\hat{f}$ is negligible for $ω > ω_{*}$ , cf, e.g. Henrici [Citation19, § 13.3]. Note that if the term in brackets in (Equation30(30) $\hat{f} (ω) \approx \frac{1}{i ω} (Δ r \sum_{j = - m}^{m + 1} ψ (r_{j}) e^{- 2 π i ω r_{j}}) .$ (30) ) is to be evaluated at the $2 (m + 1)$ frequencies $ω_{l} = \frac{l}{m + 1} ω_{*}, l = - m \dots, m + 1,$ then this can be implemented efficiently with a one-dimensional fast Fourier transform (fft) of length $2 (m + 1)$ , simultaneously for all these frequencies.

Alternatively, a matrix representation $T \in R^{m \times m}$ of the operator T of (Equation25(25) $T = (I + A)^{- 2} (2 I + A) A$ (25) ) can be assembled as (31) $T = F^{- 1} H F,$ (31) where $F$ corresponds to the Fourier matrix which takes $[f (r_{j})]_{j = 1}^{m}$ onto $[\hat{f} (ω_{l})]_{l = 1}^{m}$ given by (Equation30(30) $\hat{f} (ω) \approx \frac{1}{i ω} (Δ r \sum_{j = - m}^{m + 1} ψ (r_{j}) e^{- 2 π i ω r_{j}}) .$ (30) ), and $H \in R^{m \times m}$ is a diagonal matrix with the entries $h_{l l} = \frac{2 + ρ_{0} \hat{h} (ω_{l})}{(1 + ρ_{0} \hat{h} (ω_{l}))^{2}} ρ_{0} \hat{h} (ω_{l}), l = 1, \dots, m,$ on its diagonal; compare (Equation27(27) $\hat{φ} = \frac{2 + ρ_{0} \hat{h}}{(1 + ρ_{0} \hat{h})^{2}} ρ_{0} \hat{h} \hat{f},$ (27) ). Note that the multiplication of $T$ with the vector $g - g_{k}$ of samples of $g - g_{k}$ results in an m-dimensional vector with the values of $φ_{k}$ of (Equation19b(19b) $φ_{k} = (I + A)^{- 2} (2 I + A) A (g - g_{k}) .$ (19b) ) over Δ. If $Δ^{'}$ is a true subset of Δ, then we simply cut off the redundant entries when updating the pair potential $u_{k}$ , as it is done in IBI.

Remark 5.1

We mention that common software like votcaFootnote² [Citation6] for running IBI typically comes with additional tricks for pre- and postprocessing the relevant quantities, which are not explicit in the recursion (Equation15(15) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g}, g_{k} = G (u_{k}),$ (15) ). The same applies to the new schemes HNCN and IHNC; more precisely the following items have been addressed in our implementation of (Equation19(19a) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (19a) ) and (Equation20(20) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (20) ):

The simulated radial distribution functions will be numerically zero in the core region $0 < r \leq r_{0}$ , in which case IBI as well as the new iterative schemes (Equation19(19a) $u_{k + 1} = u_{k} + \frac{1}{β} \frac{g_{k} - g}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (19a) ) and (Equation20(20) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (20) ) fail to produce a well-defined potential update for these radii; instead, the potential $u_{k + 1}$ needs to be extrapolated into the core regionFootnote³ by some ad hoc scheme. In our implementation, we fit and extrapolate the computed values of $u_{k + 1}$ in the core region to a function of the form $a^{'} r^{- α^{'}}$ with appropriate positive parameters $a^{'}$ and $α^{'}$ .
After each iteration the new potential $u_{k + 1}$ is shifted by an additive constant to satisfy $u_{k + 1} (r_{n}) = 0$ , so that the extension of $u_{k + 1}$ by zero for arguments $r > r_{n}$ is continuous.
We have used gromacs, version 2016.3 [Citation20,Citation21] for the numerical computation of $g_{k} = G (u_{k})$ , with interpolated input values of $u_{k}$ on a grid which is 10 times finer than $Δ^{'}$ .

We finally emphasize that our implementation of HNCN and IHNC uses no postprocessing (e.g. smoothing) of the computed radial distribution functions, nor of the approximate potentials.

6. Extensions of the method

Due to the many simplifying modelling assumptions, and also due to inevitable noise in the given data, the inverse Henderson problem may not have a solution, and even when, it may not be appropriate to determine a pair potential u which satisfies $G (u) = g$ exactly. Rather, one should think of the problem as of an optimization problem $minimize ‖ g - G (u) ‖$ in some suitable norm, where the goal is to find an approximate minimizer only. In the context of our generalized Newton approach, the obvious way of treating this minimization problem numerically is via a Gauss–Newton type scheme, where each iteration consists of solving the linearized minimization problem (32) $minimize ‖ g - g_{k} - G^{'} (u_{k}) v ‖$ (32) before updating $u_{k + 1} = u_{k} + v$ ; compare, e.g. Lyubartsev et al [Citation22] or Murtola et al [Citation15]. In view of (Equation18(18) $G^{'} (u_{k})^{- 1} g^{'} \approx U^{'} (g) g^{'} = - \frac{1}{β} \frac{g^{'}}{g} + \frac{1}{β} φ,$ (18) ) we again propose to replace $G^{'} (u_{k})$ by $U^{'} (g)^{- 1}$ . With the same discretization as in Section 5, this leads to the minimization problem (33) $minimize ‖ W (g - g_{k} - U^{- 1} v) ‖_{2}$ (33) over $v \in R^{m}$ , where $‖ \cdot ‖_{2}$ denotes the standard Euclidean norm in $R^{m}$ , $W \in R^{m \times m}$ is an appropriate nonnegative diagonal weighting matrix, and (34) $U = - \frac{1}{β} D^{- 1} + \frac{1}{β} T$ (34) is the discretized approximation of $U^{'} (g)$ , cf. (Equation18(18) $G^{'} (u_{k})^{- 1} g^{'} \approx U^{'} (g) g^{'} = - \frac{1}{β} \frac{g^{'}}{g} + \frac{1}{β} φ,$ (18) ); here, $D$ is the diagonal matrix with the samples of the given radial distribution function on its diagonal and $T$ is defined in (Equation31(31) $T = F^{- 1} H F,$ (31) ).

In view of Remark 5.1, there are some numerical problems with the definition (Equation34(34) $U = - \frac{1}{β} D^{- 1} + \frac{1}{β} T$ (34) ) of $U$ . As the samples of the radial distribution function in the core region are numerically zero, the matrix $D$ will fail to be invertible; but since the potential is extended by extrapolation into the core region, anyway, we neither need to keep track of the corresponding samples of g nor of the respective function values of $u_{k}$ . So, by some abuse of notation, we assume in the sequel that the grid Δ only consists of the grid points $r_{j}$ in the exterior of the core region; we still denote the number of grid points in Δ by m. The resulting restriction of $D$ is invertible and defines a corresponding restriction of $U$ to the exterior of the core region.

As has been mentioned in the previous section, Δ will typically have more grid points than $Δ^{'}$ , and similar to above we assume below that $Δ^{'}$ consists of the first n<m grid points $r_{j}$ of Δ outside the core region. If $Δ^{'} ⊊ Δ$ , then we only admit vectors $v \in R^{m}$ for updating the pair potential which have zero entries for grid points $r_{j} \in Δ ∖ Δ^{'}$ . Moreover, for several reasons we prefer to restrict admissible vectors $v$ for (Equation33(33) $minimize ‖ W (g - g_{k} - U^{- 1} v) ‖_{2}$ (33) ) somewhat further by substituting $v = A_{0} w$ with $w \in R^{n - 1}$ and (35) $A_{0} = [\begin{matrix} A \\ O \end{matrix}], where A = Δ r [\begin{matrix} 1 & 1 & \dots & 1 \\ 0 & 1 & \dots & 1 \\ ⋮ & 0 & ⋱ & ⋮ \\ ⋮ & ⋮ & ⋱ & 1 \\ 0 & 0 & \dots & 0 \end{matrix}] \in R^{n \times (n - 1)}$ (35) stands for a discrete (negative) antiderivative operator and $O$ is an $(m - n) \times n$ zero block; accordingly, $v$ corresponds to a piecewise linear function v over Δ which is vanishing on $Δ ∖ Δ^{'}$ and whose piecewise constant derivative on the grid intervals of $Δ^{'}$ is given by the entries of $- w$ .

We thus determine the vector $u_{k + 1}$ with the values of $u_{k + 1}$ over $Δ^{'}$ by considering the weighted linear least squares problem (36a) $minimize ‖ W (g - g_{k} - U^{- 1} A_{0} w_{k}) ‖_{2},$ (36a) to be solved for $w_{k} \in R^{n - 1}$ , and then update (36b) $u_{k + 1} = u_{k} + A w_{k} .$ (36b) This we call the hypernetted-chain Gauss–Newton iteration (HNCGN).

One advantage of minimizing (Equation36a(36a) $minimize ‖ W (g - g_{k} - U^{- 1} A_{0} w_{k}) ‖_{2},$ (36a) ) over $w = w_{k}$ rather than $v$ as in (Equation33(33) $minimize ‖ W (g - g_{k} - U^{- 1} v) ‖_{2}$ (33) ) is that this adds some correlations to neighbouring function values of the pair potentials; another advantage is that we automatically respect the normalization condition $u_{k + 1} (r_{n}) = 0$ , and therefore we avoid the extra shifting step mentioned in Remark 5.1 (ii).

With HNCGN it is easy to impose additional constraints on $u_{k + 1}$ . As a simple example, we treat the case that a certain value p for the pressure of the system is being imposed, because this particular constraint has often been addressed in the literature as a possibility for improving the thermodynamical properties of coarse-grained models resulting from IBI or IMC iterations, cf., e.g. [Citation1,Citation15,Citation23–26]. In the thermodynamical limit, the pressure of the system is given by the virial integral $p = \frac{ρ_{0}}{β} - \frac{2}{3} π ρ_{0}^{2} \int_{0}^{\infty} u^{'} (r) g (r) r^{3} d r,$ provided that the pair potential is differentiable and that its derivative decays sufficiently rapidly near infinity; compare [Citation4]. One way to enforce (approximately) the same pressure for the ensemble corresponding to the pair potential $u_{k + 1}$ – assuming that the simulated radial distribution function $g_{k + 1}$ is sufficiently close to the true one – is by constraining $u_{k + 1}$ to satisfy $\frac{2}{3} π ρ_{0}^{2} \int_{0}^{\infty} (u_{k}^{'} (r) - u_{k + 1}^{'} (r)) g (r) r^{3} d r \approx p - p_{k},$ where $p_{k}$ is the pressure corresponding to $u_{k}$ ; the latter can either be evaluated within the simulation run for evaluating $G (u_{k})$ or by numerical quadrature of the corresponding virial integral. Since the entries $w_{i, k}$ of $w_{k}$ approximate the values of $u_{k}^{'} - u_{k + 1}^{'}$ over the interval $(r_{i}, r_{i + 1})$ , the left-hand side of the previous equation can be discretized as $\frac{2}{3} π ρ_{0}^{2} \sum_{i = 1}^{n - 1} w_{i, k} \frac{g (r_{i}) + g (r_{i + 1})}{2} \frac{r_{i + 1}^{4} - r_{i}^{4}}{4} =: ℓ^{T} w_{k}$ for a corresponding vector $ℓ \in R^{n - 1}$ , and this leads to a discrete constraint of the form (36c) $ℓ^{T} w_{k} = p - p_{k}$ (36c) for all $w_{k} \in R^{n - 1}$ , over which (Equation36a(36a) $minimize ‖ W (g - g_{k} - U^{- 1} A_{0} w_{k}) ‖_{2},$ (36a) ) is to be minimized.

The standard recommendation for dealing with the constrained minimization problem (Equation36a(36a) $minimize ‖ W (g - g_{k} - U^{- 1} A_{0} w_{k}) ‖_{2},$ (36a) ), (Equation36c(36c) $ℓ^{T} w_{k} = p - p_{k}$ (36c) ) numerically is to solve (Equation36c(36c) $ℓ^{T} w_{k} = p - p_{k}$ (36c) ) for one of the entries in $w_{k}$ , $w_{i_{0}, k}$ say, and to use the resulting expression to eliminate this variable from (Equation36a(36a) $minimize ‖ W (g - g_{k} - U^{- 1} A_{0} w_{k}) ‖_{2},$ (36a) ); cf., e.g. Björck [Citation27]. To achieve maximal stability $i_{0}$ should be the very index for which the corresponding element $ℓ_{i_{0}}$ of $ℓ \in R^{n - 1}$ has maximal modulus. Once $w_{i_{0}, k}$ has been eliminated, (Equation36a(36a) $minimize ‖ W (g - g_{k} - U^{- 1} A_{0} w_{k}) ‖_{2},$ (36a) ) becomes an unconstrained minimization problem over the remaining entries of $w_{k}$ , the solution of which is given by the corresponding normal equation system, cf. [Citation27]. The final algorithm is slightly more expensive than IHNC, but the extra cost is negligible compared to the overall costs of an individual iteration of either of the schemes.

It remains to discuss the choice of the weighting matrix $W$ in (Equation32(32) $minimize ‖ g - g_{k} - G^{'} (u_{k}) v ‖$ (32) ). A natural candidate is $W = I$ , the $m \times m$ identity matrix. Alternatively, since it is known that $g - g_{k} = h - h_{k} \in L_{ϱ}^{\infty}$ for some exponent $α > 3$ , one could also think of using $W$ to enforce that the discrete approximation of $g - g_{k}$ shows a similar qualitative behaviour for larger radii. In this case, the diagonal entries $w_{j j}$ of $W$ should increase with increasing index, e.g. (37) $w_{j j} = (1 + r_{j}^{2})^{γ}, 1 \leq j \leq m,$ (37) for some exponent $γ > 0$ . However, we found that the choice (Equation37(37) $w_{j j} = (1 + r_{j}^{2})^{γ}, 1 \leq j \leq m,$ (37) ) for $γ > 0$ lent too much flexibility to the values of $u_{k} (r)$ for radii r near the core region, so that the computed potentials became worse eventually. In our numerical results in Section 7.3 we therefore have used $W = I$ throughout.

7. Numerical results

We now present some numerical results to illustrate the performance of the new methods as compared to IBI and IMC. For this we concentrate on the results of IHNC; in all our tests we did not see significant differences between IHNC and HNCN, but the theoretical results of Section 4 indicate that IHNC may be slightly more robust.

Our benchmark problems include simulated data for a truncated and shifted Lennard–Jones potential as well as measured data for liquid argon taken from the literature. We mention that for the latter problem, in particular, our mathematical assumption that the system be in its gas phase, is violated. As it turns out this does not affect the applicability of our algorithms, but we have no theory to explain this observation.

In all our numerical examples, we have used gromacs to evaluate the forward operator G with a molecular dynamics simulation: to be specific, we have used the leap-frog integrator with time step $dt = 0.001$ (in dimensionless units corresponding roughly to $2 fs$ in real units), coupled to the Langevin thermostat with a unit inverse friction constant to simulate ensembles with N = 2000 particles and periodic boundary conditions, i.e. $(N, V, T)$ -ensembles. In each iteration, the ensemble has been equilibrated with $10^{6}$ timesteps (corresponding to about $2 ns$ ) starting from a distribution of the particles on a regular lattice, and afterwards the radial distribution function has been determined from 3500 uncorrelated snapshots of the system (the decorrelation times between two snapshots depends on the system and spans approximately 2000 time steps, i.e. $4 ps$ ). For IMC the same snapshots have also been used to set up the sensitivity matrix corresponding to $G^{'}$ . Remark 5.1 applies to our implementations of IBI and IMC in the same way.

7.1. Truncated and shifted Lennard–Jones fluids near phase transitions

Let (38) $u_{LJ} = 4 ε ((σ / r)^{12} - (σ / r)^{6}), r > 0,$ (38) be the classical Lennard–Jones potential with parameters $ε, σ > 0$ . Taking $ε = σ = 1$ , i.e. working in reduced (dimensionless) units with Boltzmann constant $k_{B} = 1$ , we consider the truncated and shifted Lennard–Jones potential (39) $u (r) = {\begin{cases} u_{LJ} (r) - u_{LJ} (2.5), & 0 < r < 2.5, \\ 0, & r \geq 2.5, \end{cases}$ (39) i.e. the Lennard–Jones potential is shifted, so that it becomes zero at r = 2.5, and is extended continuously by zero for $r \geq 2.5$ . The corresponding ensemble is studied at two different state points, namely

the critical point with counting density $ρ_{0} = 0.304$ and temperature T = 1.316, cf. Smit [Citation28],
a state point in the liquid phase close to the triple point with counting density $ρ_{0} = 0.8$ and temperature T = 1, cf. Hansen and Verlet [Citation29].

In both cases, the radial distribution function is sampled on an equidistant grid with mesh width $Δ r = 0.02$ ; for state point (a) we have data for m = 463 grid points covering a radial interval $r \in (0, 9.26]$ , for state point (b) we have m = 335 grid points within the interval $(0, 6.7]$ . The latter interval is smaller than the former one, because the density of the system is larger, and hence the simulation box is smaller. The given data are displayed as little circles in Figure . Note that the pair correlation function h = g−1 decays much faster at the critical point than near the triple point; as a consequence the inverse problem is much more difficult near the triple point.

Figure 1. Truncated and shifted Lennard–Jones fluids: radial distribution functions versus radius.

To solve the inverse problem, we have tabulated the approximate potentials on the first n = 125 grid points $r_{i} \in (0, 2.5]$ of the same grid. Because of the particular definition of the IBI and IMC schemes, cf. (Equation15(15) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g}, g_{k} = G (u_{k}),$ (15) ) and (Equation16(16) $u_{k + 1} = u_{k} + G^{'} (u_{k})^{- 1} (g - g_{k}), g_{k} = G (u_{k}),$ (16) ), only those n grid points of the radial distribution function have been used for these two methods; this radial interval is indicated by the dashed lines in Figure . For IHNC, we have used the full data displayed in Figure . In all three iterative schemes, the same potential (Equation24(24) $u_{0} = - \frac{1}{β} \log g,$ (24) ) of mean force has been used as initial guess.

The approximate radial distribution functions obtained by IBI, IHNC and IMC, respectively, are also shown in Figure . Essentially, all three functions are on top of each other in both plots, and they constitute perfect fits of the given data for each of the two state points. But IHNC and IMC require far less iterations to achieve this goal: Figure provides the corresponding iteration histories of the data fit, i.e. the graphs of the functions $k \mapsto ‖ G (u_{k}) - g ‖_{\infty} / ‖ G (u_{0}) - g ‖_{\infty}$ for all three individual iteration schemes and for each of the two state points, respectively; here, $‖ G (u_{k}) - g ‖_{\infty}$ measures the maximal absolute error between all given measurement data and the corresponding approximations in the exterior of the core region. (For some obscure reason, this measure of the data fit is slightly increasing for IBI and IMC in the first iteration.) From these plots, it can be seen that IHNC requires about 5 iterations at the critical point and 11 iterations near the triple point to reach the global minimum of the data fit, while IMC requires 9 (or 5) iterations at the critical point and 7 iterations near the triple point; the data fit of the two methods is comparable, eventually. IBI, on the other hand, needs 10 iterations at the critical point and more than 20 near the triple point.

While the data fit is a straightforward indicator of the performance of the iterative schemes, the true error history is the really relevant quality measure. However, the latter is not available in practice. It is the advantage of this particular example that the true solution is known, so that the error history can be computed. For a particular potential $\tilde{u}$ given on the grid $Δ^{'}$ , we define the error measure (40a) $ϵ (\tilde{u}) = {(Δ r \sum_{i = 1}^{n} g (r_{i}) {(\tilde{u} (r_{i}) - u (r_{i}))}^{2} r_{i}^{2})}^{1 / 2},$ (40a) which approximates the weighted $L^{2}$ norm (40b) $ϵ (\tilde{u}) \approx {(\int_{0}^{\infty} g (r) {(\tilde{u} (r) - u (r))}^{2} r^{2} d r)}^{1 / 2}$ (40b) of the error $\tilde{u} - u$ . This norm can be motivated by a more detailed analysis of the operator G, but this is beyond the scope of this paper; here we only emphasize that the factor g in (Equation40b(40b) $ϵ (\tilde{u}) \approx {(\int_{0}^{\infty} g (r) {(\tilde{u} (r) - u (r))}^{2} r^{2} d r)}^{1 / 2}$ (40b) ) compensates for the divergence of the potentials as $r \to 0$ .

Figure shows the relative error (41) $k \mapsto ϵ (u_{k}) / ϵ (u_{0})$ (41) as a function of the iteration count for all iterative schemes and both state points, respectively. The plots confirm that the particular iterates recommended above do indeed provide good approximations of the true truncated and shifted Lennard–Jones potential. Accordingly, IMC and IHNC both converge very rapidly in much the same number of iterations, whereas IBI is doing significantly worse. To illustrate this further the corresponding reconstructions for the more difficult problem near the triple point are shown in Figure . This plot displays the 11th IHNC iterate, the 7th IMC iterate and the 50th (!) IBI iterate, together with the true pair potential as a black solid line (marked ‘LJ’) and the potential $u_{0}$ of mean force as a dotted line. As can be seen the IHNC and IMC approximations are hardly distinguishable from the true truncated and shifted Lennard–Jones potential, while even after 50 iterations IBI is still relatively far off.

7.2. Liquid argon

As a second example, we have determined approximate pair potentials for argon, using measurements by Schmidt and Tompson [Citation30] for a state point with temperature $T = - 125^{\circ}$ C and mass density $0.982 g / {cm}^{3}$ in the liquid phase near the critical point.Footnote⁴ The corresponding data are given on an equidistant gridFootnote⁵ with m = 200 grid points and mesh width $Δ r = 0.1 Å$ . The approximate pair potentials have been tabulated on the first n = 100 grid points of this grid, and taken to be identically zero for $r > 10 Å$ .

Figure 2. Truncated and shifted Lennard–Jones fluids: data fit versus iteration count.

Figure 3. Truncated and shifted Lennard–Jones fluids: error (Equation41(41) $k \mapsto ϵ (u_{k}) / ϵ (u_{0})$ (41) ) versus iteration count.

Figure 3. Truncated and shifted Lennard–Jones fluids: error (Equation41(41) k↦ϵ(uk)/ϵ(u0)(41) ) versus iteration count.

Figure 4. Truncated and shifted Lennard–Jones fluid: reconstructed pair potentials versus radius.

The iteration history shown in Figure documents that, again, IHNC and IMC match the data much faster than IBI does: according to this plot 6 IHNC (5 IMC) iterations are sufficient, whereas IBI needs 39 iterations to achieve the same accuracy. Figure presents the corresponding approximations of the radial distribution function and Figure the corresponding potentials, together with the potential of mean force as dotted line. As before, all computed approximations are very close to each other.

Figure 5. Liquid argon: data fit versus iteration count.

Figure 6. Liquid argon: radial distribution functions versus radius (in $Å$ ); detail view on the right.

Figure 7. Liquid argon: approximate potentials (in units of ϵ) versus radius (in $Å$ ).

We have chosen argon as benchmark test case, because the interactions between argon atoms are widely considered to be well described by the Lennard–Jones pair potential (Equation38(38) $u_{LJ} = 4 ε ((σ / r)^{12} - (σ / r)^{6}), r > 0,$ (38) ) with parameters $σ = 3.405 Å and ε = 119.8 k_{B} J,$ cf., e.g. Tuckerman [Citation31, p. 127]. This Lennard–Jones potential is given by the thin black line in Figure , but it differs quite a bit from our computed pair potentials. In fact, the radial distribution function corresponding to this Lennard–Jones approximation, which has also been included in Figure , does not fit the measured data well, as can easily be seen in the magnified detail in the right-hand plot. Even the initial approximation from the potential of mean force is doing better than that. So for this real-world example, we cannot trust this Lennard–Jones potential to be the ‘ground truth’ to compare our numerical results to.

7.3. The pressure constrained HNCGN scheme

Finally, we show some numerical results for p-HNCGN, i.e. the pressure constrained hypernetted-chain Gauss–Newton iteration described in Section 6. For this we have used the same data set for liquid argon as in the previous example and have imposed the corresponding value $p = 9918.7 kPa$ of the pressure reported by Mikolaj and Pings [Citation32].

Figures and present the corresponding numerical results. In the left-hand plot of Figure , we recollect the data fit history of Figure and also show the corresponding graph for the performance of p-HNCGN: since the latter aims for a best possible fit of all 200 data points $g (r_{j})$ , it reaches a smaller value than all other competing methods.

Figure 8. Liquid argon: iteration history.

Figure 9. Liquid argon: approximate potentials (in units of ϵ) versus radius (in $Å$ ).

The right-hand plot of Figure , on the other hand, displays the average pressure (as returned by gromacs) of all corresponding ensembles for each individual iterate of the respective methods. The correct value of the pressure is indicated by the dotted horizontal line. As can be seen, except for p-HNCGN all methods fail to reproduce this number by a factor of 3 or more. p-HNCGN, on the other hand, achieves an excellent match of the target pressure after about 12 iterations.

Assessing both plots of Figure we consider the 14th iterate of p-HNCGN to be ‘optimal’, because it corresponds to the first local minimum of the data fit after having reached a fairly accurate value of the pressure. The corresponding pair potential is compared in Figure with the Lennard–Jones reference and the IHNC potential from Figure . It can be seen that the match of the pressure has a discernible (positive) impact on the computed pair potential.

Remark 7.1

Since p-HNCGN fits the data points of the measured radial distribution function, it does provide a good fit of the compressibility $κ_{T}$ of the fluid as well, because the compressibility is given by the Kirkwood–Buff integral $\frac{ρ_{0}}{β} κ_{T} = 1 + 4 π ρ_{0} \int_{0}^{\infty} h (r) r^{2} d r,$ which only depends on the pair correlation function h = g−1. Therefore p-HNCGN is able to fit both the compressibility and the pressure of a fluid to a reasonable accuracy. In the pertinent literature, this has been considered impossible when using isotropic pair potentials, compare, e.g. Wang, Junghans, and Kremer [Citation26].

8. Conclusion

We have determined new generalized Newton schemes for the inverse Henderson problem, where we approximate the inverse of the Jacobian by the functional derivative of the hypernetted-chain approximation of the pair potential. These methods have about the same computational costs per iteration as IBI, but need much less iterations near phase transitions. In terms of iteration counts they are competitive to IMC, but the individual iterations are much cheaper than the IMC ones, because no cross-correlations need to be evaluated in the numerical simulation of the corresponding ensemble of particles. While these methods turn out to be similar (but not identical) to the LWR scheme of Levesque, Weis and Reatto, they are more flexible by construction, and can easily be modified, e.g. to also match the true pressure of the target ensemble.

We finally mention that one can also use the Percus–Yevick approximation instead of the hypernetted-chain approximation for the derivation of a corresponding generalized Newton method. The resulting scheme is very similar to (Equation20(20) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (20) ), the only difference being that $φ_{k}$ is replaced by $φ_{k} / y_{k}$ , where $y_{k}$ is the cavity distribution function associated with the kth pair potential $u_{k}$ . In our numerical experiments, we found the iteration (Equation20(20) $u_{k + 1} = u_{k} + \frac{1}{β} \log \frac{g_{k}}{g} + \frac{1}{β} φ_{k}, k = 0, 1, 2, \dots,$ (20) ) to perform better near phase transitions of the truncated and shifted Lennard–Jones potentials than the corresponding Percus–Yevick recursion, and therefore we have restricted our attention to the IHNC scheme in this work.

In future work, we plan to extend our methods to binary mixtures of different fluids.

Acknowledgments

We are grateful to Gergely Tóth for pointing out to us references [Citation7,Citation12,Citation17].

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project number 233630050 - TRR 146.

Notes

1 We mention that this name may be misleading in that the approximate pair potential computed by IMC is not the result of some sophisticated Monte-Carlo simulation.

2 http://www.votca.org

3 For the core region the radius

r_{0}

is chosen in each iteration as the smallest grid point of (Equation28

(28)

Δ = {r_{j} = j Δ r : j = 1, \dots, m}

(28) ), such that g and

g_{k}

are nonzero for every

r_{j} > r_{0}

4 According to the US National Institute of Standards and Technology the critical point of argon is located at about temperature $T = - {122.3}^{\circ}$ C and mass density $0.536 g / {cm}^{3}$ ; see https://webbook.nist.gov/cgi/inchi?ID=C7440371&Mask=4.

5 For $r > 10 Å$ only every second data point is tabulated in [Citation30]; the missing data have been filled in by linear interpolation.

References

Potestio R, Peter C, Kremer K. Computer simulations of soft matter: linking the scales. Entropy. 2014;16:4199–4245. doi: 10.3390/e16084199
Web of Science ®Google Scholar
Henderson RL. A uniqueness theorem for fluid pair correlation functions. Phys Lett A. 1974;49:197–198. doi: 10.1016/0375-9601(74)90847-0
Web of Science ®Google Scholar
Ben-Naim A. Molecular theory of solutions. New York: Oxford University Press; 2006.
Google Scholar
Hansen J-P, McDonald IR. Theory of simple liquids. 4th ed. Oxford: Academic Press; 2013.
Google Scholar
Mirzoev A, Lyubartsev AP. MagiC: software package for multiscale modeling. J Chem Theory Comput. 2013;9:1512–1520. doi: 10.1021/ct301019v
PubMed Web of Science ®Google Scholar
Rühle V, Junghans C, Lukyanov A, et al. Versatile object-oriented toolkit for coarse-graining applications. J Chem Theory Comput. 2009;5:3211–3223. doi: 10.1021/ct900369w
PubMed Web of Science ®Google Scholar
Tóth G. Interactions from diffraction data: historical and comprehensive overview of simulation assisted methods. J Phys Condens Matter. 2007;19:335220.
PubMed Web of Science ®Google Scholar
Jörgens K. Linear integral operators. Boston: Pitman; 1982.
Google Scholar
Ruelle D. Statistical mechanics: rigorous results. New York: W.A. Benjamin Publ.; 1969.
Google Scholar
Hanke M. Fréchet differentiability of molecular distribution functions I. L∞ analysis. Lett Math Phys. 2018;108:285–306. doi: 10.1007/s11005-017-1009-0
Google Scholar
Hanke M. Well-posedness of the iterative Boltzmann inversion. J Stat Phys. 2018;170:536–553. doi: 10.1007/s10955-017-1944-2
Web of Science ®Google Scholar
Schommers W. A pair potential for liquid rubidium from the pair correlation function. Phys Lett. 1973;43A:157–158. doi: 10.1016/0375-9601(73)90591-4
Google Scholar
Soper AK. Empirical potential Monte Carlo simulation of fluid structure. Chem Phys. 1996;202:295–306. doi: 10.1016/0301-0104(95)00357-6
Web of Science ®Google Scholar
Lyubartsev AP, Laaksonen A. Calculation of effective interaction potentials from radial distribution functions: a reverse Monte Carlo approach. Phys Rev E. 1995;52:3730–3737. doi: 10.1103/PhysRevE.52.3730
PubMed Web of Science ®Google Scholar
Murtola T, Falck E, Karttunen M, et al. Coarse-grained model for phospholipid/cholesterol bilayer employing inverse Monte Carlo with thermodynamic constraints. J Chem Phys. 2007;126:075101.
PubMed Web of Science ®Google Scholar
Ivanizki D. Numerical analysis of the relation between interactions and structure in a molecular fluid. PhD Thesis. Johannes Gutenberg-Universität Mainz; 2015.
Google Scholar
Levesque D, Weis JJ, Reatto L. Pair interaction from structural data for dense classical liquids. Phys Rev Lett. 1985;54:451–454. doi: 10.1103/PhysRevLett.54.451
PubMed Web of Science ®Google Scholar
Heinen M. Calculating particle pair potentials from fluid-state pair correlations: iterative Ornstein–Zernike inversion. J Comput Chem. 2018;39:1531–1543. doi: 10.1002/jcc.25225
PubMed Web of Science ®Google Scholar
Henrici P. Applied and computational complex analysis. Vol. 3. New York: John Wiley & Sons; 1986.
Google Scholar
Abraham MJ, van der Spoel D, Lindahl E, et al. the Gromacs development team. Gromacs User Manual version 2016.3, www.gromacs.org (2017).
Google Scholar
Hess B, Kutzner C, van der Spoel D, et al. GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J Chem Theory Comput. 2008;4:435–447. doi: 10.1021/ct700301q
PubMed Web of Science ®Google Scholar
Lyubartsev A, Mirzoev A, Chen L, et al Systematic coarse-graining of molecular models by the Newton inversion method. Faraday Discuss. 2010;144:43–56. doi: 10.1039/B901511F
PubMed Web of Science ®Google Scholar
Fu CC, Kulkarni PM, Shell MS, et al. A test of systematic coarse-graining of molecular dynamics simulations: thermodynamic properties. J Chem Phys. 2012;137:164106.
PubMed Web of Science ®Google Scholar
Jain S, Garde S, Kumar SK. Do inverse Monte Carlo algorithms yield thermodynamically consistent interaction potentials? Ind Eng Chem Res. 2006;45:5614–5618. doi: 10.1021/ie060042h
Web of Science ®Google Scholar
Reith D, Pütz M, Müller-Plathe F. Deriving effective mesoscale potentials from atomistic simulations. J Comput Chem. 2003;24:1624–1636. doi: 10.1002/jcc.10307
PubMed Web of Science ®Google Scholar
Wang H, Junghans C, Kremer K. Comparative atomistic and coarse-grained study of water: what do we lose by coarse-graining? Eur Phys J E. 2009;28:221–229. doi: 10.1140/epje/i2008-10413-5
PubMed Web of Science ®Google Scholar
Björck Å. Numerical methods for least squares problems. Philadelphia: SIAM; 1996.
Google Scholar
Smit B. Phase diagrams of Lennard–Jones fluids. J Chem Phys. 1992;96:8639–8640. doi: 10.1063/1.462271
Web of Science ®Google Scholar
Hansen J-P, Verlet L. Phase transitions of the Lennard–Jones system. Phys Rev. 1969;184:151–161. doi: 10.1103/PhysRev.184.151
Web of Science ®Google Scholar
Schmidt PW, Tompson CW. X-ray scattering studies of simple fluids. In: Frisch HL and Salsburg ZW, editors. Simple Dense Fluids, New York: Academic Press; 1968. p. 31–110.
Google Scholar
Tuckerman ME. Statistical mechanics: theory and molecular simulation. Oxford: Oxford University Press; 2010.
Google Scholar
Mikolaj PG, Pings CJ. Structure of liquids. III. An X-ray diffraction study of fluid argon. J Chem Phys. 1967;46:1401–1411. doi: 10.1063/1.1840864
Web of Science ®Google Scholar

Appendix

The Wiener lemma

Throughout this appendix we only consider functions of three variables, whether they be radial functions, or not. If $f \in L_{ϱ}^{\infty} (R^{3})$ is radially symmetric, then its representation defined in $R^{+}$ belongs to the Banach space $L_{ϱ}^{\infty}$ introduced in (Equation5(5) $‖ f ‖_{L_{ϱ}^{\infty}} = ‖ ϱ f ‖_{(0, \infty)},$ (5) ). The three-dimensional Fourier transform of $f \in L_{ϱ}^{\infty} (R^{3})$ is denoted by $\hat{f}$ .

For ϱ defined in (Equation4(4) $ϱ (r) = (1 + r^{2})^{α / 2}, r \geq 0,$ (4) ), we have shown in [Citation11] that the space $L_{ϱ}^{\infty} (R^{3})$ of all functions $f : R^{3} \to R$ , for which $‖ f ‖_{L_{ϱ}^{\infty} (R^{3})} = {ess \sup}_{R \in R^{3}} ϱ (| R |) | f (R) | < \infty$ constitutes a Banach algebra with respect to convolution. We can extend $L_{ϱ}^{\infty} (R^{3})$ to a Banach algebra $W_{ϱ}$ with unit element e (given by the delta distribution at the origin), using the canonical norm $‖ λ e + f ‖_{W_{ϱ}} = | λ | + γ ‖ f ‖_{L_{ϱ}^{\infty} (R^{3})}, λ \in R, f \in L_{ϱ}^{\infty} (R^{3}),$ where $γ > 0$ is a small enough constant to make the norm of $W_{ϱ}$ submultiplicative.

The standard Wiener lemma for the Fourier transform starts with a similar construction for the Banach algebra $L^{1} (R^{3})$ and states that if $f \in L^{1} (R^{3})$ is such that $1 + \hat{f} \neq 0$ , then (A1) $(1 + \hat{f})^{- 1} = 1 - \hat{c}$ (A1) for some $c \in L^{1} (R^{3})$ with Fourier transform $\hat{c}$ , cf., e.g. Jörgens [Citation8]. The weighted Wiener lemma which is required for the proof of Proposition 2.1 reads as follows.

Lemma A.1

Let $f \in L_{ϱ}^{\infty} (R^{3})$ be such that $1 + \hat{f} \neq 0$ . Then the function c of (EquationA1(A1) $(1 + \hat{f})^{- 1} = 1 - \hat{c}$ (A1) ) belongs to $L_{ϱ}^{\infty} (R^{3})$ . If f is a radial function, so is c.

Proof.

We choose u (not to mix up with the pair potential in the remainder of this paper) from the standard Schwartz space $S$ , sufficiently close to f in $L^{1} (R^{3})$ so that $1 + \hat{u} \neq 0$ . Then we can apply the classical Wiener lemma to deduce that there exist $c, d \in L^{1} (R^{3})$ which satisfy (EquationA1(A1) $(1 + \hat{f})^{- 1} = 1 - \hat{c}$ (A1) ) and (A2) $(1 + \hat{u})^{- 1} = 1 - \hat{d},$ (A2) respectively. Moreover, $d \to c$ in $L^{1} (R^{3})$ as $u \to f$ in $L^{1} (R^{3})$ ; see [Citation8]. Evidently, $\hat{d} = \frac{\hat{u}}{1 + \hat{u}} \in S,$ and hence, $d \in S$ , and (A3) $w = (e - d) * (u - f) \in L_{ϱ}^{\infty} (R^{3})$ (A3) with $‖ w ‖_{L^{1} (R^{3})} \leq (1 + ‖ d ‖_{L^{1} (R^{3})}) ‖ u - f ‖_{L^{1} (R^{3})} < 1,$ provided u is sufficiently close to f. In (EquationA3(A3) $w = (e - d) * (u - f) \in L_{ϱ}^{\infty} (R^{3})$ (A3) ) and below the symbol $*$ refers to the standard three-dimensional convolution, i.e. $w (R) = u (R) - f (R) - \int_{R^{3}} d (R - R^{'}) (u (R^{'}) - f (R^{'})) d R^{'}, R \in R^{3} .$ Since $‖ w ‖_{L^{1} (R^{3})}$ has been shown to be less than 1, Corollary 4.3 in [Citation11] allows to conclude that the series (A4) $W_{Σ} := \sum_{n = 1}^{\infty} W_{n}$ (A4) of the n-fold autoconvolutions $W_{n}$ of w converges in $L_{ϱ}^{\infty} (R^{3})$ , and hence, $c_{0} := d - W_{Σ} + d * W_{Σ} \in L_{ϱ}^{\infty} (R^{3}) .$ It turns out that this very function $c_{0}$ coincides with c, for we have ${\hat{c}}_{0} = \hat{d} - (1 - \hat{d}) \frac{\hat{w}}{1 - \hat{w}} = \frac{\hat{d} - \hat{w}}{1 - \hat{w}},$ and when inserting (EquationA3(A3) $w = (e - d) * (u - f) \in L_{ϱ}^{\infty} (R^{3})$ (A3) ) and (EquationA2(A2) $(1 + \hat{u})^{- 1} = 1 - \hat{d},$ (A2) ) it follows that (A5) ${\hat{c}}_{0} = \frac{\hat{d} - (1 - \hat{d}) (\hat{u} - \hat{f})}{1 - (1 - \hat{d}) (\hat{u} - \hat{f})} = \frac{\hat{u} - (\hat{u} - \hat{f})}{1 + \hat{u} - (\hat{u} - \hat{f})} = \frac{\hat{f}}{1 + \hat{f}} = \hat{c},$ (A5) as has been claimed. This shows that $c \in L_{ϱ}^{\infty} (R^{3})$ .

If f is a radial function, so is $\hat{f}$ and also $\hat{c}$ according to (EquationA5(A5) ${\hat{c}}_{0} = \frac{\hat{d} - (1 - \hat{d}) (\hat{u} - \hat{f})}{1 - (1 - \hat{d}) (\hat{u} - \hat{f})} = \frac{\hat{u} - (\hat{u} - \hat{f})}{1 + \hat{u} - (\hat{u} - \hat{f})} = \frac{\hat{f}}{1 + \hat{f}} = \hat{c},$ (A5) ). Hence, c is a radial function, too.

Motivated by (EquationA5(A5) ${\hat{c}}_{0} = \frac{\hat{d} - (1 - \hat{d}) (\hat{u} - \hat{f})}{1 - (1 - \hat{d}) (\hat{u} - \hat{f})} = \frac{\hat{u} - (\hat{u} - \hat{f})}{1 + \hat{u} - (\hat{u} - \hat{f})} = \frac{\hat{f}}{1 + \hat{f}} = \hat{c},$ (A5) ), we simply write (A6) $c = f * (e + f)^{- 1}$ (A6) for the solution c of (EquationA1(A1) $(1 + \hat{f})^{- 1} = 1 - \hat{c}$ (A1) ) in the sequel. For the ease of completeness, we also include the following result on continuous dependence of $c \in L_{ϱ}^{\infty} (R^{3})$ .

Lemma A.2

Let $f \in L_{ϱ}^{\infty} (R^{3})$ satisfy the assumptions of Lemma A.1, and let c be given by (EquationA6(A6) $c = f * (e + f)^{- 1}$ (A6) ). If $f_{k} \in L_{ϱ}^{\infty} (R^{3})$ is sufficiently close to f in $L_{ϱ}^{\infty} (R^{3})$ , then the Ornstein–Zernike relation (EquationA1(A1) $(1 + \hat{f})^{- 1} = 1 - \hat{c}$ (A1) ) with f replaced by $f_{k}$ has a well-defined solution $c_{k} \in L_{ϱ}^{\infty} (R^{3})$ , and there holds $‖ c_{k} - c ‖_{L_{ϱ}^{\infty} (R^{3})} \to 0 as ‖ f_{k} - f ‖_{L_{ϱ}^{\infty} (R^{3})} \to 0.$

Proof.

We write $(e + f_{k})^{- 1} = (e + f)^{- 1} * (e + w_{k})^{- 1}$ with (A7) $w_{k} = (e + f)^{- 1} * (f_{k} - f),$ (A7) and note that $‖ w_{k} ‖_{L^{1} (R^{3})} \leq q < 1$ for $‖ f_{k} - f ‖_{L_{ϱ}^{\infty} (R^{3})}$ sufficiently small. Using (EquationA1(A1) $(1 + \hat{f})^{- 1} = 1 - \hat{c}$ (A1) ) it follows that (A8) $\begin{aligned} c_{k} & = f_{k} * (e + f_{k})^{- 1} = f_{k} * (e + f)^{- 1} * (e + w_{k})^{- 1} \\ = f_{k} * (e - c) * (e + w_{k})^{- 1} = f_{k} * (e - c) * (e + W_{Σ, k}) \\ = f_{k} - f_{k} * c + (f_{k} - f_{k} * c) * W_{Σ, k}, \end{aligned}$ (A8) where $W_{Σ, k}$ is the series (EquationA4(A4) $W_{Σ} := \sum_{n = 1}^{\infty} W_{n}$ (A4) ) of the n-fold autoconvolutions of $w_{k}$ . Note that (A9) $‖ W_{Σ, k} ‖_{L_{ϱ}^{\infty} (R^{3})} \leq C ‖ w_{k} ‖_{L_{ϱ}^{\infty} (R^{3})}$ (A9) for some C>0 which only depends on the upper bound q of $‖ w_{k} ‖_{L^{1} (R^{3})}$ , cf. [Citation11]. Rewriting c as $f * (e - c)$ by virtue of (EquationA1(A1) $(1 + \hat{f})^{- 1} = 1 - \hat{c}$ (A1) ) and (EquationA6(A6) $c = f * (e + f)^{- 1}$ (A6) ), we conclude from (EquationA8(A8) $\begin{aligned} c_{k} & = f_{k} * (e + f_{k})^{- 1} = f_{k} * (e + f)^{- 1} * (e + w_{k})^{- 1} \\ = f_{k} * (e - c) * (e + w_{k})^{- 1} = f_{k} * (e - c) * (e + W_{Σ, k}) \\ = f_{k} - f_{k} * c + (f_{k} - f_{k} * c) * W_{Σ, k}, \end{aligned}$ (A8) ) that $c_{k} - c = f_{k} - f - (f_{k} - f) * c + (f_{k} - f_{k} * c) * W_{Σ, k},$ and hence, the assertion follows from (EquationA9(A9) $‖ W_{Σ, k} ‖_{L_{ϱ}^{\infty} (R^{3})} \leq C ‖ w_{k} ‖_{L_{ϱ}^{\infty} (R^{3})}$ (A9) ) and (EquationA7(A7) $w_{k} = (e + f)^{- 1} * (f_{k} - f),$ (A7) ).

We finally mention that if h is the radially symmetric extension of the pair correlation function and if $f = ρ_{0} h$ , then $c / ρ_{0}$ coincides with the direct correlation function in the Ornstein-Zernike relation (Equation8(8) $c + ρ_{0} h * c = h$ (8) ); compare (EquationA5(A5) ${\hat{c}}_{0} = \frac{\hat{d} - (1 - \hat{d}) (\hat{u} - \hat{f})}{1 - (1 - \hat{d}) (\hat{u} - \hat{f})} = \frac{\hat{u} - (\hat{u} - \hat{f})}{1 + \hat{u} - (\hat{u} - \hat{f})} = \frac{\hat{f}}{1 + \hat{f}} = \hat{c},$ (A5) ).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

A generalized Newton iteration for computing the solution of the inverse Henderson problem

ABSTRACT

1. Introduction

2. Mathematical setting of the problem

3. Generalized Newton schemes for the inverse Henderson problem

4. Well-posedness of the IHNC and HNCN schemes

5. Numerical discretization

6. Extensions of the method

7. Numerical results