Publication Cover
Molecular Physics
An International Journal at the Interface Between Chemistry and Physics
Volume 118, 2020 - Issue 19-20: Special Issue of Molecular Physics in Honour of Jürgen Gauss
1,137
Views
7
CrossRef citations to date
0
Altmetric
Research Articles

Automated determination of hybrid particle-field parameters by machine learning

ORCID Icon, ORCID Icon & ORCID Icon
Article: e1785571 | Received 04 Apr 2020, Accepted 16 Jun 2020, Published online: 02 Jul 2020

Figures & data

Figure 1. Protocol for optimising hPF force fields.

Figure 1. Protocol for optimising hPF force fields.

Figure 2. Summary of the hPF phospholipid model. Left: CG representation of the DPPC phospholipid and solvent. Right: Outline of the two terms in the hPF Hamiltonian.

Figure 2. Summary of the hPF phospholipid model. Left: CG representation of the DPPC phospholipid and solvent. Right: Outline of the two terms in the hPF Hamiltonian.

Figure 3. Density profiles and representative membrane snapshots from hPF-MD simulations of a DPPC bilayer using χ~FH parameters [Citation21] (left), particle-based simulations using the MARTINI CG force field (centre), and hPF-MD χ~BO parameters (right). The table presents absolute deviations Sk in the density profiles between the F-H and BO parameter simulations, and the reference MARTINI profile. Percentage deviations Sp are given in parenthesis. Sk values are given in el./nm3.

Figure 3. Density profiles and representative membrane snapshots from hPF-MD simulations of a DPPC bilayer using χ~F−H parameters [Citation21] (left), particle-based simulations using the MARTINI CG force field (centre), and hPF-MD χ~BO parameters (right). The table presents absolute deviations Sk in the density profiles between the F-H and BO parameter simulations, and the reference MARTINI profile. Percentage deviations Sp are given in parenthesis. Sk values are given in el./nm3.

Figure 4. Left: Feature importance as ranked by the mutual information measure between the fitness and the individual χ~ij parameters, for hPF-MD simulations of a DPPC bilayer with randomly sampled χ~ matrices. Presented values are normalised relative to the most important parameter (χ~CW) (arbitrary units). Inset details the low relative MI values found for the last six matrix elements (error bars omitted). χ~ parameters are included in order of decreasing feature importance (left). Right: Best fitness achieved (here using the average coefficient of determination, R2, across all bead species) for each dimension of the parameter space subspace used in hPF-MD BO protocol runs on the DPPC bilayer system.

Figure 4. Left: Feature importance as ranked by the mutual information measure between the fitness and the individual χ~ij parameters, for hPF-MD simulations of a DPPC bilayer with randomly sampled χ~ matrices. Presented values are normalised relative to the most important parameter (χ~CW) (arbitrary units). Inset details the low relative MI values found for the last six matrix elements (error bars omitted). χ~ parameters are included in order of decreasing feature importance (left). Right: Best fitness achieved (here using the average coefficient of determination, R2, across all bead species) for each dimension of the parameter space subspace used in hPF-MD BO protocol runs on the DPPC bilayer system.

Figure 5. Top: Density profiles for hPF-MD DPPC bilayer simulations ran with Bayesian optimised parameter sets with four (left) and ten (right) included χ~ parameters. The four-parameter simulation uses χ~FH values for all but the χ~ matrix elements with the highest feature importance, namely χ~NW, χ~CW, χ~GW, and χ~GC, c.f. column three of the table (bottom). Bottom: Resulting χ~ matrices from the BO protocol applied to hPF-MD simulations of a DPPC bilayer. Results reported for selected subspaces of the full 10-dimensional parameter space, with the χ~ijs shown in red being fixed and not part of the optimisation run. All χ~ values given in kJmol1. Mean percentage errors, Sp, associated with each set of optimised parameters is given in the last row.

Figure 5. Top: Density profiles for hPF-MD DPPC bilayer simulations ran with Bayesian optimised parameter sets with four (left) and ten (right) included χ~ parameters. The four-parameter simulation uses χ~F−H values for all but the χ~ matrix elements with the highest feature importance, namely χ~NW, χ~CW, χ~GW, and χ~GC, c.f. column three of the table (bottom). Bottom: Resulting χ~ matrices from the BO protocol applied to hPF-MD simulations of a DPPC bilayer. Results reported for selected subspaces of the full 10-dimensional parameter space, with the χ~ijs shown in red being fixed and not part of the optimisation run. All χ~ values given in kJmol−1. Mean percentage errors, Sp, associated with each set of optimised parameters is given in the last row.

Table 1. Optimised χ~-matrix parameters found by the BO scheme for hPF-MD simulations of DPPC, DMPC, DSPC, and DOPC bilayer systems. The χ~ matrix elements are given in order of decreasing feature importance. Reference parameters are the Flory-Huggins (χ~FH) parameters used in [Citation21]. All values given in units of kJmol1.

Table 2. Mean absolute deviations in electron density, Sk, with respect to the MARTINI reference density for the different lipids simulated with Bayesian optimised parameter sets on the different phospholipids (relative percentage deviations Sp in parenthesis). Comparison with data from De Nicola, using the baseline χ~FH parameter set [Citation21]. All Sk values given el./nm3.

Figure 6. Top left: The surrogate objective fitness surface (here using the average coefficient of determination, R2, across all bead species) in an example DPPC BO run with only the four parameters exhibiting the highest feature importance scores included (χ~CW, χ~GC, χ~NW, and χ~GW). Individual samplings with their associated fitnesses are represented as blue dots. A projection onto the subspace spanned by χ~CW and χ~GC shown. All χ~ matrix elements are given in kJmol1. Top right: Best DPPC simulation membrane fitness (average R2) for BO and random sampling with only the four parameters exhibiting the highest feature importance scores included. Comparison with the fitness achieved by the reference χ~FH parameter set. Inset details when BO and random sampling surpass the χ~FH parameter set in terms of R2 fitness. Bottom: Scatter matrices showing correlations between all pairs of χ~ parameters in a BO run on a DPPC bilayer (left) compared with random sampling (right). Only the four parameters exhibiting the highest feature importance scores are included in the sampling. The matrix diagonal shows the density of sampled points for each individual χ~ij parameter. All χ~ matrix elements are given in kJmol1.

Figure 6. Top left: The surrogate objective fitness surface (here using the average coefficient of determination, R2, across all bead species) in an example DPPC BO run with only the four parameters exhibiting the highest feature importance scores included (χ~CW, χ~GC, χ~NW, and χ~GW). Individual samplings with their associated fitnesses are represented as blue dots. A projection onto the subspace spanned by χ~CW and χ~GC shown. All χ~ matrix elements are given in kJmol−1. Top right: Best DPPC simulation membrane fitness (average R2) for BO and random sampling with only the four parameters exhibiting the highest feature importance scores included. Comparison with the fitness achieved by the reference χ~F−H parameter set. Inset details when BO and random sampling surpass the χ~F−H parameter set in terms of R2 fitness. Bottom: Scatter matrices showing correlations between all pairs of χ~ parameters in a BO run on a DPPC bilayer (left) compared with random sampling (right). Only the four parameters exhibiting the highest feature importance scores are included in the sampling. The matrix diagonal shows the density of sampled points for each individual χ~ij parameter. All χ~ matrix elements are given in kJmol−1.

Data availability statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.