619
Views
1
CrossRef citations to date
0
Altmetric
Materials data analysis and utilization

Experimental exploration of ErB2 and SHAP analysis on a machine-learned model of magnetocaloric materials for materials design

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon show all
Article: 2217474 | Received 17 Dec 2022, Accepted 12 May 2023, Published online: 21 Jun 2023

ABSTRACT

Stimulated by a recent report of a giant magnetocaloric effect in HoB2 found via machine-learning predictions, we have explored the magnetocaloric properties of a related compound ErB2 that has remained the last ferromagnetic material among the rare-earth diboride (REB2) family with unreported magnetic entropy change |ΔSM|. The evaluated |ΔSM| at field change of 5 T in ErB2 turned out to be as high as 26.1 J kg−1 K−1 around the ferromagnetic transition (TC) of 14 K. In this series, HoB2 is found to be the material with the largest |ΔSM| as the model predicted, while the predicted values showed a deviation with a systematic error compared to the experimental values. Through a coalition analysis using SHAP, we explore how this rare-earth dependence and the deviation in the prediction are deduced in the model. We further discuss how SHAP analysis can be useful in clarifying favorable combinations of constituent atoms through the machine-learned model with compositional descriptors. This analysis helps us to perform materials design with aid of machine learning of materials data.

GRAPHICAL ABSTRACT

1. Introduction

The arrangement of spin in solids and molecules provides a source of an entropy change upon application or removal of the applied field, namely the magnetocaloric effect. Such an effect tends to peak around the magnetic ordering temperature of the material; therefore, the magnitude of the effect is known to be highly temperature- and material-dependent [Citation1,Citation2]. One of common measures for this effect is the magnitude of magnetic entropy change caused by a change in the external field to the material (|ΔSM|), and currently, there is an increasing demand for high |ΔSM| materials as it would provide an alternative cooling route other than conventional gas [Citation2]. The refrigeration using the magnetocaloric effect has been argued to be highly suitable for low-temperature applications [Citation3], such as the liquefaction of hydrogen. This, in turn, suggests the particular importance of materials with high |ΔSM| around hydrogen liquefaction temperature (~20.3 K).

Recently, we have developed a machine-learned model [Citation4] that relates compositions of materials to their peak values of |ΔSM| (|ΔSM|peak) with a mean absolute error of 1.8 J kg−1 K−1 by using reported |ΔSM|peak data [Citation2]. In the model, compositional descriptors [Citation5,Citation6] generated by XenonPy package [Citation6] were used to train the model, where corresponding descriptors for each composition are made by applying seven kinds of mathematical operations (arithmetic mean, geometrical mean, harmonic mean, variance, sum, max, min, weighted by compositions) to 58 physical properties of atomic elements, in addition to the counting of constituent atoms and experimental values of appied field change μ0ΔH. We used XGBoost package [Citation7] to build a model with a gradient boosting-based decision tree algorithm. Once a model is built, such a compositional descriptors-based model can be readily applied for the prediction of target materials in the search space as it requires only compositions to predict with fixed value of μ0ΔH (5 T). Therefore, the constructed model was applied to materials with unknown |ΔSM|peak values for the selection of a candidate to be examined, leading to an experimental discovery of a gigantic magnetocaloric effect in HoB2 with |ΔSM|peak reaching 40.1 J kg−1 K−1 around T = 15 K under field change of 5 T [Citation4]. In addition, it was also found that HoB2 undergoes an additional magnetic transition at T = 11 K, which also contributes to |ΔSM|.

HoB2 is one of the series of rare-earth diborides REB2 with hexagonal AlB2 type structure (RE = Tb, Dy, Ho, Er, Tm, Yb, Lu) [Citation8]. Except for RE = Yb [Citation9] and Lu, they are known to exhibit ferromagnetic ordering (TC = 151, 55, 15, 16, 7 K for RE = Tb, Dy, Ho, Er, Tm, respectively). Several of their detailed magnetic properties and magnetocaloric effect have been further reported. Among them, TbB2 [Citation10] and TmB2 [Citation11] show single ferromagnetic transition, while DyB2 [Citation12] and HoB2 [Citation4] show an additional magnetic transition below TC, where for the case of HoB2 it has been clarified to be related with spin reorientation [Citation13]. The magnitude of |ΔSM|peak (J kg−1 K−1) at 5 T has been reported to be 10, 17, 40 for RE = Tb [Citation10], Dy [Citation12], Ho [Citation4], and 24 for Tm [Citation11] (estimated from fitting a power law curve [Citation14] on the data calculated by using the specific heat data reported in [Citation11]). Contrary to the research above, the detail of magnetic and magnetocaloric properties of ErB2 has been left vailed. Therefore, a comparative study with ErB2 would help a comprehensive understanding of REB2 series that hosts a giant magnetocaloric effect.

In this paper, we report the magnetic properties and magnetocaloric effect of ErB2. The evaluated |ΔSM|peak was as high as 26.1 J kg−1 K−1, which is lower than that of HoB2 (40.1 J kg−1 K−1). The RE dependence in the magnitude of the magnetocaloric effect in REB2 has been shown to peak at RE = Ho, similarly to the model. However, a lack of quantitative agreement was also found between the prediction and the experimental results. Through a model analysis where coalitions to the prediction are resolved for each descriptor, we discuss how the machine-learned model tried to address the atomic-species dependence of magnetocaloric effect such as RE atoms. We also discuss that such a coalition analysis for compositional descriptors in machine-learned models would help researchers to identify favorable atoms for enhancing target property of materials with the aid of constructed models.

2. Methods

Polycrystalline samples of ErB2 and LuB2 were obtained by arc melting of Er (99.9%) or Lu (99.9%) and B (99.9%) in an evacuated arc furnace with Ar atmosphere. Samples were melted several times to ensure homogeneity. X-ray diffraction (XRD) patterns of samples were measured using a MiniFlex600 (Rigaku Corporation, Japan) and analyzed by using FullProf package [Citation15]. Magnetization of ErB2 sample was obtained by using MPMS (Quantum Design Japan, Japan), while specific heat of ErB2 and LuB2 were obtained using PPMS (Quantum Design Japan, Japan) apparatus.

3. Results and discussions

3.1. Physical properties of ErB2

shows the XRD patterns of obtained samples. In addition to the target material ErB2 (top of ), we also fabricated a nonmagnetic reference LuB2 (bottom of ). From the Rietveld analyses, the obtained samples have been evaluated to hold 99% main phase of ErB2 and 1% of Er2O3 for ErB2, and 98% main phase of LuB2 and 2% of Lu2O3 for LuB2. The obtained lattice constants were 3.2725(2) Å (a-axis) and 3.7845(3) Å (c-axis) for ErB2 and 3.2387(1) Å (a-axis) and 3.6941(1) Å (c-axis) for LuB2, showing an overall agreement with literature [Citation8].

Figure 1. XRD pattern of ErB2 (top) and LuB2 (bottom) samples. Asterisks in the figure denote Er2O3 or Lu2O3 impurity. Dots correspond to the experimental data, while lines correspond to the calculated pattern by Rietveld analysis. Inset: Crystal structure of REB2, drawn by VESTA software [Citation16].

Figure 1. XRD pattern of ErB2 (top) and LuB2 (bottom) samples. Asterisks in the figure denote Er2O3 or Lu2O3 impurity. Dots correspond to the experimental data, while lines correspond to the calculated pattern by Rietveld analysis. Inset: Crystal structure of REB2, drawn by VESTA software [Citation16].

In , we show the isofield magnetic susceptibility as a function of temperature (χT curve) for both zero field cooling (ZFC) and field cooling (FC) on ErB2, taken under a magnetic field of 0.01 T. The bottom of shows χ/T of field-cooled χT curve, indicating that ErB2 shows a single ferromagnetic transition at Curie temperature (TC)~14 K with an absence of lower transition as observed in DyB2 and HoB2 down to 1.8 K. As in the inset, isothermal magnetization as a function of field (MH curve) at T = 5 K on ErB2 did not show a clear hysteresis. Not only the MH curve, but there is also only a small difference between ZFC and FC χT curves below TC that might be coming from a domain effect. shows 1/χ as a function of temperature up to room temperature, taken at 1 T and shown in a logarithmic scale. The dashed line in the figure corresponds to the result of fitting the data by Curie–Weiss plot between the temperature range from 200 to 300 K. The estimated Weiss temperature Θ was 15.4 K, in good agreement with Curie temperature estimated from the dip in the χ/T curve. The deviation between the fitting curve and the experimental data becomes evident below ~ 80 K, which may be attributed to the effect of the crystal electric field [Citation11]. The estimated effective magnetic moment μeff was 9.57 μB per formula unit, which is close to that of the theoretical value for the free trivalent Er3+ atom (9.59).

Figure 2. (a) Temperature dependence of magnetic susceptibility in ErB2 (top) and its temperature-derivative (bottom), taken at μ 0H = 0.01 T. Inset shows isothermal magnetization curve of ErB2, taken at T = 5 K. (b) Inverse magnetic susceptibility of ErB2 at 1 T. The dashed line in the figure is the fit to Curie–Weiss law in the temperature range of 200–300 K.

Figure 2. (a) Temperature dependence of magnetic susceptibility in ErB2 (top) and its temperature-derivative (bottom), taken at μ 0H = 0.01 T. Inset shows isothermal magnetization curve of ErB2, taken at T = 5 K. (b) Inverse magnetic susceptibility of ErB2 at 1 T. The dashed line in the figure is the fit to Curie–Weiss law in the temperature range of 200–300 K.

Next, we discuss the magnetocaloric effect of ErB2. shows the temperature dependence of specific heat on ErB2 taken at various fields, in the temperature range between 1.8 and 60 K. Under zero field, ErB2 exhibits a peak in specific heat around 14 K that corresponds to the ferromagnetic transition, and the peak gets smooth and its position migrates to higher temperatures as we apply a magnetic field. This behavior is of typical ferromagnets. Entropy S curves estimated from the specific heat results are shown in . In such ST curves, the vertical gap between two curves under different magnetic fields corresponds to |ΔSM|, while the horizontal gap between two curves corresponds to the magnitude of adiabatic temperature change (so-called |ΔTad|), that are shown in , respectively. |ΔSM| can be also estimated from a series of magnetization measurements (see supplemental information S1 for detail), and that for 2 T and 5 T are in as gray open squares, showing that the |ΔSM|s estimated from two experimental methods agree very well. The |ΔSM| peak of this material is estimated to be 26.1 J kg−1 K−1 at T14 K. In volumetric unit, it corresponds to 0.23 J cm−3 K−1 as the density of the sample was evaluated to be 8.937 g/cm3 by the XRD analysis. The overall shape of the temperature dependence of |ΔSM| among different applied fields is almost unchanged, which is a typical feature of second-order ferromagnetic materials [Citation14]. Also, the peak value of ΔTad of ErB2 turned out to be 8.6 K at T ~14 K under field change of 5 T.

Figure 3. (a) Specific heat of ErB2 under various magnetic fields. (b) Entropy curves of ErB2 under magnetic fields deduced from data in (a). (c) |ΔSM| of ErB2, obtained from data in (b). Gray open squares denote |ΔSM| estimated from magnetization curves (supplemental information S1). (d) ΔTad of ErB2, obtained from data in (b).

Figure 3. (a) Specific heat of ErB2 under various magnetic fields. (b) Entropy curves of ErB2 under magnetic fields deduced from data in (a). (c) |ΔSM| of ErB2, obtained from data in (b). Gray open squares denote |ΔSM| estimated from magnetization curves (supplemental information S1). (d) ΔTad of ErB2, obtained from data in (b).

shows the comparison of specific heat between ErB2 and nonmagnetic reference LuB2 in a wider temperature range. Apart from a peak around 14 K due to ferromagnetic transition, the specific heat of ErB2 shows broad bumps or kink structures around 40–60 K. This is the temperature range that corresponds well to where there is a deviation from the Curie–Weiss fit in the 1/χ curve, implying that these are due to the presence of energy levels split caused by crystal field effect. By integrating the difference in specific heat between ErB2 and LuB2 divided by temperature (Cmag/T, shown as the dashed line in over temperature, we have estimated the temperature dependence of magnetic entropy Smag of ErB2 as shown in . The estimated Smag at TC is 7.6 J/mol, that corresponds to 65% of Rln4, implying that most likely two degenerate Kramers doublet levels of Er3+ are involved for the observed magnetic entropy change.

Figure 4. (a) Specific heat of ErB2 and LuB2 (left axis, solid lines) and estimated Cmag/T (right axis, dashed line). (b) Magnetic entropy curve of ErB2 as a function of temperature.

Figure 4. (a) Specific heat of ErB2 and LuB2 (left axis, solid lines) and estimated Cmag/T (right axis, dashed line). (b) Magnetic entropy curve of ErB2 as a function of temperature.

3.2. SHAP analyses of machine-learned model

Here let us compare the rare-earth dependence of |ΔSM|peak in REB2 system at field change of 5 T between experiment and prediction by the machine-learned model [Citation4], where all REB2 are not in the training dataset, i.e. the entire series are unseen to the model. As we have clarified in Section 3.1 and summarized in (black filled circles, left axis), |ΔSM|peak in stoichiometric REB2 is maximized at RE = Ho case. On the other hand, the model (red open squares, right axis) also predicts that RE = Ho would show the highest |ΔSM|peak despite the scale of values being quite different from the experimental values. The same trend is also visible in where we plot |ΔSM|peak of REB2 predicted by the model against the experimental values, with those in the test dataset [Citation4]. In the figure, the difference between the prediction and experimental values tends to be greater as |ΔSM|peak gets higher, indicating that the model has a certain systematic error in the prediction for REB2 system. The lack of quantitative accuracy in the prediction of the model may indicate that there could be a novel mechanism for enhancing the magnetocaloric effect in this series, that was not in the materials of the training dataset of the model. Now it would be intriguing to ask the model how it predicted this RE-dependence, in other words, what compositional descriptors made the model predict that HoB2 is the most hopeful in REB2, but failed to predict with quantitative accuracy. From now on, we will show that in addition to the conventional dependence plot between the target and the compositional descriptors, a dependence plot between the coalition of each descriptor to the individual prediction by the model and a visualization of the elemental property library would help us to understand the underlying insight that the model has captured. We show two use cases as an example for this approach: first, to understand the local explanation of RE-dependence in the prediction of REB2; second, to understand the global tendency to achieve the high target value.

Figure 5. (a) Comparison of |ΔSM|peak values in REB2 (RE = Tb, Dy, Ho, Er, Tm) at μ0ΔH = 5 T between experiment (left axis, black filled circles [Citation4,Citation10–12], and this work) and prediction by the machine-learned model (right axis, red open squares). (b) |ΔSM|peak values predicted by machine-learned model plotted against the experimental values for REB2 (red open squares) and those in the test dataset for the model (gray circles) [Citation4].

Figure 5. (a) Comparison of |ΔSM|peak values in REB2 (RE = Tb, Dy, Ho, Er, Tm) at μ0ΔH = 5 T between experiment (left axis, black filled circles [Citation4,Citation10–12], and this work) and prediction by the machine-learned model (right axis, red open squares). (b) |ΔSM|peak values predicted by machine-learned model plotted against the experimental values for REB2 (red open squares) and those in the test dataset for the model (gray circles) [Citation4].

Such coalition values are accessible by calculating the Shapley values from the constructed model [Citation17]. For this purpose, we applied a SHapley Additive exPlanations (SHAP) analysis [Citation17] to the model by using the TreeSHAP package [Citation18]. The calculated SHAP value Φi,j for ith descriptor of j-th data point represents the amount of the shift in the predicted target value for jth data compared with the mean target value of the entire training data (Φmean, in our case 7.48 J kg−1 K−1), by disclosing ith descriptor value to the model when all other descriptor values has been informed to the model. One of the prominent characteristics of SHAP values is that they are additive, namely the final predicted value Φj for data point j can be derived by simply summing up Φi,j for all descriptors i:

Φj=ΣiΦi,j+Φmean

Therefore, we can resolve what descriptor i  made a significant contribution to the prediction of the model for individual entry j . The analysis can be also applied to the training dataset, to depict how the model understood the data. shows summary plots of SHAP values for the training dataset, where compositional descriptors are sorted by (a) the magnitude of absolute mean and (b) the magnitude of RE-dependence (RE = Tb, Dy, Ho, Er, Tm) in prediction for REB2 series, and shown up to top 15 for each. The latter is obtained by sorting the SHAP values of descriptors by the magnitude of the standard deviation in the prediction of REB2 compounds. In , the biggest contribution to the predicted values came from ‘dh’ which stands for μ0ΔH namely the magnitude of field change. For this ‘dh’, a high descriptor value tends to give high SHAP values, which makes sense as |ΔSM| generally becomes higher by increasing the magnitude of the operating field change, especially in case of second-order magnetic materials [Citation14]. On the other hand, for the rest of the descriptors, SHAP values show complicated dependencies on descriptors as we will discuss some of them in detail later.

Figure 6. Summary plots of SHAP values up to the top 15 compositional descriptors, sorted by (a) magnitude of the absolute mean (i.e. significance), and (b) magnitude of RE-dependence in REB2 prediction. Dots in the figure represents each data point of the training dataset.

Figure 6. Summary plots of SHAP values up to the top 15 compositional descriptors, sorted by (a) magnitude of the absolute mean (i.e. significance), and (b) magnitude of RE-dependence in REB2 prediction. Dots in the figure represents each data point of the training dataset.

Interestingly, descriptors that give high RE-dependence for the prediction in REB2 () do not necessarily possess high SHAP values (). Here we will discuss more details about the origin of RE-dependence in the model. shows the comparison of target values in the train data and SHAP values, together with RE-dependence of prediction in REB2 series for three representative descriptors, namely harmonic mean of boiling point (K), minimum of van der Waals radius (pm), and maximum of molar heat capacity (J K−1 mol−1) of constituent atoms. Those three descriptors were chosen from the list in so that they have different atomic species dependence, to show here examples for various adaptivity of descriptors. Dashed lines in the top panel are the mean value of the target in the training dataset (7.48 J kg−1 K−1). Therefore, the SHAP values in the middle panel correspond to the degree of the shift from this mean value in the model. The vertical dispersion in SHAP plots comes from SHAP interaction between descriptors [Citation18] (A specific example of interaction term is shown in supplemental information S2). The top panel figures can be drawn even before the construction of the machine-learned model, but it is quite difficult to figure out only from such plots how these descriptor values are related to the target property. Predicted values for REB2 are overlaid in the figure, though it is also hard to draw any conclusion regarding how RE-dependence is predicted in these plots.

Figure 7. Top panel: Target values in training data (gray circles) plotted against focused compositional descriptors of (a) harmonic mean of boiling point, (b) minimum of van der Waals radius, and (c) maximum of molar heat capacity. Horizontal dashed lines correspond to the mean target value of training data (7.48). Predicted values for REB2 are also shown as colored open squares. Middle panel: The same for SHAP values for corresponding compositional descriptors. Bottom panel: RE-dependence of SHAP values for corresponding compositional descriptors in the prediction for REB2.

Figure 7. Top panel: Target values in training data (gray circles) plotted against focused compositional descriptors of (a) harmonic mean of boiling point, (b) minimum of van der Waals radius, and (c) maximum of molar heat capacity. Horizontal dashed lines correspond to the mean target value of training data (7.48). Predicted values for REB2 are also shown as colored open squares. Middle panel: The same for SHAP values for corresponding compositional descriptors. Bottom panel: RE-dependence of SHAP values for corresponding compositional descriptors in the prediction for REB2.

On the other hand, as in the middle panel, the relationship between descriptors and the partial predicted value associated with each descriptor becomes clearly visible through SHAP analysis. In the bottom panel, we show the RE-dependence of the SHAP values for each descriptor. Among those descriptors, harmonic mean of boiling point showed the largest RE-dependence. However, the RE-dependence of the SHAP value for boiling point does not coincide with that of the final prediction, and neither do other descriptors as well. shows predicted target values for REB2, resolved in partial sums of the SHAP values for descriptors. As shown in , until we sum up the SHAP values for the top 35 descriptors sorted by RE-dependence in the SHAP values, the predicted value for HoB2 do not become the largest among REB2. Therefore, it can be said that the RE-dependence in the predicted values is based on a number of descriptors, and it is difficult to mention which is a key descriptor that made HoB2 to be the most promising in the prediction, at least in the current model. It is also possible that the suppressed amplitude of the prediction in REB2 series might be a result of the sum of small contributions of SHAP values with different RE-dependence (even though it is a consequence of learning of the training dataset), which ended up in lack of quantitative accuracy for the prediction of |ΔSM| in this series.

Figure 8. RE-dependence of SHAP values. (a) Total sum of SHAP values that is identical to predicted target values. (b) Partial sum of SHAP values. (c) Sum of SHAP values for the rest of descriptors.

Figure 8. RE-dependence of SHAP values. (a) Total sum of SHAP values that is identical to predicted target values. (b) Partial sum of SHAP values. (c) Sum of SHAP values for the rest of descriptors.

shows the relationship among constituent atom X, atomic element property P=f(X), compound {X}, associated compositional descriptor g({f(X)}), and target property/SHAP value of the model. As shown in , preparation of compositional descriptors corresponds to mapping each set of elements in compounds into the basis axis for the target function. Construction of a machine-learned model corresponds to forming of the SHAP values that behave like a function against each compositional descriptor g({f(X)}), constructed through the regression process of training data with regularization equipped in the used algorithm. With SHAP plots like , we can readily expect that when the input constituent atoms or their ratio changes it would trigger a shift in the compositional descriptor value causing a change in the output SHAP values. It can be said that this is how the model can address the target with composition-dependence from a viewpoint of SHAP analysis.

Figure 9. Schematic workflow of materials search using compositional descriptor-based machine-learning model (blue arrows) and proposed possible materials design assisted by SHAP analysis of the model. (a)(b) Focused physical property f(X) for atomic elements (X). (c) Target plotted against compositional descriptor g({f(X)}). (d) Constructed model by regression of data in (c). (e) SHAP value plotted against compositional descriptor g. (f) Sorted physical property for atomic elements (X).

Figure 9. Schematic workflow of materials search using compositional descriptor-based machine-learning model (blue arrows) and proposed possible materials design assisted by SHAP analysis of the model. (a)(b) Focused physical property f(X) for atomic elements (X). (c) Target plotted against compositional descriptor g({f(X)}). (d) Constructed model by regression of data in (c). (e) SHAP value plotted against compositional descriptor g. (f) Sorted physical property for atomic elements (X).

By considering SHAP values as functions for descriptors, we can further exploit the understanding of the model. also shows a schematic of the conventional workflow of data-driven search using a machine-learned model with compositional descriptors (blue arrows), and an additional process that can be used for materials design and extraction of knowledge from the built model (red arrows) proposed here. In a conventional workflow, a dataset is prepared first, that contains compositions of materials consisting of a set of elements {X} and target physical property values. Then, compositional descriptors are generated by combining the physical properties of atomic elements with diverse mathematical operations as previously described (), that are used as training data. Once the model is built () by using a certain algorithm, it can be used to predict target values of unknown compounds after the preparation of corresponding descriptors. In this process, the prediction is solely relying on the model output and researchers have no access to the trend found by the model during its construction. On the other hand, one can also perform SHAP analysis () on the model to visualize what descriptor values tend to have high coalitions to the target value. In the case of compositional descriptors, it can be resolved which atomic element is favorable for high SHAP as well as target values, by comparing these SHAP plots with elemental physical properties (). For instance in , compounds with harmonic mean of boiling point of constituent atom higher than ~3500 K tend to have high SHAP values, hence transition metals can be suitable as a partner element for heavy RE atoms. We note here that changing a constituent atom to modify the value of a specific compositional descriptor can result in a change in other compositional descriptors at the same time, thus one has to be careful during such an approach. Therefore this approach would be more effective when the SHAP values are condensed in a small number of descriptors, or when the number of prepared descriptors is reduced by pre-processing before the construction of the final model. During such pre-processing, one has to be careful since it is important to build an model as accurate as possible before applying SHAP analysis to extract plausible knowledge out of the available dataset.

Finally, we show further a few examples of how the constructed model with compositional descriptors and its SHAP analysis would give us an overall idea to design materials holding high target properties. shows the distribution of the training dataset and associated physical properties of atomic elements (right panel), for the top five compositional descriptors holding high averaged SHAP values shown in . That are, (a) maximum of lattice constant (Å), (b) sum of electron affinity (eV), (c) variance of lattice constant (Å), (d) variance of the number of valence p-electrons, (e) ground state energy per atom (eV) in first principles calculation software VASP [Citation19]. The left panel shows the target plotted against compositional descriptors while the middle panel shows SHAP values for the same. In figures, we also highlight well-known magnetocaloric materials [Citation2] that were in the training data for a field change of 5 T, namely Gd5Si1.5Ge2.5 (GSG) [Citation20], EuTiO3 (ETO) [Citation21], EuS (ES) [Citation22], ErCo2 (EC) [Citation23], La0.8Ce0.2Fe11.7Si1.3 (LFS) [Citation24], whose |ΔSM| are 41.0, 40.4, 37.0, 36.8, 34.9 J kg−1 K−1, respectively. Interestingly, those representative materials tend to lie in the realm of positive high SHAP values in the middle panel, indicating that the regression process of the model is working fine. It is also worth noting that SHAP values for those five compositional descriptors tend to hold clear thresholds for positive and negative values, possibly because the model is based on a tree-based algorithm.

Figure 10. Left panel: Training data distribution of target plotted against focused compositional descriptors. Middle panel: The same as the left panel for SHAP values. Right panel: Heatmap of properties of atomic elements stored in XenonPy [Citation6]. Focused compositional descriptors are the ones that exhibited the top five largest absolute mean SHAP values in the model (shown in Figure 6(a)), namely (a) maximum of lattice constant, (b) sum of electron affinity, (c) variance of lattice constant, (d) variance of the number of valence p-electrons, and (e) minimum of ground state energy in first principles software VASP [Citation19]. Datapoints for several representative magnetocaloric materials are indicated explicitly, Gd5Si1.5Ge2.5 [Citation20], EuTiO3 [Citation21], EuS [Citation22], ErCo2 [Citation23], La0.8Ce0.2Fe11.7Si1.3 [Citation24] and abbreviated as GSG, ETO, ES, EC, LFS, respectively.

Figure 10. Left panel: Training data distribution of target plotted against focused compositional descriptors. Middle panel: The same as the left panel for SHAP values. Right panel: Heatmap of properties of atomic elements stored in XenonPy [Citation6]. Focused compositional descriptors are the ones that exhibited the top five largest absolute mean SHAP values in the model (shown in Figure 6(a)), namely (a) maximum of lattice constant, (b) sum of electron affinity, (c) variance of lattice constant, (d) variance of the number of valence p-electrons, and (e) minimum of ground state energy in first principles software VASP [Citation19]. Datapoints for several representative magnetocaloric materials are indicated explicitly, Gd5Si1.5Ge2.5 [Citation20], EuTiO3 [Citation21], EuS [Citation22], ErCo2 [Citation23], La0.8Ce0.2Fe11.7Si1.3 [Citation24] and abbreviated as GSG, ETO, ES, EC, LFS, respectively.

By combining the SHAP analysis shown in the middle panel with the property value of atomic elements shown in the right panel, we can extract several suggestions from the current model. For instance,

  • From , the model suggests that the inclusion of atomic elements with a ‘lattice constant’ larger than ~7 Å tends to lower the predicted values. Therefore the current model implies to avoid using B, S, Mn, and Sm.

  • From , the model suggests that inclusion of more than four or five atomic species would not do good for obtaining high |ΔSM|. This might make sense as such partially substituted alloys and composites tend to be used for obtaining table-like |ΔSM| as a function of temperature [Citation25–27].

  • From , finite variance in lattice constants of constituent atoms tends to enhance the target. Thus, a combination of anion and cation atoms with close values of elemental lattice constant can be avoided.

  • From , finite variance in the number of p-valence electrons tends to enhance the target. Therefore, the use of anion atoms with p-valence electrons is preferred.

  • From , compositions containing atoms with ground state energy lower than −8.5 eV tend to have lower target values. Thus the use of 5d-transition metal can be avoided.

We stress here that those above are merely an example of what kind of ideas materials researchers can receive through such analysis, and the current model is yet far from complete as it is evident in the quantitative disagreement in the magnitude of the target value in the REB2 system. However, at least even in the framework of the current model, if we could find a compound satisfying those above five suggestions from the model, the expected target value from the sum of those five SHAP values of compositional features can be as high as 15–20 J kg−1 K−1, which is appreciatively large, though SHAP values of rest 403 descriptors would modify the final predicted target value. We also stress that the current model holds 408 descriptors and SHAP values are widely split into small values for each descriptor. Therefore, building a model with a confined number of descriptors and/or applying feature selection would benefit the extraction of ideas from the model through such analysis.

Summary

In summary, we have investigated the magnetocaloric effect in ErB2, which was the last unreported magnetocaloric material in REB2 ferromagnets. The experimentally observed magnetic entropy change |ΔSM|peak were as large as 26.1 J kg−1 K−1 for field change of 5 T at ~ 14 K, hence |ΔSM|peak in HoB2 (40.1 J kg−1 K−1) turned out to be the highest among REB2 series (RE = Tb, Dy, Ho, Er, Tm). The machine-learned model also predicted HoB2 as the most promising material, however, the predicted magnitude of |ΔSM|peak showed a disagreement with experimental values in their quantity. Through SHAP analysis of the model, it turned out that RE-dependence in the model prediction comes from the sum of small contributions from a number of compositional descriptors with a variety of atomic-species dependence in the current model, that might have caused a systematic error in the predicted values in REB2 system. It has been also discussed that such SHAP analysis for the compositional descriptor-based machine-learned model could be helpful for researchers to visualize what trend in the training data has been found by the built model, and to plan the material design by taking advantage of this knowledge.

Supplemental material

Supplemental Material

Download PDF (2.4 MB)

Acknowledgements

This work was supported by the JST-Mirai Program (Grant No. JPMJMI18A3), JSPS Bilateral Program (JPJSBP120214602), and JSPS KAKENHI (Grant Nos. 20K05070, 23K04572, 19H02177). P.B.C. acknowledges the scholarship support from the Ministry of Education, Culture, Sports, Science and Technology (MEXT), Japan.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary data

Supplemental data for this article can be accessed online at https://doi.org/10.1080/27660400.2023.2217474.

Additional information

Funding

The work was supported by the Japan Science and Technology Agency [JPMJMI18A3] and Japan Society for the Promotion of Science [19H02177, 20K05070, 23K04572, JPJSBP120214602].

References

  • Gschneidner KA, Pecharsky VK. Magnetocaloric materials. Annu Rev Mater Sci. 2000;30(1):387–10.
  • Franco V, Blázquez J, Ipus J, et al. Magnetocaloric effect: from materials research to refrigeration devices. Pro Mater Sci. 2018;93:112–232.
  • Utaki T, Kamiya K, Nakagawa T, et al. Research on a magnetic refrigeration cycle for hydrogen liquefaction. Vol. 14, New York: Kluwer Academic/Prenum Publishers; 2007. p. 645.
  • de Castro PB, Terashima K, Yamamoto TD, et al. Machine-learning-guided discovery of the gigantic magnetocaloric effect in HoB2 near the hydrogen liquefaction temperature. NPG Asia Mater. 2020;12:35.
  • Ward L, Agrawal A, Choudhary A, et al. A general-purpose machine learning framework for predicting properties of inorganic materials. npj Comput Mater. 2016;2:16028.
  • Yamada H, Liu C, Wu S, et al. Predicting materials properties with little data using shotgun transfer learning. ACS Cent Sci. 2019;5:1717–1730.
  • Chen T, Guestrin C. XGBoost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; New York, NY, USA. ACM; 2016. p. 785–794.
  • Buschow KHJ. Magnetic properties of borides. Berlin, Heidelberg: Springer Berlin Heidelberg; 1977. p. 494–515.
  • Avila M, Bud’ko S, Petrovic C, et al. Synthesis and properties of YbB2. J Alloys Compd. 2003;358:56–64.
  • Han Z, Li D, Meng H, et al. Magnetocaloric effect in terbium diboride. J Alloys Compd. 2010;498:118–120.
  • Mori T, Takimoto T, Leithe-Jasper A, et al. Ferromagnetism and electronic structure of TmB2. Phys Rev B. 2009;79:104418.
  • Meng H, Li B, Han Z, et al. Reversible magnetocaloric effect and refrigeration capacity enhanced by two successive magnetic transitions in DyB2. Sci China Technol Sci. 2012;55:501–504.
  • Terada N, Terashima K, de Castro PB, et al. Relationship between magnetic ordering and gigantic magnetocaloric effect in HoB2 studied by neutron diffraction experiment. Phys Rev B. 2020;102:094435.
  • Franco V, Conde A, Romero-Enrique JM, et al. A universal curve for the magnetocaloric effect: an analysis based on scaling relations. J Phys Condens Matter. 2008;20:285207.
  • Rodríguez-Carvajal J. Recent advances in magnetic structure determination by neutron powder diffraction. Phys B Condens Matter. 1993;192:55–69.
  • Momma K, Izumi F. VESTA: a three-dimensional visualization system for electronic and structural analysis. J Appl Crystallogr. 2008;41:653–658.
  • Lundberg SM, Lee SI. A unified approach to interpreting model predictions. In: Guyon I, Luxburg U, and Bengio S, editors. Advances in neural information processing systems 30. New York, US: Curran Associates, Inc.; 2017. p. 4765–4774.
  • Lundberg SM, Erion G, Chen H, et al. From local explanations to global understanding with explainable ai for trees. Nat Mach Intell. 2020;2:56–67.
  • Kresse G, Hafner J. Ab initio molecular dynamics for liquid metals. Phys Rev B. 1993;47:558–561.
  • Pecharsky A, Gschneidner K, Pecharsky V. The giant magnetocaloric effect between 190 and 300 K in the Gd5SixGe4–x alloys for 1.4≤x≤2.2. J Magn Magn Mater. 2003;267:60–68.
  • Mo ZJ, Shen J, Li L, et al. Observation of giant magnetocaloric effect in EuTiO3. Mater Lett. 2015;158:282–284.
  • Li D, Yamamura T, Nimori S, et al. Large reversible magnetocaloric effect in ferromagnetic semiconductor EuS. Solid State Commun. 2014;193:6–10.
  • Wada H, Tomekawa S, Shiga M. Magnetocaloric properties of a first-order magnetic transition system ErCo2. Cryogenics. 1999;39:915–919.
  • Chen X, Chen Y, Tang Y. The effect of different temperature annealing on phase relation of LaFe11.5Si1.5 and the magnetocaloric effects of La0.8Ce0.2Fe11.5–xCoxSi1.5 alloys. J Magn Magn Mater. 2011;323:3177–3183.
  • de Castro PB, Terashima K, Yamamoto TD, et al. Enhancement of giant refrigerant capacity in Ho1–xGdxB2 alloys (0.1 ≤ x ≤ 0.4). J Alloys Compd. 2021;865:158881.
  • Takeya H, Pecharsky VK, Gschneidner KA, et al. New type of magnetocaloric effect: implications on low-temperature magnetic refrigeration using an ericsson cycle. Appl Phys Lett. 1994;64:2739–2741.
  • Li L, Yuan Y, Qi Y, et al. Achievement of a table-like magnetocaloric effect in the dual-phase ErZn2/ErZn composite. Mater Res Lett. 2017;6:67–71.