Full article: Artificial neural network study of the electrical conductivity of mould flux

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

The electrical conductivity of mould flux with chemical constitution of CaO-SiO₂-Al₂O₃-NaO-K₂O-MgO-CaF₂-Cr₂O₃-FeO-MnO has been investigated. The assessed database contains one unitary, five binary, nine ternary, four quaternary, two quinary, two senary and one octonary subsystems. Each constitutional component is in connection to another via some direct or indirect links. A multilayer artificial neural network method was developed and implemented in the database. The work provides a method to calculate the relationships between the composition, temperature and electrical property of the mould flux within the defined parameter ranges. The results have been validated against those experimental data that are not included in the training of the neural networks.

KEYWORDS:

Introduction

Many steelmaking companies are using a mould flux with chemical constitution of CaO-SiO₂-Al₂O₃-NaO-K₂O-MgO-CaF₂-Cr₂O₃-FeO-MnO to cast stainless steels in continuous casting mould. The mould slag contains high concentration of fluorine to reduce viscosity and solidification temperature [Citation1]. Cr₂O₃-FeO-MnO can either pre-exist in mould powder or enter to liquid slag from oxidation of alloying elements in liquid stainless steel at casting mould and tundish [Citation2]. The electrical conductivity of this system has never been assessed systematically although the data for its subsystem are notably rich.

The primary driving force to study the electrical properties of mould powder is for electroslag remelting processing [Citation3,Citation4]. This has, therefore, attracted considerable experimental measurement activities [Citation3–6], theoretical modelling [Citation7,Citation8] and data assessment [Citation9] for several subsystems. The recent environment regulation on carbon neutral steelmaking promotes the electrification of continuous casting. Electric field affects materials segregation [Citation10], viscosity [Citation11], distribution of oxide inclusions [Citation12] and surface roughness in the cast mould [Citation13]. This demands an analytical expression to represent the relationship between the chemical constitution, temperature and electrical conductivity of the system. The aim of the present work was to provide a mean to calculate the constitution and temperature-dependent electrical conductivity of the mould flux materials.

This work uses artificial neural network to approach the target. The method relies on available data to extrapolate values in the unknown parameters’ range [Citation14]. Ideally, a relationship between the electrical conductivity, constitution and temperature should be derived from micro-mechanisms [Citation15]. The previous theoretical modelling for the electrical conductivity of mould slag has been based largely on an assumption that electrical conduction is carried out by the moving ionised atoms. Obviously, viscosity affects the mobility of ionised atoms and hence plays an important role in electrical conduction. The basicity of mould slag affects the amount and length of silicate chains. This lays a foundation for the optical basicity model to calculate the electrical conductivity of mould slag [Citation7]. On the other side, the interactions between constitutional components influence the mobility of atoms. This forms a basis to use particle interaction to calculate electrical conductivity [Citation8]. For the oxides which can conduct electricity by electronic means, such as FeO, the above-mentioned theoretical models stray away [Citation2]. The data-based artificial neural network has proved to be an effective solution to provide alternative solution [Citation9].

The artificial neural network is different from data fitting such as the least square method. The former prevents overfitting but the latter seeks best fitting to data. The recent development of artificial intelligent learning enables the method to reduce its dimension according to the conservative laws that are hidden in the data [Citation16,Citation17]. Artificial neural network has potential to indicate some physics natures buried in the big data. This work intends to provide a method to calculate the constitution and temperature-dependent electrical conductivity of the mould flux.

Artificial neural network

A supervised multilayer artificial neural network with back propagation learning algorithm has been coded for the present purpose. The network has an input layer containing 11 units to record the constitutional compositions and temperature. The output layer has 1 unit to provide the computational result for electrical conductivity. There are T-1 hidden layers each containing numerous units. The architecture of 2-layer network is illustrated schematically in Figure , where the hidden layer has n-units. The mapping function for 2-layer neural network in the present work is defined as (1.1) $\begin{aligned} x_{j} & = \sum_{i = 1}^{10} ω_{j, i}^{(1)} c_{i} + ω_{j, 11}^{(1)} T + b_{j}^{(1)} \end{aligned}$ (1.1) (1.2) $\begin{aligned} u_{j} & = 1 / (1 + e^{- x_{j}}) \end{aligned}$ (1.2) (1.3) $\begin{aligned} y & = \sum_{j = 1}^{n} ω_{1, j}^{(2)} u_{j} + b_{1}^{(2)} \end{aligned}$ (1.3) (1.4) $\begin{aligned} σ & = 1 / (1 + e^{- y}) \end{aligned}$ (1.4)

Figure 1. Schematic diagram illustrates 2-layer neural network.

The mapping function for >2 layers neural network can be manipulated in the same way. A code package to calculate up to 4-layer neural network has been developed by the author. $ω_{j, i}^{(k)}$ is the weight factor between j^th unit in k^th layer and i^th unit in (k−1)^th layer, where 0^th layer is the input layer. $b_{j}^{(k)}$ is the bias for j^th unit in k^th layer. $x_{j}$ is the activation of j^th unit in 1^st layer $u_{j}$ is its activation function. $y$ is the activation of output layer. The electrical conductivity $σ$ is the activation function of $y$ .

The activation functions in Equations (1.2) and (1.4) are nonlinear and with output values between 0 and 1. Many other neural network models choose other functions such as $f (x) = tanh (x)$ [Citation14], which has value between −1 and 1, or Gaussian distribution [Citation16]. It is important to normalise the data according to the format of activation functions. In the present model, all the electrical conductivity data are normalised to a value between 0 and 1. The numerical results are denormalised to get the true value.

Following the standard method in neural network calculation [Citation18], the total difference is defined as (2) $\begin{aligned} E_{D} = \frac{1}{m} \sum_{l = 0}^{m} {(σ_{c}^{(l)} - σ_{e}^{(l)})}^{2} \end{aligned}$ (2) where m is the total number of sets of training data. $σ_{e}^{(l)}$ and $σ_{c}^{(l)}$ are the l^th training value and calculated value, respectively. To prevent neural network optimisation from overfitting the noise in training data, a regularisation is defined to minimise the total value of weight factor square as $E_{W} = \sum_{i, j, k} {(w_{j, i}^{(k)})}^{2}$ . The overall target function is defined as [Citation18] (3) $\begin{aligned} E = \frac{1}{2} β E_{D} + \frac{1}{2} α E_{W} \end{aligned}$ (3) where β and α are coefficients. Their ratio reflects a balance between accuracy and simplicity. The intelligent learning is to perform a gradient descent minimisation of the total difference via [Citation19] (4) $\begin{aligned} Δ w_{j, i}^{(k)} = - η \partial E / \partial w_{j, i}^{(k)} \end{aligned}$ (4) where $η$ is the learning rate. Equation (4) defines one of the learning methods that always works but unnecessarily the most effective method. In the following section, other learning methods will be used and compared with Equation (4). The analytical expression for each weight factor and bias can be obtained by back propagation of the partial difference of target function according to Equation (4).

Data assessment

The composition in the database is defined to be atomic percentage. The weight percentage data from literature has been converted according to their molar weight [Citation3,Citation5,Citation6,Citation21]. Renormalisation has been performed for those data that the original total composition does not come to 100% [Citation22]. This is not rare in steel company become the initial constitutions of C and CO₂ in mould powder are not counted in liquid slag. However, some data in literature has a total composition in excess of 100%. Those data are ignored entirely. For the data contain tiny fraction (3.00 wt-%) of NaO and K₂O but without detailed information about their ratio [Citation21]. An assumption of 50:50 is applied. This assumption does not affect the specification of the individual contribution from NaO and K₂O to the electrical conductivity because a significant amount of NaO and K₂O data from other literature has been applied to justify the assumption [Citation23,Citation24]. For the data that have been assessed previously [Citation8], the assessed data instead of large amount of raw experimental data has been adopted to minimise noise. Some data are reported as scattered points in plotted figure. To get the values from the plotted figure with high accuracy, a java-based code package has been developed by the author to convert the figure to data. Some figures have been plotted artificially with scales in coordinates not proportional [Citation22]. Only those data with clear indication of their values have been adopted. For the plotted continuous curves, only those points accompanied with experimental values are adopted. After the critical assessment, a database contains 752 sets of data with one unitary, five binary, nine ternary, four quaternary, two quinary, two senary and one octonary subsystems has been built up. The detailed subsystems are listed in Table . The parameter ranges are shown in Table . The unit of temperature is Kelvin. The unit of electrical conductivity is Ω⁻¹·cm⁻¹.

Table 1. The subsystems in the electrical conductivity database.

Download CSV Display Table

Table 2. The range of parameters’ values in the database.

Download CSV Display Table

Neural network computation and results

The activation functions, as illustrated in Equations (1.2) and (1.4), have an output value between 0 and 1 for the activation between -∞ and +∞. However, $σ = 0.006693$ at $y = - 5$ and $σ = 0.993307$ at $y = 5$ , which indicate an extremely slow approximation to either 0 or 1. Based on this consideration, the training data for electrical conductivity is not normalised by the true minimum and maximum values in the database but multiplied by 0.7 to the minimum value and 1.3 to the maximum value. The input parameters for composition and temperature are all normalised to a value between 0 and 1 to ensure every input parameter has the same weight of contribution. The denormalisation and normalisation procedure followed the following equations (5.1) $\begin{aligned} \tilde{σ} & = \frac{σ - 0.7 σ^{m i n}}{1.3 σ^{m a x} - 0.7 σ^{m i n}} \end{aligned}$ (5.1) (5.2) $\begin{aligned} {\tilde{c}}_{i} & = \frac{c_{i} - c_{i}^{m i n}}{c_{i}^{m a x} - c_{i}^{m i n}} \end{aligned}$ (5.2) where $\tilde{σ}$ and ${\tilde{c}}_{i}$ are the normalised electrical conductivity and composition for $i < 11$ and temperature when $i = 11$ . The initial values for all the weight factors and biases are assigned to a random float value between −5 and 5. A high-quality random number generator was coded according to a probability theory developed by Marsaglia et al. [Citation25].

In neural network calculation, it has been noted that the artificial learning by means of Equation (4) in every time iteration does not help to find the weight factors and biases to achieve minimum total differences between the target values and calculated value. The weight factors soon adjust their values to minimise the overall target function ( $E$ ) rather than the total difference ( $E_{D}$ ). To overcome this problem, the regularisation term ( $E_{W}$ ) is not included in each time iteration but replaced at the final step assessment. It is also found that the convergence rate is almost doubled by the following artificial learning mechanism, which agrees with the suggestion from Rumelhart et al. [Citation19] (6) $\begin{aligned} Δ w_{j, i}^{(k)} (t + 1) = - η \partial E / \partial w_{j, i}^{(k)} + δ Δ w_{j, i}^{(k)} (t) \end{aligned}$ (6) where δ is a coefficient. For the 2-layer neural network calculation with $n = 16$ and $α = β = η = δ = 0.5$ , the evolution of total difference, regularisation term and target function vs iteration steps are demonstrated in Figure . It shows that that total difference drops sharply in the early stage (labelled by A), followed by slow drops (labelled by B) until a flat stage (labelled by C) to fluctuate around a minimum value. However, the regularisation term was increased slowly but monotonically until stage C. This is due to the early mentioned decision of not to include the minimisation of regularisation term in the time iteration. The target function has been reduced monotonically until the flat stage.

Figure 2. The evolution of total difference ( $E_{D}$ ), regularisation term ( $E_{W}$ ) and target function ( $E$ ) vs. iteration steps for 2-layer neural network with hidden layer containing 16 units.

To determine the optimum number of units in the hidden layer in 2-layer neural network calculation, one has calculated the change of differences until 1.6 × 10⁷ iteration steps for various number of units. The results are plotted in Figure . Although $E_{D}$ decreases when the number of units increases, $E_{W}$ demonstrates some optimised values. $E_{W}$ increases sharply when the number of units is away from the optimised one. The target function, $E$ , reveals an optimised value for the number of units in the hidden layer of the 2-layer neural network. Based on the results, $n = 16$ is chosen. It is worth mentioning that the local minimum at $n = 16$ for the curve of $E_{D}$ is out of expectation. To double check whether it is a numerical coincidence, the code and parameters were run at three different workstations but the results were very similar, given the fact that the initialisation of weight factors involves a random number generator which should be different at different computers. The smallest $E_{D}$ appeared at 9,012,000^th iteration step. The values for the weight factors and biases at this optimised condition are listed in Table . These values can be used to calculate the electrical conductivity of the system at any composition and temperature in the parameters’ range.

Figure 3. The evolution of total difference ( $E_{D}$ ), regulation term ( $E_{W}$ ) and target function ( $E$ ) vs. number of units in hidden layer after 1.6 × 10⁷ iteration steps.

Figure 3. The evolution of total difference (ED), regulation term (EW) and target function (E) vs. number of units in hidden layer after 1.6 × 107 iteration steps.

Table 3. The optimised weight factors and biases obtained by artificial neural network calculations.

Display Table

The optimised weight factors and bias values have been implemented to calculate the electrical conductivity for 752 sets of compositions and temperature. The results are plotted in Figure and compared with the values in the database. Figure (a) shows the comparison at linear scale coordinates. The 45° line indicates the perfect agreement between the artificial neural network computational results and the value in database, where majority data are from experimental measurement and the rest from assessment based on experimental values. The figure shows good agreement. Owing to the wide range distribution of the electrical conductivity values from 0.016 to 23.771, which across three orders of magnitude, the comparison in logarithmic scale is shown in Figure (b). The data shows some almost evenly distribution around the 45° line, majority with absolute error below 5%. The largest absolute discrepancy appears in the lowest electrical conductivity end, as is circled in Figure (b). Those data are found all belong to CaO-SiO₂-Al₂O₃ subsystem at a temperature either in 1623 K or 1673 K, and was reported in one paper.

Figure 4. The electrical conductivity from numerical results by artificial neural network calculation vs. the value in the database: (a) in linear scale plotting, and (b) in logarithmic scale plotting.

To validate the artificial neural network calculations, the optimised weight factors and bias values have been implemented to calculate two binary systems CaF₂-Al₂O₃ and Cao-CaF2 at different temperature. The results have been compared with the experimental results reported in various literature [Citation6,Citation26,Citation27]. Figure presents the results for (a) CaF₂-Al₂O₃ at 1773K, (b) CaF₂-Al₂O₃ at 1973K, (c) Cao-CaF2 at 1773K and (d) Cao-CaF2 at 1873K. These experimental data are not in database during the training of neural network. Figure shows that the electrical conductivity obtained in the neural network calculations are within the fluctuation of various experimental measurements. It proves that the artificial neural network prediction for the electrical conductivity can be used to predict the change of electrical conductivity at various subsystems in different compositions and temperatures.

Figure 5. Validation of neural network calculation by experimental data reported in literature [Citation6,Citation26,Citation27]. (a) CaF₂-Al₂O₃ at 1773K, (b) CaF₂-Al₂O₃ at 1973K, (c) Cao-CaF2 at 1773K, and (d) Cao-CaF2 at 1873K.

Figure 5. Validation of neural network calculation by experimental data reported in literature [Citation6,Citation26,Citation27]. (a) CaF2-Al2O3 at 1773K, (b) CaF2-Al2O3 at 1973K, (c) Cao-CaF2 at 1773K, and (d) Cao-CaF2 at 1873K.

The artificial neural network and machine learning have many potential applications in steel metallurgy [Citation28]. In the future, more works will be done to include other components to the system, such as NiO, TiO₂, MgF₂, BaF₂, BaO, ZrO, CaS. The availability of the new experimental measurement method for electrical conductivity enables to get more accurate data in other systems [Citation29], which will help to build up database for training and validation of the neural networks. The future work can, hopefully, also include the effort to use the data and machine learning method to identify the main oxides that control the electrical conductivity of mould flux and the influence of temperature on the electrical properties, and to compare the results with the theoretical predictions [Citation7,Citation8,Citation15].

Conclusions

An electrical conductivity database for CaO-SiO₂-Al₂O₃-NaO-K₂O-MgO-CaF₂-Cr₂O₃-FeO-MnO liquid mould slag system has been built up.
The database has been implemented to train an artificial neural network. It is found that the two-layer neural network with 16 units in hidden layer provides the minimum difference in target function. The optimised weight factors and bias values can be used to calculate the electrical conductivity of the system in a wide range of compositions and temperature.
The numerical prediction has been validated by the experimental results reported in literature. Excellent performance of artificial neural network derivation has been proved.

Acknowledgements

This work was financially supported by the European Commission RFCS (No. 847269) and Engineering and Physical Sciences Research Council in UK (No. EP/R029598/1).

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Engineering and Physical Sciences Research Council [grant number EP/R029598/1]; European Commission RFCS (Research Fund for Coal and Steel) [grant number 847269].

References

Yang J, Zhang JQ, Ostrovski O, et al. Effects of fluorine on solidification, viscosity, structure, and heat transfer of CaO-Al2O3-based mold fluxes. Metall Mater Trans. 2019;50B:1766–1772.
Web of Science ®Google Scholar
Barati M, Coley KS. Electrical and electronic conductivity of CaO-SiO2-FeOx slags at various oxygen potentials: Part I. Experimental results. Metall Mater Trans. 2006;37B:41–49.
Web of Science ®Google Scholar
Winterhager H, Kammel R, Gad A. Electric conductivity, density and surface tension of fluoride type slag for electroslag remelting method. Electric Steelmak. 1974;45:234–252.
Google Scholar
Hara S, Hashimoto H, Kaxumi O. Electrical conductivity of molten slags for electro-slag remelting. Trans Iron Steel Inst Jpn. 1983;23:1053–1058.
Google Scholar
Sarka SB. Electrical conductivity of molten high-alumina blast furnace slags. ISIJ Int. 1989;29:348–351.
Web of Science ®Google Scholar
Ogino K, Hara S. Density, surface tension and electrical conductivity of calcium fluoride based fluxes for electroslag remelting, iron and steel. Tetsu-To-Hagane. 1977;63:2141–2151.
Google Scholar
Zhang GH, Chou KC. Simple method for estimating the electrical conductivity of oxide melts with optical basicity. Metall Mater Trans. 2010;41B:131–136.
Web of Science ®Google Scholar
Shi GY, Ta Z, Dou ZH, et al. Estimation model for electrical conductivity of CaF2-CaO-Al2O3 slags. JOM. 2016;68:2365–2370.
Web of Science ®Google Scholar
Haraguchi Y, Nakamoto M, Suzuki M, et al. Electrical conductivity calculation of molten multicomponent. ISIJ Int. 2018;58:1007–1012.
Web of Science ®Google Scholar
Zhang XF, Qin RS. Segregation of copper in an Fe–Cu alloy under pulsed electric current. Phil Mag Lett. 2015;95:367–375.
Web of Science ®Google Scholar
Halsey TC. Electrorheological fluids. Science. 1992;258:761–766.
PubMed Web of Science ®Google Scholar
Zhang XF, Qin RS. Electric current-driven migration of electrically neutral particles in liquids. Appl Phys Lett. 2014;104:114106.
Web of Science ®Google Scholar
Qin RS. Suppression of the surface roughness and fluctuation frequency by electric method. Mater Today Commun. 2021;28:102512.
Web of Science ®Google Scholar
Bhadeshia HKDH, Dimitriu RC, Forsik S, et al. Performance of neural networks in materials science. Mater Sci Technol. 2009;25:504–510.
Web of Science ®Google Scholar
Bohnenkamp U, Sandström R, Grimvall G. Electrical resistivity of steels and face-centered-cubic iron. J Appl Phys. 2002;92:4402–4407.
Web of Science ®Google Scholar
Liu ZM, Tegmark M. Machine learning conservation laws from trajectories. Phys Rev Lett. 2021;126:180604.
PubMed Web of Science ®Google Scholar
Wetzel SJ, Melko RG, Scott J, et al. Discovering symmetry invariants and conserved quantities by interpreting siamese neural networks. Phys Rev Res. 2020;2:033499.
Google Scholar
MacKay DJC. Information theory, inference, and learning algorithms. 4th ed., Cambridge, Cambridge University Press; 2003.
Google Scholar
Rumelhart DE, Hinton GE, Williams RJ. Learning representations by back-propagating errors. Nature. 1986;323:533–536.
Web of Science ®Google Scholar
Mills KC, Su YC, Li ZS, et al. Equations for the calculation of the thermo-physical properties of stainless steel. ISIJ Int. 2004;44:1661–1668.
Web of Science ®Google Scholar
Deng LB, Wang S, Zhang Z, et al. The viscosity and conductivity of the molten glass and crystallization behavior of the glass ceramics derived from stainless steel slag. Mater Chem Phys. 2020;251:123159.
Web of Science ®Google Scholar
Farahat R, Eissa M, Megahed G, et al. Effect of EAF slag temperature and composition on its electrical conductivity. ISIJ Int. 2019;59:216–220.
Web of Science ®Google Scholar
Zhang GH, Zheng WW, Jiao SQ, et al. Influences of Na2O and K2O additions on electrical conductivity of CaO-SiO2-(Al2O3) melts. ISIJ Int. 2017;57:2091–2096.
Web of Science ®Google Scholar
Mori K, Matsushita Y. On the constitution of molten slags viewing from the standpoint of the electrical conductivity. Tetsu-to-Hagane. 1952;38:444–448.
Google Scholar
Marsaglia G, Zaman A, Tsang W. Toward a universal random number generator. Stat Probab Lett. 1990;9:35–39.
Web of Science ®Google Scholar
Eisenhuttenleute VD. Slag Atlas. 2nd ed Dusseldorf: Verlag-Stahleisen GmbH; 1995.
Google Scholar
Mitchell A, Cameron J. Electrical conductivity of some liquids in system CaF2 + CaO + Al2O3. Mater Trans. 1971;2:3361–3366.
Web of Science ®Google Scholar
Smith JL. Advances in neural networks and potential for their application to steel metallurgy. Mater Sci Technol. 2020;36:1805–1819.
Web of Science ®Google Scholar
Zhang L, Malfliet A, Blanpain B, et al. In situ electrical conductivity measurement by using confocal scanning laser microscopy. Metall Mater Trans. 2021;52B:2563–2572.
Web of Science ®Google Scholar

Artificial neural network study of the electrical conductivity of mould flux

Abstract

Introduction

Artificial neural network

Data assessment

Table 1. The subsystems in the electrical conductivity database.

Table 2. The range of parameters’ values in the database.

Neural network computation and results

Table 3. The optimised weight factors and biases obtained by artificial neural network calculations.

Conclusions

Acknowledgements

Disclosure statement

References

Information for

Open access

Opportunities

Help and information

Artificial neural network study of the electrical conductivity of mould flux

Abstract

Introduction

Artificial neural network

Data assessment

Table 1. The subsystems in the electrical conductivity database.

Table 2. The range of parameters’ values in the database.

Neural network computation and results

Table 3. The optimised weight factors and biases obtained by artificial neural network calculations.

Conclusions

Acknowledgements

Disclosure statement

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date