1,341
Views
3
CrossRef citations to date
0
Altmetric
Research Article

Artificial neural network study of the electrical conductivity of mould flux

ORCID Icon
Pages 1476-1482 | Received 24 Aug 2021, Accepted 05 Dec 2021, Published online: 21 Dec 2021

Abstract

The electrical conductivity of mould flux with chemical constitution of CaO-SiO2-Al2O3-NaO-K2O-MgO-CaF2-Cr2O3-FeO-MnO has been investigated. The assessed database contains one unitary, five binary, nine ternary, four quaternary, two quinary, two senary and one octonary subsystems. Each constitutional component is in connection to another via some direct or indirect links. A multilayer artificial neural network method was developed and implemented in the database. The work provides a method to calculate the relationships between the composition, temperature and electrical property of the mould flux within the defined parameter ranges. The results have been validated against those experimental data that are not included in the training of the neural networks.

Introduction

Many steelmaking companies are using a mould flux with chemical constitution of CaO-SiO2-Al2O3-NaO-K2O-MgO-CaF2-Cr2O3-FeO-MnO to cast stainless steels in continuous casting mould. The mould slag contains high concentration of fluorine to reduce viscosity and solidification temperature [Citation1]. Cr2O3-FeO-MnO can either pre-exist in mould powder or enter to liquid slag from oxidation of alloying elements in liquid stainless steel at casting mould and tundish [Citation2]. The electrical conductivity of this system has never been assessed systematically although the data for its subsystem are notably rich.

The primary driving force to study the electrical properties of mould powder is for electroslag remelting processing [Citation3,Citation4]. This has, therefore, attracted considerable experimental measurement activities [Citation3–6], theoretical modelling [Citation7,Citation8] and data assessment [Citation9] for several subsystems. The recent environment regulation on carbon neutral steelmaking promotes the electrification of continuous casting. Electric field affects materials segregation [Citation10], viscosity [Citation11], distribution of oxide inclusions [Citation12] and surface roughness in the cast mould [Citation13]. This demands an analytical expression to represent the relationship between the chemical constitution, temperature and electrical conductivity of the system. The aim of the present work was to provide a mean to calculate the constitution and temperature-dependent electrical conductivity of the mould flux materials.

This work uses artificial neural network to approach the target. The method relies on available data to extrapolate values in the unknown parameters’ range [Citation14]. Ideally, a relationship between the electrical conductivity, constitution and temperature should be derived from micro-mechanisms [Citation15]. The previous theoretical modelling for the electrical conductivity of mould slag has been based largely on an assumption that electrical conduction is carried out by the moving ionised atoms. Obviously, viscosity affects the mobility of ionised atoms and hence plays an important role in electrical conduction. The basicity of mould slag affects the amount and length of silicate chains. This lays a foundation for the optical basicity model to calculate the electrical conductivity of mould slag [Citation7]. On the other side, the interactions between constitutional components influence the mobility of atoms. This forms a basis to use particle interaction to calculate electrical conductivity [Citation8]. For the oxides which can conduct electricity by electronic means, such as FeO, the above-mentioned theoretical models stray away [Citation2]. The data-based artificial neural network has proved to be an effective solution to provide alternative solution [Citation9].

The artificial neural network is different from data fitting such as the least square method. The former prevents overfitting but the latter seeks best fitting to data. The recent development of artificial intelligent learning enables the method to reduce its dimension according to the conservative laws that are hidden in the data [Citation16,Citation17]. Artificial neural network has potential to indicate some physics natures buried in the big data. This work intends to provide a method to calculate the constitution and temperature-dependent electrical conductivity of the mould flux.

Artificial neural network

A supervised multilayer artificial neural network with back propagation learning algorithm has been coded for the present purpose. The network has an input layer containing 11 units to record the constitutional compositions and temperature. The output layer has 1 unit to provide the computational result for electrical conductivity. There are T-1 hidden layers each containing numerous units. The architecture of 2-layer network is illustrated schematically in Figure , where the hidden layer has n-units. The mapping function for 2-layer neural network in the present work is defined as (1.1) xj=i=110ωj,i(1)ci+ωj,11(1)T+bj(1)(1.1) (1.2) uj=1/(1+exj)(1.2) (1.3) y=j=1nω1,j(2)uj+b1(2)(1.3) (1.4) σ=1/(1+ey)(1.4)

Figure 1. Schematic diagram illustrates 2-layer neural network.

Figure 1. Schematic diagram illustrates 2-layer neural network.

The mapping function for >2 layers neural network can be manipulated in the same way. A code package to calculate up to 4-layer neural network has been developed by the author. ωj,i(k) is the weight factor between jth unit in kth layer and ith unit in (k−1)th layer, where 0th layer is the input layer. bj(k) is the bias for jth unit in kth layer. xj is the activation of jth unit in 1st layer uj is its activation function. y is the activation of output layer. The electrical conductivity σ is the activation function of y.

The activation functions in Equations (1.2) and (1.4) are nonlinear and with output values between 0 and 1. Many other neural network models choose other functions such as f(x)=tanh(x) [Citation14], which has value between −1 and 1, or Gaussian distribution [Citation16]. It is important to normalise the data according to the format of activation functions. In the present model, all the electrical conductivity data are normalised to a value between 0 and 1. The numerical results are denormalised to get the true value.

Following the standard method in neural network calculation [Citation18], the total difference is defined as (2) ED=1ml=0m(σc(l)σe(l))2(2) where m is the total number of sets of training data. σe(l) and σc(l) are the lth training value and calculated value, respectively. To prevent neural network optimisation from overfitting the noise in training data, a regularisation is defined to minimise the total value of weight factor square as EW=i,j,k(wj,i(k))2. The overall target function is defined as [Citation18] (3) E=12βED+12αEW(3) where β and α are coefficients. Their ratio reflects a balance between accuracy and simplicity. The intelligent learning is to perform a gradient descent minimisation of the total difference via [Citation19] (4) Δwj,i(k)=ηE/wj,i(k)(4) where η is the learning rate. Equation (4) defines one of the learning methods that always works but unnecessarily the most effective method. In the following section, other learning methods will be used and compared with Equation (4). The analytical expression for each weight factor and bias can be obtained by back propagation of the partial difference of target function according to Equation (4).

Data assessment

The composition in the database is defined to be atomic percentage. The weight percentage data from literature has been converted according to their molar weight [Citation3,Citation5,Citation6,Citation21]. Renormalisation has been performed for those data that the original total composition does not come to 100% [Citation22]. This is not rare in steel company become the initial constitutions of C and CO2 in mould powder are not counted in liquid slag. However, some data in literature has a total composition in excess of 100%. Those data are ignored entirely. For the data contain tiny fraction (3.00 wt-%) of NaO and K2O but without detailed information about their ratio [Citation21]. An assumption of 50:50 is applied. This assumption does not affect the specification of the individual contribution from NaO and K2O to the electrical conductivity because a significant amount of NaO and K2O data from other literature has been applied to justify the assumption [Citation23,Citation24]. For the data that have been assessed previously [Citation8], the assessed data instead of large amount of raw experimental data has been adopted to minimise noise. Some data are reported as scattered points in plotted figure. To get the values from the plotted figure with high accuracy, a java-based code package has been developed by the author to convert the figure to data. Some figures have been plotted artificially with scales in coordinates not proportional [Citation22]. Only those data with clear indication of their values have been adopted. For the plotted continuous curves, only those points accompanied with experimental values are adopted. After the critical assessment, a database contains 752 sets of data with one unitary, five binary, nine ternary, four quaternary, two quinary, two senary and one octonary subsystems has been built up. The detailed subsystems are listed in Table . The parameter ranges are shown in Table . The unit of temperature is Kelvin. The unit of electrical conductivity is Ω−1·cm−1.

Table 1. The subsystems in the electrical conductivity database.

Table 2. The range of parameters’ values in the database.

Neural network computation and results

The activation functions, as illustrated in Equations (1.2) and (1.4), have an output value between 0 and 1 for the activation between -∞ and +∞. However, σ=0.006693 at y=5 and σ=0.993307 at y=5, which indicate an extremely slow approximation to either 0 or 1. Based on this consideration, the training data for electrical conductivity is not normalised by the true minimum and maximum values in the database but multiplied by 0.7 to the minimum value and 1.3 to the maximum value. The input parameters for composition and temperature are all normalised to a value between 0 and 1 to ensure every input parameter has the same weight of contribution. The denormalisation and normalisation procedure followed the following equations (5.1) σ~=σ0.7σmin1.3σmax0.7σmin(5.1) (5.2) c~i=cicimincimaxcimin(5.2) where σ~ and c~i are the normalised electrical conductivity and composition for i<11 and temperature when i=11. The initial values for all the weight factors and biases are assigned to a random float value between −5 and 5. A high-quality random number generator was coded according to a probability theory developed by Marsaglia et al. [Citation25].

In neural network calculation, it has been noted that the artificial learning by means of Equation (4) in every time iteration does not help to find the weight factors and biases to achieve minimum total differences between the target values and calculated value. The weight factors soon adjust their values to minimise the overall target function (E) rather than the total difference (ED). To overcome this problem, the regularisation term (EW) is not included in each time iteration but replaced at the final step assessment. It is also found that the convergence rate is almost doubled by the following artificial learning mechanism, which agrees with the suggestion from Rumelhart et al. [Citation19] (6) Δwj,i(k)(t+1)=ηE/wj,i(k)+δΔwj,i(k)(t)(6) where δ is a coefficient. For the 2-layer neural network calculation with n=16 and α=β=η=δ=0.5, the evolution of total difference, regularisation term and target function vs iteration steps are demonstrated in Figure . It shows that that total difference drops sharply in the early stage (labelled by A), followed by slow drops (labelled by B) until a flat stage (labelled by C) to fluctuate around a minimum value. However, the regularisation term was increased slowly but monotonically until stage C. This is due to the early mentioned decision of not to include the minimisation of regularisation term in the time iteration. The target function has been reduced monotonically until the flat stage.

Figure 2. The evolution of total difference (ED), regularisation term (EW) and target function (E) vs. iteration steps for 2-layer neural network with hidden layer containing 16 units.

Figure 2. The evolution of total difference (ED), regularisation term (EW) and target function (E) vs. iteration steps for 2-layer neural network with hidden layer containing 16 units.

To determine the optimum number of units in the hidden layer in 2-layer neural network calculation, one has calculated the change of differences until 1.6 × 107 iteration steps for various number of units. The results are plotted in Figure . Although ED decreases when the number of units increases, EW demonstrates some optimised values. EW increases sharply when the number of units is away from the optimised one. The target function, E, reveals an optimised value for the number of units in the hidden layer of the 2-layer neural network. Based on the results, n=16 is chosen. It is worth mentioning that the local minimum at n=16 for the curve of ED is out of expectation. To double check whether it is a numerical coincidence, the code and parameters were run at three different workstations but the results were very similar, given the fact that the initialisation of weight factors involves a random number generator which should be different at different computers. The smallest ED appeared at 9,012,000th iteration step. The values for the weight factors and biases at this optimised condition are listed in Table . These values can be used to calculate the electrical conductivity of the system at any composition and temperature in the parameters’ range.

Figure 3. The evolution of total difference (ED), regulation term (EW) and target function (E) vs. number of units in hidden layer after 1.6 × 107 iteration steps.

Figure 3. The evolution of total difference (ED), regulation term (EW) and target function (E) vs. number of units in hidden layer after 1.6 × 107 iteration steps.

Table 3. The optimised weight factors and biases obtained by artificial neural network calculations.

The optimised weight factors and bias values have been implemented to calculate the electrical conductivity for 752 sets of compositions and temperature. The results are plotted in Figure  and compared with the values in the database. Figure (a) shows the comparison at linear scale coordinates. The 45° line indicates the perfect agreement between the artificial neural network computational results and the value in database, where majority data are from experimental measurement and the rest from assessment based on experimental values. The figure shows good agreement. Owing to the wide range distribution of the electrical conductivity values from 0.016 to 23.771, which across three orders of magnitude, the comparison in logarithmic scale is shown in Figure (b). The data shows some almost evenly distribution around the 45° line, majority with absolute error below 5%. The largest absolute discrepancy appears in the lowest electrical conductivity end, as is circled in Figure (b). Those data are found all belong to CaO-SiO2-Al2O3 subsystem at a temperature either in 1623 K or 1673 K, and was reported in one paper.

Figure 4. The electrical conductivity from numerical results by artificial neural network calculation vs. the value in the database: (a) in linear scale plotting, and (b) in logarithmic scale plotting.

Figure 4. The electrical conductivity from numerical results by artificial neural network calculation vs. the value in the database: (a) in linear scale plotting, and (b) in logarithmic scale plotting.

To validate the artificial neural network calculations, the optimised weight factors and bias values have been implemented to calculate two binary systems CaF2-Al2O3 and Cao-CaF2 at different temperature. The results have been compared with the experimental results reported in various literature [Citation6,Citation26,Citation27]. Figure  presents the results for (a) CaF2-Al2O3 at 1773K, (b) CaF2-Al2O3 at 1973K, (c) Cao-CaF2 at 1773K and (d) Cao-CaF2 at 1873K. These experimental data are not in database during the training of neural network. Figure  shows that the electrical conductivity obtained in the neural network calculations are within the fluctuation of various experimental measurements. It proves that the artificial neural network prediction for the electrical conductivity can be used to predict the change of electrical conductivity at various subsystems in different compositions and temperatures.

Figure 5. Validation of neural network calculation by experimental data reported in literature [Citation6,Citation26,Citation27]. (a) CaF2-Al2O3 at 1773K, (b) CaF2-Al2O3 at 1973K, (c) Cao-CaF2 at 1773K, and (d) Cao-CaF2 at 1873K.

Figure 5. Validation of neural network calculation by experimental data reported in literature [Citation6,Citation26,Citation27]. (a) CaF2-Al2O3 at 1773K, (b) CaF2-Al2O3 at 1973K, (c) Cao-CaF2 at 1773K, and (d) Cao-CaF2 at 1873K.

The artificial neural network and machine learning have many potential applications in steel metallurgy [Citation28]. In the future, more works will be done to include other components to the system, such as NiO, TiO2, MgF2, BaF2, BaO, ZrO, CaS. The availability of the new experimental measurement method for electrical conductivity enables to get more accurate data in other systems [Citation29], which will help to build up database for training and validation of the neural networks. The future work can, hopefully, also include the effort to use the data and machine learning method to identify the main oxides that control the electrical conductivity of mould flux and the influence of temperature on the electrical properties, and to compare the results with the theoretical predictions [Citation7,Citation8,Citation15].

Conclusions

  • An electrical conductivity database for CaO-SiO2-Al2O3-NaO-K2O-MgO-CaF2-Cr2O3-FeO-MnO liquid mould slag system has been built up.

  • The database has been implemented to train an artificial neural network. It is found that the two-layer neural network with 16 units in hidden layer provides the minimum difference in target function. The optimised weight factors and bias values can be used to calculate the electrical conductivity of the system in a wide range of compositions and temperature.

  • The numerical prediction has been validated by the experimental results reported in literature. Excellent performance of artificial neural network derivation has been proved.

Acknowledgements

This work was financially supported by the European Commission RFCS (No. 847269) and Engineering and Physical Sciences Research Council in UK (No. EP/R029598/1).

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Engineering and Physical Sciences Research Council [grant number EP/R029598/1]; European Commission RFCS (Research Fund for Coal and Steel) [grant number 847269].

References

  • Yang J, Zhang JQ, Ostrovski O, et al. Effects of fluorine on solidification, viscosity, structure, and heat transfer of CaO-Al2O3-based mold fluxes. Metall Mater Trans. 2019;50B:1766–1772.
  • Barati M, Coley KS. Electrical and electronic conductivity of CaO-SiO2-FeOx slags at various oxygen potentials: Part I. Experimental results. Metall Mater Trans. 2006;37B:41–49.
  • Winterhager H, Kammel R, Gad A. Electric conductivity, density and surface tension of fluoride type slag for electroslag remelting method. Electric Steelmak. 1974;45:234–252.
  • Hara S, Hashimoto H, Kaxumi O. Electrical conductivity of molten slags for electro-slag remelting. Trans Iron Steel Inst Jpn. 1983;23:1053–1058.
  • Sarka SB. Electrical conductivity of molten high-alumina blast furnace slags. ISIJ Int. 1989;29:348–351.
  • Ogino K, Hara S. Density, surface tension and electrical conductivity of calcium fluoride based fluxes for electroslag remelting, iron and steel. Tetsu-To-Hagane. 1977;63:2141–2151.
  • Zhang GH, Chou KC. Simple method for estimating the electrical conductivity of oxide melts with optical basicity. Metall Mater Trans. 2010;41B:131–136.
  • Shi GY, Ta Z, Dou ZH, et al. Estimation model for electrical conductivity of CaF2-CaO-Al2O3 slags. JOM. 2016;68:2365–2370.
  • Haraguchi Y, Nakamoto M, Suzuki M, et al. Electrical conductivity calculation of molten multicomponent. ISIJ Int. 2018;58:1007–1012.
  • Zhang XF, Qin RS. Segregation of copper in an Fe–Cu alloy under pulsed electric current. Phil Mag Lett. 2015;95:367–375.
  • Halsey TC. Electrorheological fluids. Science. 1992;258:761–766.
  • Zhang XF, Qin RS. Electric current-driven migration of electrically neutral particles in liquids. Appl Phys Lett. 2014;104:114106.
  • Qin RS. Suppression of the surface roughness and fluctuation frequency by electric method. Mater Today Commun. 2021;28:102512.
  • Bhadeshia HKDH, Dimitriu RC, Forsik S, et al. Performance of neural networks in materials science. Mater Sci Technol. 2009;25:504–510.
  • Bohnenkamp U, Sandström R, Grimvall G. Electrical resistivity of steels and face-centered-cubic iron. J Appl Phys. 2002;92:4402–4407.
  • Liu ZM, Tegmark M. Machine learning conservation laws from trajectories. Phys Rev Lett. 2021;126:180604.
  • Wetzel SJ, Melko RG, Scott J, et al. Discovering symmetry invariants and conserved quantities by interpreting siamese neural networks. Phys Rev Res. 2020;2:033499.
  • MacKay DJC. Information theory, inference, and learning algorithms. 4th ed., Cambridge, Cambridge University Press; 2003.
  • Rumelhart DE, Hinton GE, Williams RJ. Learning representations by back-propagating errors. Nature. 1986;323:533–536.
  • Mills KC, Su YC, Li ZS, et al. Equations for the calculation of the thermo-physical properties of stainless steel. ISIJ Int. 2004;44:1661–1668.
  • Deng LB, Wang S, Zhang Z, et al. The viscosity and conductivity of the molten glass and crystallization behavior of the glass ceramics derived from stainless steel slag. Mater Chem Phys. 2020;251:123159.
  • Farahat R, Eissa M, Megahed G, et al. Effect of EAF slag temperature and composition on its electrical conductivity. ISIJ Int. 2019;59:216–220.
  • Zhang GH, Zheng WW, Jiao SQ, et al. Influences of Na2O and K2O additions on electrical conductivity of CaO-SiO2-(Al2O3) melts. ISIJ Int. 2017;57:2091–2096.
  • Mori K, Matsushita Y. On the constitution of molten slags viewing from the standpoint of the electrical conductivity. Tetsu-to-Hagane. 1952;38:444–448.
  • Marsaglia G, Zaman A, Tsang W. Toward a universal random number generator. Stat Probab Lett. 1990;9:35–39.
  • Eisenhuttenleute VD. Slag Atlas. 2nd ed Dusseldorf: Verlag-Stahleisen GmbH; 1995.
  • Mitchell A, Cameron J. Electrical conductivity of some liquids in system CaF2 + CaO + Al2O3. Mater Trans. 1971;2:3361–3366.
  • Smith JL. Advances in neural networks and potential for their application to steel metallurgy. Mater Sci Technol. 2020;36:1805–1819.
  • Zhang L, Malfliet A, Blanpain B, et al. In situ electrical conductivity measurement by using confocal scanning laser microscopy. Metall Mater Trans. 2021;52B:2563–2572.