1,575
Views
14
CrossRef citations to date
0
Altmetric
Articles

Estimating CO2-Brine diffusivity using hybrid models of ANFIS and evolutionary algorithms

, , ORCID Icon & ORCID Icon
Pages 818-834 | Received 15 May 2019, Accepted 22 Jan 2020, Published online: 22 Jun 2020

Abstract

One of the important parameters illustrating the mass transfer process is the diffusion coefficient of carbon dioxide which has a great impact on carbon dioxide storage in marine ecosystems, saline aquifers, and depleted reservoirs. Due to the complex interpretation approaches and special laboratory equipment for measurement of carbon dioxide-brine system diffusivity, the computational and mathematical methods are preferred. In this paper, the adaptive neuro-fuzzy inference system (ANFIS) is coupled with five different evolutionary algorithms for predicting the diffusivity coefficient of carbon dioxide. The R2 values forthe testing phase are 0.9978, 0.9932, 0.9854, 0.9738 and 0.9514 for ANFIS optimized by particle swarm optimization (PSO), genetic algorithms (GA), ant colony optimization (ACO), backpropagation (BP), and differential evolution (DE), respectively. The hybrid machine learning model of ANFIS-PSO outperforms the other models.

Notations

ANFIS=

adaptive neuro-fuzzy inference system

ACO=

ant colony optimization

CF=

crossover factor

DE=

differential evolution

GA=

genetic algorithms

PSO=

particle swarm optimization

STD=

Standard deviations

R2=

coefficient of determination

BP=

backpropagation

FL=

fuzzy logic

RMSEs=

root mean squared errors

MFs=

membership functions

TSK=

Takagi–Sugeno-Kang

MF=

mutation factor

MAREs=

Mean absolute relative errors

1. Introduction

Recently, the utilization of environmentally friendly and green energy has been accelerated because of the pollution crisis and increasing global energy demand. The carbon dioxide shows the valuable and wide application in this issue (Chang et al., Citation2012; Hemmati-Sarapardeh et al., Citation2013; Li & Gu, Citation2014). One of the common methods for extracting methane from its hydrates without damaging to the marine is replacing carbon dioxide instead of methane hydrate(Bai et al., Citation2012; Ota et al., Citation2007). The utilization of carbon dioxide for heat transmission is more applicable for extraction heat from hot fractured rock respect to water (Cui et al., Citation2016; Pruess, Citation2006; Zhang et al., Citation2016). Also in the other viewpoint the carbon dioxide injection to the geothermal reservoirs and seabed environment has straight effect on reduction of carbon dioxide emission (Agartan et al., Citation2015; Eccles et al., Citation2009; Javadpour, Citation2009; Rau & Caldeira, Citation1999; Ren et al., Citation2015; Ren et al., Citation2014; Trevisan et al., Citation2014a). When carbon dioxide has contact with water interface it can diffuse through the water so the diffusion coefficient is known as a major parameter which effects fluid diffusivity(Farajzadeh et al., Citation2009; Mutoru et al., Citation2011). This factor has a dominant effect on chemical reactions and mass transfer in porous media and solutions(S. P. Cadogan et al., Citation2014a; Trevisan et al., Citation2014b).

Numerous studies have been done about the experimental estimation of carbon dioxide diffusion coefficients (Bodnar & Himmelblau, Citation1962; Brignole & Echarte, Citation1981; Cadogan et al., Citation2014a; Choudhari & Doraiswamy, Citation1972; Maharajh, Citation1973; Frank et al., Citation1996; Guzmán & Garrido, Citation2012; Himmelblau, Citation1964; Hirai et al., Citation1997; Jähne et al., Citation1987; Liger-Belair et al., Citation2003; Lu et al., Citation2013; Cadogan et al., Citation2014b; Maharajh & Walkley, Citation1973; Mazarei & Sandall, Citation1980; Versteeg & Van Swaaij, Citation1988; Vivian & Peaceman, Citation1956; Vivian & King, Citation1964). According to the published experimental investigations, the laboratory measurement of CO2 diffusion coefficients can be classified to the two main categories, the direct methods such as Taylor-Aris dispersion approach which is based on the experimental determination of CO2 concentration in solvent and the indirect approaches which used the variance of volume or pressure depended on gas diffusivity (Bodnar & Himmelblau, Citation1962; Brignole & Echarte, Citation1981; S. P. Cadogan et al., Citation2014a; S. P. Cadogan et al., Citation2014b; Choudhari & Doraiswamy, Citation1972; Maharajh, Citation1973; Frank et al., Citation1996; Guzmán & Garrido, Citation2012; Himmelblau, Citation1964; Hirai et al., Citation1997; Jähne et al., Citation1987; Liger-Belair et al., Citation2003; Lu et al., Citation2013; D. M. Maharajh & Walkley, Citation1973; Mazarei & Sandall, Citation1980; Ng & Walkley, Citation1969; Nijsing et al., Citation1959; Pratt et al., Citation1973; Reddy & doraiswamy, Citation1967; Tamimi et al., Citation1994; Tan & Thorpe, Citation1992; Tham et al., Citation1967; Thomas & Adams, Citation1965; Versteeg & Van Swaaij, Citation1988; J. Vivian & Peaceman, Citation1956; J. E. Vivian & King, Citation1964). The prediction of carbon dioxide diffusion coefficients can be straightly affected by the convection in the view of accuracy at the elevated pressure (Farajzadeh et al., Citation2009) due to the effect of convection on estimation and its difficulties of diffusivity coefficients there are just a few experimental studies in literature for the elevated pressures (S. P. Cadogan et al., Citation2014b; Lu et al., Citation2013; D. M. Maharajh & Walkley, Citation1973). The experimental measurement of CO2 diffusion coefficients are complicated, time and cost consuming because of special procedures and complex measurement equipment. To overcome these problems some empirical relations have been proposed as reported in Table (S. P. Cadogan et al., Citation2014b; Lu et al., Citation2013; Moultos et al., Citation2016; Othmer & Thakar, Citation1953; Wilke & Chang, Citation1955). DCO2, µ, and Vm is the carbon dioxide diffusion coefficient, the viscosity of solvent and molar volume. Φ, M, nSE and denote the association parameter, Molecular weight of solvent, stokes–Einstein number and hydrodynamic radius of solute.

Table 1. The brief summary of developed correlations for diffusion coefficient of carbon dioxide.

The dominant operational conditions such as temperature and pressure have strong effects on CO2 diffusion during mass transfer so there are significant differences between diffusivity of carbon dioxide under high pressure and temperature and normal conditions. The salinity of the water is known as another critical parameter that effects CO2 molecular diffusion (Cadogan, Citation2015). Wilke et al. suggested a predicting model for the estimation of diffusion coefficients of carbon dioxide in different viscosity and temperature (Wilke & Chang, Citation1955). Lu and coworkers developed a relation for the diffusion of carbon dioxide into pure water in the range of 268–473 K (Lu et al., Citation2013) for temperature. Moultos et al. utilized dynamics simulations to modify the Lu's equation and proposed a relation to forecast the diffusion coefficient of CO2 and water system in the elevated temperatures and pressures (Moultos et al., Citation2016). In order to accurate determination of carbon dioxide diffusivity in the brine system, the viscosity of saline solution must be considered so Lu and Moultos correlations are not applicable directly for the CO2-brine system. Cadogan et al. made a modification on Stokes–Einstein equation to predict diffusivity of CO2-brine system but it is not reliable for high temperatures (S. P. Cadogan et al., Citation2014b).

The previous works have some limitations in the accuracy of the range of their applications and also the experimental works on this issue require a considerable amount of time and cost. Due to these facts, the importance of the development of an accurate and comprehensive method that has minimum cost has been highlighted. In order to accurate estimation of carbon dioxide diffusivity in solution with various salinities, the importance of proposing a general and accurate predictive algorithm becomes highlighted. On the other hand, the wide applications of artificial intelligence approaches in different parts of engineering processes and proposing the solutions for difficult issues (Baghban et al., Citation2016; Baghban et al., Citation2017a; Baghban et al., Citation2017b; Baghban et al., Citation2017b; Chau, Citation2017; Chuntian & Chau, Citation2002; Fotovatikhah et al., Citation2018; Moazenzadeh et al., Citation2018; Taherei Ghazvinei et al., Citation2018; Yaseen et al., Citation2018), cause artificial intelligence methods can be considered as an approach for estimation of CO2 diffusivity in different conditions. First, a collection of carbon dioxide diffusivity coefficients was gathered, then, we utilized ANFIS to build a model for the prediction of carbon dioxide diffusivity coefficients in the brine system. The predicting algorithm is optimized by the utilization of five evolutionary algorithms which are DE, ACO, GA, PSO, and BP. The ability of each developed algorithm is assessed by utilization of graphical and statistical analyses and also compared with the available correlations in the literature. In order to clarify the present work, different steps are shown in Figure .

Figure 1. A brief summary of present work.

Figure 1. A brief summary of present work.

2. Methodology

2.1. ANFIS

Zadeh introduced FL. The main characteristic of FL is known for the advancement of the linguistic parameters into the forms of mathematical equations. The regulations of FL include a number of if–then operators to change the quantitative parameters into qualitative. Due to a lack of accuracy in the modeling process, this approach could not satisfy scholars for prediction so the coupling FL and ANN were suggested as an efficient and accurate algorithm. The proposed tool which has potentials of FL such as MFs and if–then regulations called ANFIS. ANN is implemented to tune membership functions (Shamshirbandet al., Citation2019; Karkevandi-Talkhooncheh et al., Citation2017). Takagi- TSK as one of the FIs was applied in this study. TSK used to input and output patterns for if–then rules (Najafi-Marghmaleki et al., Citation2017; Nikravesh et al., Citation2003). Such ANFIS models have been widely used in building advanced prediction models in the relevant applications (Dehghani et al., Citation2019; Mosavi & Edalatifar, Citation2018; Mosavi et al., Citation2018; Rezakazemi et al., Citation2019).

Figure illustrates the structure of ANFIS network which has n input data. Xi and Yi represent the input and output of this network. The below rules are used to construct a new model in TSK FIS:

  • Rule one: IF < X1, is A11, and X2, is A21, and … and X5, is A51 > THEN < y1 = a1X1 + b1X2 + c1X3 + d1X4 + e1X5 + f1>

  • Rule two: IF < X1, is A12, and X2, is A22, and … and X5, is A52 > THEN < y2 = a2X1 + b2X2 + c2X3 + d2X4 + e2X5 + f2>

  • Rule three: IF < X1, is A13, and X2, is A23, and … and X5, is A53 > THEN < y3 = a3X1 + b3X2 + c3X3 + d3X4 + e3X5 + f3>

  • Rule four: IF < X1, is A14, and X2, is A24, and … and X5, is A54 > THEN < y4 = a4X1 + b4X2 + c4X3 + d4X4 + e4X5 + f4>

  • Rule five: IF < X1, is A15, and X2, is A25, and … and X5, is A55 > THEN < y5 = a5X1 + b5X2 + c5X3 + d5X4 + e5X5 + f5>

  • Rule fix: IF < X1, is A16, and X2, is A26, and … and X5, is A56 > THEN < y6 = a6X1 + b6X2 + c6X3 + d6X4 + e6X5 + f6>

  • Rule seven: IF < X1, is A17, and X2, is A27, and … and X5, is A57 > THEN < y7 = a7X1 + b7X2 + c7X3 + d7X4+e7X5+f7>

  • Rule eight: IF < X1, is A18, and X2, is A28, and … and X5, is A58 > THEN < y8 = a8X1 + b8X2 + c8X3 + d8X4 + e8X5 + f8>

Figure 2. Different layers of ANFIS algorithm.

Figure 2. Different layers of ANFIS algorithm.

The phrases after IF and THEN represent antecedent and consequences respectively. A and B denote the fuzzy model sets of inputs.

Figure is carried out to show ANFIS model has five different layers which can be described in details as below (Dadkhah et al., Citation2017; Safari et al., Citation2014; Tatar et al., Citation2016):

Layer 1:

The changing of input data can be used for the creation of linguistic terms. The n nodes have been created to relate the linguistic attributes of input data. As the Gaussian functions are efficient with higher performance in prediction for algorithm so it is implemented to organize these linguistic attributes. The Gaussian functions are presented as below: (1) Oi1=β(X)=exp12(XZ)2σ2(1) Here, σ, Z and O parameters and denote to the variance, center for Gaussian MF, and output, respectively. They can be determined during the model training.

Layer 2:

The controlling of the accuracy and performance of qualifications is located in the second layer, the firing strength layer can be mentioned as another name for this layer, the following formulation explains the determination level: (2) Oi2=Wi=βAi(X).βBi(X)(2)

Layer 3:

The normalization of firing strength which is placed in third layer can be formulated as below: (3) Oi3=WiWi(3)

Layer 4:

The creation of linguistic terms is conducted within this layer. Furthermore, the effect of rules on outputs is presented as follow: (4) Oi4=Wi¯fi=Wi¯(miX1+niX2+ri)(4) In the above expression, ni, mi, and ri relate to linear parameters. In addition, the model has been developed for the purpose of reduction of the difference between the measured data and the forecasted values to optimize the parameters.

Layer 5:

The alteration of rules to qualitative form by weighted average summation takes place in this layer as follows: (5) Oi5=Y=iWi¯fi=W1¯f1+W2¯f2=WifiWi(5) To clarify the proposing of the algorithm, the schematic diagram of ANFIS coupled with the evolutionary algorithm is shown in Figure .

Figure 3. The schematic diagram of proposing predictive algorithm.

Figure 3. The schematic diagram of proposing predictive algorithm.

2.2. BP

BP minimizes the error by reducing the difference between estimated values and experimental data. to this end, the error is propagated back by optimization of variables in the network and biases and weights are adjusted to minimize the objective function (Afshar et al., Citation2014).

2.3. ACO

Dorigo introduced the ACO algorithm as an approach that is based on a population algorithm (Dorigo et al., Citation1996). ACO method is developed based on the natural behavior of ants to determine the shortest ways from their storage to food resources (Stützle & Hoos, Citation2000). Ants release a pheromone in their ways so the population can find probable paths or solutions. This approach can be applied for discrete space, not for the continuous domain. Due to this fact, a Gaussian model was proposed by the construction of a list of probable solutions to generalize this approach in continuous space. Some special solutions exist in solution archives all over time (Heris & Khaloozadeh, Citation2014). These solutions can be obtained by optimization algorithm.the minimization of cost function as the first step of the ACO approach starts with finding the vector xXRnx by the below steps (Lozano et al., Citation2006):

  1. Initialization: generation of random N definitions in X and evaluation of cost function.

  2. Solution archive: the best of the solution is described by x1 and the worst of them is represented by xN.

  3. Weight definition: the weights for members of solution archive are determined as follows: (6) ui12παNexp(12(i1αN)2(6)

Based on the below condition: (7) i=1Nui=1(7)
  1. Generating the probabilistic model: the Gaussian probabilistic model is constructed by the below formulations: (8) Gj(X[j])=i=1NuiN(X[j];μi[j];σi[j])(8) (9) N(X;μ;σ)=12πσexp12xμσ2(9) where j and x[j] denote the decision variable and the jth part in x.

    The standard deviation and mean of Gaussian mixture are expressed in the following: (10) μi[j]=Xi[j](10) (11) σi[j]=ξN1i=1N|Xi[j]Xi[j]|(11) where ξ>0 denotes the exploration/exploitation balance.

  2. New samples are generated by using the model G=(G1;G2;Gnx) as new members of the solution archive and their objective functions are evaluated in this part.

  3. The solution for the optimization problem is chosen by selecting the best solution archive.

  4. The Criterions are checked to end the optimization or restart from part 3.

2.4. DE

Differential evolution as one of the major metaheuristics methods utilizes fundamental genetic operators to investigate a population which does not require the optimization problem to be differentiable. The differential evolution method is described below (Das et al., Citation2011):

  1. The first iteration starts as mutation factor F and crossover rate CR are selected and fitness of elements Fik=f(Xik) for each I are determined.

  2. The mutation operator is used to generating a trial solution as below: (12) TXi,j=Xa1,jk+F(Xa2jk+Xa3jk)j and i(12)

  3. The crossover operator is employed on solutions as below: (13) Ui,j=TXij if rand()CR or j=randbXijk if rand()>CR and jrandb(13)

    r and b represents the random index of best element b which is obtained from randb=rand(D){1,2,,D4

  4. Trial fitness TFi=f(ui,1,ui,2,,Ui,D) is calculated and the selection operator is applied on trial solution as below: (14) XijK+1=UijjifTFi<FikXijKjifTFiFik(14)

  5. Fitness estimation Fik+1=f(Xik+1) is done and the best index is distinguished.

  6. The end criterion is checked and the explained loop restarts from part 3 if it is necessary.

2.5. GA

One of the optimization approaches which has great potential in optimizing various objective function is GA. The initial solutions of this approach which are called chromosomes are generated randomly and some actions are done on chromosomes by means of operators such as mutation, crossover, and reproduction. In this algorithm, to determine the probability of offsprings production, the CF and MF can be used (Alam et al., Citation2015; Bedekar & Bhide, Citation2011).

  1. An initial solution is created randomly and CF and MF factors are generated.

  2. In the second step the Chromosomes fitness, FiK=f(Xik) i is determined and the best solution index b{1, is distinguished.

  3. the various GA operators produce the new Chromosomes Xik+1,i

  4. the best one b1 is determined by a fitness assessment FiK+1=f(Xik+1)

  5. In the fifth step, the old chromosome is replaced by the new best one.

  6. The algorithm continues the loop to meet the best conditions.

2.6. PSO

PSO is considered as a stochastic optimization which mimics the populations in nature such as fish, birds and insects (Kennedy, Citation2011). In the PSO algorithm, promoting the initial populations is known as an optimization problem and particles represent the solutions (Castillo, Citation2012). A group of particles denotes swarm so the particle and swarm terms represent individual and population has extensive application in this algorithm and genetic algorithm(Onwunalu & Durlofsky, Citation2010; Sharma & Onwubolu, Citation2009).

In PSO algorithm, Xi(t) and Vi(t) denote position vector and velocity vector for particle i. The following equation is used for updating the particle velocity (Chen, Citation2013; Lin & Hong, Citation2007): (15) vid(t+1)=wvid(t)+c1r1(pbest,id(t)Xiid(t))+c2r2(gbest,d(t)Xid(t)),d=1,2,,D(15)

In Equation (16), pbest,id and gbest,id represent the best position of particle and global position r, c and w are the random number, learning rate and inertia weight. The below expression shows the next particle position (Chiou et al., Citation2012; Eberhart & Kennedy, Citation1995; Shi & Eberhart, Citation1998): (16) Xid(t+1)=Xid(t)+vid(t+1)d=1,2,,D(16)

2.7. Data gathering

Efficiency and accuracy of any estimating algorithm are highly dependent on the accuracy of experimental data used for the development of the predictive model (Das et al., Citation2011; Hajirezaie et al., Citation2017; Karkevandi-Talkhooncheh et al., Citation2017). So we gathered 86 actual data for the carbon dioxide diffusion coefficient in the brine system from the reliable papers (S. P. Cadogan et al., Citation2014b; Choudhari & Doraiswamy, Citation1972; Maharajh, Citation1973; Frank et al., Citation1996; Himmelblau, Citation1964; Hirai et al., Citation1997; Lu et al., Citation2013; D. M. Maharajh & Walkley, Citation1973; Nijsing et al., Citation1959; Reddy & doraiswamy, Citation1967; Tamimi et al., Citation1994; Tan & Thorpe, Citation1992; Thomas & Adams, Citation1965; Versteeg & Van Swaaij, Citation1988; J. Vivian & Peaceman, Citation1956). The experimental data consist of diffusion coefficients of carbon dioxide in the brine system as a function of viscosity, pressure, and temperature. The collected experimental data includes the diffusion coefficients for a pressure range of 0.1–49.3 MPa, a temperature range of 273–473.15 K and a viscosity range of 0.1388–1.95 mPa.s. In order to enhance the performance of training, the experimental data are normalized such as below: (17) xN=2xxminxmaxxmin1(17) The degree of membership functions for different inputs of algorithms are shown in Figure .

Figure 4. Degree of membership function for (a) PSO, (b) GA, (c) ACO, (d) BP, (e) DE.

3. Results and discussion

In this paper, the ANFIS algorithm was combined with GA, ACO, PSO, DE, and BP to estimate the diffusivity coefficient in terms of temperature, pressure, and viscosity. In order to investigate the performance of the developed models, different statistical parameters such as the R2, MAREs, RMSEs and STD were calculated and reported in Table . The aforementioned indexes were determined such as following formulations: (18) R2=1i=1N(XiactualXipredicted)2i=1N(XiactualXactual¯)2(18) (19) MARE=1Ni=1N|XipredictedXiactual|Xiactual(19) (20) RMSE=1Ni=1N(XiactualXipredicted)2(20) (21) STDerror=1N1i=1N(errorerror¯)0.5(21)

Table 2. The determined statistical parameters for proposed different algorithms.

The determined R2 values in the testing phase were 0.997753, 0.993202, 0.985409, 0.973849 and 0.951371 for ANFIS optimized by PSO, GA, ACO, BP, and DE respectively. Furthermore, RMSE values were 0.112953, 0.197641, 0.316064, 0.398 and 0.533 for them. Also for four different empirical correlations, these statistical parameters were determined and shown in Table . The comparisons of these parameters exhibit that the proposed ANFIS algorithm combined with an appropriate evolutionary algorithm can be better predictive machine respect to the other published correlations.

The experimental and estimated diffusion coefficients are demonstrated simultaneously in Figure for different evolutionary algorithms. This comparison expresses good consistency between actual and predicted diffusivity coefficients for proposed algorithms. In order to illustrate the consistency, the experimental diffusivity is depicted against predicted diffusivity in Figure for ANFIS with different evolutionary algorithms. The resulted points for total data lie near to the bisector of the first quadrant which means the high capacity and great performance of algorithms. Furthermore, the determined fitting lines have formulations near the y = x line. One of the great approaches in the graphical analysis is the depiction of relative deviations for total datasets. So the relative deviations of predicted and actual diffusivity coefficients of different algorithms are shown in Figure . This analysis shows the low errors in the prediction of diffusivity for ANFIS combined with different algorithms. The ranges of relative error for different optimization processes are very low and also the PSO algorithm has better performance in comparison with others.

Figure 5. The comparison of experimental and predicted diffusivity coefficient for (a) PSO, (b) GA, (c) ACO, (d) BP, (e) DE.

Figure 5. The comparison of experimental and predicted diffusivity coefficient for (a) PSO, (b) GA, (c) ACO, (d) BP, (e) DE.

Figure 6. The experimental diffusivity coefficient versus predicted diffusivity coefficient for (a) PSO, (b) GA, (c) ACO, (d) BP, (e) DE.

Figure 6. The experimental diffusivity coefficient versus predicted diffusivity coefficient for (a) PSO, (b) GA, (c) ACO, (d) BP, (e) DE.

Figure 7. The relative deviation of experimental and estimated diffusivity for (a) PSO, (b) GA, (c) ACO, (d) BP, (e) DE.

Figure 7. The relative deviation of experimental and estimated diffusivity for (a) PSO, (b) GA, (c) ACO, (d) BP, (e) DE.

Table 3. The determined statistical indexes for the empirical correlations.

As mentioned before one of the important factors which have great impact on accuracy and performance of the model is accuracy and error of measured data points (Rousseeuw & Leroy, Citation2005), however the measured data utilized in the present paper were gathered from the reliable literature, due to experimental equipment and conditions they might contain some errors. Thus, here, the Leverage method has been employed to determine the inaccurate measured data. Referring to this computational method, the residuals are calculated and then inputs are utilized to organize Hat matrix by the below expression (Mohammadi et al., Citation2012b): (22) H=X(XTX)1XT(22)

In the above expression, X is known as the m×n matrix that n and m are the model parameters number and number of samples respectively. In order to the identification of inaccurate data and outliers of the dataset, William's diagram was plotted for outlier realization. As shown in Figure , the used data has some inaccurate and suspected data. These data are distinguished through standard residual indexes and limitations of the Leverage which are between −3 and 3. The leverage limit presented by H* can be determined by the below expression(Mohammadi et al., Citation2012a): (23) H=3(n+1)/m(23)

Figure 8. William's plots depicted for (a) PSO, (b) GA, (c) ACO, (d) BP, (e) DE.

Figure 8. William's plots depicted for (a) PSO, (b) GA, (c) ACO, (d) BP, (e) DE.

The validity and generality of the proposed model for the diffusivity coefficient of carbon dioxide prediction at different conditions are illustrated in this study. The Relevancy factor is utilized to study the impact of input variables on diffusivity. The Relevancy factor can be formulated as following (Bemani et al., Citation2019; Razavi et al., Citation2019b; Razavi et al., Citation2019a): (24) r=i=1n(Xk,iXk¯)(YiY¯)i=1n(Xk,iXk¯)2i=1n(YiY)¯2(24) where Yi,Y¯,Xk,i and Xk¯ denote the ‘i’ th output, output average, kth of input, and average of input. The range of absolute value of r is lower than one and as the absolute value of r is near to one the effectiveness of the input parameter on the diffusion coefficient is higher. Figure shows the Relevancy factor of the diffusivity coefficient in terms of different parameters for ANFIS combined with five different evolutionary algorithms. This analysis shows that the temperature is the most effective input parameter on the diffusivity coefficient of carbon dioxide. Furthermore, temperature and pressure have a straight relationship with the diffusivity coefficient of carbon dioxide and also it can be seen that as viscosity increases, the coefficient decreases. According to the above discussions, a present study is a helpful tool for engineers and scientists (Figure ).

Figure 9. The calculated Relevancy factor of diffusivity coefficient in terms of different parameters.

Figure 9. The calculated Relevancy factor of diffusivity coefficient in terms of different parameters.

4. Conclusions

We proposed the ANFIS coupled with five different evolutionary algorithms to estimate CO2 diffusivity in the brine system. In order to train and test these models, a total number of 86 measured data were collected from previous works. The different statistical indicators and graphical analysis demonstrated the high efficiency and accurate ability of proposing models. The determined R2 values in the testing phase were 0.9978, 0.9932, 0.9854, 0.9738 and 0.9514 for ANFIS optimized by PSO, GA, ACO, BP, and DE respectively. Also for general evaluation of model four empirical correlations from previous papers were utilized and compared with the proposed algorithms. According to the aforementioned results, it can be concluded that proposed algorithms have better performance in the prediction of carbon dioxide diffusion coefficients. Moreover, sensitivity analysis showed that temperature and pressure are the most and least effective parameters on the carbon dioxide diffusion coefficient. As a recommendation for future works, it can be suggested that other machine methods can be used for the prediction of the carbon dioxide diffusivity coefficient. However, it is worthy to mention that these methods have a limitation or drawback in comparison with experimental works which is a dependency of the accuracy of the proposed models on the accuracy and amount of databanks.

Supplemental material

Supplemental Material

Download MS Excel (11.8 KB)

Acknowledgements

We acknowledge the “Open Access Funding by the Publication Fund of the TU Dresden”. We also acknowledge the financial support of this work by the Hungarian State and the European Union under the EFOP-3.6.1-16-2016-00010 project and the 2017-1.3.1-VKE-2017-00025 project. This research has been additionally supported by the Project: “Support of research and development activities of the J. Selye University in the field of Digital Slovakia and creative industry” of the Research and Innovation Operational Programme (ITMS code: NFP313010T504) co-funded by the European Regional Development Fund.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Correction Statement

This article has been republished with minor changes. These changes do not impact the academic content of the article.

References