Search in:

Engineering Applications of Computational Fluid Mechanics Volume 14, 2020 - Issue 1

Submit an article Journal homepage

Open access

2,884

Views

CrossRef citations to date

Altmetric

Listen

Articles

Modeling monthly pan evaporation process over the Indian central Himalayas: application of multiple learning artificial intelligence model

Anurag Malika Department of Soil and Water Conservation Engineering, College of Technology, G.B. Pant University of Agriculture and Technology, Pantnagar263145, Uttarakhand, India

https://orcid.org/0000-0002-0298-5777

Anil Kumara Department of Soil and Water Conservation Engineering, College of Technology, G.B. Pant University of Agriculture and Technology, Pantnagar263145, Uttarakhand, India

Sungwon Kimb Department of Railroad Construction and Safety Engineering, Dongyang University, Yeongju36040, Republic of Korea

Mahsa H. Kashanic Department of Water Engineering, Faculty of Agriculture and Natural Resources, University of Mohaghegh Ardabili, Ardabil, Iran

Vahid Karimid Department of Water Engineering, Faculty of Agriculture, University of Tabriz, Tabriz, Iran

Ahmad Sharafatie Department of Civil Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran

https://orcid.org/0000-0003-0448-2871

Mohammad Ali Ghorbanid Department of Water Engineering, Faculty of Agriculture, University of Tabriz, Tabriz, Iran

Nadhir Al-Ansarif Civil, Environmental and Natural Resources Engineering, Lulea University of Technology, 97187Lulea, Sweden

Sinan Q. Salihg Institute of Research and Development, Duy Tan University, Da Nang550000, Vietnam

Zaher Mundher Yaseenh Faculty of Civil Engineering, Ton Duc Thang University, Ho Chi Minh City, VietnamCorrespondence[email protected]

https://orcid.org/0000-0003-3647-7137

Kwok-Wing Chaui Department of Civil and Environmental Engineering, Hong Kong Polytechnic University, Hong Kong, People’s Republic of China

show all

Pages 323-338 | Received 17 Oct 2019, Accepted 06 Jan 2020, Published online: 27 Jan 2020

Cite this article
https://doi.org/10.1080/19942060.2020.1715845
CrossMark

In this article

1. Introduction
2. Material and methods
3. Results and discussion
4. Conclusion
Disclosure statement
References

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

The potential of several predictive models including multiple model-artificial neural network (MM-ANN), multivariate adaptive regression spline (MARS), support vector machine (SVM), multi-gene genetic programming (MGGP), and ‘M5Tree’ were assessed to simulate the pan evaporation in monthly scale (EP_m) at two stations (e.g. Ranichauri and Pantnagar) in India. Monthly climatological information were used for simulating the pan evaporation. The utmost effective input-variables for the MM-ANN, MGGP, MARS, SVM, and M5Tree were determined using the Gamma test (GT). The predictive models were compared to each other using several statistical criteria (e.g. mean absolute percentage error (MAPE), Willmott's Index of agreement (WI), root mean squared error (RMSE), Nash-Sutcliffe efficiency (NSE), and Legate and McCabe’s Index (LM)) and visual inspection. The results showed that the MM-ANN-1 and MGGP-1 models (NSE, WI, LM, RMSE, MAPE are 0.954, 0.988, 0.801, 0.536 mm/month, 9.988% at Pantnagar station, and 0.911, 0.975, 0.724, and 0.364 mm/month, 12.297% at Ranichauri station, respectively) with input variables equal to six were more successful than the other techniques during testing period to simulate the monthly pan evaporation at both Ranichauri and Pantnagar stations. Thus, the results of proposed MM-ANN-1 and MGGP-1 models will help to the local stakeholders in terms of water resources management.

KEYWORDS:

Pan evaporation
multiple model strategy
gamma test
Indian central Himalayas
meteorological variables

1. Introduction

Evaporation plays the main role in environmental studies and water resources management. Therefore, the precise simulation of EP_m is necessary for hydrological simulating, river flow forecasting, forestry, lake-ecosystems, agronomy and irrigation sciences (Burt, Mutziger, Allen, & Howell, Citation2005). The most effective meteorological factors on evaporation rate are relative humidity, air temperature, vapor pressure deﬁcit, atmospheric pressure, vapor pressure, wind speed, and sunshine hours (Yaseen et al., Citation2019). Evaporation is generally measured using two methods: (i) direct methods such as Class A pan-evaporimeter and (ii) indirect methods include empirical equations (Ghorbani, Deo, Yaseen, Kashani, & Mohammadi, Citation2017). Doorenbos and Pruitt (Citation1977) said that the Class A pan-evaporimeter performance was affected by the turbidity of the water, watering of birds or other animals, human and instrumentation errors, and maintenance problems. Numerous studies reported the determination of the pan evaporation applying some empirical and semi-empirical formulations based on various meteorological information (Griffiths, Citation1966; Penman, Citation1948; Priestley & Taylor, Citation1972). However, there are limitations to the applications of these methods because of using different climatic factors. Therefore, alternative methods, which require less meteorological data are needed to predict the evaporation (Kisi, Citation2015; Kisi, Genc, Dinc, & Zounemat-Kermani, Citation2016).

AI models has shown promising progress on the evaporation estimation (Tao et al., Citation2018). Several versions of AI models are explored for pan evaporation simulation including; ANN based models, Fuzzy based models, support vector machine (SVM), evolutionary computing, data mining and complementary wavelet-AI models (Adnan, Malik, Kumar, Parmar, & Kisi, Citation2019; Guven & Kisi, Citation2013; Kisi & Heddam, Citation2019; Qasem et al., Citation2019; Qutbudin et al., Citation2019; Rezaie-Balf, Kisi, & Chua, Citation2019; Sebbar, Heddam, & Djemili, Citation2019; Yaseen, El-shafie, Jaafar, Afan, & Sayl, Citation2015). Tabari, Talaee, and Abghari (Citation2012) predicted daily pan-evaporation using CANFIS and MLPNN techniques in Iran. The research findings demonstrated that the MLPNN model provided a better accuracy than the CANFIS model. Kisi (Citation2015) used least square SVM (LSSVM), M5Tree, and MARS for estimating pan evaporation in Turkey. They found that the MARS provided a significant superiority compared to other models. Deo, Samui, and Kim (Citation2016) developed the ELM, MARS, and relevance vector machine (RVM) to predict evaporative using maximum and minimum temperatures, atmospheric vapor pressure, precipitation and solar radiation in Australia. The results indicated that the RVM model has high ability compared to the other techniques. Malik and Kumar (Citation2015) hired the CANFIS, MLR, and ANN models for predicting the pan evaporation over different stations in India and the ANN models are more successful compared to other ones. Keshtegar, Piri, and Kisi (Citation2016) used M5Tree, conjugate gradient (CG), and ANFIS models to simulate of evaporation. Results produced that the CG model is more satisfactory than the M5Tree and ANFIS techniques. Wang, Kisi, Zounemat-Kermani, and Li (Citation2017) estimated pan evaporation using the FG, ANFIS-GP, and M5Tree models in China. Results proved the high ability of the FG model in pan evaporation estimation. Wang, Kisi, Hu, et al. (Citation2017) predicted pan evaporation using MLPNN, GRNN, ANFIS with grid partition (ANFIS-GP), MARS, MLR, FG, LSSVM, and SS models in China. The results showed that the AI models provided the most accuracy than the SS and MLR techniques. Wang, Niu, Kisi, Li, and Yu (Citation2017) evaluated the LSSVR, MARS, MLR, FG, and M5Tree models for simulating EP_m in daily scale in China. Results demonstrated that the LSSVR and FG models show the highest accuracy to the other applied models in pan evaporation estimation.

Furthermore, the current research the correlated input variables are identified using a nonlinear method called Gama Test (GT). In recent decade, various researches about applying Gamma test have been done in numerous cases (Malik & Kumar, Citation2018; Remesan, Shamim, & Han, Citation2008; Singh, Malik, Kumar, & Kisi, Citation2018). Moghaddamnia, Ghafari Gousheh, Piri, Amin, and Han (Citation2009) simulated evaporation using the ANN and ANFIS models while applying GT to select suitable input variables in different area of Iran. Results showed the ANFIS and ANN models have high capability in the estimation of EP_m values. Goyal, Bharti, Quilty, Adamowski, and Pandey (Citation2014) identified suitable input variables using GT for FL, ANN, ANFIS, and least square-SVM techniques to estimate EP_m in daily scale in India, and the SS and Hargreaves-Samani (HGS) equations were compared with the results obtained by AI models. The study showed the superior accuracy of the SCT over SS and HGS techniques. Malik, Kumar, and Kisi (Citation2018) simulated daily EP_m using RBNN, MLR, Griffiths, Stephens-Stewart, Priestley-Taylor, Christiansen, Penman, SOMNN, and Jensen-Burman-Allen models in India. The input combinations of MLR, RBNN, and SOMNN were determined using the GT. The results of the research indicated that the most accuracy was observed in the RBNN model.

The current research is conducted to accomplish the following aims:

An appropriate input combination is identified using GT before the construction of the proposed and comparable predictive models.
Potential of the proposed model is investigated using climate information of Ranichauri and Pantnagar stations located in Indian central Himalayas.
A comprehensive evaluation is performed on the obtained performance of the proposed predictive models.

2. Material and methods

2.1. Description of case study

The sources of the weather information used in this study are two stations (i.e. Ranichauri and Pantnagar) in India. The coordinates of the Pantnagar and Ranichauri stations are (79° 38′ 0′′ E, 29° 0′ 0′′ N) and (78° 24′ 35′′ E, 30° 18′ 40′′ N), respectively. These coordinates are depicted in Figure . The Pantnagar and Ranichauri stations are located at altitude of 243.8 and 2000 m above sea level, respectively. The recorded meteorological information was collected from the Crop Research Centres in Uttarakhand, India are the monthly minimum (T_min) and maximum temperatures (T_max), the wind speed (S_w), the hours of sunshine (H_ss), the monthly pan evaporation (EP_m), and the morning and afternoon relative humidity (RH₁ and RH₂). The EP_m in monthly scale were obtained from Crop Research Centre (CRC) observatory. Figure (a and b) depict the box–whisker plots of the meteorological information collected over 27 years at Pantnagar station (January 1990 to December 2016), and at Ranichauri station for over 13 years (January 2000 to December 2012), respectively. These plots provide information such as the minimum, quartile-wise, and maximum values of meteorological data. The collected data was partitioned into two different sets of varying strengths, (i) the training dataset, which was selected from January to December years 1990–2010 at the Pantnagar station, and years 2000–2009 at the Ranichauri station; and (ii) testing dataset, which was selected from January to December years 2011–2016 at the Pantnagar station and years 2010–2012 at the Ranichauri station. The correlation statistic between the predictive and target variables for both stations are presented in Table . This table shows that the input variables (i.e. RH₁, RH_2, S_w, T_min, T_max, and H_ss) have better cross-correlations (significant in statistics) with the output variable EP_m at 5% confidence level for Pantnagar station. Besides, the input variables (i.e. RH₁, H_ss, T_min, and T_max) have better cross-correlation (significant in statistics) with the output variable EP_m at 95% confidence level for Ranichauri station. It was also observed from Table , that there was no significant cross-correlation between T_min and H_ss variables for Pantnagar station, and T_min, S_w; T_min, H_ss; T_max, RH₁; T_max, RH_2; T_max, S_w; RH_2, EP_m; and S_w and EP_m variables for Ranichauri stations.

Figure 1. Location of the studied meteorological stations.

Figure 2. Box-Whisker plot of monthly meteorological data at (a) Pantnagar station, and (b) Ranichauri station.

Table 1. Inter-correlation between climatic variables at Pantnagar and Ranichauri stations.

Download CSV Display Table

2.2. Gamma test (GT)

Gamma test which includes continuous nonlinear models calculates the minimum standard error (SE) for a dataset. A data sample would be characterized as following formulas (Tsui, Jones, & De Oliveira, Citation2002); (1) ${x_{1} (i), \dots, x_{m} (i), y_{i}} = {(x_{i}, y_{i}) | 1 \leq i \leq M}$ (1) where, the x = ( $x_{1}$ , … , $x_{m}$ ) is the vector of inputs; $y_{i}$ is the target; M and m are the number patterns and inputs, respectively. The GT extracts the k^t^h (k = 1 … p) nearest neighbor lists (p is a user defined parameters) $x_{N [i, k]}$ (i = 1 … M) of the input vector ( $x_{i}$ ). Finally, the GT calculates the following four statistics (Remesan, Shamim, Han, & Mathew, Citation2009): (2) $Δ_{M} (k) = \frac{1}{M} \sum_{i = 1}^{M} | x_{N [i, k]} - x_{i} |^{2} (k = 1 \dots p)$ (2) where, | … | indicates the Euclidean distance. Equation (3) indicates the gamma function of the output as (Moghaddamnia et al., Citation2009): (3) $γ_{M} (k) = \frac{1}{2 M} \sum_{i = 1}^{M} | y_{N [i, k]} - y_{i} |^{2} (k = 1 \dots p)$ (3) where, $y_{N [i, k]}$ is the output of the k^th nearest neighbor of x_i. To calculate the gamma (Γ), a linear function is obtained using the p points { $Δ_{M} (k), γ_{M} (k)$ } as follows (Piri et al., Citation2009): (4) $y = A δ + Γ$ (4) where, A = gradient, Γ = intercept ( $δ$ = 0), and y = output vector. Small values of the Γ (near to 0) shows that the input parameter has been selected more logically. The standard error (SE) of Γ is computed to measure the gamma value reliability. Small values of the SE shows that the gamma value is more reliable. V_ratio reveals the predictability of model, and defined as (Malik et al., Citation2018): (5) $V_{ratio} = \frac{Γ}{σ^{2} (y)}$ (5) where Γ is the gamma function and $σ^{2} (y)$ shows the output variance. The smaller V _ratio indicates the predictability of the model output.

It will be possible to create a high-quality mathematical model (e.g. MM-ANN, MARS, MGGP, SVM, and M5Tree) if the mentioned four factors i.e. A, SE, Γ, and V _ratio are small. In this research, the best predictive variables were picked using the obtained lowest values of Γ and V ratio (Malik, Kumar, Ghorbani, et al., Citation2019; Piri et al., Citation2009) for both stations.

2.3. Multiple model-artificial neural networks (MM-ANN)

In this research, the multiple model-artificial neural networks (MM-ANN), which is a hybrid model was used as to simulate the evaporation process. This hybrid model includes two levels of modeling learning. Level 1 manages the primary learning process, and the main inputs (meteorological variables) and output (pan evaporation) variables are used for training the candidate AI models, this is in one land. On the other hand, the Level 2 modeling process is initiated based on the Level one result. In this manner, a kind of binary learning process of machine learning modeling strategy is initiated. The outputs of level 1 learning process are utilized as input attributes while the original output (i.e. pan evaporation) is used as outputs in Level 2 (See Figure ). The established modeling strategy in the current research evidenced its potential in multiple hydrological engineering application such as soil cation exchange capacity, streamflow prediction, inflow detection (Ghorbani, Khatibi, Karimi, Yaseen, & Zounemat-Kermani, Citation2018; Kashani, Ghorbani, Shahabi, Naganna, & Diop, Citation2020; Khatibi, Ghorbani, Jani, & Servati, Citation2018; Khatibi, Ghorbani, & Pourhosseini, Citation2017). Yet, for the evaporation process to be investigated. This is due to the fact; the evaporation process is highly stochastic and non-linear characterized and thus such a binary ‘multiple model learning’ is needed here.

Figure 3. The structure of the MM-ANN model.

2.4. Multivariate adaptive regression spline (MARS)

The MARS model represents a linear model that can auto-simulate parametric interactions and nonlinearities (Friedman, Citation1991). This model has a backward and forward stepwise plan, where the plan that will ensure an over-trained and complex model after series of splits but with a lower accuracy is chosen at the forward stepwise plan (De Andrés, Lorca, de Cos Juez, & Sánchez-Lasheras, Citation2011). However, the previously chosen non-obligatory variables are removed at the backward stepwise plan. This function uses two functions (e.g. a knot – a value of a variable) that represents the meeting point within range of inputs to map X to Y (Adamowski, Chan, Prasher, & Sharda, Citation2012). Two functions [(x-t)₊ and (t-x)₊] are deployed in MARS. Here, ‘+’ represents only the positive as follows (Deo, Kisi, & Singh, Citation2017; Kisi, Citation2015): (6) $(x - t)_{+} = \{\begin{matrix} x - t, if x > t \\ 0, otherwise \end{matrix}$ (6) (7) $(t - x)_{+} = \{\begin{matrix} t - x, if x < t \\ 0, otherwise \end{matrix}$ (7)

The MARS formula is written as following equation: (8) $\hat{f} (x) = \sum_{i}^{k} c_{i} B_{i} (x)$ (8) where c_i is the constant coefficients, and B_i(x) indicates a weighted sum of function. To run the MARS model, the Salford Predictive Modeler 8 software was utilized.

2.5. Multi-gene genetic programming (MGGP)

The MGGP is the advanced stage of classical genetic programming (GP), which improves the fitness of classical GP by linearly combining low depth GP trees (Searson, Citation2015). Several types of research about different genotype GP have been reported in various fields by (Danandeh Mehr & Nourani, Citation2017; Guven, Citation2009; Shoaib, Shamseldin, Melville, & Khan, Citation2015; Zorn & Shamseldin, Citation2015). Figure represents the tree structure while the function is as follows (Danandeh Mehr, Kahya, & Olyaie, Citation2013): (9) $F (X_{1}, X_{2}) = ((3.5 X_{1} \div C o s (X_{2}) + exp (C o s (X_{1}))$ (9) The major inputs of GP include (i) training/validation pattern; (ii) function for fitness (iii) leaves and inner nodes for identifying the structure; and (iv) the syntax tree formation parameters (Danandeh Mehr & Nourani, Citation2017). For simple problems, the arithmetic parameters including plus (+), minus (-), divide (÷), and multiply (x) have been used whereas for more complex problems, other operators including exp, cos, sin, and tan are applied (Searson, Citation2015). For example, a pseudo-linear MGGP chromosome predicts the predictand $\hat{y}$ using several input variables as shown in Figure . The mathematical expression can be expressed as (Danandeh Mehr & Kahya, Citation2017; Danandeh Mehr, Kahya, & Yerdelen, Citation2014): (10) $\hat{y} = b_{0} + b_{1} T_{1} + b_{2} T_{2}, \dots \dots, + b_{G} T_{G}$ (10) where $\hat{y}$ is the estimated value; $T_{i}$ is the i^th gene output; $b_{0}$ is the noise; and $b_{1}$ , $b_{2}$ , … , $b_{G}$ are the gene weights. The more detailed information about MGGP is given by (Danandeh Mehr & Kahya, Citation2017; Danandeh Mehr & Nourani, Citation2017).

Figure 4. GP tree structure of expression $((3.5 X_{1} \div C o s (X_{2}) + exp (C o s (X_{1}))$ .

Figure 4. GP tree structure of expression ((3.5X1÷Cos(X2)+exp(Cos(X1)).

Figure 5. Multi-gene genetic programming (symbolic regression).

In this study MGGP framework topology was developed using 500 population size, 300 generations, mutation probability rate of 0.5, crossover probability rate of 0.4, reproduction probability rate of 0.1, 5 genes (trees), 4 depth of tree, and function set (divide, minus, plus, sin, cos, sqrt. log, exp.) in MATLAB software.

2.6. Support vector machine (SVM)

The SVM is mainly useful in regression and classification tasks. The model performs using the concepts of structural risk minimization (SRM) and traditional empirical risk minimization (ERM). Both concepts are normally applied by conventional neural networks (CNN). All the input space operations in the potentially low-dimensional feature space are performed by the kernel function in SVM. The more information about the SVM and its applications can be found in some studies such as reference evapotranspiration modeling (Kişi & Cimen, Citation2009), pan evaporation simulation (Goyal et al., Citation2014), and drought modeling (Deo et al., Citation2017). In this research, the SVM model was conducted by considering a gamma regularization parameter value of 0.0064 and polynomial kernel function with 3rd degree.

2.7. M5tree model

The M5Tree determines a relationship between predictive and output variables by obtaining the parameter subspaces based on the decision tree (Sharafati, Khosravi, et al., Citation2019). There are two stages in the structure of the M5Tree model (Salih et al., Citation2020): (i) dividing the data into the subsets and form decision trees by reducing standard deviation (SDR); and (ii) fitting a linear function to the actual dataset. The SDR values are obtained by Equation (11) (Rahimikhoob, Citation2014): (11) $S D R = s d (T) - \sum \frac{| T_{i} |}{| T |} s d (T_{i})$ (11) where, $T_{i}$ is the subset of dataset having the i^th outcome of the potential set; T is a dataset that reach the node; and sd shows the standard deviation.

The more information about the M5Tree model can be found in different studies such as reference evapotranspiration modeling (Rahimikhoob, Citation2014), pan evaporation modeling (Wang, Kisi, Zounemat-Kermani, et al., Citation2017), drought forecasting (Deo et al., Citation2017), and sediment modeling (Goyal, Citation2014). In this research, the M5Tree was run using the Weka 3.9.2 software.

2.8. Model performance evaluation indicators

The potential of the MM-ANN, MARS, MGGP, M5Tree, and SVM techniques for simulating the EP were assessed using Nash-Sutcliffe Efficiency (NSE), Willmott's Index of agreement (WI), Legate and McCabe’s Index (LM), Root Mean Squared Error (RMSE), (Sharafati, Tafarojnoruz, Shourian, & Yaseen, Citation2019; Sharafati, Yasa, & Azamathulla, Citation2018; Tikhamarine, Malik, Kumar, Souag-Gamane, & Kisi, Citation2019), and Mean Absolute Percent Error (MAPE), Standard Deviation (SD) and Correlation Coefficient (CC) (Malik, Kumar, & Singh, Citation2019; Singh, Pal, & Singh, Citation2010). The mentioned performance criteria can be expressed as: (12) $\begin{aligned} NSE & = [1 - \frac{\sum_{i = 1}^{N} {(E P_{m}^{o b s, i} - E P_{m}^{s i m, i})}^{2}}{\sum_{i = 1}^{N} {(E P_{m}^{o b s, i} - \bar{E P_{m}^{o b s, i}})}^{2}}] \\ (- \infty < NSE \leq 1) \end{aligned}$ (12) (13) $WI = [1 - \frac{\sum_{i = 1}^{N} {(E P_{m}^{o b s, i} - E P_{m}^{s i m, i})}^{2}}{\begin{matrix} \sum_{i = 1}^{N} (| E P_{m}^{s i m, i} - \bar{E P_{m}^{o b s, i}} | \\ + | E P_{m}^{o b s, i} - \bar{E P_{m}^{o b s, i}} |)^{2} \end{matrix}}] (0 < WI \leq 1)$ (13) (14) $LM = [1 - \frac{\sum_{i = 1}^{N} | E P_{m}^{o b s, i} - E P_{m}^{s i m, i} |}{\sum_{i = 1}^{N} | E P_{m}^{o b s, i} - \bar{E P_{m}^{o b s, i}} |}] (0 < LM \leq 1)$ (14) (15) $\begin{aligned} RMSE & = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(E P {_{m}^{o b s, i}}_{} - E P_{m}^{s i m, i})}^{2}} \\ (0 \leq RMSE < \infty) \end{aligned}$ (15) (16) $\begin{aligned} MAPE & = \frac{\sum_{i = 1}^{N} |\frac{E P_{m}^{o b s, i} - E P_{m}^{s i m, i}}{E P_{m}^{o b s, i}}|}{N} \times 100 \\ (0 \leq MAPE < \infty) \end{aligned}$ (16) (17) $CC = \frac{\begin{matrix} \sum_{i = 1}^{N} (E P_{m}^{o b s, i} - \bar{E P_{m}^{o b s, i}}) \\ (E P_{m}^{s i m, i} - \bar{E P_{m}^{s i m, i}}) \end{matrix}}{\sqrt{\begin{matrix} \sum_{i = 1}^{N} {(E P_{m}^{o b s, i} - \bar{E P_{m}^{o b s, i}})}^{2} \\ \sum_{i = 1}^{N} {(E P_{m}^{s i m, i} - \bar{E P_{m}^{s i m, i}})}^{2} \end{matrix}}} (- 1 \leq CC \leq 1)$ (17) (18) $SD = \sqrt{\frac{\sum_{i = 1}^{N} {(E P_{m}^{s i m, i} - \bar{E P_{m}^{s i m, i}})}^{2}}{N - 1}} (0 \leq CC \leq \infty)$ (18) where, N is the number of samples, $E P_{m}^{s i m, i}$ and $E P_{m}^{o b s, i}$ are the simulated and observed EP_m in i^th dataset, and $\bar{E P_{m}^{s i m, i}}$ and $\bar{E P_{m}^{o b s, i}}$ are mean of simulated and observed EP_m values, respectively. The model with larger value of WI, NSE, LM, and CC, and a smaller value of RMSE, MAPE, and SD during testing period was selected better model for monthly pan evaporation simulation understudy location (Malik, Kumar, & Kisi, Citation2017).

3. Results and discussion

3.1. Selecting the GT based effective input variables

Among several learning process components, input variables selection is an essential one for modeling AI models. This is owing to the fact AI models behave differently in different cases based on the attributes of the simulated matrix. Hence, in the current study different input combinations were constructed incorporating various climatological variables and their positive influence on the pan evaporation at the inspected Ranichauri and Pantnagar meteorological stations (See Table ). The input combinations were configured using the gamma test where the highly influential variables are determined. The statistical results of GT were reported in Table for both stations. According to the results tabulated in Table and with a fixed mask example (111111), it appears that the minimum value of Γ = 0.070 and V_ratio = 0.062 at Ranichauri station. Whereas, the minimum value of Γ = 0.515 and V_ratio = 0.068 at Pantnagar station. The mask example demonstrated the incorporation of the six input variables to predict the monthly scale pan evaporation. Hence, the following input variables were utilized for the prediction process (i.e. T_max, T_min, RH₁, RH₂, H_ss, and S_w).

Table 2. Final structure of MM-ANN, MARS, MGGP, SVM and M5Tree models at Pantnagar and Ranichauri stations.

Download CSV Display Table

Table 3. GT statistics of monthly EP_m datasets for Pantnagar and Ranichauri stations.

Download CSV Display Table

3.2. Simulation of EP_m at Pantnagar station

The monthly pan evaporation was estimated using MM-ANN-1, MARS-1, MGGP-1, SVM-1 and M5Tree-1 models based on NSE, WI, LM, RMSE and MAPE for both training and testing stages at Pantnagar station. The values of NSE, WI, LM, RMSE and MAPE criteria during the training and testing periods for MM-ANN-1, MARS-1, MGGP-1, SVM-1 and M5Tree-1 models are presented in Table . As evaluated for Pantnagar station from Table , the MM-ANN-1, MARS-1, MGGP-1, SVM-1 and M5Tree-1 models provided NSE = 0.939, 0.931, 0.922, 0.944 and 0.924, WI = 0.984, 0.982, 0.979, 0.985 and 0.980, LM = 0.771, 0.757, 0.741, 0.784 and 0.738, RMSE = 0.693, 0.738, 0.787, 0.666 and 0.775 mm/ month, and MAPE = 12.399%, 13.486%, 14.340%, 11.848%, and 14.777% during training period. In addition, the MM-ANN-1, MARS-1, MGGP-1, SVM-1 and M5Tree-1 models provided NSE = 0.954, 0.938, 0.932, 0.924 and 0.923, WI = 0.988, 0.985, 0.984, 0.982 and 0.981, LM = 0.801, 0.754, 0.744, 0.718 and 0.707, RMSE = 0.536, 0.621, 0.651, 0.687 and 0.691 mm/ month, and MAPE = 9.988%, 12.637%, 13.211%, 14.074%, and 16.389% during testing period, respectively. Table exposed the MM-ANN-1 model performed the best simulation during the testing periods. Therefore, MM-ANN-1 model followed the best statistical criteria (i.e. minimum RMSE and MAPE values, and maximum NSE, WI and LM values) in testing stage and selected best model among other models. MARS-1 model followed the MM-ANN-1 as the second rank closely.

Table 4. Values of the NSE, WI, LM, RMSE and MAPE criteria for the developed models during training and testing periods at Pantnagar and Ranichauri stations.

Download CSV Display Table

The temporal variation between simulated and observed monthly EP_m data, and their scatter plots (right side) for the MM-ANN-1, MARS-1, MGGP-1, SVM-1, and M5Tree-1 models was plotted in Figure (a through e) for the testing period. In scatter plots, the regression line provided the coefficient of determination (R²) as 0.955 for the MM-ANN-1 model, 0.959 for the MARS-1 model, 0.956 for MGGP-1 model, 0.946 for SVM-1 model, and 0.962 for M5Tree-1 model during the testing stage, respectively. The fitted regression line (RL) and the perfect line of fit (1:1) were close to each other for all techniques. The RL was located above the best fit (1:1) for MARS-1, MGGP-1, SVM-1, and M5Tree-1 models. This means that at Pantnagar station, the five models over-predict the monthly pan evaporation values. Also, the regression line was located under the 1:1 line for MM-ANN-1 models, which means the model under-predict the pan evaporation values at Pantnagar station.

Figure 6. Line (left) and scatter (right) plots between observed and simulated monthly pan evaporation values by (a) MM-ANN-1, (b) MARS-1, (c) MGGP-1, (d) SVM-1, and (e) M5Tree-1 models during testing period at Pantnagar station.

According to Figure , all model symbols were very close to each other. In detail, SVM-1 was located as the furthest from the observed point. This introduces SVM-1 as the worst model. MM-ANN-1 was the closest model to the observed point based on the standard deviation, correlation, and RMSE. The MM-ANN-1 model presented identical peaks and resembled the observed values (Figure (a)). This proves the superiority of the MM-ANN-1 model over the others. The density plots of the simulated EP_m by AI models against the observed (OBS) values are presented in Figure . However, the density plot suggested a slightly different distribution of the simulated values.

Figure 7. Taylor diagrams comparing the models’ fit statistics at Pantnagar station.

Figure 8. Density plots of MM-ANN, MARS-1, MGGP-1, SVM-1 and M5Tree-1 models at Pantnagar station.

3.3. Simulation of EP_m at Ranichauri station

The MM-ANN-1, MARS-1, MGGP-1, SVM-1 and M5Tree-1 models were trained using (January, 2000 to December, 2009) and tested using (2010 to December, 2012) datasets. The adequacy of these models was assessed using the NSE, WI, LM, RMSE and MAPE values in the both training and testing phases. Table represents the NSE, WI, LM, RMSE and MAPE values for MM-ANN-1, MARS-1, MGGP-1, SVM-1 and M5Tree-1 models. As assessing Ranichauri station, NSE = 0.914, WI = 0.978, LM = 0.730, RMSE = 0.296 mm/ month and MAPE = 9.647% for MM-ANN-1 model; NSE = 0.927, WI = 0.981, LM = 0.766, RMSE = 0.272 mm/ month and MAPE = 8.028% for MARS-1 model; NSE = 0.887, WI = 0.969, LM = 0.677, RMSE = 0.339 mm/ month and MAPE = 12.631% for MGGP-1 model; NSE = 0.913, WI = 0.977, LM = 0.738, RMSE = 0.297 mm/ month and MAPE = 9.146% for SVM-1 model, and NSE = 0.905, WI = 0.974, LM = 0.724, RMSE = 0.310 mm/ month and MAPE = 9.595% for M5Tree-1 model were obtained during the training period. In addition, NSE = 0.909, WI = 0.975, LM = 0.736, RMSE = 0.368 mm/ month and MAPE = 10.633% for MM-ANN-1 model; NSE = 0.881, WI = 0.963, LM = 0.678, RMSE =0.421 mm/ month and MAPE=12.757% for MARS-1 model; NSE = 0.911, WI = 0.975, LM = 0.724, RMSE =0.364 mm/ month and MAPE=12.297% for MGGP-1 model; NSE = 0.811, WI = 0.955, LM = 0.598, RMSE =0.529 mm/ month and MAPE=18.096% for SVM-1 model, and NSE = 0.693, WI = 0.929, LM = 0.483, RMSE = 0.675 mm/ month and MAPE = 23.727% for M5Tree-1 model were obtained during the testing stage. Table revealed that the MARS-1 model produced the best performances and results in the training stage while failed in the testing stage. The MGGP-1 model tracked the best statistical criteria (i.e. minimum RMSE and MAPE values, and maximum NSE, WI and LM values) over the testing stage and indicated the best performance compared to other models. MM-ANN-1 model follows the MGGP-1 as the second-best model.

The scatter plots and temporal variations graphs of the observed against the simulated EP_m of the MM-ANN-1, MARS-1, MGGP-1, SVM-1, and M5Tree-1 models over the testing phase were presented in Figure (a through e). The RL in scatter plots marked R² as 0.910 for the MM-ANN-1, 0.910 for the MARS-1, 0.916 for the MGGP-1 model, 0.850 for SVM-1 model, and 0.852 for M5Tree-1 model over the testing period. The different conditions have been yielded between RL and 1:1 line for all the models. For the SVM-1 model (Figure (d)), the RL was exactly over the 1:1 line, it over-predicted the EP_m values. For the M5Tree-1 model (Figure (e)), the regression line was exactly below the best fit line (1:1), it under-predicted the EP_m values. In the case of MM-ANN-1, MARS-1 and MGGP-1 models (Figure (a–c)), the regression line divided the best fit line (1:1). It can be explained that the models over-predicted the smaller values (<2.5 mm/month for MM-ANN-1, <3.0 mm/month for MGGP-1, <2.5 mm/month for MARS-1) and under-predict the higher EP_m values at Ranichauri station.

Figure 9. Line (left) and scatter (right) plots between observed and simulated monthly pan evaporation values by (a) MM-ANN-1, (b) MARS-1, (c) MGGP-1, (d) SVM-1, and (e) M5Tree-1 models during testing period at Ranichauri station.

According to Figure , the highest correlation coefficient (CC) was observed in the MARS-1. But the standard deviation showed much less than the observed values. The SVM-1 and M5Tree-1 model is far from being competing on the best performance, MGGP-1 can be considered as a high performed model with lowest RMSE and high correlation although it has a standard deviation lower than the observed values. Finally, MGGP-1 was the closest model to the observed values based on the standard deviation, correlation, and RMSE. This was giving superiority to MGGP-1 over the others. Considering the density plot (Figure ), the MGGP-1 model, which identified as the closest model in the Taylor diagram yielded a similar box from the observed values.

Figure 10. Taylor diagrams comparing the models’ fit statistics at Ranichauri station.

Figure 11. Density plots of MM-ANN, MARS-1, MGGP-1, SVM-1 and M5Tree-1 models at Ranichauri station.

The findings of the current research demonstrated that the MM-ANN at Pantnagar station and MGGP model at Ranichauri station achieved better performances in simulating EP_m values. The MM-ANN-1 and MGGP-1 models were selected as the best ones based on minimum RMSE and MAPE values, and maximum NSE, WI, and LM values at the testing period. Various researchers reported that the soft computing models provide better performances during the testing period, and the best performing procedure was selected (Abdullah, Malek, Abdullah, Kisi, & Yap, Citation2015; Guven & Kisi, Citation2013). Malik, Kumar, and Kisi (Citation2017) estimated EP_m using SOMNN, CANFIS, MLPNN, RBNN, Griffith’s (G), and Stephens-Stewart (SS) models at Ranichauri and Pantnagar stations. They published that the specific heuristic approaches (i.e. MLPNN and CANFIS models) outperformed the other models. It can be clear that the previous investigations confirmed the multiple model strategies for modeling nonlinear time series phenomena. The findings of this research were also in close agreement with the study done by (Malik, Kumar, & Kisi, Citation2017). Future research can be devoted to the incorporation of other casual related climate variables such as vapor pressure deficit, atmospheric pressure or others.

4. Conclusion

The evaporation process of the hydrological cycle is characterized by highly complex and stochastic phenomena in nature. In the current research, the ability of multiple model strategies (MM-ANN, MARS, MGGP, SVM, and M5Tree) was evaluated in simulating monthly pan evaporation at two meteorological stations, Ranichauri and Pantnagar in India. The best input variables combination was selected using GT. The results of applied models were examined using some performance criteria and visual presentations. In general, we derived five conclusions from the current research as follows:

Six appropriate variables (RH₁, RH₂, S_w, H_ss, and T_min, T_max,) were selected by GT as the optimistic input combination for simulating the pan evaporation in monthly scale at both studied stations. The nature of the EP_m process in this region is highly stochastic and more climatic information is needed for building the prediction matrix of the proposed AI models.
At the Pantnagar station, the MM-ANN-1 model indicated the better performance compared to the MARS-1, MGGP-1, SVM-1 and M5Tree-1 models. However, the feasibility of the MGGP-1 model demonstrated batter results at Ranichauri station. This is normal for the case where AI models behave differently in different cases based on the actual internal mechanism between the predictand and predictors.
MARS model performed as the second accurate model following the MM-ANN at Pantnagar station, while the MM-ANN was following the MGGP model capacity in mimicking the EP_m trend at Ranichauri station.
Overall, the attained results of the proposed MM-ANN and MGGP models demonstrated an optimistic intelligence approaches for this particular region where they would contribute to the water resources engineering practice, and management.

Disclosure statement

No potential conflict of interest was reported by the authors.

Related Research Data

Pan evaporation modeling by three different neuro-fuzzy intelligent systems using climatic inputs

Source: Springer Science and Business Media LLC

Streamflow prediction using linear genetic programming in comparison with a neuro-wavelet technique

Source: Elsevier BV

Natural evaporation from open water, bare soil and grass

Source: Royal Society Publishing

Evaporation modelling by heuristic regression approaches using only temperature data

Source: Informa UK Limited

A nonlinear mathematical modeling of daily pan evaporation based on conjugate gradient method

Source: Elsevier BV

River suspended sediment load prediction based on river discharge information: application of newly developed data mining models

Source: Informa UK Limited

Multiple AI model integration strategy—Application to saturated hydraulic conductivity prediction from easily available soil properties

Source: Elsevier BV

A Pareto-optimal moving average-multigene genetic programming model for rainfall-runoff modelling

Source: Elsevier BV

EVAPORATION MODELLING IN YANGTZE RIVER BASIN

Source: Wiley

Pan evaporation prediction using a hybrid multilayer perceptron-firefly algorithm (MLP-FFA) model: case study in North Iran

Source: Springer Science and Business Media LLC

The potential of novel data mining models for global solar radiation prediction

Source: Springer Science and Business Media LLC

Evaporation Research: Review and Interpretation

Source: American Society of Civil Engineers (ASCE)

Multivariate Adaptive Regression Splines

Source: Institute of Mathematical Statistics

A Pareto-optimal moving average multigene genetic programming model for daily streamflow prediction

Source: Elsevier BV

Drought forecasting in eastern Australia using multivariate adaptive regression spline, least square support vector machine and M5Tree model

Source: Elsevier BV

Modeling monthly pan evaporation using wavelet support vector regression and wavelet artificial neural networks in arid and humid climates

Source: Taylor & Francis Group

Comparison of soft-computing and statistical techniques in simulating daily river flow: a case study in India

Source: Diva Enterprises Private Limited

Estimation of monthly evaporative loss using relevance vector machine, extreme learning machine and multivariate adaptive regression spline models

Source: Springer Science and Business Media LLC

Application of Heuristic Approaches for Prediction of Hydrological Drought Using Multi-scalar Streamflow Drought Index

Source: Springer Science and Business Media LLC

Peak flood estimation using gene expression programming

Source: Elsevier BV

Pan evaporation modeling using six different heuristic computing methods in different climates of China

Source: Elsevier BV

Pan evaporation modeling using four different heuristic approaches

Source: Elsevier BV

GPTIPS 2: an open-source software platform for symbolic data mining

Source: Springer International Publishing

Application of ensemble empirical mode decomposition based on machine learning methodologies in forecasting monthly pan evaporation

Source: IWA Publishing

Daily Pan Evaporation Estimation Using Heuristic Methods with Gamma Test

Source: American Society of Civil Engineers (ASCE)

The viability of co-active fuzzy inference system model for monthly reference evapotranspiration estimation: case study of Uttarakhand State

Source: IWA Publishing

Another evaporation formula

Source: Elsevier BV

Application of linear sliding discharges for flow control: study of the energy coupling mechanisms

Source: HAL CCSD

Pan evaporation modeling using least square support vector machine, multivariate adaptive regression splines and M5 model tree

Source: Elsevier BV

Assessment of Stochastic Approaches in Prediction of Wave-Induced Pipeline Scour Depth

Source: American Society of Civil Engineers (ASCE)

Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: Regional case study in Burkina Faso

Source: Elsevier BV

Estimation of monthly reference evapotranspiration using novel hybrid machine learning approaches

Source: Informa UK Limited

Daily pan evaporation modeling using chi-squared automatic interaction detector, neural networks, classification and regression tree

Source: Elsevier BV

Rainfall-runoff modeling in hilly watershed using heuristic approaches with gamma test

Source: Springer Science and Business Media LLC

Predicting Daily Pan Evaporation (E pan ) from Dam Reservoirs in the Mediterranean Regions of Algeria: OPELM vs OSELM

Source: Springer Science and Business Media LLC

Linear genetic programming for time-series modelling of daily flow rate

Source: Springer Science and Business Media LLC

Runoff forecasting using hybrid Wavelet Gene Expression Programming (WGEP) approach

Source: Elsevier BV

Evaporation Estimation Using Artificial Neural Networks and Adaptive Neuro-Fuzzy Inference System Techniques

Source: Elsevier BV

Artificial intelligence based models for stream-flow forecasting: 2000–2015

Source: Elsevier BV

Learning from Multiple Models Using Artificial Intelligence to Improve Model Prediction Accuracies: Application to River Flows

Source: Springer Science and Business Media LLC

Monthly pan-evaporation estimation in Indian central Himalayas using different heuristic approaches and climate based models

Source: Elsevier BV

On the Assessment of Surface Heat Flux and Evaporation Using Large-Scale Parameters

Source: American Meteorological Society

Bankruptcy forecasting: A hybrid approach using Fuzzy c-means clustering and Multivariate Adaptive Regression Splines (MARS)

Source: Elsevier BV

Comparison between M5 Model Tree and Neural Networks for Estimating Reference Evapotranspiration in an Arid Environment

Source: Springer Science and Business Media LLC

Linear genetic programming application for successive-station monthly streamflow prediction

Source: Elsevier BV

The Construction of Smooth Models using Irregular Embeddings Determined by a Gamma Test Analysis

Source: Springer Science and Business Media LLC

Daily Pan Evaporation Modeling in a Hot and Dry Climate

Source: American Society of Civil Engineers (ASCE)

Simulation of the depth scouring downstream sluice gate: The validation of newly developed data-intelligent models

Source: Elsevier BV

Monthly pan evaporation modeling using linear genetic programming

Source: Elsevier BV

Modeling of daily pan evaporation in sub tropical climates using ANN, LS-SVR, Fuzzy Logic, and ANFIS

Source: Elsevier BV

Utility of coactive neuro-fuzzy inference system for pan evaporation modeling in comparison with multilayer perceptron

Source: Springer Science and Business Media LLC

Extreme Learning Machines: A new approach for prediction of reference evapotranspiration

Source: Elsevier BV

Evapotranspiration modelling using support vector machines

Source: Informa UK Limited

Model data selection using gamma test for daily solar radiation estimation

Source: Wiley

Forming Multiple Models Run by SVM to Learn From ANN and Its Hybrid With Firefly Algorithm

Source: IGI Global

Runoff prediction using an integrated hybrid modelling scheme

Source: Elsevier BV

Comparison of multivariate adaptive regression splines with coupled wavelet transform artificial neural networks for runoff forecasting in Himalayan micro-watersheds with limited data

Source: IWA Publishing

Stream flow predictions using nature-inspired Firefly Algorithms and a Multiple Model strategy Directions of innovation towards next generation practices

Source: Elsevier BV

Linking provided by

References

Abdullah, S. S., Malek, M. a., Abdullah, N. S., Kisi, O., & Yap, K. S. (2015). Extreme learning machines: A new approach for prediction of reference evapotranspiration. Journal of Hydrology, 527, 184–195. doi: 10.1016/j.jhydrol.2015.04.073
Web of Science ®Google Scholar
Adamowski, J., Chan, H. F., Prasher, S. O., & Sharda, V. N. (2012). Comparison of multivariate adaptive regression splines with coupled wavelet transform artificial neural networks for runoff forecasting in Himalayan micro-watersheds with limited data. Journal of Hydroinformatics, 14(3), 731–744. doi: 10.2166/hydro.2011.044
Web of Science ®Google Scholar
Adnan, R. M., Malik, A., Kumar, A., Parmar, K. S., & Kisi, O. (2019). Pan evaporation modeling by three different neuro-fuzzy intelligent systems using climatic inputs. Arabian Journal of Geosciences, 12(20), 606. doi: 10.1007/s12517-019-4781-6
Web of Science ®Google Scholar
Burt, C. M., Mutziger, A. J., Allen, R. G., & Howell, T. A. (2005). Evaporation research: Review and interpretation. Journal of Irrigation and Drainage Engineering, 131(1), 37–58. doi: 10.1061/(ASCE)0733-9437(2005)131:1(37)
Web of Science ®Google Scholar
Danandeh Mehr, A., & Kahya, E. (2017). A Pareto-optimal moving average multigene genetic programming model for daily streamflow prediction. Journal of Hydrology, 549, 603–615. doi: 10.1016/j.jhydrol.2017.04.045
Web of Science ®Google Scholar
Danandeh Mehr, A., Kahya, E., & Olyaie, E. (2013). Streamflow prediction using linear genetic programming in comparison with a neuro-wavelet technique. Journal of Hydrology, 505, 240–249. doi: 10.1016/j.jhydrol.2013.10.003
Web of Science ®Google Scholar
Danandeh Mehr, A., Kahya, E., & Yerdelen, C. (2014). Linear genetic programming application for successive-station monthly streamflow prediction. Computers and Geosciences, 70, 63–72. doi: 10.1016/j.cageo.2014.04.015
Web of Science ®Google Scholar
Danandeh Mehr, A., & Nourani, V. (2017). A Pareto-optimal moving average-multigene genetic programming model for rainfall-runoff modelling. Environmental Modelling and Software, 92, 239–251. doi: 10.1016/j.envsoft.2017.03.004
Web of Science ®Google Scholar
De Andrés, J., Lorca, P., de Cos Juez, F. J., & Sánchez-Lasheras, F. (2011). Bankruptcy forecasting: A hybrid approach using Fuzzy c-means clustering and multivariate adaptive regression Splines (MARS). Expert Systems with Applications, 38(3), 1866–1875. doi: 10.1016/j.eswa.2010.07.117
Web of Science ®Google Scholar
Deo, R. C., Kisi, O., & Singh, V. P. (2017). Drought forecasting in eastern Australia using multivariate adaptive regression spline, least square support vector machine and M5Tree model. Atmospheric Research, 184, 149–175. doi: 10.1016/j.atmosres.2016.10.004
Web of Science ®Google Scholar
Deo, R. C., Samui, P., & Kim, D. (2016). Estimation of monthly evaporative loss using relevance vector machine, extreme learning machine and multivariate adaptive regression spline models. Stochastic Environmental Research and Risk Assessment, 30(6), 1769–1784. doi: 10.1007/s00477-015-1153-y
Web of Science ®Google Scholar
Doorenbos, J., & Pruitt, W. O. (1977). Guidelines for predicting crop water requirements (Irrigation and Drainage Paper No. 24, FAO). doi: 10.2514/6.2014-2117
Google Scholar
Friedman, J. H. (1991). Multivariate adaptive regression Splines. The Annals of Statistics, 19, 1–67. doi: 10.1214/aos/1176347963
Web of Science ®Google Scholar
Ghorbani, M. A., Deo, R. C., Yaseen, Z. M., Kashani, M. H., & Mohammadi, B. (2017). Pan evaporation prediction using a hybrid multilayer perceptron-firefly algorithm (MLP-FFA) model: Case study in North Iran. Theoretical and Applied Climatology, 133, 1119–1131. doi: 10.1007/s00704-017-2244-0
Web of Science ®Google Scholar
Ghorbani, M. A., Khatibi, R., Karimi, V., Yaseen, Z. M., & Zounemat-Kermani, M. (2018). Learning from multiple models using artificial intelligence to Improve model prediction accuracies: Application to river flows. Water Resources Management, 32, 4201–4215. doi: 10.1007/s11269-018-2038-x
Web of Science ®Google Scholar
Goyal, M. K. (2014). Modeling of sediment yield prediction using M5 model tree algorithm and wavelet regression. Water Resources Management 28, 1991–2003. doi: 10.1007/s11269-014-0590-6
Web of Science ®Google Scholar
Goyal, M. K., Bharti, B., Quilty, J., Adamowski, J., & Pandey, A. (2014). Modeling of daily pan evaporation in sub tropical climates using ANN, LS-SVR, Fuzzy Logic, and ANFIS. Expert Systems with Applications, 41(11), 5267–5276. doi: 10.1016/j.eswa.2014.02.047
Web of Science ®Google Scholar
Griffiths, J. F. (1966). Another evaporation formula. Agricultural Meteorology, 3(3–4), 257–261. doi: 10.1016/0002-1571(66)90033-1
Google Scholar
Guven, A. (2009). Linear genetic programming for time-series modelling of daily flow rate. Journal of Earth System Science, 118(2), 137–146. doi: 10.1007/s12040-009-0022-9
Web of Science ®Google Scholar
Guven, A., & Kisi, O. (2013). Monthly pan evaporation modeling using linear genetic programming. Journal of Hydrology, 503, 178–185. doi: 10.1016/j.jhydrol.2013.08.043
Web of Science ®Google Scholar
Kashani, M. H., Ghorbani, M. A., Shahabi, M., Naganna, S. R., & Diop, L. (2020). Multiple AI model integration strategy—application to saturated hydraulic conductivity prediction from easily available soil properties. Soil and Tillage Research, 196, 104449. doi: 10.1016/j.still.2019.104449
Web of Science ®Google Scholar
Keshtegar, B., Piri, J., & Kisi, O. (2016). A nonlinear mathematical modeling of daily pan evaporation based on conjugate gradient method. Computers and Electronics in Agriculture, 127, 120–130. doi: 10.1016/j.compag.2016.05.018
Web of Science ®Google Scholar
Khatibi, R., Ghorbani, M. A., Jani, R., & Servati, M. (2018). Soil cation exchange capacity predicted by learning from multiple modelling: Forming multiple models run by SVM to Learn from ANN and its hybrid with firefly algorithm. In D. Kim, R. Deo, & P. Samui (Eds.), Handbook of research on predictive modeling and optimization methods in science and engineering (pp. 465–480). Hershey, PA: IGI Global.
Google Scholar
Khatibi, R., Ghorbani, M. A., & Pourhosseini, F. A. (2017). Stream flow predictions using nature-inspired firefly algorithms and a multiple model strategy–directions of innovation towards next generation practices. Advanced Engineering Informatics, 34, 80–89. doi: 10.1016/j.aei.2017.10.002
Web of Science ®Google Scholar
Kisi, O. (2015). Pan evaporation modeling using least square support vector machine, multivariate adaptive regression splines and M5 model tree. Journal of Hydrology, 528, 312–320. doi: 10.1016/j.jhydrol.2015.06.052
Web of Science ®Google Scholar
Kişi, O., & Cimen, M. (2009). Evapotranspiration modelling using support vector machines/Modélisation de l’évapotranspiration à l’aide de ‘support vector machines’. Hydrological Sciences Journal, 54(5), 918–928. doi: 10.1623/hysj.54.5.918
Web of Science ®Google Scholar
Kisi, O., Genc, O., Dinc, S., & Zounemat-Kermani, M. (2016). Daily pan evaporation modeling using chi-squared automatic interaction detector, neural networks, classification and regression tree. Computers and Electronics in Agriculture, 122, 112–117. doi: 10.1016/j.compag.2016.01.026
Web of Science ®Google Scholar
Kisi, O., & Heddam, S. (2019). Evaporation modelling by heuristic regression approaches using only temperature data. Hydrological Sciences Journal, 64(6), 653–672. doi: 10.1080/02626667.2019.1599487
Web of Science ®Google Scholar
Malik, A., & Kumar, A. (2015). Pan evaporation simulation based on daily meteorological data using soft computing techniques and multiple linear regression. Water Resources Management, 29(6), 1859–1872. doi: 10.1007/s11269-015-0915-0
Web of Science ®Google Scholar
Malik, A., & Kumar, A. (2018). Comparison of soft-computing and statistical techniques in simulating daily river flow: A case study in India. Journal of Soil and Water Conservation, 17(2), 192. doi: 10.5958/2455-7145.2018.00029.2
Google Scholar
Malik, A., Kumar, A., Ghorbani, M. A., Kashani, M. H., Kisi, O., & Kim, S. (2019). The viability of co-active fuzzy inference system model for monthly reference evapotranspiration estimation: Case study of Uttarakhand State. Hydrology Research, 50, 1623–1644. doi: 10.2166/nh.2019.059
Web of Science ®Google Scholar
Malik, A., Kumar, A., & Kisi, O. (2017). Monthly pan-evaporation estimation in Indian central Himalayas using different heuristic approaches and climate based models. Computers and Electronics in Agriculture, 143, 302–313. doi: 10.1016/j.compag.2017.11.008
Web of Science ®Google Scholar
Malik, A., Kumar, A., & Kisi, O. (2018). Daily Pan evaporation estimation using heuristic methods with gamma test. Journal of Irrigation and Drainage Engineering, 144(9), 04018023. doi: 10.1061/(asce)ir.1943-4774.0001336
Web of Science ®Google Scholar
Malik, A., Kumar, A., & Singh, R. P. (2019). Application of heuristic approaches for prediction of hydrological drought using multi-scalar streamflow drought index. Water Resources Management, 33, 3985–4006. doi: 10.1007/s11269-019-02350-4
Web of Science ®Google Scholar
Moghaddamnia, a., Ghafari Gousheh, M., Piri, J., Amin, S., & Han, D. (2009). Evaporation estimation using artificial neural networks and adaptive neuro-fuzzy inference system techniques. Advances in Water Resources, 32(1), 88–97. doi: 10.1016/j.advwatres.2008.10.005
Web of Science ®Google Scholar
Penman, H. L. (1948). Natural evaporation from open water, bare soil and grass. Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences, 193(1032), 120–145. doi: 10.1098/rspa.1948.0037
PubMed Web of Science ®Google Scholar
Piri, J., Amin, S., Moghaddamnia, A., Keshavarz, A., Han, D., & Remesan, R. (2009). Daily pan evaporation modeling in a hot and dry climate. Journal of Hydrologic Engineering, 14(8), 803–811. doi: 10.1061/(ASCE)HE.1943-5584.0000056
Web of Science ®Google Scholar
Priestley, C. H. B., & Taylor, R. J. (1972). On the assessment of the surface heat flux and evaporation using large-scale parameters. Monthly Weather Review, 100, 81–92. doi: 10.1175/1520-0493(1972)100<0081:OTAOSH>2.3.CO;2
Web of Science ®Google Scholar
Qasem, S. N., Samadianfard, S., Kheshtgar, S., Jarhan, S., Kisi, O., Shamshirband, S., & Chau, K.-W. (2019). Modeling monthly pan evaporation using wavelet support vector regression and wavelet artificial neural networks in arid and humid climates. Engineering Applications of Computational Fluid Mechanics, 13(1), 177–187. doi: 10.1080/19942060.2018.1564702
Web of Science ®Google Scholar
Qutbudin, I., Shiru, M. S., Sharafati, A., Ahmed, K., Al-Ansari, N., Yaseen, Z. M., … Wang, X. (2019). Seasonal drought pattern changes Due to climate variability: Case study in Afghanistan. Water, 11(5), 1096. doi: 10.3390/w11051096
Web of Science ®Google Scholar
Rahimikhoob, A. (2014). Comparison between M5 model tree and neural networks for estimating reference evapotranspiration in an arid environment. Water Resources Management, 28(3), 657–669. doi: 10.1007/s11269-013-0506-x
Web of Science ®Google Scholar
Remesan, R., Shamim, M. A., & Han, D. (2008). Model data selection using gamma test for daily solar radiation estimation. Hydrological Processes, 22(21), 4301–4309. doi: 10.1002/hyp.7044
Web of Science ®Google Scholar
Remesan, R., Shamim, M. A., Han, D., & Mathew, J. (2009). Runoff prediction using an integrated hybrid modelling scheme. Journal of Hydrology, 372(1–4), 48–60. doi: 10.1016/j.jhydrol.2009.03.034
Web of Science ®Google Scholar
Rezaie-Balf, M., Kisi, O., & Chua, L. H. C. (2019). Application of ensemble empirical mode decomposition based on machine learning methodologies in forecasting monthly pan evaporation. Hydrology Research, 50(2), 498–516. doi: 10.2166/nh.2018.050
Web of Science ®Google Scholar
Salih, S. Q., sharafati, A., Khosravi, K., Faris, H., Kisi, O., Tao, H., … Yaseen, Z. M. (2020). River suspended sediment load prediction based on river discharge information: Application of newly developed data mining models. Hydrological Sciences Journal. doi: 10.1080/02626667.2019.1703186
Web of Science ®Google Scholar
Searson, D. P. (2015). GPTIPS 2: An open-source software platform for symbolic data mining. In Handbook of genetic programming applications (pp. 551–573). doi: 10.1007/978-3-319-20883-1_22
Google Scholar
Sebbar, A., Heddam, S., & Djemili, L. (2019). Predicting daily Pan evaporation (Epan) from Dam reservoirs in the mediterranean regions of Algeria: OPELM vs OSELM. Environmental Processes, 6, 309–319. doi: 10.1007/s40710-019-00353-2
Google Scholar
Sharafati, A., Khosravi, K., Khosravinia, P., Ahmed, K., Salman, S. A., Mundher, Z., & Shamsuddin, Y. (2019). The potential of novel data mining models for global solar radiation prediction. International Journal of Environmental Science and Technology, 0123456789. doi: 10.1007/s13762-019-02344-0
Web of Science ®Google Scholar
Sharafati, A., Tafarojnoruz, A., Shourian, M., & Yaseen, Z. M. (2019). Simulation of the depth scouring downstream sluice gate: The validation of newly developed data-intelligent models. Journal of Hydro-Environment Research. doi: 10.1016/j.jher.2019.11.002
Web of Science ®Google Scholar
Sharafati, A., Yasa, R., & Azamathulla, H. M. (2018). Assessment of stochastic approaches in prediction of wave-induced pipeline scour depth. Journal of Pipeline Systems Engineering and Practice, 9(4), 04018024. doi: 10.1061/(ASCE)PS.1949-1204.0000347
Web of Science ®Google Scholar
Shoaib, M., Shamseldin, A. Y., Melville, B. W., & Khan, M. M. (2015). Runoff forecasting using hybrid wavelet gene expression programming (WGEP) approach. Journal of Hydrology, 527, 326–344. doi: 10.1016/j.jhydrol.2015.04.072
Web of Science ®Google Scholar
Singh, A., Malik, A., Kumar, A., & Kisi, O. (2018). Rainfall-runoff modeling in hilly watershed using heuristic approaches with gamma test. Arabian Journal of Geosciences, 11(11). doi: 10.1007/s12517-018-3614-3
Web of Science ®Google Scholar
Singh, K. K., Pal, M., & Singh, V. P. (2010). Estimation of mean annual flood in Indian catchments using backpropagation neural network and M5 model tree. Water Resources Management, 24(10), 2007–2019. doi: 10.1007/s11269-009-9535-x
Web of Science ®Google Scholar
Tabari, H., Talaee, P. H., & Abghari, H. (2012). Utility of coactive neuro-fuzzy inference system for pan evaporation modeling in comparison with multilayer perceptron. Meteorology and Atmospheric Physics, 116(3–4), 147–154. doi: 10.1007/s00703-012-0184-x
Web of Science ®Google Scholar
Tao, H., Diop, L., Bodian, A., Djaman, K., Ndiaye, P. M., & Yaseen, Z. M. (2018). Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: Regional case study in Burkina Faso. Agricultural Water Management, 208, 140–151. doi: 10.1016/j.agwat.2018.06.018
Web of Science ®Google Scholar
Tikhamarine, Y., Malik, A., Kumar, A., Souag-Gamane, D., & Kisi, O. (2019). Estimation of monthly reference evapotranspiration using novel hybrid machine learning approaches. Hydrological Sciences Journal, 64(15), 1824–1842. doi: 10.1080/02626667.2019.1678750
Web of Science ®Google Scholar
Tsui, A. P. M., Jones, A. J., & De Oliveira, A. G. (2002). The construction of smooth models using irregular embeddings determined by a gamma test analysis. Neural Computing & Applications, 10(4), 318–329. doi: 10.1007/s005210200004
Web of Science ®Google Scholar
Wang, L., Kisi, O., Hu, B., Bilal, M., Zounemat-Kermani, M., & Li, H. (2017). Evaporation modelling using different machine learning techniques. International Journal of Climatology, 37(S1), 1076–1092. doi: 10.1002/joc.5064
Web of Science ®Google Scholar
Wang, L., Kisi, O., Zounemat-Kermani, M., & Li, H. (2017). Pan evaporation modeling using six different heuristic computing methods in different climates of China. Journal of Hydrology, 544, 407–427. doi: 10.1016/j.jhydrol.2016.11.059
Web of Science ®Google Scholar
Wang, L., Niu, Z., Kisi, O., Li, C., & Yu, D. (2017). Pan evaporation modeling using four different heuristic approaches. Computers and Electronics in Agriculture, 140, 203–213. doi: 10.1016/j.compag.2017.05.036
Web of Science ®Google Scholar
Yaseen, Z. M., Al-Juboori, A. M., Beyaztas, U., Al-Ansari, N., Chau, K.-W., Qi, C., … Shahid, S. (2019). Prediction of evaporation in arid and semi-arid regions: A comparative study using different machine learning models. Engineering Applications of Computational Fluid Mechanics, 14(1), 70–89. doi: 10.1080/19942060.2019.1680576
Web of Science ®Google Scholar
Yaseen, Z. M., El-shafie, A., Jaafar, O., Afan, H. A., & Sayl, K. N. (2015). Artificial intelligence based models for stream-flow forecasting: 2000–2015. Journal of Hydrology, 530, 829–844. doi: 10.1016/j.jhydrol.2015.10.038
Web of Science ®Google Scholar
Zorn, C. R., & Shamseldin, A. Y. (2015). Peak flood estimation using gene expression programming. Journal of Hydrology, 531, 1122–1128. doi: 10.1016/j.jhydrol.2015.11.018
Web of Science ®Google Scholar

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Modeling monthly pan evaporation process over the Indian central Himalayas: application of multiple learning artificial intelligence model

Abstract

1. Introduction

2. Material and methods

2.1. Description of case study

Table 1. Inter-correlation between climatic variables at Pantnagar and Ranichauri stations.

2.2. Gamma test (GT)

2.3. Multiple model-artificial neural networks (MM-ANN)

2.4. Multivariate adaptive regression spline (MARS)

2.5. Multi-gene genetic programming (MGGP)

2.6. Support vector machine (SVM)

2.7. M5tree model

2.8. Model performance evaluation indicators

3. Results and discussion

3.1. Selecting the GT based effective input variables

Table 2. Final structure of MM-ANN, MARS, MGGP, SVM and M5Tree models at Pantnagar and Ranichauri stations.

Table 3. GT statistics of monthly EP_m datasets for Pantnagar and Ranichauri stations.

3.2. Simulation of EP_m at Pantnagar station

Table 4. Values of the NSE, WI, LM, RMSE and MAPE criteria for the developed models during training and testing periods at Pantnagar and Ranichauri stations.

3.3. Simulation of EP_m at Ranichauri station

4. Conclusion

Disclosure statement

Related Research Data

References

Information for

Open access

Opportunities

Help and information

Modeling monthly pan evaporation process over the Indian central Himalayas: application of multiple learning artificial intelligence model

Abstract

1. Introduction

2. Material and methods

2.1. Description of case study

Table 1. Inter-correlation between climatic variables at Pantnagar and Ranichauri stations.

2.2. Gamma test (GT)

2.3. Multiple model-artificial neural networks (MM-ANN)

2.4. Multivariate adaptive regression spline (MARS)

2.5. Multi-gene genetic programming (MGGP)

2.6. Support vector machine (SVM)

2.7. M5tree model

2.8. Model performance evaluation indicators

3. Results and discussion

3.1. Selecting the GT based effective input variables

Table 2. Final structure of MM-ANN, MARS, MGGP, SVM and M5Tree models at Pantnagar and Ranichauri stations.

Table 3. GT statistics of monthly EPm datasets for Pantnagar and Ranichauri stations.

3.2. Simulation of EPm at Pantnagar station

Table 4. Values of the NSE, WI, LM, RMSE and MAPE criteria for the developed models during training and testing periods at Pantnagar and Ranichauri stations.

3.3. Simulation of EPm at Ranichauri station

4. Conclusion

Disclosure statement

Related Research Data

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 3. GT statistics of monthly EP_m datasets for Pantnagar and Ranichauri stations.

3.2. Simulation of EP_m at Pantnagar station

3.3. Simulation of EP_m at Ranichauri station