604
Views
16
CrossRef citations to date
0
Altmetric
Articles

Polynomial chaos expansion and response surface method for nonlinear modelling of reference evapotranspiration

ORCID Icon, &
Pages 720-730 | Received 08 May 2018, Accepted 19 Jan 2019, Published online: 26 Apr 2019

ABSTRACT

The feasibility of polynomial chaos expansion (PCE) and response surface method (RSM) models is investigated for modelling reference evapotranspiration (ET0). The modelling results of the proposed models are validated against the M5 model tree and multi-layer perceptron neural network (MLPNN) methods. Two meteorological stations, Isparta and Antalya, in the Mediterranean region of Turkey, are inspected. Various input combinations of daily air temperature, solar radiation, wind speed and relative humidity are constructed as input attributes for the ET0. Generally, the modelling accuracy is increased by increasing the number of inputs. Including wind speed in the model inputs considerably increases their accuracy in modelling ET0. Mean absolute error (MAE), root mean square error (RMSE), agreement index (d) and Nash-Sutcliffe efficiency (NSE) are used as comparison criteria. The PCE is the most accurate model in estimating daily ET0, giving the lowest MAE (0.036 and 0.037 mm) and RMSE (0.047 and 0.050 mm) and the highest d (0.9998 and 0.9999) and NSE (0.9992 and 0.9996) with the four-input PCE models for Isparta and Antalya, respectively.

Editor R. Woods Associate editor F.-J. Chang

1 Introduction

Evapotranspiration (ET) is an important part of the hydrological budget, which is associated with the energy and mass exchange between the water–soil system and the environment. A precise determination of the amount of water consumed by ET is a basic factor in increasing crop yield. Moreover, the estimation of ET has a decisive role in designing and determining the capacity of irrigation and drainage networks. In other words, ET is a necessary parameter for computing soil water balance and suitable irrigation scheduling, frequency of irrigation and even choosing a proper irrigation system. Considering the direct linkage between the irrigation strategy and soil water availability, the simulation/prediction of ET has an indirect effect on plant/crop productivity. Indeed, in the case of prediction of higher levels of evapotranspiration and, consequently, more water for irrigation use, other irrigation strategies such as deficit irrigation might be considered in arid and semi-arid areas (Beeson Citation2006).

In most of the methods used for estimating ET, the amount of reference evapotranspiration (ET0) is first estimated, based on which the ET of the studied plant is calculated. Based on FAO (Food and Agriculture Organization) standards, ET0 (e.g. lawn) is defined as the amount of water consumed by a farm covered with the reference plant in a specified period such that the plants in this farm never face water shortage during the growth period (Allen Citation2000, Kisi et al. Citation2016, Wang et al. Citation2017c, Citation2017b).

The most precise method of measuring ET0 is to use a lysimeter. Lysimeters work by directly measuring the elements related to water balance in a controlled cultivated land area. Nevertheless, this method is time consuming and expensive. Various methods exist for estimating ET/ET0, each yielding different results based on various meteorological data and assumptions. The level of ET0 is affected by different atmospheric factors, e.g. relative humidity, air temperature, speed of wind, solar radiation and sunshine hours. Therefore, it is impossible to come up with a simple equation for determining the level of ET0 because of the complex and nonlinear nature of this phenomenon (Kisi et al. Citation2015, Wang et al. Citation2017a). Thus, the high cross-correlation between the input dataset may improve the accuracy predictions of the complex hydrological problem based on a nonlinear model of ET0. Modelling approaches that are structured using atmospheric factors as the input data may be successfully and simply applied to predict ET0, but the modelling relationship and input dataset in the modelling processes are the effective issues for the accurate prediction of the ET0.

Soft computing techniques comprising data-knowledge techniques, such as fuzzy logic and probabilistic reasoning, and data-driven techniques, such as machine learning, artificial neural networks (ANN) and evolutionary computation, can be deployed as individual models or be embedded in unified and hybrid architectures for simulating and predicting complex phenomena and problems (Wang et al. Citation2017d, Yaseen et al. Citation2018). Contrary to hard computing techniques (e.g. numerical modelling) they have the advantages of being easier to set up, faster, robust in modelling complex problems (Keshtegar et al. Citation2019), and tolerant of imprecision and uncertainty. However, as they are approximate models, this is a drawback of soft computing techniques (Sanikhani et al. Citation2018).

In recent years, applications of soft computing methods have been increased for estimating ET/ET0 because of their success in simulating nonlinear phenomena (George et al. Citation2002, Kumar et al. Citation2002, Citation2011, Kisi Citation2006, Citation2013, Zanetti et al. Citation2007, Landeras et al. Citation2008, Doğan Citation2009, Kim et al. Citation2014, Kisi and Zounemat-Kermani Citation2014, Kisi et al. Citation2015).

Among the most recent studies, Gavili et al. (Citation2018) evaluated three different soft computing methods – ANN, adaptive neuro‐fuzzy inference system (ANFIS) and gene expression programming (GEP) – for modelling ET0 in five meteorological stations located in Iran. In general, all of the soft computing models applied were superior to the empirical models, such as the FAO56 Penman-Monteith model, in modelling ET0. Among the soft computing models, the ANN was reported as a better simulating model than the ANFIS and GEP (Gavili et al. Citation2018). Petković et al. (Citation2016) estimated ET0 in a study area in Serbia using a radial basis function neural network with particle swarm optimization (RBFN-PSO) and with back-propagation (RBFN-BP). It was found that the RBFN-PSO produced better predictions than the RBFN-BP. The abilities of two hybrid neural networks – ANFIS and ANN – combined with a cuckoo search algorithm (CS) were explored for monthly ET0 in 12 stations in Serbia, and the soft computing models performed better than the empirical models (Shamshirband et al. Citation2015).

Kisi (Citation2016) investigated the application of three different heuristic modelling methods – least square support vector regression (LSSVR), M5 model tree (M5Tree) and multivariate adaptive regression splines (MARS) – for estimating ET0. He reported that the accuracy of the MARS method was greater than that of the other methods when the climatic data of nearby stations were considered as model inputs. The accuracy of the support vector machine (SVM) (Wen et al. Citation2015) and SVM with genetic algorithm (Yin et al. Citation2017) was compared with ANN in the prediction of ET0 and the SVM showed more accurate results than the ANN. Kisi and Kilic (Citation2016) investigated the general ability of ANN and M5Tree for simulating ET0 using daily data of mean temperature, wind speed, relative humidity and solar radiation from six stations. Their analysis showed that the ANN and M5Tree models are superior to the empirical models. In two other studies, ET0 was modelled using the extreme learning machine (ELM) and generalized regression neural network (GRNN) based on temperature data in Sichuan basin (China) (Feng et al. Citation2017) and the GRNN and RBFN were compared for prediction of ET0 in the north of Algeria with various daily climate data (Ladlani et al. Citation2012). In comparison to the empirical method (FAO-56 Penman-Monteith), good performance of ELM and GRNN models was extracted (Ladlani et al. Citation2012, Feng et al. Citation2017). The ELM was applied for modelling monthly ET0 of two stations in Serbia by Gocic et al. (Citation2016) and it gave greater accuracy than the empirical models. The monthly ET0 for Belgrade and Nis stations in Serbia was predicted by Misaghian et al. (Citation2017) using the empirical relationships of Blaney-Criddle, FAO-56 Penman-Monteith, Priestley-Taylor, adjusted Hargreaves, and Jensen-Haise models with three-order tensor to give the monthly correlations. Khoshravesh et al. (Citation2017) applied a multivariate fractional polynomial model, robust regression and Bayesian regression for estimating ET0 in three arid areas in Iran. They claimed that the multivariate fractional polynomial model showed better accuracy than the other two models for approximating ET0. Petković et al. (Citation2015), Cobaner (Citation2011) and Doğan (Citation2009) investigated the effects of climatic data on monthly ET0 using ANFIS. Keshtegar et al. (Citation2018a) applied a subset-based input dataset to build the ANFIS for modelling ET0 and compared the accuracy of the subset-based ANFIS with M5Tree, ANFIS and ANN for three stations in the Anatolian climate. The subset-based ANFIS improved the accuracy of ET0 prediction compared to other nonlinear data-driven models.

As seen from the literature, input data and modelling approaches are two important issues in presenting accurate predictions of ET0. Although the nonparametric models such as M5Tree and MARS, nonlinear data-driven artificial intelligence models such as ANN, ANFIS, SVR and ELM, and empirical relationships have been developed to give accurate predictions of ET0, there is a need for a novel heuristic model and suitable strategy to obtain effective input data in the calibration process.

It is worth noting that all the reported soft computing techniques are data-driven models, even the ANFIS model which is the combination of fuzzy logic (as a knowledge-based model) and an ANN. Even though the M5Tree and SVR are categorized as machine learning models, and they both use regression techniques for mapping the relationships between the input and output vectors, their dividing process solution spaces are totally different.

Although several researchers have deployed different types of soft computing techniques for modelling ET0, comparing the performance of novel and commonly used data-driven models on the one hand, and evaluating their efficiency for different application areas on the other, it is clear that the use of soft computing will continue to expand.

In this study, attempts are made to apply several soft computing techniques – the response surface method (RSM), the polynomial chaos expansion (PCE) method, the M5 model tree (M5Tree) and the multi-layer perceptron neural network (MLPNN) – to estimate ET0 values for Isparta and Antalya stations (Turkey). The main novelty in this paper is the contribution of the less well-known methods of RSM and PCE, and the comparison of their results with those of the more common soft computing methods such as MLPNN and M5Tree models.

2 Methods and performance indicators

2.1 Response surface method (RSM)

The RSM is an efficient and simple tool for approximating nonlinear phenomena using a second-order polynomial function as follows (Keshtegar et al. Citation2018b):

(1) ET0(X)=a0+i=1naixi+i=1nj=inaijxixj(1)

where ET0(X) is the predicted reference evapotranspiration; n is the number of input variables, including the mean daily temperature (T), solar radiation (SR), mean relative humidity (RH) and wind speed (W); and a0, ai and aij are unknown coefficients. The total number of unknown coefficients is NC = (n + 1)(n + 2)/2 for n input variables. Keshtegar et al. (Citation2016) applied the RSM to predict hydrological problems and improved the RSM to obtain an accurate approximation for streamflow forecasting with a high-order RSM (Keshtegar et al. Citation2016). A nonlinear model of daily dissolved oxygen was proposed by Keshtegar and Heddam (Citation2017) using a modified RSM and hybrid response surface function (HRSF) based on exponential and second-order polynomial estimations, and this was applied for modelling monthly pan evaporation (Keshtegar and Kisi Citation2017, Yang et al. Citation2017b). The unknown coefficients of Equation (1) can be calibrated using the least-squares method by following the direct relationship given by Equation (2) (Keshtegar and Kisi Citation2017):

(2) a=P(X1),P(X2),P(X3),...,P(XN)P(X1)P(X2)P(X3)P(XN)1P(X1),P(X2),P(X3),...,P(XN)ET01ET02ET03ET0N(2)

In Equation (2), a represents the unknown coefficients vector with components of NC × 1; N denotes the number of data points in the training phase; and P(XN) is the polynomial basis value for input data XN, which is obtained using second-order polynomial basic functions at point X as given by Equation (3):

(3) P(X)=1,x1,x2,...,xn,x12,x1x2,x1x3,...,x1xn,x22,x2x3,...,x2xn,x32,x3x4,...,x3xn,...,xn2N×NC(3)

The polynomial basic functions are simply used to predict the ET0 in RSM.

2.2 Polynomial chaos expansion (PCE)

The PCE (Xiu and Karniadakis Citation2002) is widely used for considering uncertainties based on a mathematical meta-model to predict the statistical properties of real events. The PCE was originally introduced by Wiener (Citation1938) based on homogeneous and orthogonal Hermite polynomial functions for stochastic phenomena with normally distributed variables. The PCE method is structured using series expansions of orthogonal polynomials which are given by the following equation for ET0 (Wang et al. Citation2015):

(4) ET0(ξ)=i=1Paiα=1MHα(ξi)(4)

in which ai represent the unknown coefficients, Ψ(ξ)=α=1MHα(ξi) are multivariate orthogonal polynomial basis functions for input variables, which can be computed using the product of M univariate orthogonal one-dimensional polynomials. In Equation (4), ξiare the normal standard input variables, which are given for the ith data point (Xi) as ξi=Xiμσ in which μ and σ are, respectively, the mean and standard deviation of training data for the input variables of T, SR, RH and W; M is given less than the order of the problem; and Hα is the orthogonal polynomial function based on orderα. Hermite polynomials are generally selected for normal variables. The Hermite polynomials for normal standard variable ξ can be given as (Lasota et al. Citation2015):

(5) H0(ξ)=1,H1(ξ)=ξHk+1(ξ)=ξHk(ξ)kHk1(ξ)(5)

The univariate orthogonal polynomials hold the orthogonality conditions. A finite total number of PCE terms P is used to evaluate the ET0 using Equation (4). The multivariate orthogonal polynomials are given by a homogeneous chaos function with dimension M and order p.

For a problem with three dimensions (M = 3) and third-order orthogonal functionsH0,H1,H2,H3) of two input data (ξ1,ξ2), the multivariate orthogonal polynomial basis function is given as Equation (6):

(6) Ψ(ξ)=[1,H1(ξ2),H2(ξ2),H3(ξ2),H1(ξ1),H1(ξ1)H1(ξ2),H1(ξ1)H2(ξ2),H2(ξ1),H2(ξ1)H1(ξ2),H3(ξ1)](6)

Based on orthogonal polynomials, the high cross-correlation input datasets are given using the high-order p in PCE-based modelling of ET0. Thus, it may improve the prediction accuracy of nonlinear complex problems. The PCE is commonly used as an efficient and accurate tool for representation of general second-order random variables (Blatman and Sudret Citation2010). A PCE-based model was implemented for assessment of uncertainties in destitution function parameters in a hydrological model under randomness (Fajraoui et al. Citation2011, Laloy et al. Citation2013). A PCE-based surrogate model was used to represent uncertainties of input parameters for simulating subsurface flows in porous media based on a second-order PCE (Sochala and Le Maître Citation2013). In this study, the order of chaos polynomials to generate multi-dimensional Hermite polynomials is considered as 3, i.e. p = 3.

The regression method is a useful technique for determining the unknown coefficients ai, and it represents the relationship between the observed ET0 and one or more input variables. Generally, the least-squares approach can be used to calibrate the unknown coefficients, which are determined by minimizing a norm of residuals between the observed and predicted ET0 as follows:

(7) a=[ΨTΨ]1ΨTET0(7)

where a is the vector of unknown coefficients, a=a1,a2,...,aP. The matrix ΨN×P is determined using multivariate orthogonal polynomial basis functions, as follows:

(8) Ψ=Ψ1(ξ1)Ψ2(ξ1)ΨP(ξ1)Ψ1(ξ2)Ψ2(ξ2)ΨP(ξ2)Ψ1(ξN)Ψ2(ξN)ΨP(ξN)(8)

The number of data points selected should be more than P to compute the multivariate orthogonal polynomial basis functionΨ(ξ)=α=1MHα(ξi). This modelling approach can be used for a computer program by the following steps:

  • Step 1. Load input database and relative ET0 in the training phase.

  • Step 2. Set the input parameters of the PCE model as the order of orthogonal functions (p) and dimension of the problem, which is relative to the number of input data.

  • Step 3. Compute the basic p-order Hermite polynomial basis function for each input data.

  • Step 4. Determine the multivariate orthogonal polynomial basis function Ψ(ξ)=α=1MHα(ξi) using the data from Step 3 for all training data.

  • Step 5. Create the multivariate orthogonal polynomial matrixΨN×P.

  • Step 6. Compute the unknown coefficients using Equation (7).

  • Step 7. Load the input data in the test phase.

  • Step 8. Repeat Steps 3 and 4 for the test database.

  • Step 9. Predict ET0 in test phase using unknown coefficients in Step 6.

2.3 M5 model tree (M5Tree)

The M5Tree, which was introduced by Quinlan (Citation1992), belongs to the category of machine learning and data-mining approaches. This model is based on a binary decision tree, which has a set of linear regression functions in its terminal nodes (leaves). The major aim of this model is to identify the relationship between independent and dependent variables. The M5Tree can be used for quantitative data as well; this feature distinguishes it from the decision trees, which can only be used for categorical data. The main advantage of the M5Tree over regression trees is that the former is much smaller and regression functions naturally exclude many variables. A decision tree usually consists of four sections: root, branches, nodes and leaves. Each node denotes one specified feature, while branches represent an interval of values. These intervals yield the values of different sections of the set of known values for features. Splitting is performed by one of the predictor variables. The splitting criterion shows the error in that node. The process of splitting is reiterated a number of times in each node until reaching the final node (leaf). This process makes the standard deviation of data in child nodes (lower nodes) smaller than that of the parent node (Sattari et al. Citation2013, Kisi Citation2015).

The process of producing the tree structure in this model comprises the following steps: (1) error estimates, (2) linear models, (3) simplification of linear models, (4) pruning and (5) smoothing. Assuming T as a collection of training cases (a set of samples), either it is associated with a leaf (node) or it should be split into some sub-categories (subsets). Hence, the first step is to develop a decision tree by dividing data into sub-categories with respect to the standard deviation of the target values of cases in T (see Fig. 1(a)). In this step, the division criterion is determined by the assumption of standard deviation of the class values which get the node to measure the error in that node. The standard deviation reduction (SDR) in error as a result of this case is defined as:

(9) SDR=sd(T)TiTsdTi(9)

where Ti denotes a sub-category of samples that includes the ith output of the potential set; and sd stands for standard deviation. After examining all the available tests, the M5 selects one that maximizes the SDR. To estimate the error of a derived model from sets of training cases, the average residual of these cases is determined.

In the second step, a multivariate linear model is allocated for the cases at the leaves (nodes) by the aid of a regression technique. Afterwards, the linear model is simplified by eliminating some regression parameters.

The above process develops a large tree with numerous branches and nodes, complicating its use. Thus, excess branches must be trimmed to reach an optimal and efficient tree structure in the pruning phase. There are two general approaches for this: (a) trimming before the formation of the overgrown tree, and (b) trimming after the formation of the overgrown tree.

In the first approach, the trimming process does not allow for the formation of excess branches. In the second approach, however, the overgrown trees are first formed, and then trimming is done (similar to the present study). Finally, the smoothing phase can be used for improving the results by adjusting the value at the leaves along the path from the root to the leaves (Quinlan Citation1992). More information about the M5Tree modelling process is given by Pal and Deswal (Citation2009) and Sattari et al. (Citation2013).

2.4 Multi-layer perceptron artificial neural network (MLPNN)

Artificial neural networks (ANNs) are templates for data processing, and are built based on the biological neural network in the human brain. The key element in this template is the novel structure of its data processing system, which comprises numerous elements (neurons) with strong internal associations working together to solve special problems. With their considerable potential in deducing results from complicated and vague data, ANNs can be used for extracting patterns and identifying trends that are very difficult to detect by humans and computers. The MLPNN are among the most common ANNs (Nguyen et al. Citation2017). Because of their considerable potential in learning system behaviour, MLPNNs can be employed for modelling nonlinear systems. The pre-requisite for the optimal performance of MLPNNs is the optimal determination of internal parameters, e.g. initial weights and network structure. The network structure of MLPNNs usually consists of three or more layers: input, hidden and output layers. The number of neurons in the input and output layers is equal to the number of input and output variables, respectively. However, the number of neurons in the hidden layer(s) is not pre-determined and the optimal number can be obtained from a trial-and-error procedure. In fact, the major objective is to determine the number of neurons in the hidden layer and the activation function. Although there are basic rules for determining the topology of network structure, this is, in practice, experimental and done by developing many structures and finding the best one.

Neurons in the hidden layer(s) and the output layer are connected with neurons of the previous layer (or the input of the following hidden layer) by synaptic weights and bias (see )). After summing up their inputs, neurons produce their output through activation functions. The relationship that describes the input/output of a time series set of data for the MLP network is as follows:

(10) yi=βi+j=1qβijFik=1pwjkxk+bk;k=1,2,,p,j=1,2,,q(10)

Figure 1. Schematic view of (a) an M5 model tree (M5Tree) with six linear regression models, and (b) a multi-layer perceptron neural network (MLPNN).

Figure 1. Schematic view of (a) an M5 model tree (M5Tree) with six linear regression models, and (b) a multi-layer perceptron neural network (MLPNN).

where xk stands for the input value; yi is the MLPNN output (i = 1, 2, …, number of targets); F is the transfer function (usually sigmoidal for the hidden layer(s) and linear for the output layer); p and q are the number of elements for the input vector and the number of neurons in the hidden layer(s), respectively; w and β are the synaptic weights; and b is the bias.

After establishing an ANN, the network must be trained. To this end, many optimization algorithms can be utilized. The present study employed the Levenberg-Marquardt (LM) algorithm to update network weights. The LM algorithm (which is also known as the damped least-squares method) is an iterative technique which can be applied for searching local minima of complex functions. Detailed information about the LM algorithm can be found in several references (e.g. Kumar et alCitation2002).

2.5 Performance indicators

Following the literature on the validation of predictive models, the models in this research are validated using various statistical indicators (Sanikhani et al. Citation2018, Kisi and Yaseen Citation2019). The models are evaluated with respect to mean absolute error (MAE), root mean square error (RMSE), agreement index (d) and Nash-Sutcliffe efficiency (NSE), which may be calculated using the following equations:

(11) MAE=1Ni=1N(ET0)i(ETp)i(11)
(12) RMSE=1Ni=1N[(ET0)i(ETp)i]2(12)
(13) d=1i=1N[(ET0)i(ETp)i]2i=1N[(ET0)iET0+(ETp)iET0]2, 0<d1(13)
(14) NSE=1i=1N[(ET0)i(ETp)i]2i=1N[(ET0)iET0]2, <NSE1(14)

where N is the number of data; and ET0, ETp and ET0 are, respectively, the observed, predicted and mean of observed reference evapotranspiration. When MAE and RMSE values tend to zero, the predicted data obtained using the model indicate perfect performance. A perfect fitness of the predicted data with observed ET0 is given when d = 1 or NSE = 1. These evaluation metrics were selected in this study because they are commonly used in the related literature (Shamshirband et al. Citation2015, Petković et al. Citation2016, Khoshravesh et al. Citation2017, Keshtegar et al. Citation2018a).

3 Case study

Daily climatic data of air temperature (T), solar radiation (SR), relative humidity (RH) and wind speed (W) were obtained from the stations Isparta (37°47′N, 30°34′E; altitude: 997 m a.m.s.l.) and Antalya (36°42′N, 30°44′E; altitude: 47 m a.m.s.l.), located in the Mediterranean region of Turkey. The locations of the stations are shown in . The data used were obtained from the Turkish Meteorological Organization for the data periods 1973–2002 and 1978–2002 for Antalya and Isparta, respectively, of which 60% was used for training of the applied models and the remaining 40% was used for testing the obtained models. A summary of the statistical characteristics of the datasets is provided in . The T and SR show low skewed distribution and have high correlations with ET0, while W has significantly highly skewed distribution and low correlation, especially for Isparta station. This low correlation may be due to the high altitude of this station. It should be noted that the mean relative humidity is more than 60% for both stations. According to the correlations given in , T and SR seem to be the most effective parameters for ET0 estimation.

Figure 2. Location of the stations in the Mediterranean region of Turkey.

Figure 2. Location of the stations in the Mediterranean region of Turkey.

Table 1. Daily statistical parameters of each dataset. xmin, xmax, xmean, Sx and Csx denote minimum, maximum, mean, standard deviation and skewness, respectively.

4 Application and results

In this study, PCE, M5Tree, MLPNN and RSM were applied for estimation of ET0. First, an effective variable was used as the only input to the models; then, other climatic variables were added one by one considering the correlation values between each of them and ET0. Thus, the following input combinations were tried: (i) T, (ii) T and SR, (iii) T, SR and RH, and (iv) T, SR, RH and W. This strategy was also used in previous research (e.g. Doğan Citation2009, Tabari et al. Citation2010, Cobaner Citation2011, Ladlani et al. Citation2012, Citakoglu et al. Citation2014, Yang et al. Citation2017a).

The models for the simulation of ET0 in the training period for Isparta and Antalya stations are compared in and , respectively. It is apparent from and that each method shows different training performance for different input combinations. At Isparta station, with input combinations (i)–(iii), the M5Tree has the best approximation, while the PCE has the best accuracy with input combination (iv). The PCE has the second-best approximation with input combination (iii), while the MLPNN is the second most accurate with the other three input combinations. The RSM generally has the worst performance in approximation of ET0 for Isparta station. For Antalya station, the M5Tree has better approximation than the other methods for input combinations (i)–(iii), while the PCE has the best accuracy with combination (iv). The performance of the MLPNN is in the second rank for three combinations, while the RSM generally has the worst approximation of ET0 estimation at Antalya station. It is clear from and that the accuracy of the models in training is increased by adding a new variable as input, and the four-input models gave the best approximation. The PCE had the smallest MAE (0.038 and 0.033 mm) and RMSE (0.053 and 0.046 mm), and the highest d (0.9997 and 0.9999) and NSE (0.9990 and 0.9996) in the training period for Isparta and Antalya stations, respectively.

Table 2. Comparison of statistical errors for PCE, RSM, M5Tree and MLPNN models for estimation of ET0 in the training stage – Isparta station.

Table 3. Comparison of statistical errors for PCE, RSM, M5Tree and MLPNN models for estimation of ET0 in the training stage – Antalya station.

The performance statistics of the four models applied in the estimation of ET0 for Isparta station in the test stage are presented in . As seen from , the PCE has a better accuracy than the other three models for input combinations (iii) and (iv), while the MLPNN and RSM have the best accuracy with combinations (i) and (ii), respectively. In fact, there is a slight difference between the MLPNN and PCE for the first input combination. In the test period, the M5Treee generally has the worst accuracy in estimating ET0 at Isparta station. Similar to the training period, the performance of the models increased by including more inputs; the four-input models have the best accuracy. The PCE model has the lowest MAE (0.036 mm) and RMSE (0.047), and the highest d (0.9998) and NSE (0.9992) in estimating ET0 for Isparta station.

Table 4. Comparison of statistical errors for PCE, RSM, M5Tree and MLPNN models for estimation of ET0 in the test stage – Isparta station.

The performance of the applied methods in estimation of ET0 on the test data of Antalya station is presented in . From , it is apparent that the PCE has better estimation accuracy than the other methods for two of the four combinations, (i) and (iv), while the MLPNN has the best accuracy for input combination (ii), followed by the PCE model. With input combination (iii), both PCE and MLPNN have almost the same accuracy in estimation of ET0 for Antalya station. Similar to the Isparta station, here also the M5Tree generally has the worst estimates. An increment in the accuracy of the models is clearly seen when the number of inputs is increased. For Antalya station also, the PCE has the lowest MAE (0.037 mm) and RMSE (0.050) and the highest d (0.9999) and NSE (0.9996) in estimating ET0.

Table 5. Comparison of statistical errors for PCE, RSM, M5Tree and MLPNN models for estimation of ET0 in the test stage – Antalya station.

The observed and estimated ET0 values are compared for each method and each combination in Figures 3–. It is clear from these scatter plots that including more variables in the model inputs results in less scattered estimates for both stations. It may be seen from that the estimates of the four-input models are closer to the observed ET0 than those of the other models that use different input combinations; the four-input PCE has less scattered estimates compared to the corresponding M5Tree, MLPNN and RSM models. Comparison of and indicates that adding wind speed input considerably improved the accuracy of the models in estimation of ET0 at both stations, even though this parameter has the lowest correlation with ET0 (see ). This implies that there is a strong nonlinear relationship between the W and ET0. The comparison of the two stations suggests that the models are better in estimating ET0 for Antalya compared to Isparta. The main reason for this might be the difference in altitude and climatic characteristics of these locations. This can be clearly seen from the temperature ranges shown in : the temperature at Antalya is in the range [1.7, 37.5], while that at Isparta is [−10.5, 29.5]. Similarly, the ET0 ranges are [0, 15] and [0, 8.19] for Antalya and Isparta, respectively.

Figure 3. Scatter plots of observed and different estimated ET0 models for Isparta station (top row) and Antalya station (bottom row) in the test period based on the input variable mean air temperature.

Figure 3. Scatter plots of observed and different estimated ET0 models for Isparta station (top row) and Antalya station (bottom row) in the test period based on the input variable mean air temperature.

Figure 4. Scatter plots of observed and different estimated ET0 models for Isparta station (top row) and Antalya station (bottom row) in the test period based on the input variables mean air temperature and solar radiation.

Figure 4. Scatter plots of observed and different estimated ET0 models for Isparta station (top row) and Antalya station (bottom row) in the test period based on the input variables mean air temperature and solar radiation.

Figure 5. Scatter plots of observed and different estimated ET0 models for Isparta station (top row) and Antalya station (bottom row) in the test period based on the input variables mean air temperature, solar radiation and mean relative humidity.

Figure 5. Scatter plots of observed and different estimated ET0 models for Isparta station (top row) and Antalya station (bottom row) in the test period based on the input variables mean air temperature, solar radiation and mean relative humidity.

Figure 6. Scatter plots of observed and different estimated ET0 models for Isparta station (top row) and Antalya station (bottom row) in the test period based on the input variables mean air temperature, solar radiation, mean relative humidity and wind speed.

Figure 6. Scatter plots of observed and different estimated ET0 models for Isparta station (top row) and Antalya station (bottom row) in the test period based on the input variables mean air temperature, solar radiation, mean relative humidity and wind speed.

5 Conclusion

This study investigated the ability of a new method, polynomial chaos expansion (PCE), to model reference evapotranspiration. The results of the PCE were compared with those of the M5 model tree (M5Tree), the multi-layer perceptron neural network (MLPNN), and the response surface method (RSM) with respect to various evaluation criteria. Daily climatic data from two stations situated in the Mediterranean region of Turkey were used as case studies. Application of various combinations of input data indicated that the four-input models, comprising temperature, solar radiation, relative humidity and wind speed as inputs, have the best accuracy in predicting ET0. From the comparison of the four methods, the PCE provided the best accuracy in modelling ET0 followed by the MLPNN, RSM and M5Tree, respectively. The addition of wind speed as input in the models increased the models’ accuracy considerably, even though wind speed has low correlation with ET0. This indicated the importance of including wind speed for the accurate estimation of ET0. The applied models gave better estimates for Antalya station compared to Isparta. The possible reasons for this were thought to be the differences in altitude and climatic characteristics of these two stations. The ability of PCE to predict complex engineering problems such as ET0 should be evaluated for other stations with different climates in future studies.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

  • Allen, R.G., 2000. Using the FAO-56 dual crop coefficient method over an irrigated region as part of an evapotranspiration intercomparison study. Journal of Hydrology, 229 (1–2), 27–41. doi:10.1016/S0022-1694(99)00194-8.
  • Beeson, R., 2006. Relationship of plant growth and actual evapotranspiration to irrigation frequency based on management allowed deficits for container nursery stock. Journal of the American Society for Horticultural Science, 131 (1), 140–148. doi:10.21273/JASHS.131.1.140.
  • Blatman, G. and Sudret, B., 2010. An adaptive algorithm to build up sparse polynomial chaos expansions for stochastic finite element analysis. Probabilistic Engineering Mechanics, 25 (2), 183–197. doi:10.1016/j.probengmech.2009.10.003.
  • Citakoglu, H., et al., 2014. Estimation of monthly mean reference evapotranspiration in Turkey. Water Resources Management, 28 (1), 99–113. doi:10.1007/s11269-013-0474-1.
  • Cobaner, M., 2011. Evapotranspiration estimation by two different neuro-fuzzy inference systems. Journal of Hydrology, 398 (3–4), 292–302. doi:10.1016/j.jhydrol.2010.12.030.
  • Doğan, E., 2009. Reference evapotranspiration estimation using adaptive neuro‐fuzzy inference systems. Irrigation and Drainage: the Journal of the International Commission on Irrigation and Drainage, 58 (5), 617–628. doi:10.1002/ird.445.
  • Fajraoui, N., et al., 2011. Use of global sensitivity analysis and polynomial chaos expansion for interpretation of nonreactive transport experiments in laboratory‐scale porous media. Water Resources Research, 47 (2). doi:10.1029/2010WR009639.
  • Feng, Y., et al., 2017. Modeling reference evapotranspiration using extreme learning machine and generalized regression neural network only with temperature data. Computers and Electronics in Agriculture, 136, 71–78. doi:10.1016/j.compag.2017.01.027
  • Gavili, S., et al., 2018. Evaluation of several soft computing methods in monthly evapotranspiration modelling. Meteorological Applications, 25 (1), 128–138. doi:10.1002/met.2018.25.issue-1.
  • George, B.A., et al., 2002. Decision support system for estimating reference evapotranspiration. Journal of Irrigation and Drainage Engineering, 128 (1), 1–10. doi:10.1061/(ASCE)0733-9437(2002)128:1(1).
  • Gocic, M., et al., 2016. Comparative analysis of reference evapotranspiration equations modelling by extreme learning machine. Computers and Electronics in Agriculture, 127, 56–63. doi:10.1016/j.compag.2016.05.017
  • Keshtegar, B., et al., 2016. Optimized river stream-flow forecasting model utilizing high-order response surface method. Water Resources Management, 30 (11), 3899–3914. doi:10.1007/s11269-016-1397-4.
  • Keshtegar, B., et al., 2018a. Subset modeling basis ANFIS for prediction of the reference evapotranspiration. Water Resources Management, 32 (3), 1101–1116. doi:10.1007/s11269-017-1857-5.
  • Keshtegar, B., et al., 2019. A novel nonlinear modeling for the prediction of blast-induced airblast using a modified conjugate FR method. Measurement, 131, 35–41. doi:10.1016/j.measurement.2018.08.052
  • Keshtegar, B. and Heddam, S., 2017. Modeling daily dissolved oxygen concentration using modified response surface method and artificial neural network: a comparative study. Neural Computing and Applications. doi:10.1007/s00521-017-2917-8.
  • Keshtegar, B. and Kisi, O., 2017. Modified response-surface method: new approach for modeling pan evaporation. Journal of Hydrologic Engineering, 22 (10), 04017045. doi:10.1061/(ASCE)HE.1943-5584.0001541.
  • Keshtegar, B., Mert, C., and Kisi, O., 2018b. Comparison of four heuristic regression techniques in solar radiation modeling: kriging method vs RSM, MARS and M5 model tree. Renewable and Sustainable Energy Reviews, 81, 330–341. doi:10.1016/j.rser.2017.07.054
  • Khoshravesh, M., Sefidkouhi, M.A.G., and Valipour, M., 2017. Estimation of reference evapotranspiration using multivariate fractional polynomial, Bayesian regression, and robust regression models in three arid environments. Applied Water Science, 7 (4), 1911–1922. doi:10.1007/s13201-015-0368-x.
  • Kim, S., et al., 2014. Modeling nonlinear monthly evapotranspiration using soft computing and data reconstruction techniques. Water Resources Management, 28 (1), 185–206. doi:10.1007/s11269-013-0479-9.
  • Kisi, Ö., 2006. Generalized regression neural networks for evapotranspiration modelling. Hydrological Sciences Journal, 51 (6), 1092–1105. doi:10.1623/hysj.51.6.1092.
  • Kisi, O., 2013. Least squares support vector machine for modeling daily reference evapotranspiration. Irrigation Science, 31 (4), 611–619. doi:10.1007/s00271-012-0336-2.
  • Kisi, O., et al., 2015. Long-term monthly evapotranspiration modeling by several data-driven methods without climatic data. Computers and Electronics in Agriculture, 115, 66–77. doi:10.1016/j.compag.2015.04.015
  • Kisi, O., 2015. Pan evaporation modeling using least square support vector machine, multivariate adaptive regression splines and M5 model tree. Journal of Hydrology, 528, 312–320. doi:10.1016/j.jhydrol.2015.06.052
  • Kisi, O., et al., 2016. Daily pan evaporation modeling using chi-squared automatic interaction detector, neural networks, classification and regression tree. Computers and Electronics in Agriculture, 122, 112–117. doi:10.1016/j.compag.2016.01.026
  • Kisi, O., 2016. Modeling reference evapotranspiration using three different heuristic regression approaches. Agricultural Water Management, 169, 162–172. doi:10.1016/j.agwat.2016.02.026
  • Kisi, O. and Kilic, Y., 2016. An investigation on generalization ability of artificial neural networks and M5 model tree in modeling reference evapotranspiration. Theoretical and Applied Climatology, 126 (3–4), 413–425. doi:10.1007/s00704-015-1582-z.
  • Kisi, O. and Yaseen, Z.M., 2019. The potential of hybrid evolutionary fuzzy intelligence model for suspended sediment concentration prediction. CATENA, 174, 11–23. doi:10.1016/j.catena.2018.10.047
  • Kisi, O. and Zounemat-Kermani, M., 2014. Comparison of two different adaptive neuro-fuzzy inference systems in modelling daily reference evapotranspiration. Water Resources Management, 28 (9), 2655–2675. doi:10.1007/s11269-014-0632-0.
  • Kumar, M., et al., 2002. Estimating evapotranspiration using artificial neural network. Journal of Irrigation and Drainage Engineering, 128 (4), 224–233. doi:10.1061/(ASCE)0733-9437(2002)128:4(224).
  • Kumar, M., Raghuwanshi, N., and Singh, R., 2011. Artificial neural networks approach in evapotranspiration modeling: a review. Irrigation Science, 29 (1), 11–25. doi:10.1007/s00271-010-0230-8.
  • Ladlani, I., et al., 2012. Modeling daily reference evapotranspiration (ET0) in the north of Algeria using generalized regression neural networks (GRNN) and radial basis function neural networks (RBFNN): a comparative study. Meteorology and Atmospheric Physics, 118 (3–4), 163–178. doi:10.1007/s00703-012-0205-9.
  • Laloy, E., et al., 2013. Efficient posterior exploration of a high‐dimensional groundwater model from two‐stage Markov chain Monte Carlo simulation and polynomial chaos expansion. Water Resources Research, 49 (5), 2664–2682. doi:10.1002/wrcr.20226.
  • Landeras, G., Ortiz-Barredo, A., and López, J.J., 2008. Comparison of artificial neural network models and empirical and semi-empirical equations for daily reference evapotranspiration estimation in the Basque Country (Northern Spain). Agricultural Water Management, 95 (5), 553–565. doi:10.1016/j.agwat.2007.12.011.
  • Lasota, R., et al., 2015. Polynomial chaos expansion method in estimating probability distribution of rotor-shaft dynamic responses. Bulletin of the Polish Academy of Sciences Technical Sciences, 63 (2), 413–422. doi:10.1515/bpasts-2015-0047.
  • Misaghian, N., et al., 2017. Predicting the reference evapotranspiration based on tensor decomposition. Theoretical and Applied Climatology, 130 (3–4), 1099–1109. doi:10.1007/s00704-016-1943-2.
  • Nguyen, K.Q., et al., 2017. Artificial lights improve the catchability of snow crab (Chionoecetes opilio) traps. Aquaculture and Fisheries, 2 (3), 124–133. doi:10.1016/j.aaf.2017.05.001.
  • Pal, M. and Deswal, S., 2009. M5 model tree based modelling of reference evapotranspiration. Hydrological Processes, 23 (10), 1437–1443. doi:10.1002/hyp.v23:10.
  • Petković, D., et al., 2015. Determination of the most influential weather parameters on reference evapotranspiration by adaptive neuro-fuzzy methodology. Computers and Electronics in Agriculture, 114, 277–284. doi:10.1016/j.compag.2015.04.012.
  • Petković, D., et al., 2016. Particle swarm optimization-based radial basis function network for estimation of reference evapotranspiration. Theoretical and Applied Climatology, 125 (3–4), 555–563. doi:10.1007/s00704-015-1522-y.
  • Quinlan, J.R., 1992. Learning with continuous classes. In: 5th Australian joint conference on artificial intelligence, Singapore, 343–348.
  • Sanikhani, H., et al., 2018. Temperature-based modeling of reference evapotranspiration using several artificial intelligence models: application of different modeling scenarios. Theoretical and Applied Climatology. doi:10.1007/s00704-018-2390-z.
  • Sattari, M.T., et al., 2013. M5 model tree application in daily river flow forecasting in Sohu Stream, Turkey. Water Resources, 40 (3), 233–242. doi:10.1134/S0097807813030123.
  • Shamshirband, S., et al., 2015. Estimation of reference evapotranspiration using neural networks and cuckoo search algorithm. Journal of Irrigation and Drainage Engineering, 142 (2), 04015044. doi:10.1061/(ASCE)IR.1943-4774.0000949.
  • Sochala, P. and Le Maître, O., 2013. Polynomial Chaos expansion for subsurface flows with uncertain soil parameters. Advances in Water Resources, 62, 139–154. doi:10.1016/j.advwatres.2013.10.003
  • Tabari, H., Marofi, S., and Sabziparvar, -A.-A., 2010. Estimation of daily pan evaporation using artificial neural network and multivariate non-linear regression. Irrigation Science, 28 (5), 399–406. doi:10.1007/s00271-009-0201-0.
  • Wang, L., et al., 2017a. Evaporation modelling using different machine learning techniques. International Journal of Climatology, 37, 1076–1092. doi:10.1002/joc.2017.37.issue-S1.
  • Wang, L., et al., 2017b. Influence of main-panel angle on the hydrodynamic performance of a single-slotted cambered otter-board. Aquaculture and Fisheries, 2 (5), 234–240. doi:10.1016/j.aaf.2017.09.001.
  • Wang, L., et al., 2017c. Pan evaporation modeling using six different heuristic computing methods in different climates of China. Journal of Hydrology, 544, 407–427. doi:10.1016/j.jhydrol.2016.11.059
  • Wang, L., et al., 2017d. Pan evaporation modeling using four different heuristic approaches. Computers and Electronics in Agriculture, 140, 203–213. doi:10.1016/j.compag.2017.05.036
  • Wang, S., et al., 2015. A polynomial chaos ensemble hydrologic prediction system for efficient parameter inference and robust uncertainty assessment. Journal of Hydrology, 530, 716–733. doi:10.1016/j.jhydrol.2015.10.021
  • Wen, X., et al., 2015. Support-vector-machine-based models for modeling daily reference evapotranspiration with limited climatic data in extreme arid regions. Water Resources Management, 29 (9), 3195–3209. doi:10.1007/s11269-015-0990-2.
  • Wiener, N., 1938. The homogeneous chaos. American Journal of Mathematics, 60 (4), 897–936. doi:10.2307/2371268.
  • Xiu, D. and Karniadakis, G.E., 2002. The Wiener–askey polynomial chaos for stochastic differential equations. SIAM Journal on Scientific Computing, 24 (2), 619–644. doi:10.1137/S1064827501387826.
  • Yang, G., et al., 2017a. Effect of the exposure to suspended solids on the enzymatic activity in the bivalve Sinonovacula constricta. Aquaculture and Fisheries, 2 (1), 10–17. doi:10.1016/j.aaf.2017.01.001.
  • Yang, Y., Liu, Z., and Liu, Y., 2017b. Prediction of lute acoustic quality based on soundboard vibration performance using multiple choice model. Journal of Forestry Research, 28 (4), 855–861. doi:10.1007/s11676-016-0365-4.
  • Yaseen, Z.M., et al., 2018. Application of the hybrid artificial neural network coupled with rolling mechanism and grey model algorithms for streamflow forecasting over multiple time horizons. Water Resources Management, 32 (5), 1883–1899. doi:10.1007/s11269-018-1909-5.
  • Yin, Z., et al., 2017. Integrating genetic algorithm and support vector machine for modeling daily reference evapotranspiration in a semi-arid mountain area. Hydrology Research, 48 (5), 1177–1191. doi:10.2166/nh.2016.205.
  • Zanetti, S., et al., 2007. Estimating evapotranspiration using artificial neural network and minimum climatological data. Journal of Irrigation and Drainage Engineering, 133 (2), 83–89. doi:10.1061/(ASCE)0733-9437(2007)133:2(83).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.