Search in:

Engineering Applications of Computational Fluid Mechanics Volume 15, 2021 - Issue 1

Submit an article Journal homepage

Open access

2,106

Views

CrossRef citations to date

Altmetric

Listen

Research Article

Prediction of daily water level using new hybridized GS-GMDH and ANFIS-FCM models

Isa Ebtehaja Department of Soils and Agri-Food Engineering, Universite Laval, Quebec, Canada

Saad Sh. Sammenb Department of Civil Engineering, College of Engineering, University of Diyala, Diyala Governorate, IraqCorrespondence[email protected]

https://orcid.org/0000-0001-7014-0285

Lariyah Mohd Sidekc Institute of Energy Infrastructure, Universiti Tenaga Nasional, Jalam IKRAM-UINITEN, Selangor, Malaysia

Anurag Malikd Punjab Agricultural University, Regional Research Station, Bathinda, Punjab, India

https://orcid.org/0000-0002-0298-5777

Parveen Sihage Department of civil engineering, Shoolini University, Solan, India

https://orcid.org/0000-0002-7761-0603

Ahmed Mohammed Sami Al-Janabif Department of Civil Engineering, Tishk International University-Erbil, Kurdistan Region, Iraq

https://orcid.org/0000-0003-2150-6396

Kwok-Wing Chaug Department of Civil and Environmental Engineering, Hong Kong Polytechnic University, Hong Kong, People’s Republic of China

Hossein Bonakdarih Department of Soils and Agro-Food Engineering, Laval University, Quebec, Canada

https://orcid.org/0000-0001-6169-3654

show all

Pages 1343-1361 | Received 13 Jul 2020, Accepted 15 Jul 2021, Published online: 20 Sep 2021

Cite this article
https://doi.org/10.1080/19942060.2021.1966837
CrossMark

In this article

1. Introduction
2. Methodology and dataset
3. Results and discussion
4. Conclusions
Acknowledgements
Disclosure statement
References

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Accurate prediction of water level (WL) is essential for the optimal management of different water resource projects. The development of a reliable model for WL prediction remains a challenging task in water resources management. In this study, novel hybrid models, namely, Generalized Structure-Group Method of Data Handling (GS-GMDH) and Adaptive Neuro-Fuzzy Inference System with Fuzzy C-Means (ANFIS-FCM) were proposed to predict the daily WL at Telom and Bertam stations located in Cameron Highlands of Malaysia. Different percentage ratio for data division i.e. 50%–50% (scenario-1), 60%–40% (scenario-2), and 70%–30% (scenario-3) were adopted for training and testing of these models. To show the efficiency of the proposed hybrid models, their results were compared with the standalone models that include the Gene Expression Programming (GEP) and Group Method of Data Handling (GMDH). The results of the investigation revealed that the hybrid GS-GMDH and ANFIS-FCM models outperformed the standalone GEP and GMDH models for the prediction of daily WL at both study sites. In addition, the results indicate the best performance for WL prediction was obtained in scenario-3 (70%–30%). In summary, the results highlight the better suitability and supremacy of the proposed hybrid GS-GMDH and ANFIS-FCM models in daily WL prediction, and can, serve as robust and reliable predictive tools for the study region.

KEYWORDS:

Water level prediction
hybrid models
GEP
GMDH
Cameron highland

1. Introduction

Prediction of river water level is a critical process in river discharge estimation and it is required for better water resources management (Dingman & Bjerklie, Citation2005; Tsujikura et al., Citation2016; Vachtman & Laronne, Citation2014). The accurate prediction of a river water level improves flood prediction systems and can act as a warning alarm for early decision-making and planning to reduce the effect of flood events which is considered as one of the most damaging natural hazards on life and property (Hettiarachchi & Thilakumara, Citation2014; Morales-Pinzón et al., Citation2015; Tsujikura et al., Citation2016; Xu et al., Citation2019). In Malaysia, floods and flash floods are often happened due to prolonged heavy rainfall; however, the possibility of floods may increase as a result of climate change and global warming (Arbain & Wibowo, Citation2012; Buslima et al., Citation2018; Suri et al., Citation2014). To deal with the flood phenomena, three categories of critical river water levels have been introduced by the Department of Irrigation and Drainage (DID) Malaysia, namely, normal, alert, and danger levels (Gasim et al., Citation2007). The three categories have been identified by analyzing the characteristics of floods in Malaysia, such as water level, peak discharge, inundated area, the volume of flow, and flood duration, for many years.

Water level prediction in rivers is usually conducted using empirical models. These empirical models are developed based on accumulating long-time-series data using in situ sensors that are expensive, hard to maintain, and available in specific areas (Rigos et al., Citation2020). However, as per (Hettiarachchi & Thilakumara, Citation2014) prediction of river water level using non-linear models that includes many environmental parameters (e.g. catchment area and flow rates) imperfectly agreed with the realistic observation data; this may due to the complex nature of the dynamic and rapidly water level fluctuations or due to ignoring some important parameters in the theory (See & Openshaw, Citation1999). Moreover, modeling these complex processes by differential equations has little use in practice as it results in complex, time-consuming, and mathematically intractable non-linear models.

Soft computing techniques have successfully used in the last three decades to solve different complex hydrological problems (Daliakopoulos et al., Citation2005; Ehteram et al., Citation2021; Hadi et al., Citation2019; Kaloop et al., Citation2017; Malik et al., Citation2020b; Parsaie et al., Citation2015; Sammen et al., Citation2017; Singh et al., Citation2018; Tikhamarine et al., Citation2020c; Yaseen et al., Citation2020b; Young et al., Citation2015). For water level prediction, several techniques have been used such as Artificial Neural Networks (ANN) (Alvisi et al., Citation2006), Autoregressive Integrated Moving Average (ARIMA) (Reza et al., Citation2018; Sihag et al., Citation2020; Xu et al., Citation2019), and Support Vector Machine (SVM) (Khan & Coulibaly, Citation2006; Liong & Sivapragasam, Citation2002). These models have been applied for flow and water level prediction of some rivers in Malaysia, such as Muda River, Kedah (Khairuddin et al., Citation2019), Dungun River, Terengganu (Gasim et al., Citation2007), Langat River, Selangor (Toriman et al., Citation2009). Evaluation of soft computing performance conducted by Firat (Citation2008), Toriman et al. (Citation2009), and Khairuddin et al. (Citation2019) acknowledged superiority over statistical and time series methods for flood forecasting. Besides, the soft computing/ machine learning (ML) models received several practical applications in diverse fields like the prediction of solar radiation (Qin et al., Citation2018; Wang et al., Citation2016, Citation2017b), evaporation modeling (Adnan et al., Citation2019; Ashrafzadeh et al., Citation2020; Malik et al., Citation2017, Citation2018; Wang et al., Citation2017a, Citation2017c), rainfall-runoff forecasting (Malik et al., Citation2020b; Singh et al., Citation2018; Tikhamarine et al., Citation2020c), reference evapotranspiration estimation (Malik et al., Citation2019a; Mohamadi et al., Citation2020; Tikhamarine et al., Citation2019, Citation2020a, Citation2020b), meteorological and hydrological drought prediction (Malik et al., Citation2019c, Citation2020a, Citation2021a, Citation2021b, Citation2021c; Malik & Kumar, Citation2020), and simulation of seepage flow through embankment dam (Rehamnia et al., Citation2021).

Moreover, a comparison among ANN, ARMA (Auto-regressive Moving Average), and SVM models that were conducted by Lin et al. (Citation2006) revealed that the SVM model can give a more accurate prediction of long-term flow discharges than the others. A comprehensive review of the applications of genetic programming (GP) in the analysis of water resources systems was conducted by Mohammad-Azari et al. (Citation2020). The review indicates the capability and superiority of the model for solving a wide variety of water-related problems such as modeling rainfall-runoff, streamflow, sedimentation, flood, evaporation, water quality, water demand, and water distribution systems. Moosavi et al. (Citation2017) evaluated the performance of GMDH and wavelet-GMDH models for daily runoff forecasting from Darian-Chay, Ghale-Chay, and Lilan-Chay Rivers in East Azerbaijan (Iran). The evaluation indicates that the performance of the GMDH model was efficiently enhanced when the wavelet-based analyzed data was added to the model to deal with the non- stationarities in the data.

In this study, two hybrid models, namely, Generalized Structure-Group Method of Data Handling (GS-GMDH) and Adaptive Neuro-Fuzzy Inference System with Fuzzy C-Means (ANFIS-FCM) were developed by using data obtained from two water level stations located in Perak River, Malaysia. The study also compared the efficiency and performance of the hybrid models (i.e. GS-GMDH and ANFIS-FCM) with two standalone models, namely, the Gene Expression Programming (GEP) and Group Method of Data Handling (GMDH) through statistical indicators and graphical interpretation. The results of this study promise better accuracy of the hybrid GS-GMDH and ANFIS-FCM models in river water level prediction.

2. Methodology and dataset

2.1. Study area

Cameron Highlands is the smallest region in the province of Pahang Darul Makmur and offers its fringes with the territory of Kelantan and Perak, in the north and west, respectively. It is situated in the Main Range (Banjaran Titiwangsa) between 4° 27′ 53^″ N – 4° 32′ 39^″ N and 101° 23′ 10^″ E – 101° 25′ 25^″ E. The region of Cameron Highlands with an expected region of 71,218 hectares is hilly, extending from 300 m at the stream valleys on the eastern limit to 210 m (Gunung Irau) on the western boundary. The most elevated point open by street in Peninsular Malaysia, Gunung Brinchang (2031 m), is one of the significant tops in Cameron Highlands, side from Gunung Swettenham (1961 m), Gunung Siku (1916 m), Gunung Berembun (1840 m), Gunung Cantik (1802 m) and Gunung Jasar (1704 m). About 75% of the area of the provenance is situated above 1000 m heights. The examination zone falls within Cameron Highlands Districts arranged at Pahang Darul Makmur, which the region assessed to be 712 km². Its temperature falls not more than 25°C and is broadly known as an uneven region with horticultural practices (Eisakhani & Malakahmad, Citation2009). Cameron Highlands is comprised of three significant catchments of Bertam, Telom, and Lemoi as shown in Figure . Bertam comprises five main sub-catchments which are Habu, Ringlet, Lembah Bertam, Tanah Rata, and Brinchang. While, Tringkap, Kampung Raja, and Kuala Terla are the sub-catchments in Telom. Cameron Highlands gets normal yearly precipitation of 2800 mm and normally 2 out of 3 days is raining (Tan & Beh, Citation2015). Therefore, almost every day precipitation could be felt in Cameron Highlands. The details of the station are organized in Table . For modeling the water level, daily data from January 2009 to August 2014 was used for Telom at Batu station, and data from January 2009 to March 20016 was used for Bertam at Rabinson Falls Intake station. Table summarizes statistical parameters i.e. Max. = maximum, Min. = Minimum, SD = standard deviation, Skew = Skewness, Q1, Q2, and Q3 = first, second and third quartiles of WL at both stations.

Figure 1. Location map of study basin (Nasidi et al., Citation2021).

Table 1. List of water level stations with their geographical coordinates.

Download CSV Display Table

Table 2. The statistical characteristics of two stations.

Download CSV Display Table

2.2. Gene expression programming (GEP)

GEP was initially introduced by Ferreira (Citation2002). It is a generated technique with the base of genetic algorithms (GA) and has been broadly adopted in recent investigations (Ebtehaj et al., Citation2015a; Ferreira, Citation2002). The PC program of GEP is encoded in linear chromosomes, which are then explained into trees term (Shabani et al., Citation2018). A systematic diagram of GEP appears in Figure . The initial step is to create the underlying population, which occurs with subjective births of chromosomes. Then the chromosomes are converted to expression trees (ETs) that are analyzed by performance measures to shows the solubility of delivered ETs. If the outcomes convince the performance measures criteria, population producing stops, and if the outcomes are not agreeable, the system redeveloped with some improvement to make generation with improved value, and this procedure happens until the best outcomes are accomplished. For additional clarification about GEP, readers and researchers are referred to (Ferreira, Citation2006; Kiafar et al., Citation2017). According to that, there is no certain method to find the optimum values of the GEP parameters, the optimum values of the GEP parameters for each station were found through a trial-and-error process (Azimi et al., Citation2017). The most optimum values of the GEP parameters for each station are provided in Table .

Figure 2. Description of GEP model.

Table 3. The most optimum values of the GEP parameters.

Download CSV Display Table

2.3. Group method of data handling (GMDH)

Ivakhnenko (Citation1971) initially proposed the GMDH method. It’s practical in different sections for deep learning and science detection and is applied in several fields as forecasting, pattern recognition, and optimization. Analogical GMDH algorithms present the feasibility to discover automatically interrelations in data, to obtain the best structure of model or network, and to enhance the accuracy of existing algorithms. GMDH is containing numerous algorithms for the solution of various types of problems including clusterization, parametric, and probability algorithms. This method is relying on the sorting-out of gradually complex models and chooses the superlative solution via the lowest of outside criterion features. Generally, this method has numerous inputs and one output, which is a subset of elements of the base function (Madala & Ivakhnenko, Citation2019). To obtain the superior solution this model considers a variety of elements subsets of the initial function (Madala & Ivakhnenko, Citation2019) known as partial models. Least-squares techniques are used to find the coefficients of these models. GMDH algorithms gently enhance the number of incomplete model elements and discover a model structure with optimal complications represented via the lowest value of an outside criterion. This method is known as the self-organization of models (Schmidhuber, Citation2015): (1) $\begin{aligned} Y (x_{1}, \dots x_{n}) & = α_{0} + \sum_{i = 1}^{m} α_{i} f_{i} \end{aligned}$ (1) (2) $\begin{aligned} Y (x_{1}, \dots x_{n}) & = α_{0} + \sum_{i = 1}^{n} α_{i} x_{i} + \sum_{i = 1}^{n} \sum_{j = i}^{n} α_{i j} x_{i x_{j}} \\ + \sum_{i = 1}^{n} \sum_{j = i}^{n} \sum_{k = j}^{n} α_{i j k} x_{i} x_{j} x_{k} + \dots \end{aligned}$ (2) where $Y (x_{1}, \dots x_{n})$ represent the input content and $n$ the number of input variables. Also, $α (α_{1}, \dots α_{n})$ coefficients are acquired via regression techniques for each couple of $x_{i} and x_{j}$ input variables (Farlow, Citation1981). Hence, the GMDH algorithm uses various second-degree polynomials.

2.4. Generalized structure-group method of data handling (GS-GMDH)

The standard GMDH model has few drawbacks that the low performance of this model in complex and nonlinear problems. In the present investigation, a novel encoding of GMDH has developed to increases the accuracy of the standard GMDH model. The main drawback of this model is the utilize of only two parameters as inputs for every neuron. In standard GMDH models, the input variables of every neuron are chosen from neighboring neurons. In the present study, a new generalized structure of the GMDH algorithm (GS-GMDH) model was developed to decrease the drawbacks of the standard GMDH model. The introduced new method decreases the limitations available in the standard GMDH algorithm. In GS-GMDH, the proposed neurons can be consisting of 2 or 3 input variables. Moreover, the polynomials are considered as second and third order. Besides, the input of every neuron can be chosen from both neighboring and non-neighboring layers. The most favorable structure of GS-GMDH (Figure ) is obtained based on the Akaike Information Criterion (AIC) as follows (Ebtehaj et al., Citation2015b): (3) $AIC = n * \log (MSE) + 2 (N + 1)$ (3) where N is the number of neurons in the model, n is the number of samples, and MSE is the mean square error.

Figure 3. The flowchart of the proposed GS-GMDH model.

2.5. Adaptive neuro-fuzzy inference system (ANFIS)

The adaptive neuro-fuzzy inference system (ANFIS) is an amalgamation of ANN and fuzzy logic (FL) for developing non-linear problems and initially, it was developed by Jang et al. IF-THEN fuzzy rules are utilized in model development by ANFIS (Sobhani et al., Citation2010; Yuan et al., Citation2014). It draws advantages of both ANN and FL. It could successfully be used where ordinary traditional techniques fail or too weighty (Vakhshouri & Nejadi, Citation2018). Shape and number of membership functions (MFs) are significant parameters in ANFIS to generate a model with the least error zone. Figure display the structure of an ANFIS model having two input variables. For simplicity of illustration only two inputs p, q, and single target, y is considered in this figure.

Figure 4. The flowchart of the proposed ANFIS-FCM model.

2.6. Fuzzy C-means method (FCM)

The algorithm k-mean is one of the grouping algorithms, which is utilized broadly. This algorithm with unsupervised in large data sets is faced with limitations in preparing. To deal with the shortcoming, distinctive grouping algorithms are given. Fuzzy C-means clustering as an alternative technique is utilized (Kisi & Zounemat-Kermani, Citation2016). Fuzzy c-means (FCM) were presented by Bezdek et al. (Citation1981), and improved by variables and dependent variables (target) specified in this stage are: (4) $\begin{aligned} S^{1} M_{a} & = μ M_{a} (p), a = 1, 2 \end{aligned}$ (4) (5) $\begin{aligned} S^{1} N_{a} & = μ N_{a} (q), a = 1, 2 \end{aligned}$ (5) where $p$ and $q$ are crisp inputs, and $M_{a}$ and $N_{a}$ are fuzzy set, low, medium, high-class size membership functions are applied, which could any shape such as triangular, trapezoidal, bell-shaped, Gaussian function, etc. (Cai et al., Citation2007). In Fuzzy clustering, designs in clusters with common are classified, and a pattern can appertain multiple clusters with an alternate proportion. In the FCM algorithm, designs are blocked to the C cluster, truth be told, the quantity of clusters (C) is indicated prior, yet the focal point of the cluster is chosen haphazardly. The level of membership for each example as indicated by the membership function is determined by the focal point of each cluster. The goal of the FCM algorithm is to discover a group that the likeness between designs inside various clusters is minimized. One of the primary benefits of the FCM technique is that in this approach, every data point is related to at least two clusters. The FCM cluster center utilizes the minimization of the objective function, which is considered as the squared separation between each group center and information point and is weighted by its memberships.

2.7. Performance indicators

The accuracy of the hybrid (i.e. GS-GMDH and ANFIS-FCM) and standalone (i.e. GMDH and GEP) models developed for water level prediction at both study stations were evaluated by using four performance or statistical indicators i.e. Root Mean Square Error (RMSE) (Malik et al., Citation2019b; Pham et al., Citation2021; Sammen et al., Citation2020), Nash-Sutcliffe efficiency (NSE) (Nash & Sutcliffe, Citation1970), Pearson Correlation Coefficient (PCC) (Adnan et al., Citation2019; Malik & Kumar, Citation2020), and Willmott Index (WI) (Willmott, Citation1981), and through graphical inspection (time-variation plot, scatter plot, box–whisker plot, and Taylor diagram). The RMSE, NSE, PCC, and WI are stated as (6) $\begin{aligned} RMSE & = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(W L_{obs, i} - W L_{pre, i})}^{2}} \end{aligned}$ (6) (7) $\begin{aligned} NSE & = 1 - [\frac{\sum_{i = 1}^{N} {(W L_{obs, i} - W L_{pre, i})}^{2}}{\sum_{i = 1}^{N} {(W L_{obs, i} - \bar{W L_{obs}})}^{2}}] \end{aligned}$ (7) (8) $\begin{aligned} PCC & = \frac{\begin{matrix} \sum_{i = 1}^{N} (W L_{obs, i} - \bar{W L_{obs}}) \\ (W L_{pre, i} - \bar{W L_{pre}}) \end{matrix}}{\sqrt{\begin{matrix} \sum_{i = 1}^{N} {(W L_{obs, i} - \bar{W L_{obs}})}^{2} \\ \sum_{i = 1}^{N} {(W L_{pre, i} - \bar{W L_{pre}})}^{2} \end{matrix}}} \end{aligned}$ (8) (9) $\begin{aligned} WI & = 1 - [\frac{\sum_{i = 1}^{N} {(W L_{pre, i} - W L_{obs, i})}^{2}}{\begin{matrix} \sum_{i = 1}^{N} (| W L_{pre, i} - \bar{W L_{obs}} | \\ + | W L_{obs, i} - \bar{W L_{obs}} |)^{2} \end{matrix}}] \end{aligned}$ (9) where $N, W L_{obs}, W L_{pre}, \bar{W L_{obs}}$ and $\bar{W L_{pre}}$ are the data points, observed and predicted water level (WL) values for the ith observations, and mean of observed and predicted WL values, respectively. In general, if the applied models follow the criteria of higher values of NSE, PCC, and WI, and the lower value of RMSE designated a relatively better model for WL prediction at study stations. These four statistical indicators are commonly used performance indicators in assessing model performance, which has proven their values in previous studies. They are used together in this study because each of them has both advantages and disadvantages. The use of all four indicators will ensure that an all-around assessment can be made of the model performance.

3. Results and discussion

3.1. Performance assessment using statistical metrics

Four different machine learning techniques, namely, GEP, GMDH, GS-GMDH, and ANFIS-FCM were employed to predict the daily WL for two stations of Cameron Highlands in Malaysia. Table present the results of the performance indices (i.e. RMSE, NSE, PCC, and WI) of the GEP, GMDH, GS-GMDH, and ANFIS-FCM models at Telom station during the validation period under three different scenarios. It is clear from Table that the values of RMSE, NSE, PCC and WI found in the range of 87.340–88.552 m, 0.755–0.761, 0.869–0.873 and 0.923–0.927 for scenario-1, 84.203–84.704 m, 0.769–0.772, 0.878–0.879 and 0.929–0.931 for scenario-2, and 82.987–83.616 m, 0.763–0.767, 0.874–0.876 and 0.927–0.929 for scenario-3, respectively. According to the Table , the GS-GMDH models had better performance for all three scenarios, but optimal results yielded under scenario-3 with RMSE = 82.987 m, NSE = 0.767, PCC = 0.876 and WI = 0.929. Likewise, the GS-GMDH model follows the criteria of lower values of RMSE, and higher values of NSE, PCC, and WI for all three scenarios and designated the first (or highest) rank for water level prediction. Similarly, the ANFIS-FCM model closely follows the GS-GMDH model, while the GMDH and GEP models had similar performance for water level prediction for the Telom station.

Table 4. Performance indicators of hybrid and simple ML models at Telom station during the validation phase.

Download CSV Display Table

Similarly, Table summaries the results of GEP, GMDH, GS-GMDH, and ANFIS-FCM models at Bertam station during validation phase. It was noted from Table that the values of RMSE, NSE, PCC and WI found in the range 48.148–49.86 m, 0.646–0.656, 0.804–0.810 and 0.882–0.887 for scenario-1, 49.047–49.766 m, 0.639–0.649, 0.799–0.806 and 0.882–0.884 for scenario-2, and 48.143–49.041 m, 0.680–0.691, 0.825–0.832 and 0.895–0.900 for scenario-3, respectively. Besides, the GS-GMDH models had better performance for all three scenarios, but improved results produced under scenario-3 (RMSE = 48.143 m, NSE = 0.691, PCC = 0.832 and WI = 0.900).

Table 5. Performance indicators of hybrid and simple ML models at Bertam station during the validation phase.

Download CSV Display Table

Furthermore, the prediction accuracy of the GS-GMDH model improved by 0.81%, 1.37%, 1.00% in scenario-1; 0.44%, 0.23%, 0.59% in scenario-2, and 0.17%, 0.97%, 0.75% in scenario-3 with respect to RMSE over GMDH, ANFIS-FCM and GEP models at Telom station. Likewise, the prediction accuracy of the GS-GMDH model enhanced by 1.43%, 0.47%, 1.17% in scenario-1, 1.44%, 0.30%, 1.26% in scenario-2, and 1.83%, 0.49%, 0.73% in scenario-3 regarding the RMSE over GMDH, ANFIS-FCM and GEP models at Bertam station. Therefore, for the Telom and Bertam stations, the obtained results indicate that the best performance was attained under scenario-3 where the data was divided by 70% for the calibration and the remaining 30% for validating the models.

3.2. Performance assessment using graphical interpretation

Besides the statistical assessment of the results, graphical methods have been widely used for model assessment. Accordingly, three different graphical methods namely temporal and scatter plots, Box–Whisker plot, and Taylor diagram were adopted in the study to assess the model performance graphically. Figures and illustrate the temporal and scatter plots of GS-GMDH, GMDH, ANFIS-FCM, and GEP models under scenario-1, scenario-2, and scenario-3 during the validation period at Telom and Bertam stations, respectively. As can see from these two figures that the GS-GMDH model had a higher value of the coefficient of determination: R² = 0.7619, 0.7724, and 0.7679 for scenario-1, scenario-2, and scenario-3 respectively at Telom station. Similarly, the high value of R² = 0.6562, 0.6494, and 0.6916 for scenario-1, scenario-2, and scenario-3, respectively was obtained when the GS-GMDH model was applied at Bertam station.

Figure 5. Comparison of observed and predicted WL values by GS-GMDH, GMDH, ANFIS-FCM, and GEP models under (a) scenario-1, (b) scenario-2, and (c) scenario-3 during validation period at Telom station.

Figure 6. Comparison of observed and predicted WL values by GSGMDH, GMDH, ANFIS-FCM, and GEP models over (a) scenario-1, (b) scenario-2, and (c) scenario-3 during validation period at Bertam station.

Furthermore, the performance of the GS-GMDH, GMDH, ANFIS-FCM, and GEP models in this study was evaluated by using the Box–Whisker plot. According to this diagram, it is easy to explain if there is any skew in the distribution of the data or there are any outliers. Figures and display the Box–Whisker plots for Telom and Bertam, respectively. In these figures, the distribution of the predicted values over the observed values during the validation period was explained. It was seen from the figures the distributional variation among predicted vs observed water level values were relatively minor. Therefore, the verdict based on performance measures (RMSE, NSE, PCC, and WI) and graphical inspection (coefficient of determination of regression line in scatter plots) showed the better water level prediction accuracy of the hybrid GS-GMDH model than the GMDH, ANFIS-FCM, and GEP models.

Figure 7. Box-Whisker plot of observed and predicted WL by GS-GMDH, GMDH, ANFIS-FCM, and GEP models in (a) scenario-1, (b) scenario-2, and (c) scenario-3 during validation period at Telom station.

Figure 8. Box-Whisker plot of observed and predicted WL by GS-GMDH, GMDH, ANFIS-FCM, and GEP models in (a) scenario-1, (b) scenario-2, and (c) scenario-3 during validation period at Bertam station.

Likewise, the Taylor diagram (Taylor, Citation2001), an association of standard deviation, RMSE, and the correlation coefficient was employed to display the spatial variation of predicted water level using all four models in three different scenarios over the observed one in a single topology. Figures and demonstrate the Taylor diagram for the relative performance at Telom and Bertam sites, respectively. These diagrams clearly show the better performance of the GS-GMDH model for both stations. It is clear from Figures and that the obtained results by the GS-GMDH models are closer to the observed values of water level prediction and it has the superior performance as discussed before in the previous section.

Figure 9. Taylor diagram of GS-GMDH, GMDH, ANFIS-FCM, and GEP models under (a) scenario-1, (b) scenario-2, and (c) scenario-3 during validation phase at Telom station.

Figure 10. Taylor diagram of GSGMDH, GMDH, ANFIS-FCM, and GEP models under (a) scenario-1, (b) scenario-2, and (c) scenario-3 during validation phase at Bertam station.

The final equation of the GEP for both stations are provided as follow: (10) $\begin{aligned} W L (t)_T e l o m & = ((W L (t - 3) - W L (t - 1)) \\ / (- 4.87 / (((W L (t - 3) - W L (t - 1)) \\ - W L (t - 1)) / (W L (t - 3) + 9.44)))) \\ + ((((((W L (t - 2) + W L (t - 6)) \\ * - 5.32) / W L (t - 4)) * W L (t - 2)) \\ / W L (t - 5)) - 5.32) \\ + (s q r t (W L (t - 6)) + W L (t - 1)) \\ + (((8.82 - (W L (t - 3)^{(1 / 3)})) \\ * (s q r t (W L (t - 1)) * W L (t - 1))) \\ / W L (t - 3)) + ((- 1.61)^{3}) \end{aligned}$ (10) (11) $\begin{aligned} WL (t)_Bertam & = 2 + ((((WL (t - 5) - WL (t - 1)) \\ - (- 0.56 + WL (t - 1))) \\ + (WL (t - 3) - WL (t - 1))) / 8.88) \\ + ((((7.49 + WL (t - 3)) \\ + (WL (t - 2) + 7.49)) / ((WL (t - 2) \\ - WL (t - 3)) - WL (t - 5))) \\ + WL (t - 1)) \end{aligned}$ (11)

The results of the current research were compared with existing studies on water level prediction by employing machine learning techniques. Altunkaynak (Citation2019) predicted monthly WL in Lake Van, Tukey by employing the multilayer perceptron (MLP), wavelet-MLP (W-MLP), and MLP-ASA (additive season algorithm). Their prediction performance was evaluated using RMSE and NSE criteria. They found that the MLP-ASA model (RMSE = 3.550 cm, NSE = 0.992) outperformed the other models. Alizamir et al. (Citation2020) employed a deep echo state network (DESN) to predict the monthly WL of lake Van (Turkey), and its outcomes were compared against the ANN, extreme learning machine (ELM), and regression tree (RT) based on RMSE, R², and NSE performance indicators. The investigation shows better performance of the DESN model with RMSE = 0.025 m, NSE = 0.998, and R² = 0.998 than the ANN, ELM, and RT models. Nhu et al. (Citation2020) applied four decision tree-based algorithms i.e. M5 pruned (M5P), random forest (RF), RT, and reduced error pruning tree (REPT) for predicting the daily WL in Zrebar Lake, Iran during 2011–2017. These models were optimized with 70% data for training and 30% data for testing. Their performance was evaluated using RMSE, MAE, (mean absolute error), R², PBIAS (percent bias), and RSR (ratio of RMSE to standard deviation of observed data) indicators and graphical interpretation (Taylor and Box plots). They reported the M5P model produced better estimates (R² = 0.99, RMSE = 0.05 m, MAE = 0.01 m, NSE = 0.98, PBIAS = 0.00, RSR = 0.11) than the other models. Yaseen et al. (Citation2020a) examined the comparative potential of the hybrid MLP-WOA (Whale Optimization Algorithm) model against the CCNN (Cascade Correlation Neural Network), SOM (Self Organizing Map), DTR (Decision Tree Regression), RFR (Random Forest Regression), and Classical MLP to predict the monthly WL of Van Lake, Turkey. The comparison demonstrates that the hybrid MLP-WOA models produced superior prediction (RMSE = 0.047 m, MAE = 0.035 m, NSE = 0.969, WI = 0.992, Legate McCabe’s Index: LMI = 0.836, and R²= 0.970) over other models during validation phase. Zhu et al. (Citation2020) forecasted the monthly WL of 69 temperate lakes in Poland by utilizing Feed Forward Neural Network (FFNN), and Long Short-Term Memory (LSTM) models. Their finding reveals the better feasibility of FFNN and LSTM models in forecasting the monthly WL of the 69 lakes. Overall, the findings of the listed studies and current research confirm the superiority of ML models in predicting the daily/ monthly Lake water levels.

4. Conclusions

The present study has presented two new hybrid machine learning models, namely, generalized structure with GMDH algorithm (GS-GMDH) and adaptive neuro-fuzzy inference system with fuzzy C-means (ANFIS-FCM) for daily water level prediction at Telom, and Bertam stations positioned on Perak River in Malaysia. The performance of the hybrid models was compared with standalone models i.e. GEP and GMDH. To meet the objectives, the daily water level data of two stations for the Cameron Highlands in Malaysia were used. In addition, three different percentage ratios were used to divide the data for calibration and validation sets which include 50%–50%, 60%–40%, and 70%–30%, respectively. The results of the analysis reveal that the performance of the GMDH model could be enhanced with a new general structure (GS-GMDH) model. According to the best performance, the models were ordered as GS-GMDH > ANFIS-FCM > GEP > GMDH for both study locations. In addition, the results of the hybrid GS-GMDH model can be utilized to formulate the smart and truthful intelligent system for managing the water-related operations over the study sites.

Acknowledgements

The authors are grateful to the Tenaga Nasional Berhad (TNB) (Malaysia) for providing the original dataset and Universiti Tenaga Nasional (UNITEN) for providing the financial support for this study.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

Adnan, R. M., Malik, A., Kumar, A, Parmar, K. S., Kisi, O. (2019). Pan evaporation modeling by three different neuro-fuzzy intelligent systems using climatic inputs. Arabian Journal of Geosciences, 12(19), 606. https://doi.org/https://doi.org/10.1007/s12517-019-4781-6
Web of Science ®Google Scholar
Alizamir, M., Kisi, O., Kim, S., & Heddam, S. (2020). A novel method for lake level prediction: Deep echo state network. Arabian Journal of Geosciences, 13(18). https://doi.org/https://doi.org/10.1007/s12517-020-05965-9
Google Scholar
Altunkaynak, A. (2019). Predicting water level fluctuations in Lake Van using hybrid season-neuro approach. Journal of Hydrologic Engineering, 24(8), 04019021. https://doi.org/https://doi.org/10.1061/(asce)he.1943-5584.0001804
Web of Science ®Google Scholar
Alvisi, S., Mascellani, G., Franchini, M., & Bárdossy, A. (2006). Water level forecasting through fuzzy logic and artificial neural network approaches. Hydrology and Earth System Sciences, 10 (1), 1–17. https://doi.org/https://doi.org/10.5194/hess-10-1-2006
Web of Science ®Google Scholar
Arbain, S. H., & Wibowo, A. (2012). Time Series methods for water level forecasting of Dungun River in Terengganu Malaysia. Int J Eng Sci Technol, 4, 1803–1811.
Google Scholar
Ashrafzadeh, A., Malik, A., Jothiprakash, V., et al. (2020). Estimation of daily pan evaporation using neural networks and meta-heuristic approaches. ISH Journal of Hydraulic Engineering, 26(4), 421–429. https://doi.org/https://doi.org/10.1080/09715010.2018.1498754
Google Scholar
Azimi, H., Bonakdari, H., & Ebtehaj, I. (2017). A highly efficient gene expression programming model for predicting the discharge coefficient in a side weir along a trapezoidal canal. Irrigation and Drainage, 66(4), 655–666. https://doi.org/https://doi.org/10.1002/ird.2127
Web of Science ®Google Scholar
Bezdek, J. C., Coray, C., Gunderson, R., & Watson, J. (1981). Detection and characterization of cluster substructure I. Linear structure: Fuzzy c -lines. SIAM Journal on Applied Mathematics, 40(2), 339–357. https://doi.org/https://doi.org/10.1137/0140029
Web of Science ®Google Scholar
Buslima, F. S., Omar, R. C., Jamaluddin, T. A., & Taha, H. (2018). Flood and flash flood geo-hazards in Malaysia. International Journal of Engineering & Technology, 7(4.35), 760–764. https://doi.org/https://doi.org/10.14419/ijet.v7i4.35.23103
Google Scholar
Cai, W., Chen, S., & Zhang, D. (2007). Fast and robust fuzzy c-means clustering algorithms incorporating local information for image segmentation. Pattern Recognition, 40(3), 825–838. https://doi.org/https://doi.org/10.1016/j.patcog.2006.07.011
Web of Science ®Google Scholar
Daliakopoulos, I. N., Coulibaly, P., & Tsanis, I. K. (2005). Groundwater level forecasting using artificial neural networks. Journal of Hydrology, 309(1–4), 229–240. https://doi.org/https://doi.org/10.1016/j.jhydrol.2004.12.001
Web of Science ®Google Scholar
Dingman, S. L., & Bjerklie, D. M. (2005). Estimation of river discharge. Encycl Hydrol Sci John Wiley Sons.
Google Scholar
Ebtehaj, I., Bonakdari, H., Zaji, A. H., et al. (2015a). Gene expression programming to predict the discharge coefficient in rectangular side weirs. Applied Soft Computing, 35, 618–628. https://doi.org/https://doi.org/10.1016/j.asoc.2015.07.003
Web of Science ®Google Scholar
Ebtehaj, I., Bonakdari, H., Zaji, A. H., et al. (2015b). GMDH-type neural network approach for modeling the discharge coefficient of rectangular sharp-crested side weirs. Engineering Science and Technology, an International Journal, 18(4), 746–757. https://doi.org/https://doi.org/10.1016/j.jestch.2015.04.012
Google Scholar
Ehteram, M., Ferdowsi, A., Faramarzpour, M., et al. (2021). Hybridization of artificial intelligence models with nature inspired optimization algorithms for lake water level prediction and uncertainty analysis. Alexandria Engineering Journal, 60(2), 2193–2208. https://doi.org/https://doi.org/10.1016/j.aej.2020.12.034
Web of Science ®Google Scholar
Eisakhani, M., & Malakahmad, A. (2009). Water quality assessment of Bertam River and its tributaries in Cameron highlands. Malaysia. World Appl Sci J, 7, 769–776.
Google Scholar
Farlow, S. J. (1981). The GMDH algorithm of Ivakhnenko. The American Statistician, 35(4), 210. https://doi.org/https://doi.org/10.2307/2683292
Web of Science ®Google Scholar
Ferreira, C. (2002). Gene expression programming in problem solving. Soft Computing and Industry.
Google Scholar
Ferreira, C. (2006). Gene Expression Programming Mathematical Modeling by an Artificial Intelligence.
Google Scholar
Firat, M. (2008). Comparison of artificial intelligence techniques for river flow forecasting. Hydrology and Earth System Sciences, 12(1), 123–139 https://doi.org/https://doi.org/10.5194/hess-12-123-2008
Web of Science ®Google Scholar
Gasim, M. B., Adam, J. H., Toriman, M. E. H., et al. (2007). Coastal flood phenomenon in terengganu. Malaysia: Special Reference to Dungun. Res J Environ Sci, 1, 102–109. https://doi.org/https://doi.org/10.3923/rjes.2007.102.109
Google Scholar
Hadi, S. J., Abba, S. I., Sammen, S. S. H., et al. (2019). Non-linear input variable selection approach integrated with non-tuned data intelligence model for streamflow pattern simulation. IEEE Access, 7, 141533–141548. https://doi.org/https://doi.org/10.1109/ACCESS.2019.2943515
Web of Science ®Google Scholar
Hettiarachchi, N., & Thilakumara, R. P. (2014). Water Level Forecasting and Flood Warning System A Neuro-Fuzzy Approach. International Conference in Video Image and Signal Processing, at India.
Google Scholar
Ivakhnenko, A. G. (1971). Polynomial theory of complex systems. IEEE Transactions on Systems, Man, and Cybernetics, SMC-1(4), 364–378. https://doi.org/https://doi.org/10.1109/TSMC.1971.4308320
Google Scholar
Kaloop, M. R., El-Diasty, M., & Hu, J. W. (2017). Real-time prediction of water level change using adaptive neuro-fuzzy inference system. Geomatics, Natural Hazards and Risk, 8(2), 1320–1332. https://doi.org/https://doi.org/10.1080/19475705.2017.1327464
Web of Science ®Google Scholar
Khairuddin, N., Aris, A. Z., Elshafie, A., et al. (2019). Efficient forecasting model technique for river stream flow in tropical environment. Urban Water Journal, 16(3), 183–192. https://doi.org/https://doi.org/10.1080/1573062X.2019.1637906
Web of Science ®Google Scholar
Khan, M. S., & Coulibaly, P. (2006). Application of support vector machine in lake water level prediction. Journal of Hydrologic Engineering, 11(3), 199–205. https://doi.org/https://doi.org/10.1061/(ASCE)1084-0699(2006)11:3(199)
Web of Science ®Google Scholar
Kiafar, H., Babazadeh, H., Marti, P., et al. (2017). Evaluating the generalizability of GEP models for estimating reference evapotranspiration in distant humid and arid locations. Theoretical and Applied Climatology, 130, 377–389. https://doi.org/https://doi.org/10.1007/s00704-016-1888-5
Web of Science ®Google Scholar
Kisi, O., & Zounemat-Kermani, M. (2016). Suspended sediment modeling using neuro-fuzzy embedded fuzzy c-means clustering technique. Water Resources Management, 30(11), 3979–3994 https://doi.org/https://doi.org/10.1007/s11269-016-1405-8
Web of Science ®Google Scholar
Lin, J.-Y., Cheng, C.-T., & Chau, K.-W. (2006). Using support vector machines for long-term discharge prediction. Hydrological Sciences Journal, 51(4), 599–612. https://doi.org/https://doi.org/10.1623/hysj.51.4.599
Web of Science ®Google Scholar
Liong, S. Y., & Sivapragasam, C. (2002). Flood stage forecasting with support vector machines. Journal of the American Water Resources Association, 38(1), 173–186. https://doi.org/https://doi.org/10.1111/j.1752-1688.2002.tb01544.x
Web of Science ®Google Scholar
Madala, H. R., & Ivakhnenko, A. G. (2019). Inductive learning algorithms for complex systems modeling.
Google Scholar
Malik, A., & Kumar, A. (2020). Meteorological drought prediction using heuristic approaches based on effective drought index: A case study in uttarakhand. Arabian Journal of Geosciences, 13(1), 1–17. https://doi.org/https://doi.org/10.1007/s12517-020-5239-6
Google Scholar
Malik, A., Kumar, A., Ghorbani, M. A., et al. (2019a). The viability of co-active fuzzy inference system model for monthly reference evapotranspiration estimation: Case study of Uttarakhand State. Hydrology Research, 50(6), 1623–1644. https://doi.org/https://doi.org/10.2166/nh.2019.059
Web of Science ®Google Scholar
Malik, A., Kumar, A., & Kisi, O. (2017). Monthly pan-evaporation estimation in Indian central Himalayas using different heuristic approaches and climate based models. Computers and Electronics in Agriculture, 143, 302–313. https://doi.org/https://doi.org/10.1016/j.compag.2017.11.008
Web of Science ®Google Scholar
Malik, A., Kumar, A., & Kisi, O. (2018). Daily pan evaporation estimation using heuristic methods with Gamma test. Journal of Irrigation and Drainage Engineering, 144(9), 04018023. https://doi.org/https://doi.org/10.1061/(ASCE)IR.1943-4774.0001336
Web of Science ®Google Scholar
Malik, A., Kumar, A., Kisi, O., & Shiri, J. (2019b). Evaluating the performance of four different heuristic approaches with Gamma test for daily suspended sediment concentration modeling. Environmental Science and Pollution Research, 26(22), 22670–22687. https://doi.org/https://doi.org/10.1007/s11356-019-05553-9
PubMed Web of Science ®Google Scholar
Malik, A., Kumar, A., Rai, P., & Kuriqi, A. (2021a). Prediction of multi-scalar standardized precipitation index by using artificial intelligence and regression models. Climate, 9(2), 28. https://doi.org/https://doi.org/10.3390/cli9020028
Web of Science ®Google Scholar
Malik, A., Kumar, A., Salih, S. Q., et al. (2020a). Drought index prediction using advanced fuzzy logic model: Regional case study over kumaon in India. PLoS One, 15(5), e0233280. https://doi.org/https://doi.org/10.1371/journal.pone.0233280
PubMed Web of Science ®Google Scholar
Malik, A., Kumar, A., & Singh, R. P. (2019c). Application of heuristic approaches for prediction of hydrological drought using multi-scalar Streamflow drought index. Water Resources Management, 33(11), 3985–4006. https://doi.org/https://doi.org/10.1007/s11269-019-02350-4
Web of Science ®Google Scholar
Malik, A., Tikhamarine, Y., Sammen, S. S., et al. (2021b). Prediction of meteorological drought by using hybrid support vector regression optimized with HHO versus PSO algorithms. Environmental Science and Pollution Research, 28(29), 39139–39158. https://doi.org/https://doi.org/10.1007/s11356-021-13445-0
PubMed Web of Science ®Google Scholar
Malik, A., Tikhamarine, Y., Souag-Gamane, D., et al. (2020b). Support vector regression optimized by meta-heuristic algorithms for daily streamflow prediction. Stochastic Environmental Research and Risk Assessment, 34(11), 1755–1773. https://doi.org/https://doi.org/10.1007/s00477-020-01874-1
Web of Science ®Google Scholar
Malik, A., Tikhamarine, Y., Souag-Gamane, D., et al. (2021c). Support vector regression integrated with novel meta-heuristic algorithms for meteorological drought prediction. Meteorology and Atmospheric Physics, 133(3), 891–909. https://doi.org/https://doi.org/10.1007/s00703-021-00787-0
Web of Science ®Google Scholar
Mohamadi, S., Sammen, S. S., Panahi, F., et al. (2020). Zoning map for drought prediction using integrated machine learning models with a nomadic people optimization algorithm. Natural Hazards, 104(1), 537–579. https://doi.org/https://doi.org/10.1007/s11069-020-04180-9
Web of Science ®Google Scholar
Mohammad-Azari, S., Bozorg-Haddad, O., & Loáiciga, H. A. (2020). State-of-art of genetic programming applications in water-resources systems analysis. Environ. Monit. Assess, 192, 73 (2020). https://doi.org/https://doi.org/10.1007/s10661-019-8040-9.
Google Scholar
Moosavi, V., Talebi, A., & Hadian, M. R. (2017). Development of a hybrid wavelet packet-group method of data handling (WPGMDH). Model for Runoff Forecasting. Water Resour Manag, 31(1), 43–59. https://doi.org/https://doi.org/10.1007/s11269-016-1507-3
Google Scholar
Morales-Pinzón, T., Céspedes-Restrepo, J. D., & Flórez-Calderón, M. T. (2015). Daily river level forecast based on the development of an artificial neural network case study in La Virginia-Risaralda. Revista Facultad de Ingeniería Universidad de Antioquia, 2015(76), 46–57. https://doi.org/https://doi.org/10.17533/udea.redin.n76a06
Google Scholar
Nash, J. E., & Sutcliffe, J. V. (1970). River flow forecasting through conceptual models part I – a discussion of principles. Journal of Hydrology, 10(3), 282–290. https://doi.org/https://doi.org/10.1016/0022-1694(70)90255-6
Google Scholar
Nasidi, N. M., Wayayok, A., Abdullah, A. F., et al. (2021). Dynamics of potential precipitation under climate change scenarios at Cameron highlands. SN Applied Sciences, 3(3), 334. https://doi.org/https://doi.org/10.1007/s42452-021-04332-x
Web of Science ®Google Scholar
Nhu, V. H., Shahabi, H., Nohani, E., et al. (2020). Daily water level prediction of zrebar lake (Iran): A comparison between m5p, random forest, random tree and reduced error pruning trees algorithms. ISPRS International Journal of Geo-Information, 9(8), 479. https://doi.org/https://doi.org/10.3390/ijgi9080479
Web of Science ®Google Scholar
Parsaie, A., Yonesi, H. A., & Najafian, S. (2015). Predictive modeling of discharge in compound open channel by support vector machine technique. Modeling Earth Systems and Environment, 1(1–2). https://doi.org/https://doi.org/10.1007/s40808-015-0002-9
Google Scholar
Pham, Q. B., Mohammadpour, R., Linh, N. T. T., et al. (2021). Application of soft computing to predict water quality in wetland. Environmental Science and Pollution Research, 28(1), 185–200. https://doi.org/https://doi.org/10.1007/s11356-020-10344-8
PubMed Web of Science ®Google Scholar
Qin, W., Wang, L., Lin, A., et al. (2018). Comparison of deterministic and data-driven models for solar radiation estimation in China. Renew. Sustain. Energy Rev, 81(1), 579–594. https://doi.org/https://doi.org/10.1016/j.rser.2017.08.037
Google Scholar
Rehamnia, I., Benlaoukli, B., Jamei, M., et al. (2021). Simulation of seepage flow through embankment dam by using a novel extended Kalman filter based neural network paradigm: Case study of Fontaine Gazelles Dam, Algeria. Measurement, 176, 109219. https://doi.org/https://doi.org/10.1016/j.measurement.2021.109219
Web of Science ®Google Scholar
Reza, M., Harun, S., & Askari, M. (2018). Streamflow forecasting in Bukit Merah Watershed by using Arima and Ann. Portal: Jurnal Teknik Sipil, 9(1), 18–26. https://doi.org/https://doi.org/10.30811/portal.v9i1.612.
Google Scholar
Rigos, A., Krommyda, M., Tsertou, A., & Amditis, A. (2020). A polynomial neural network for river’s water-level prediction. SN Applied Sciences, 2(4. https://doi.org/https://doi.org/10.1007/s42452-020-2328-9.
Google Scholar
Sammen, S. S., Ghorbani, M. A., Malik, A., Tikhamarine, Y., AmirRahmani, M., Al-Ansari, N., Chau, K. W., (2020). Enhanced artificial neural network with Harris hawks optimization for predicting scour depth downstream of ski-jump spillway. Applied Sciences, 10(15), 5160. https://doi.org/https://doi.org/10.3390/app10155160
Google Scholar
Sammen, S. S., Mohamed, T. A., Ghazali, A. H., El-Shafie, A. H., Sidek, L. M., (2017). Generalized regression neural network for prediction of peak outflow from dam breach. Water Resources Management, 31(1), 549–562. https://doi.org/https://doi.org/10.1007/s11269-016-1547-8
Web of Science ®Google Scholar
Schmidhuber, J. (2015). Deep learning in neural networks: An overview. Neural Networks, 61, 85–117. https://doi.org/https://doi.org/10.1016/j.neunet.2014.09.003
PubMed Web of Science ®Google Scholar
See, L., & Openshaw, S. (1999). Applying soft computing approaches to river level forecasting. Hydrological Sciences Journal, 44(5), 763–778. https://doi.org/https://doi.org/10.1080/02626669909492272
Web of Science ®Google Scholar
Shabani, S., Candelieri, A., Archetti, F., & Naser, G. (2018). Gene expression programming coupled with unsupervised learning: A two-stage learning process in multi-scale, short-termwater demand forecasts. Water, 10(2), 142. https://doi.org/https://doi.org/10.3390/w10020142
Web of Science ®Google Scholar
Sihag, P., Al-Janabi, A. M. S., Alomari, N. K., & Ghani, A. A. (2020). Tree regression for river discharge estimation: Application to the Baitarani River Basin. Modeling Earth Systems and Environment. https://doi.org/https://doi.org/10.1007/s40808-020-01045-9
Google Scholar
Singh, A., Malik, A., Kumar, A., & Kisi, O. (2018). Rainfall-runoff modeling in hilly watershed using heuristic approaches with gamma test. Arabian Journal of Geosciences, 11(1), 1–12. https://doi.org/https://doi.org/10.1007/s12517-018-3614-3
Google Scholar
Sobhani, J., Najimi, M., Pourkhorshidi, A. R., & Parhizkar, T. (2010). Prediction of the compressive strength of no-slump concrete: A comparative study of regression, neural network and ANFIS models. Construction and Building Materials, 24(5), 709–718. https://doi.org/https://doi.org/10.1016/j.conbuildmat.2009.10.037
Web of Science ®Google Scholar
Suri, S., Ahmad, F., Yahaya, A. S., Mokhtar, Z. A., Halim, M. H., et al. (2014). Climate change impact on water level in Peninsular Malaysia. 4, 228–232. https://doi.org/https://doi.org/10.5923/c.jce.201402.39
Google Scholar
Tan, K. W., & Beh, W. C. (2015). Water quality monitoring using biological indicators in Cameron Highlands Malaysia. Journal of Sustainable Development, 8(3). https://doi.org/https://doi.org/10.5539/jsd.v8n3p28
Google Scholar
Taylor, K. E. (2001). Summarizing multiple aspects of model performance in a single diagram. Journal of Geophysical Research: Atmospheres, 106(D7), 7183–7192. https://doi.org/https://doi.org/10.1029/2000JD900719
Web of Science ®Google Scholar
Tikhamarine, Y., Malik, A., Kumar, A., Gamane, D. S., Kisi, O, et al. (2019). Estimation of monthly reference evapotranspiration using novel hybrid machine learning approaches. Hydrological Sciences Journal, 64(15), 1824–1842. https://doi.org/https://doi.org/10.1080/02626667.2019.1678750
Web of Science ®Google Scholar
Tikhamarine, Y., Malik, A., Pandey, K., Sammen, S. , Gamane, D. , Heddam, S., Kisi, O. , (2020a). Monthly evapotranspiration estimation using optimal climatic parameters: Efficacy of hybrid support vector regression integrated with whale optimization algorithm. Environmental Monitoring and Assessment, 192(11). https://doi.org/https://doi.org/10.1007/s10661-020-08659-7
Google Scholar
Tikhamarine, Y., Malik, A., Souag-Gamane, D., & Kisi, O. (2020b). Artificial intelligence models versus empirical equations for modeling monthly reference evapotranspiration. Environmental Science and Pollution Research, 27(24), 30001–30019. https://doi.org/https://doi.org/10.1007/s11356-020-08792-3
PubMed Web of Science ®Google Scholar
Tikhamarine, Y., Souag-Gamane, D., Ahmed, A. N., et al. (2020c). Rainfall-runoff modelling using improved machine learning methods: Harris hawks optimizer vs. particle swarm optimization. Journal of Hydrology, 589, 125133. https://doi.org/https://doi.org/10.1016/j.jhydrol.2020.125133
Web of Science ®Google Scholar
Toriman, M. E., Juahir, H., Mokhtar, M., et al. (2009). Predicting for discharge characteristics in Langat River, Malaysia using neural network Application model. Res J Earth Sci.
Google Scholar
Tsujikura, H., Tanaka, K., & Tachikawa, Y. (2016). Development of a water surface level prediction method affected by river mouth sandbar collapse. Procedia Engineering, 154, 1349–1358. https://doi.org/https://doi.org/10.1016/j.proeng.2016.07.491
Google Scholar
Vachtman, D., & Laronne, J. B. (2014). Remotely sensed estimation of water discharge into the rapidly dwindling dead sea. Hydrological Sciences Journal, 59(8), 1593–1605. https://doi.org/https://doi.org/10.1080/02626667.2013.852278
Web of Science ®Google Scholar
Vakhshouri, B., & Nejadi, S. (2018). Prediction of compressive strength of self-compacting concrete by ANFIS models. Neurocomputing, 280, 13–22 https://doi.org/https://doi.org/10.1016/j.neucom.2017.09.099
Web of Science ®Google Scholar
Wang, L., Kisi, O., Hu, B., et al. (2017a). Evaporation modelling using different machine learning techniques. International Journal of Climatology, 37, 1076–1092. https://doi.org/https://doi.org/10.1002/joc.5064
Web of Science ®Google Scholar
Wang, L., Kisi, O., Zounemat-Kermani, M., et al. (2016). Solar radiation prediction using different techniques: Model evaluation and comparison. Renewable and Sustainable Energy Reviews, 61, 384–397. https://doi.org/https://doi.org/10.1016/j.rser.2016.04.024
Web of Science ®Google Scholar
Wang, L., Kisi, O., Zounemat-Kermani, M., et al. (2017b). Prediction of solar radiation in China using different adaptive neuro-fuzzy methods and M5 model tree. International Journal of Climatology, 37(3), 1141–1155. https://doi.org/https://doi.org/10.1002/joc.4762
Web of Science ®Google Scholar
Wang, L., Kisi, O., Zounemat-Kermani, M., & Li, H. (2017c). Pan evaporation modeling using six different heuristic computing methods in different climates of China. Journal of Hydrology, 544, 407–427. https://doi.org/https://doi.org/10.1016/j.jhydrol.2016.11.059
Web of Science ®Google Scholar
Willmott, C. J. (1981). On the validation of models. Physical Geography, 2(2), 184–194. https://doi.org/https://doi.org/10.1080/02723646.1981.10642213
Google Scholar
Xu, G., Cheng, Y., Liu, F., et al. (2019). A water level prediction model based on ARIMA-RNN. 2019 IEEE Fifth International Conference on Big Data Computing Service and Applications (BigDataService) (pp. 221–226).
Google Scholar
Yaseen, Z. M., Naghshara, S., Salih, S. Q., et al. (2020a). Lake water level modeling using newly developed hybrid data intelligence model. Theoretical and Applied Climatology, 141(3–4), 1285–1300. https://doi.org/https://doi.org/10.1007/s00704-020-03263-8
Web of Science ®Google Scholar
Yaseen, Z. M., Sihag, P., Yusuf, B., & Al-Janabi, A. M. S. (2020b). Modelling infiltration rates in permeable stormwater channels using soft computing techniques*. Irrigation and Drainage, 70(1), 117–130. https://doi.org/https://doi.org/10.1002/ird.2530
Web of Science ®Google Scholar
Young, C. C., Liu, W. C., & Hsieh, W. L. (2015). Predicting the water level fluctuation in an alpine Lake using physically based, artificial neural network, and time series forecasting models. Mathematical Problems in Engineering, 2015, 1–11. https://doi.org/https://doi.org/10.1155/2015/708204
Web of Science ®Google Scholar
Yuan, Z., Wang, L. N., & Ji, X. (2014). Prediction of concrete compressive strength: Research on hybrid models genetic based algorithms and ANFIS. Advances in Engineering Software, 67, 156–163. https://doi.org/https://doi.org/10.1016/j.advengsoft.2013.09.004
Web of Science ®Google Scholar
Zhu, S., Hrnjica, B., Ptak, M., et al. (2020). Forecasting of water level in multiple temperate lakes using machine learning models. Journal of Hydrology, 585, 124819. https://doi.org/https://doi.org/10.1016/j.jhydrol.2020.124819
Web of Science ®Google Scholar

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Prediction of daily water level using new hybridized GS-GMDH and ANFIS-FCM models

Abstract

1. Introduction