834
Views
14
CrossRef citations to date
0
Altmetric
Original Articles

New fuzzy neural network–Markov model and application in mid- to long-term runoff forecast

, , &
Pages 1157-1169 | Received 03 Jan 2014, Accepted 18 Aug 2014, Published online: 08 Mar 2016

ABSTRACT

In this paper, a mid- to long-term runoff forecast model is developed using an ideal point fuzzy neural network–Markov (NFNN-MKV) hybrid algorithm to improve the forecasting precision. Combining the advantages of the new fuzzy neural network and the Markov prediction model, this model can solve the problem of stationary or volatile strong random processes. Defined error statistics algorithms are used to evaluate the performance of models. A runoff prediction for the Si Quan Reservoir is made by utilizing the modelling method and the historical runoff data, with a comprehensive consideration of various runoff-impacting factors such as rainfall. Compared with the traditional fuzzy neural networks and Markov prediction models, the results show that the NFNN-MKV hybrid algorithm has good performance in faster convergence, better forecasting accuracy and significant improvement of neural network generalization. The absolute percentage error of the NFNN-MKV hybrid algorithm is less than 7.0%, MSE is less than 3.9, and qualification rate reaches 100%. For further comparison of the proposed model, the NFNN-MKV model is employed to estimate (training and testing for 120-month-ahead prediction) and predict river discharge for 156 months at Weijiabao on the Weihe River in China. Comparisons among the results of the NFNN-MKV model, the WNN model and the SVR model indicate that the NFNN-MKV model is able to significantly increase prediction accuracy.

Editor D. Koutsoyiannis; Associate editor Y. Gyasi-Agyei

1 Introduction

Mid- to long-term runoff forecasting is important for the study and guidance of water resources management. However, accurate forecasts are difficult because hydrological signals are nonlinear, highly complex and have severe variations in time and space (Yang et al. Citation2013). At present, the research in mid- to long-term runoff prediction is still in the development stage. A variety of mid- to long-term runoff forecasting models have been developed based on physical considerations or other numerical theories, such as Markov prediction models (Wang et al. Citation2005) based on Markov chain theory; linear regressive analysis methods and nonlinear regressive analysis methods (Shamseldin et al. Citation1997) based on stochastic theory; SVM (support vector machine) models (Lin et al. Citation2006) based on nonlinear time series analysis; fuzzy prediction models (Nayak et al. Citation2004) based on fuzzy theory; and ANN (artificial neural network) models (Dibike and Solomatine Citation2001) based on black-box theory. However, because of some inaccurate initial conditions and limited parameterization schemes, those models are generally insufficient for making accurate predictions (Gardner et al. Citation2009, Shi Citation2010b).

It is a hot issue for scholars all over the world to improve the accuracy of mid- to long-term runoff prediction. A variety of methods have been developed to improve the accuracy of the predictions, including physical models, time series methods, conceptual models, ANN models, and many hybrid models. Now, models integrating two or more of these approaches have been proposed as predictors to improve the accuracy of mid- to long-term runoff forecasts, such as multiple regression–ANN models (Elshorbagy et al. Citation2000), fuzzy pattern recognition models (Xiong et al. Citation2001), the wavelet–ANN model (Anctil and Tape Citation2004, Adamowski et al. Citation2012, Wei et al. Citation2012), the wavelet–ANFIS hybrid model (Moosavi et al. Citation2013, Nourani et al. Citation2013), the wavelet–neuron fuzzy model (Partal and Kişi Citation2007, Shiri and Kisi Citation2010), the fuzzy–SVM model (Guo et al. Citation2010, Hu et al. Citation2012), the support vector regression (SVR) model (Lin et al. Citation2006, Hong and Pai Citation2007, Wu et al. Citation2008, Behzad et al. Citation2009) and the wavelet regression model (Kişi Citation2009, Citation2010, Citation2011). These hybrid models have shown different advantages for accurate predictions due to their capabilities in utilizing present information effectively. Among these hybrid models, FNN (fuzzy neural network) and ANN models are the most popularly utilized sub-models for signal forecasting due to their capability of effectively learning complex and nonlinear relationships (Deka and Chandramouli Citation2003). The wavelet–ANN (WNN) model (Wei et al. Citation2012, Citation2013) has been popularly used in hydrological multiple forecasts in recent years by a number of researchers (Adamowski and Sun Citation2010, Yang et al. Citation2013). The SVR model has been successfully used in the hydrological sciences in recent years (Behzad et al. Citation2009). However, high nonlinearity and nonstationarity are properties of runoff time series, and it is still difficult for these hybrid models to replicate these properties.

As mentioned above, these methods have both advantages and disadvantages. Regression analysis can solve the transient nonequilibrium relationship between climate and runoff. The pattern recognition method is suitable for small regional runoff forecasts, but it is unable to deal with runoff forecasts in large regions. A fuzzy system can deal with the nonlinear characteristics, but it lacks the qualification of self-learning. At the same time, how to generate and adjust the membership functions and fuzzy rules (Kwan and Cai Citation1994, Wang and Hung Citation2013, Shi Citation2010a) automatically is still an open problem. The back propagation (BP) neural network has nonlinear recognition and self-learning ability, and is a common intelligence method in mid- to long-term runoff forecasting. But the BP neural network has its inherent defects, such as being prone to oscillation, easily falling into local minima, slow convergence, and the initial value, the threshold value and the number of neurons in the hidden layer are difficult to determine (Shi Citation2009, Maier et al. Citation2010).

The fuzzy neural network forecasting method is based on fuzzy set theory, artificial intelligence and many historical data. Historical data are properly segregated and given appropriate membership functions. Fuzzy neural network forecasting is good at solving the problem of stationary random processes, but poor in the forecast of volatile random questions (Chang et al. Citation2002, Jain and Kumar 2007). Markov models are stochastic models that capture variability in a process through time. Markov chain and hidden Markov models are probably the simplest models that can be used to model sequential data, i.e. data samples that are not independent of each other (Bolch et al. Citation2006). Markov chain models have been successfully applied to model daily rainfall processes (Marshall et al. Citation2004). Because of the “no backward effect” property of the Markov chain (Jothityangkoon et al. Citation2000, Kottegoda et al. Citation2000, Hlynka Citation2007, Jackson Citation2009, Li et al. Citation2010, Pachet et al. Citation2011), it is good at solving prediction problems with volatile strong random processes. Hydrological systems have the characteristics of a stationary random change tendency and nonstationary random processes, so we can combine the advantages of the fuzzy neural network forecasting method and the Markov forecasting model, which can form a fuzzy neural network–Markov prediction method, to deal with the forecasting of hydrological systems.

As mentioned above, the crucial and most difficult point is the development of effective models to reduce error accumulation and increase the accuracy of mid- to long-term runoff forecasts (Yang et al. Citation2013). In view of this, the main purpose of this paper is to develop an ideal point fuzzy neural network (NFNN)–Markov hybrid algorithm (NFNN-MKV) to estimate and predict mid- to long-term (annual and monthly) runoff using the Si Quan Reservoir on the Han River in China as one case study, and Weijiabao (Wei et al. Citation2012, Citation2013) on the Weihe River in China as one comparison case study. The main objectives include:

  1. developing an NFNN-MKV hybrid model that can predict mid- to long-term runoff using the annual data and monthly data of the previous year and month to improve the accuracy of mid- to long-term serial prediction;

  2. applying this model to simulate annual runoff and monthly runoff of the Si Quan Reservoir, and simulate instream flow at Weijiabao on the Weihe River in China;

  3. comparing the fitting and prediction performances of the NFNN-MKV hybrid model with the traditional fuzzy neural network model, the Markov prediction model, the wavelet–ANN model (Wei et al. Citation2012, Citation2013) and the SVR model (Behzad et al. Citation2009);

  4. investigating network generalization improvement of the ideal point fuzzy neural network.

This paper is organized as follows. Section 2 presents the proposed approach for mid- to long-term runoff forecasting, and provides the different criteria used to evaluate the forecasting accuracy. Section 3 provides the simulation results from a case study based on the Si Quan Reservoir on the Han River in China. Section 4 provides the simulation results from one comparison case study based on Weijiabao on the Weihe River in China. Section 5 gives the conclusions and discussion.

2 Materials and methods

The proposed approach is based on the combination of an NFNN and a Markov hybrid algorithm. The NFNN is used to carry out suppositional prediction, and the Markov algorithm is employed to improve the prediction accuracy.

According to the clustering principle, the percentage error sequence of the suppositional prediction is divided into a number of state intervals, then the transition probability matrix and the frequency probability matrix are identified and calculated, and the state transition probability of the Markov chain can be obtained. Finally, selecting the following state point with the maximum probability and the change rate of the following state, according to the change rate and data of the previous state, data for the following state can be calculated.

2.1 New fuzzy neural network algorithm and structure

Since the runoff forecasting procedure follows the dynamics of a nonlinear system, a common approach is to configure and train a neural network to represent the nonlinear autoregressive model structure. It is a natural perception to reflect the dynamic nature of the problem by sequential information processing. A major factor affecting the neural model prediction accuracy is the data coding method. The literature (Andrews et al. Citation2001, Aqil et al. Citation2007) has pointed out that the acquisition of expert knowledge and the parameter initialization capability constitute the two major advantages of FNN over ANN. In this paper, we develop the ideal point fuzzy neural network with Markov algorithm technique to improve the accuracy of the mid- to long-term serial prediction. This fuzzy system, called NFNN, combines the fuzzy inference principles with the neural network structure and learning abilities into an integrated neural-network-based ideal point fuzzy decision system. Its data coding method is called data fuzzification encoding (FE). According to the FE technique, each data value is represented as the mean value of a fuzzy pattern of excitation over several nodes at the network input and output. The reverse procedure is applied at the network output to decode the values back into the original variable range. Also, decoding of the network output using the FE method is performed by computing a weighted summation of the node excitations.

NFNN is a class of adaptive multilayer feedforward networks, which can be applied to nonlinear forecasting, whose past samples are used to forecast the sample ahead. NFNN incorporates the self-learning ability of NN with the linguistic expression function of fuzzy inference (Kasabov and Song Citation2002, Rong et al. Citation2006).

The NFNN architecture comprises an input layer, a hidden (interface) layer and an output layer. The NFNN network is composed of three layers. Each layer contains several nodes described by the node function. Its details are given as follows.

Assume there are n training sample sets, and m characteristic values of each sample. The predictors’ eigenvalue matrix is as follows:

(1)

aij is the ith predictor’s characteristic value of the training sample j.

The sample set is made up of n prediction objects. Its feature vector is:

(2)

The prediction object’s membership formula is:

(3)

where min bj and max bj are the minimum and maximum characteristic values of the prediction object.

The positively correlated predictors’ relative membership formula is:

(4)

The negatively correlated predictors’ relative membership formula is:

(5)

where min aij and max aij are the ith predictor’s minimum and maximum characteristic values of the training sample j.

The generalized delta rule is applied to adjust the weights of the feedforward networks, thus minimizing a predetermined cost error function. The weight adjustment rule is deduced by using a Sugeno-type function, and its expression is as follows:

(6)

The weights of the output layer are:

(7)

The weights of the hidden layer are:

(8)

The fuzzy optimization model is used as the activation function (Shi Citation2010a). The weights are as follows:

The weight of the hidden layer k is:

(9)

where ; η is the learning efficiency; α is the momentum operator; n is the number of the iteration; δk j is the error signal of the hidden layer.

The weight of the output layer is:

(10)

where ; δhj is the error signal of the output layer.

Thus, the output formula of the hidden layer k can be introduced by the model of the fuzzy ideal point, and its expression is as follows:

(11)

where j is sequence number of the sample; rij is the input layer’s input elements; ωik are the connection weights between the layer i and the layer k; dkj are the output values of the layer k.

The output formula of the output layer h can be introduced by the model of the fuzzy ideal point, and its expression is as follows:

(12)

where dkj are the input values of the hidden layer; ωkh are the connection weights between the layer k and the layer h; dhj are the output values of the layer h.

Thus, an ideal point fuzzy neural network is functionally equivalent to an adaptive Sugeno-type fuzzy inference system network.

2.2 Markov prediction algorithms

A Markov process is a special, perhaps the most important, subclass of a stochastic process (Bolch et al. Citation2006). In particular, a stochastic process provides a relation between the elements of a possibly infinite family of random variables. A series of random experiments can thus be taken into consideration and analysed as an ensemble. shows a schematic of the Markov prediction method.

The core of the Markov prediction method is “no backward effect”; the object must pass the change of state process from one random state to another state. The previous state, following state and state transition probability have the following relation:

Figure 1. Explanatory diagram of the Markov prediction algorithm, where Sn1 is the previous state, Sn is the following state, P is the state transition probability matrix, and Sn = Sn1P.

Figure 1. Explanatory diagram of the Markov prediction algorithm, where Sn−1 is the previous state, Sn is the following state, P is the state transition probability matrix, and Sn = Sn−1P.
(13)

where Sn is the following state, Sn1 is the previous state, P is the state transition probability.

According to expression (13), two results can be derived: (a) if the state transition probability is known, it can predict the following state result according to the information of the previous state; (b) it can predict the change result of the next or the next several periods according to the information of the current state and the state transition probability.

Starting with state i, the Markov chains will go to some state j (including the possibility of i), the one-step transition probabilities pij are usually summarized in a non-negative number. Usually, the stochastic transition matrix P is as follows:

(14)

where j and i are the states of state space, = 1, 2, …, n; = 1, 2, …, n; pij is the state transition probability, and , where .

Assuming the number of state variables is n, state variables are denoted as S1S2, …, Sn. The state transition probability is that the state Si goes to the state Sj in the step r. The transition probability matrix of r is as follows:

(15)

Repeatedly applying one-step transitions generalizes immediately to r-step transition probabilities. The r-step transition probability matrix can be computed by the (r − 1)-fold multiplication of the one-step transition matrix by itself. So, the transition probability matrix of step r can be concluded:

(16)

where is the transition probability matrix of the step r. is the transition probability matrix of the step r − 1. is the first transition probability matrix, which is usually denoted by P.

2.3 Hybrid approach

In this section, the algorithm used to implement the proposed approach is described step-by-step.

Step 1: Suppositional prediction. According to historical data, carry out the object’s suppositional prediction with the fuzzy neural network forecast method; find the percentage error sequence of the object.

Step 2: The percentage errors are classified according to the classification of percentage errors; the percentage error sequence of the object is divided into a number of state intervals.

Step 3: Form the state transition probability matrix based on Step 2.

Step 4: According to the previous state of history point based on Step 3, predict the most likely state and solve the change rate of the following state (the next forecast point).

Step 5: Calculate the prediction value of (the following state) the next point, according to the change rate of Step 4 and the actual value of the previous state.

2.4 Model evaluation

Since the developed models are applied for management and planning, appropriate model evaluation methods are essential. The accuracy of developed models can be assessed using many measures. In this study, the following error statistics are used to evaluate the models. The criteria for comparing the performance are the percentage error (PE), mean absolute percentage error (MAPE), absolute percentage error (APE), mean squared error (MSE), the maximum APE (Max-APE), the minimum APE (Min-APE) and forecast accuracy rate (FAR) in this paper, which indicate the accuracy of recall.

These criteria are defined as follows:

(17)
(18)
(19)
(20)
(21)

where is the actual value, is the forecast value, and n is the total number of values predicted.

3 Application of hybrid algorithm to the Si Quan Reservoir

3.1 Results of annual runoff

3.1.1 Data and parameters

The Si Quan Reservoir is not entirely an annual regulating reservoir. The annual distribution of runoff of the Si Quan Reservoir is as much as 75–80% in summer, about 10–15% in spring, and only about 5% in winter. The main flood season is July to September and the flood water is abundant. The annual maximum flow appears in July and September; the flood runoff of the Han River has obvious characteristics of bimodal type. The river discharge also has clear seasonal character, and the highest discharge occurs from July to October and the lowest discharge occurs from December to March. So, the neural network model’s input values are the previous history data, such as rainfall, annual inflow and average inflow for 5–10 months, and the output values are predicted annual inflow.

According to the characteristics of runoff in the Han River, setting the training sample sequence, the training samples include rainfall, annual inflow and the average 5–10 monthly inflow for years 1954 to 1987 for the Si Quan Reservoir, and the test samples have data from years 1988 to 2008. Applying the fuzzy neural network, we forecast the annual inflow of the Si Quan Reservoir from years 2009 to 2013.

In this context, we used the previous annual data (rainfall, the average 5–10 monthly inflow and annual inflow) as inputs and the following annual data (annual inflow) as the targets for network training ().

Table 1. Data division for model training, testing and prediction. P: input vectors; T: target vectors; Ri: signal, i.e. observed rainfall dataset; AMi: signal, i.e. observed average 5–10 monthly inflow dataset; Ai: signal, i.e. observed annual inflow dataset.

The structure of the fuzzy neural network is as follows: there are three input layer nodes; hidden layer nodes are determined by the test method; and the output layer has only one node representing the following annual inflow.

3.1.2 Results of NFNN

The prediction test results are shown in . The prediction results are shown in . Evaluation of the NFNN model performance is shown in .

Table 2. Actual annual inflow and prediction testing annual inflow of the Si Quan Reservoir and APE of NFNN.

Table 3. Actual annual inflow and predicted annual inflow for the Si Quan Reservoir and APE of NFNN.

Table 4. Evaluation of the NFNN model performance.

shows that the APE of NFNN in the prediction test results is uneven, with the Max-APE being 43.16% and the Min-APE, 3.23%. This reveals that the prediction test period was appropriate.

shows that APE of NFNN from year 2009 to 2012 is uneven, with the Max-APE being 32.99% and the Min-APE, 9.12%.

, and show that the predicted test annual inflow and predicted annual inflow of the Si Quan Reservoir using the NFNN show some discrepancies with the actual inflow. However, the error indexes show that the errors on the training set, testing and prediction have almost no difference. This reveals that the network is able to memorize the training examples, and learn how to generalize the new situations. The range of the APE is uneven, at 2.84–43.16%. According to regulation SL250-2000 “Standard of Hydrology Information Forecast”, the qualified rate is 64%. This indicates that a fuzzy neural network forecasting model’s prediction accuracy needs to be improved with respect to its fitting and prediction ability.

3.1.3 Prediction process of NFNN-MKV

Step 1: Division state of the absolute percentage errors (APE). The APE is divided into nine state intervals according to the APE of the fuzzy neural network model, and the division result is shown in .

Step 2: Calculating the one-step transition probability matrix. According to the division state of the percentage errors of the NFNN model and the frequency probability matrix, we can obtain the state transition probability R. The state transition probability matrix R is shown in .

Step 3: Calculating the initial state vector. According to the one-step transition probability matrix, calculating the initial state vector, the initial state vector can be known from the reference P(0) = (0,0,0,0,0,0,0,0,1).

Step 4: We can predict the inflows of the Si Quan Reservoir from 1988 to 2013. The formula is as follows:

(22)

We calculate the state probability of the inflows from year 1988 to year 2013, and select the maximum annual probability and the inflow’s change rate for the predicted year. Combining the inflow of the last history point, we can deduct the inflow of the next forecast point.

Table 5. Division of the status space.

Table 6. State transition probability matrix R.

3.1.4 Results of NFNN-MKV

Prediction results for the Si Quan Reservoir with the NFNN-MKV model are shown in and .

Table 7. Actual annual inflows and predicted annual inflows for the Si Quan Reservoir and percentage errors (PE) of the NFNN-MKV model from 1988 to 2013.

Table 8. Evaluation results of the NFNN-MKV, Markov and FNN model performances from 1988 to 2012.

shows that the predicted annual inflows of the fuzzy neural network– Markov forecasting model are close to the actual values. The range of the APE is relatively uniform, and the floating range of the APE is 0–7.0%, and the qualified rate is 100%.

shows the prediction results of three forecasting models for 25 years. It also shows evaluation results of the NFNN-MKV, Markov and FNN model performances for years 1988 to 2012.

and show that the mean absolute percentage errors (MAPE) of the NFNN-MKV model are less than those of the other forecasting models in all periods: the APE do not exceed 7% in each period, the MAPE is less than 3.385%, and the MSE is 3.806 with the NFNN-MKV model. While the MAPE is more than 12% and the MSE is more than 14.0 with the Markov model, the MAPE is more than 17% and MSE is more than 20.0 with FNN model. and show that the MAPE and MSE of the NFNN-MKV model are lower than those of the other forecasting models. Further, they show that the prediction accuracy of the NFNN-MKV model is higher than that of the Markov or FNN models in terms of MAPE and MSE. They indicate that the NFNN-MKV model can improve prediction accuracy in the Si Quan Reservoir.

3.2 Results of monthly runoff

3.2.1 Monthly data and parameters

The data used are the natural time series of monthly inflow and rainfall for the Si Quan Reservoir between January 1954 and October 2013 (718 months). The training samples include monthly rainfall and monthly inflow from January 1954 to December 1998, and the testing samples have monthly data from January 1999 to December 2008. Applying the fuzzy neural network–Markov hybrid algorithm (NFNN-MKV), the Markov forecasting model (Markov) and the new fuzzy neural network (NFNN), we can forecast the average monthly inflow for the Si Quan Reservoir between January 2009 and November 2013. We count 179 monthly inflows for each forecasting model.

In this context, we used the previous 12-month data (the average monthly rainfall, and the average monthly inflow) as inputs and the following 12-month data (monthly inflow) as the targets for fuzzy neural network training ().

Table 9. Data division for model training, testing and prediction. P: input vectors; T: target vectors; Ri: signal, i.e. observed rainfall dataset; Ai: signal, i.e. observed monthly inflow dataset.

The structure of the fuzzy neural network is as follows: there are 24 input layer nodes (the average monthly rainfall, the average monthly inflow); hidden layer nodes are determined by the test method; and the output layer has 12 nodes representing the prediction 12-monthly inflow.

3.2.2 Monthly runoff prediction process of NFNN-MKV

First, the NFNN is used to carry out suppositional prediction. According to the absolute percentage errors (APE) of the prediction testing and prediction using the NFNN, the APE is divided into nine parts, which are nine state intervals of the Markov chain algorithm. Then, the state transition probability R and the initial state vector are calculated. Finally, we can predict the monthly inflows for the Si Quan Reservoir from January 2009 to November 2013. The formula is as follows:

(23)

where P(0) = (0,0,0,0,0,0,0,0,1).

We calculate the state probability of the monthly inflows from January 2009 to November 2013, and select the maximum probability and the inflow’s change rate for the predicted month. Combining monthly inflow for the previous history point, we can deduct monthly inflow for the following forecast point.

3.2.3 Prediction results of NFNN-MKV

Prediction results for the Si Quan Reservoir are shown in and , and . contrasts values of actual monthly inflows and the predicted inflows of the NFNN-MKV, Markov model and fuzzy neural network (FNN) from January 2012 to November 2013. also contrasts values of prediction model percentage errors (PE) from January 2012 to November 2013.

Table 10. Actual monthly inflows and predicted monthly inflows for the Si Quan Reservoir and percentage errors (PE) of three forecasting models from January 2012 to November 2013.

Table 11. Evaluation results of the NFNN-MKV, Markov and FNN model performances from January 2012 to November 2013.

Figure 2. Actual monthly inflows and forecast monthly inflows of three forecasting models from January 2009 to December 2011.

Figure 2. Actual monthly inflows and forecast monthly inflows of three forecasting models from January 2009 to December 2011.

Figure 3. Percentage errors of three forecasting models from January 2009 to December 2011.

Figure 3. Percentage errors of three forecasting models from January 2009 to December 2011.

Figure 4. Actual monthly inflows and forecast monthly inflows of the NFNN-MKV model from January 1999 to November 2013.

Figure 4. Actual monthly inflows and forecast monthly inflows of the NFNN-MKV model from January 1999 to November 2013.

Figure 5. Percentage errors of three forecasting models from January 1999 to November 2013.

Figure 5. Percentage errors of three forecasting models from January 1999 to November 2013.

shows that the predicted monthly inflows for the NFNN-MKV model are better than those for the Markov model and the FNN model. The predicted monthly inflow for the NFNN-MKV model is closer to the actual value in each month. The APE for the NFNN-MKV model is smaller than that for the Markov model and the FNN model in each month. The prediction precision of the NFNN-MKV model is higher than that of the other forecasting models in terms of PE.

gives the reservoir’s prediction results for three forecasting models in 23 months, which are also the evaluated results of the NFNN-MKV, Markov and FNN model performances from January 2012 to November 2013.

and show that the PE and APE of the NFNN-MKV model are lower than those of the other forecasting models in each month. The APE of the NFNN-MKV model does not exceed 7% in each period, the MAPE is less than 3.4%, and the MSE is 3.706. But the MAPE of the Markov model is higher than11%, the MSE is higher than 13.0, and the qualified rate is 82.6%. The MAPE of the FNN model is higher than 17%, the MSE is higher than 19.0, and the qualified rate is 60.9%. The MAPE and MSE of the NFNN-MKV model are lower than those of the other forecasting models. The PE of the NFNN-MKV model is relatively uniform, the floating range of its APE is 0–7.0%, and the qualified rate is 100%. This indicates that the NFNN-MKV model has much better prediction results than the Markov model and the FNN model in terms of MSE, MAPE, PE and APE. It shows that prediction accuracy of the NFNN-MKV model is higher than that of the Markov and FNN models. For further comparison of these better models, the maximum APE (Max-APE) and minimum APE (Min-APE) of the forecasting models, called the extreme error indices, were considered. The Max-APE (33.283) and Min-APE (4.998) of the FNN proved that this model had weaker prediction ability than the NFNN-MKV model ().

shows the contrast curve of actual monthly inflows and predicted monthly inflows for the NFNN-MKV model, the Markov model and the FNN model from January 2009 to December 2011.

shows the contrast curve of percentage errors for the NFNN-MKV, Markov and FNN forecasting models from January 2009 to December 2011.

shows the contrast curve of actual monthly inflows and predicted monthly inflows of the NFNN-MKV model from January 1999 to November 2013.

shows the contrast curve of percentage errors for the NFNN-MKV, Markov and FNN forecasting models from January 1999 to November 2013.

The prediction ability of the NFNN-MKV model is superior to that of the Markov and FNN models, as indicated by direct comparison of prediction results and observed data (). and and show that the predicted monthly inflows of the NFNN-MKV model are closer to the actual monthly inflows in all periods. The prediction precision of the NFNN-MKV model is higher than that of the Markov and FNN models. and show that the percentage errors of the NFNN-MKV are smaller than those of the Markov and FNN models. The percentage error of the NFNN-MKV model is relatively uniform, and the floating range of its APE is 0–7.0%. This shows that the NFNN-MKV model is stable, and has higher prediction accuracy. Comparison of extreme error (Max-APE = 6.949, Min-APE = 0.679) indices indicates that the NFNN-MKV model is able to accurately forecast mid- to long-term runoff of rivers and reservoirs. Comparing the evaluation indices of the NFNN-MKV, Markov and FNN models, in terms of the smallest errors (MSE = 3.706, MAPE = 3.317, Max-APE = 6.949, Min-APE = 0.679), reveals that the NFNN-MKV model has the best performance to estimate general river and peak discharge. The NFNN-MKV model is suitable for mid- to long-term runoff prediction for rivers and reservoirs with respect to targets.

shows forecast monthly inflow accuracy rates (FAR) for the NFNN-MKV model from January 2010 to November 2013.

Table 12. Forecast monthly inflow accuracy rate (FAR) of NFNN-MKV model for the Si Quan Reservoir.

shows that the accuracy rate for the NFNN-MKV model is about 96% from the continuous 47 months, the lowest accuracy rate is 94.2%, and the highest accuracy rate is 99.3%. The prediction accuracy has been significantly improved. This proves that the NFNN-MKV model has good prediction performance.

4 Application comparison of hybrid algorithm

4.1 Data and parameters

The datasets used for this comparison study include monthly rainfall and monthly river discharge time series for Weijiabao, referred to hereafter as the WJ series. The WJ series includes 696 months (58 years) of data from January 1956 to December 2013. The dataset was divided into three parts for network training, testing and prediction. We reserved the first 420 months (January 1956 to December 1990) of the data series for network training, used 120 months (January 1991 to December 2000) of the data series for network prediction testing, and used the rest (156 months: January 2001 to December 2013) of the data for prediction.

We compare the model fitting and prediction performance of the ideal point fuzzy neural network–Markov hybrid algorithm (NFNN-MKV) through selecting the wavelet–ANN (WNN) model (Wei et al. Citation2012, Citation2013) and the support vector regression (SVR) model (Behzad et al. Citation2009) with good performances. The performance of the method is validated via out-of-sample comparisons using real data from the runoff systems of Weijiabao. The performance of the NFNN-MKV method is tested by comparing the modelling results of the training, testing and prediction with the WNN model and the SVR model.

In this context, we used the previous 12-month data (average monthly rainfall and average monthly discharge) as inputs and the following 12-month data (monthly inflow) as the targets for neural network training ().

The structure of the neural network is as follows: there are 24 input layer nodes (average monthly rainfall, average monthly discharge); hidden layer nodes are determined by the test method; and the output layer has 12 nodes representing the targets for the following 12-month data (monthly inflow).

4.2 Simulation results

In the experiment, we adopt the monthly flow time series for Weijiabao to verify the effectiveness of the learning and forecasting for the NFNN-MKV method, the WNN model and the SVR model. The criteria used to compare the performance are the mean absolute percentage error (MAPE), the percentage error (PE), the absolute percentage error (APE) and the mean square error (MSE) in this paper, which indicate the accuracy of recall. In addition, for comparative study, numerical simulations comparing with the WNN and SVR methods are also conducted. First, according to the training, testing and prediction stages, we calculate the PE, MAPE, APE and MSE for three forecasting models. At the same time, to verify the effectiveness of the NFNN-MKV method, the performance of the proposed approach is studied.

Numerical results from the NFNN-MKV, WNN and SVR models are shown in and and .

Table 13. Evaluation of the NFNN-MKV, WNN and SVR model performances for the whole training stage from January 1956 to December 1990.

Table 14. Evaluation results of the NFNN-MKV, WNN and SVR model performances in prediction testing stage from January 1991 to December 2000.

Table 15. Evaluation results of the NFNN-MKV, WNN and SVR model performances for the forecasting stage from January 2001 to December 2013.

Figure 6. Comparison of simulated results and observed actual data of the NFNN-MKV, WNN and SVR models.

Figure 6. Comparison of simulated results and observed actual data of the NFNN-MKV, WNN and SVR models.

Figure 7. Comparison of simulated results and the percentage errors of the NFNN-MKV, WNN and SVR models.

Figure 7. Comparison of simulated results and the percentage errors of the NFNN-MKV, WNN and SVR models.

The visualized comparison of the simulated and observed discharge series between the NFNN-MKV, WNN and SVR models is illustrated in . This figure clearly illustrates that, comparing the NFNN-MKV, WNN and SVR models, each model had very good performance for fitting historical data during training, but the simulated results of the NFNN-MKV model are closer to the actual monthly discharges in the training, testing and forecasting periods, so are better than those of the WNN and SVR models.

The general training, testing and forecasting evaluation results of the NFNN-MKV, WNN and SVR models using the observed discharge data and the percentage errors are showed in and . Comparing the results shown in and indicates that the performance of the NFNN-MKV model is better than the WNN and SVR models. The percentage errors of the NFNN-MKV model are smaller in all periods, so better than those of the WNN and SVR models. The percentage errors for the NFNN-MKV model are relatively uniform, and the floating range of its APE is 0–6.0%, which indicates that the stability and robustness of the NFNN-MKV model are better than those of the WNN and SVR models.

The MAPE, Max-APE, Min-APE and MSE of the three models for performance comparison over the whole training stage from January 1956 to December 1990 are summarized in .

In general, the results show that the NFNN-MKV, WNN and SVR models meet the performance goal quickly. Comparison of the training results of the NFNN-MKV, WNN and SVR models suggests that they have very good simulated performances in terms of MAPE, Max-APE, Min-APE as well as MSE. However, the training results of the NFNN-MKV model with smaller errors (MAPE = 3.037%, Max-APE = 5.97%, Min-APE = 0.053% and MSE = 3.383) show that this model has the best performance for fitting historical data, better than that of the WNN and SVR models.

The general testing evaluation results for model prediction ability of the NFNN-MKV, WNN and SVR models using the observed discharge data from January 1991 to December 2000 are summarized in . In terms of MAPE, Max-APE, Min-APE and MSE, the NFNN-MKV, WNN and SVR models exhibit good correlation between observed data and simulated results. However, the NFNN-MKV model has much better prediction results than the WNN and SVR models in terms of MAPE, Max-APE, Min-APE, MSE as well as average prediction testing monthly discharge.

shows a comparison between the NFNN-MKV approach and two other approaches (WNN and SVR models) regarding the MAPE, Max-APE, Min-APE and MSE criteria. The NFNN-MKV approach presents better forecasting accuracy: the MAPE, Max-APE, Min-APE and MSE over the forecasting months (156 months: from January 2001 to December 2013) for Weijiabao on the Weihe River have values of 2.768%, 5.904%, 0.0447% and 3.256(m3/s)2, respectively. Improvement in the MAPE of the NFNN-MKV approach with respect to the WNN and SVR models is 66.1% and 83.6%, respectively. Improvement in the Max-APE of the NFNN-MKV approach with respect to the WNN and SVR models is 72.9% and 83.5%, respectively. Improvement in the MSE of the NFNN-MKV approach with respect to the WNN and SVR models is 67.3% and 82.3%, respectively. From the three tables and two figures, clearly the NFNN-MKV outperforms the WNN and SVR models in almost all situations. It can be seen that the NFNN-MKV approach is much better than the WNN and SVR models and can be considered high in terms of accuracy.

and and show that the NFNN-MKV, WNN and SVR models have very good performance for fitting historical data during training, testing and prediction, but the NFNN-MKV model values are closer to the actual monthly discharges in all periods, better than those of the WNN and SVR models. In terms of MAPE, Max-APE, Min-APE and MSE, the stability and robustness of the NFNN-MKV model are better than those of the WNN and SVR models; the former has the higher prediction accuracy.

5 Conclusions

In this paper, a mid- to long-term runoff forecasting model has been developed using an ideal point fuzzy neural network–Markov (NFNN-MKV) hybrid algorithm to improve the forecasting precision. Prediction application results for the Si Quan Reservoir show that the NFNN-MKV hybrid algorithm has good performance in terms of faster convergence, better forecasting accuracy and significant improvement of neural network generalization in comparison with the traditional fuzzy neural network or Markov models. The absolute percentage error of the NFNN-MKV hybrid algorithm is less than 7.0%, MSE is less than 3.9, and qualification rate reaches 100%.

Another comparison case study illustrates the performance of the NFNN-MKV hybrid algorithm to improve forecasting accuracy. The NFNN-MKV, WNN and SVR models have been employed to estimate (training (420-month) and testing for 120-month-ahead prediction) and predict river discharge (156 months). Numerical comparison results confirm the efficiency of the NFNN-MKV method, which indicates that the NFNN-MKV model is able to significantly increase prediction accuracy. The NFNN-MKV approach presents better forecasting accuracy: the MAPE, Max-APE, Min-APE and MSE over the forecasting months (156 months: from January 2001 to December 2013) for Weijiabao on the Weihe River have values of 2.768%, 5.904%, 0.0447% and 3.256 (m3/s)2, respectively. In terms of MAPE, Max-APE, Min-APE and MSE, the stability and robustness of the NFNN-MKV model are better than those of the WNN and SVR models; the former has the higher prediction accuracy. The NFNN-MKV model takes the advantages of the ideal point fuzzy neural network and the Markov prediction model, which can solve the problem of a stationary or volatile strong random process. The NFNN-MKV hybrid algorithm enhances the efficiency of mid- to long-term runoff forecasting for a reservoir system, and it is a scientific and effective forecasting model. The mid- to long-term runoff forecasting method based on the NFNN-MKV hybrid algorithm has promotional values.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was financially supported by the China National Funds for Distinguished Young Scientists (61025014), National Natural Science Foundation (61304001), the China Ministry of Science Technology Innovation Funds (12C26213714122), the China Ministry of Industry and Information Technology Special Funds ([2011] 353), the Ning Xia Natural Science Funds (NZ12112), the Ning Xia Department of Education Reform Project Funds ([2012] 348), the China Postdoctoral Science Foundation (2012M512147), the China Postdoctoral Science Special Foundation (2013T60929) and Innovation team project of Chang Jiang Scholars and Innovative Research Team of Ministry of Education (IRT1067).

References

  • Adamowski, J., et al., 2012. Comparison of multivariate adaptive regression splines with coupled wavelet transform artificial neural networks for runoff forecasting in Himalayan micro-watersheds with limited data. Journal of Hydroinformatics, 14, 731. doi:10.2166/hydro.2011.044
  • Adamowski, J. and Sun, K., 2010. Development of a coupled wavelet transform and neural network method for flow forecasting of non-perennial rivers in semi-arid watersheds. Journal of Hydrology, 390 (1–2), 85–91. doi:10.1016/j.jhydrol.2010.06.033
  • Anctil, F. and Tape, D.G., 2004. An exploration of artificial neural network rainfall-runoff forecasting combined with wavelet decomposition. Journal of Environmental Engineering and Science, 3 (S1), S121–S128. doi:10.1139/s03-071
  • Andrews, R., et al., 2001. A review of techniques for extracting rules from trained artificial neural networks. Clinical Applications of Artificial Neural Networks, 256–297. doi:10.1017/CBO9780511543494.012
  • Aqil, M., et al., 2007. A comparative study of artificial neural networks and neuro-fuzzy in continuous modeling of the daily and hourly behaviour of runoff. Journal of Hydrology, 337 (1–2), 22–34. doi:10.1016/j.jhydrol.2007.01.013
  • Behzad, M., et al., 2009. Generalization performance of support vector machines and neural networks in runoff modeling. Expert Systems with Applications, 36 (4), 7624–7629. doi:10.1016/j.eswa.2008.09.053
  • Bolch, G., et al., 2006. Queueing networks and Markov chains: modeling and performance evaluation with computer science applications. Hoboken, NJ: John Wiley & Sons.
  • Chang, F., Chang, L.-C., and Huang, H.-L., 2002. Real-time recurrent learning neural network for stream flow forecasting. Hydrological Processes, 16 (13), 2577–2588. doi:10.1002/hyp.1015
  • Deka, P. and Chandramouli, V., 2003. A fuzzy neural network model for deriving the river stage-discharge relationship. Hydrological Sciences Journal, 48 (2), 197–209. doi:10.1623/hysj.48.2.197.44697
  • Dibike, Y.B. and Solomatine, D.P., 2001. River flow forecasting using artificial neural networks. Physics and Chemistry of the Earth, Part B: Hydrology, Oceans and Atmosphere, 26 (1), 1–7. doi:10.1016/S1464-1909(01)85005-X
  • Elshorbagy, A., Simonovic, S.P., and Panu, U.S., 2000. Performance evaluation of artificial neural networks for runoff prediction. Journal of Hydrologic Engineering, 5 (4), 424–427. doi:10.1061/(ASCE)1084-0699(2000)5:4(424)
  • Gardner, L.R., 2009. Assessing the effect of climate change on mean annual runoff. Journal of Hydrology, 379 (3–4), 351–359. doi:10.1016/j.jhydrol.2009.10.021
  • Guo, X., et al., 2010. Study on prediction model of the chaotic runoff time series using fuzzy support vector machines and a case study. Shuili Fadian Xuebao (Journal of Hydroelectric Engineering), 29 (3), 51–55.
  • Hlynka, M., 2007. Queueing networks and markov chains (modeling and performance evaluation with computer science applications). Technometrics, 49 (1), 104–105. doi:10.1198/tech.2007.s457
  • Hong, W.-C. and Pai, P.-F., 2007. Potential assessment of the support vector regression technique in rainfall forecasting. Water Resources Management, 21 (2), 495–513. doi:10.1007/s11269-006-9026-2
  • Hu, X.J., et al., 2012. Index of multiple factors and expected height of fully mechanized water flowing fractured zone. Journal of China Coal Society, 4, 016.
  • Jackson, A., 2009. Markov Analysis with Non-constant Hazard Rates. Thesis (Master). California State University.
  • Jain, A. and Kumar, A.M., 2007. Hybrid neural network models for hydrologic time series forecasting. Applied Software Computing, 7 (2), 585–592. doi:10.1016/j.asoc.2006.03.002
  • Jothityangkoon, C., Sivapalan, M., and Viney, N.R., 2000. Tests of a space-time model of daily rainfall in southwestern Australia based on nonhomogeneous random cascades. Water Resources Research, 36 (1), 267–284. doi:10.1029/1999WR900253
  • Kasabov, N.K. and Song, Q., 2002. DENFIS: dynamic evolving neural-fuzzy inference system and its application for time-series prediction. IEEE Transactions on Fuzzy Systems, 10 (2), 144–154. doi:10.1109/91.995117
  • Kisi, O., 2010. Wavelet regression model for short-term stream flow forecasting. Journal of Hydrology, 389 (3–4), 344–353. doi:10.1016/j.jhydrol.2010.06.013
  • Kisi, O., 2011. Wavelet regression model as an alternative to neural networks for river stage forecasting. Water Resources Management, 25 (2), 579–600. doi:10.1007/s11269-010-9715-8
  • Kişi, Ö., 2009. Wavelet regression model as an alternative to neural networks for monthly stream flow forecasting. Hydrological Processes, 23 (25), 3583–3597. doi:10.1002/hyp.7461
  • Kottegoda, N.T., Natale, L., and Raiteri, E., 2000. Statistical modeling of daily stream flows using rainfall input and curve number technique. Journal of Hydrology, 234 (3–4), 170–186. doi:10.1016/S0022-1694(00)00252-3
  • Kwan, H.K. and Cai, Y., 1994. A fuzzy neural network and its application to pattern recognition. IEEE Transactions on Fuzzy Systems, 2 (3), 185–193. doi:10.1109/91.298447
  • Li, Z., et al., 2010. Analysis of parameter uncertainty in semi-distributed hydrological models using bootstrap method: A case study of SWAT model applied to Yingluoxia watershed in northwest China. Journal of Hydrology, 385 (1–4), 76–83. doi:10.1016/j.jhydrol.2010.01.025
  • Lin, J.-Y., Cheng, C.-T., and Chau, K.-W., 2006. Using support vector machines for long-term discharge prediction. Hydrological Sciences Journal, 51 (4), 599–612. doi:10.1623/hysj.51.4.599
  • Maier, H.R., et al., 2010. Methods used for the development of neural networks for the prediction of water resource variables in river systems: current status and future directions. Environmental Modelling & Software, 25 (8), 891–909. doi:10.1016/j.envsoft.2010.02.003
  • Marshall, L., Nott, D., and Sharma, A., 2004. A comparative study of Markov chain Monte Carlo methods for conceptual rainfall-runoff modeling. Water Resources Research, 40, 2. doi:10.1029/2003WR002378
  • Moosavi, V., et al., 2013. A wavelet-ANFIS hybrid model for groundwater level forecasting for different prediction periods. Water Resources Management, 27 (5), 1301–1321. doi:10.1007/s11269-012-0239-2
  • Nayak, P.C., et al., 2004. A neuro-fuzzy computing technique for modeling hydrological time series. Journal of Hydrology, 291 (1–2), 52–66. doi:10.1016/j.jhydrol.2003.12.010
  • Nourani, V., et al., 2013. Using self-organizing maps and wavelet transforms for space-time pre-processing of satellite precipitation and runoff data in neural network based rainfall-runoff modeling. Journal of Hydrology, 476, 228–243. doi:10.1016/j.jhydrol.2012.10.054
  • Pachet, F., Roy, P., and Barbieri, G., 2011. Finite-length Markov processes with constraints. In: Proceedings of the 22nd International Joint Conference on Artificial Intelligence, IJCAI. Menlo Park, CA: AAAI Press, 635–642.
  • Partal, T. and Kişi, Ö., 2007. Wavelet and neuro-fuzzy conjunction model for precipitation forecasting. Journal of Hydrology, 342 (1–2), 199–212. doi:10.1016/j.jhydrol.2007.05.026
  • Rong, H.-J., et al., 2006. Sequential adaptive fuzzy inference system (SAFIS) for nonlinear system identification and prediction. Fuzzy Sets and Systems, 157 (9), 1260–1275. doi:10.1016/j.fss.2005.12.011
  • Shamseldin, A.Y., O’Connor, K.M., and Liang, G.C., 1997. Methods for combining the outputs of different rainfall-runoff models. Journal of Hydrology, 197 (1–4), 203–229. doi:10.1016/S0022-1694(96)03259-3
  • Shi, B., 2009. Short-term load forecast based on modified particle swarm optimizer and back propagation neural network model. Journal of Computer Applications, 29 (4), 1036–1039. doi:10.3724/SP.J.1087.2009.01036
  • Shi, B., 2010a. Short-term load forecasting based on modified particle swarm optimizer and fuzzy neural network model. System Engineering-Theory & Practice, 30 (1), 157–166.
  • Shi, B., 2010b. Long-term runoff forecast method based on dynamic adjustment particle swarm optimizer algorithm and Holt-Winters linear seasonal model. Transactions of the CSAE, 26 (7), 8–13.
  • Shiri, J. and Kisi, O., 2010. Short-term and long-term stream flow forecasting using a wavelet and neuro-fuzzy conjunction model. Journal of Hydrology, 394 (3–4), 486–493. doi:10.1016/j.jhydrol.2010.10.008
  • Wang, C.-H. and Hung, K.-N., 2013. Intelligent adaptive law for missile guidance using fuzzy neural networks. International Journal of Fuzzy Systems, 15 (2), 182–191.
  • Wang, Z., Liang, H., and Liu, G., 2005. Mid and long term hydrology forecast model and its application. Journal of Hydroelectric Engineering, 2, 008.
  • Wei, S., Song, J., and Khan, N.I., 2012. Simulating and predicting river discharge time series using a wavelet-neural network hybrid modelling approach. Hydrological Processes, 26 (2), 281–296. doi:10.1002/hyp.8227
  • Wei, S., et al., 2013. A wavelet-neural network hybrid modelling approach for estimating and predicting river monthly flows. Hydrological Sciences Journal, 58 (2), 374–389. doi:10.1080/02626667.2012.754102
  • Wu, C.L., Chau, K.W., and Li, Y.S., 2008. River stage prediction based on a distributed support vector regression. Journal of Hydrology, 358 (1–2), 96–111. doi:10.1016/j.jhydrol.2008.05.028
  • Xiong, L., Shamseldin, A.Y., and O’connor, K.M., 2001. A non-linear combination of the forecasts of rainfall-runoff models by the first-order Takagi–Sugeno fuzzy system. Journal of Hydrology, 245 (1–4), 196–217. doi:10.1016/S0022-1694(01)00349-3
  • Yang, J.-S., Yu, S.-P., and Liu, G.-M., 2013. Multi-step-ahead predictor design for effective long-term forecast of hydrological signals using a novel wavelet-NN hybrid model. Hydrology & Earth System Sciences Discussions, 10, 9239–9269. doi:10.5194/hessd-10-9239-2013

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.