Search in:

Hydrological Sciences Journal Volume 61, 2016 - Issue 6

Submit an article Journal homepage

Free access

834

Views

CrossRef citations to date

Altmetric

Listen

Original Articles

New fuzzy neural network–Markov model and application in mid- to long-term runoff forecast

Biao Shi Room 302, Xi’an Research Institute of High Technology, China;Civil and Hydraulic Engineering, Ning Xia University, Yin Chuan750021, Ning Xia, China;Engineering Research Centre, Ministry of Education on Water Resources Efficient Use in Arid Modern Agriculture, Yin Chuan750021, ChinaCorrespondence[email protected]
View further author information

Chang Hua Hu Room 302, Xi’an Research Institute of High Technology, ChinaView further author information

Xin Hua YuCivil and Hydraulic Engineering, Ning Xia University, Yin Chuan750021, Ning Xia, China;Engineering Research Centre, Ministry of Education on Water Resources Efficient Use in Arid Modern Agriculture, Yin Chuan750021, ChinaView further author information

Xiao Xiang Hu Room 302, Xi’an Research Institute of High Technology, ChinaView further author information

Pages 1157-1169 | Received 03 Jan 2014, Accepted 18 Aug 2014, Published online: 08 Mar 2016

Cite this article
https://doi.org/10.1080/02626667.2014.986486
CrossMark

In this article

ABSTRACT
1 Introduction
2 Materials and methods
3 Application of hybrid algorithm to the Si Quan Reservoir
4 Application comparison of hybrid algorithm
5 Conclusions
Disclosure statement
Additional information
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

ABSTRACT

In this paper, a mid- to long-term runoff forecast model is developed using an ideal point fuzzy neural network–Markov (NFNN-MKV) hybrid algorithm to improve the forecasting precision. Combining the advantages of the new fuzzy neural network and the Markov prediction model, this model can solve the problem of stationary or volatile strong random processes. Defined error statistics algorithms are used to evaluate the performance of models. A runoff prediction for the Si Quan Reservoir is made by utilizing the modelling method and the historical runoff data, with a comprehensive consideration of various runoff-impacting factors such as rainfall. Compared with the traditional fuzzy neural networks and Markov prediction models, the results show that the NFNN-MKV hybrid algorithm has good performance in faster convergence, better forecasting accuracy and significant improvement of neural network generalization. The absolute percentage error of the NFNN-MKV hybrid algorithm is less than 7.0%, MSE is less than 3.9, and qualification rate reaches 100%. For further comparison of the proposed model, the NFNN-MKV model is employed to estimate (training and testing for 120-month-ahead prediction) and predict river discharge for 156 months at Weijiabao on the Weihe River in China. Comparisons among the results of the NFNN-MKV model, the WNN model and the SVR model indicate that the NFNN-MKV model is able to significantly increase prediction accuracy.

Editor D. Koutsoyiannis; Associate editor Y. Gyasi-Agyei

KEYWORDS:

mid- to long-term runoff
NFNN-MKV hybrid algorithm
Si Quan Reservoir
Weijiabao

1 Introduction

Mid- to long-term runoff forecasting is important for the study and guidance of water resources management. However, accurate forecasts are difficult because hydrological signals are nonlinear, highly complex and have severe variations in time and space (Yang et al. Citation2013). At present, the research in mid- to long-term runoff prediction is still in the development stage. A variety of mid- to long-term runoff forecasting models have been developed based on physical considerations or other numerical theories, such as Markov prediction models (Wang et al. Citation2005) based on Markov chain theory; linear regressive analysis methods and nonlinear regressive analysis methods (Shamseldin et al. Citation1997) based on stochastic theory; SVM (support vector machine) models (Lin et al. Citation2006) based on nonlinear time series analysis; fuzzy prediction models (Nayak et al. Citation2004) based on fuzzy theory; and ANN (artificial neural network) models (Dibike and Solomatine Citation2001) based on black-box theory. However, because of some inaccurate initial conditions and limited parameterization schemes, those models are generally insufficient for making accurate predictions (Gardner et al. Citation2009, Shi Citation2010b).

It is a hot issue for scholars all over the world to improve the accuracy of mid- to long-term runoff prediction. A variety of methods have been developed to improve the accuracy of the predictions, including physical models, time series methods, conceptual models, ANN models, and many hybrid models. Now, models integrating two or more of these approaches have been proposed as predictors to improve the accuracy of mid- to long-term runoff forecasts, such as multiple regression–ANN models (Elshorbagy et al. Citation2000), fuzzy pattern recognition models (Xiong et al. Citation2001), the wavelet–ANN model (Anctil and Tape Citation2004, Adamowski et al. Citation2012, Wei et al. Citation2012), the wavelet–ANFIS hybrid model (Moosavi et al. Citation2013, Nourani et al. Citation2013), the wavelet–neuron fuzzy model (Partal and Kişi Citation2007, Shiri and Kisi Citation2010), the fuzzy–SVM model (Guo et al. Citation2010, Hu et al. Citation2012), the support vector regression (SVR) model (Lin et al. Citation2006, Hong and Pai Citation2007, Wu et al. Citation2008, Behzad et al. Citation2009) and the wavelet regression model (Kişi Citation2009, Citation2010, Citation2011). These hybrid models have shown different advantages for accurate predictions due to their capabilities in utilizing present information effectively. Among these hybrid models, FNN (fuzzy neural network) and ANN models are the most popularly utilized sub-models for signal forecasting due to their capability of effectively learning complex and nonlinear relationships (Deka and Chandramouli Citation2003). The wavelet–ANN (WNN) model (Wei et al. Citation2012, Citation2013) has been popularly used in hydrological multiple forecasts in recent years by a number of researchers (Adamowski and Sun Citation2010, Yang et al. Citation2013). The SVR model has been successfully used in the hydrological sciences in recent years (Behzad et al. Citation2009). However, high nonlinearity and nonstationarity are properties of runoff time series, and it is still difficult for these hybrid models to replicate these properties.

As mentioned above, these methods have both advantages and disadvantages. Regression analysis can solve the transient nonequilibrium relationship between climate and runoff. The pattern recognition method is suitable for small regional runoff forecasts, but it is unable to deal with runoff forecasts in large regions. A fuzzy system can deal with the nonlinear characteristics, but it lacks the qualification of self-learning. At the same time, how to generate and adjust the membership functions and fuzzy rules (Kwan and Cai Citation1994, Wang and Hung Citation2013, Shi Citation2010a) automatically is still an open problem. The back propagation (BP) neural network has nonlinear recognition and self-learning ability, and is a common intelligence method in mid- to long-term runoff forecasting. But the BP neural network has its inherent defects, such as being prone to oscillation, easily falling into local minima, slow convergence, and the initial value, the threshold value and the number of neurons in the hidden layer are difficult to determine (Shi Citation2009, Maier et al. Citation2010).

The fuzzy neural network forecasting method is based on fuzzy set theory, artificial intelligence and many historical data. Historical data are properly segregated and given appropriate membership functions. Fuzzy neural network forecasting is good at solving the problem of stationary random processes, but poor in the forecast of volatile random questions (Chang et al. Citation2002, Jain and Kumar 2007). Markov models are stochastic models that capture variability in a process through time. Markov chain and hidden Markov models are probably the simplest models that can be used to model sequential data, i.e. data samples that are not independent of each other (Bolch et al. Citation2006). Markov chain models have been successfully applied to model daily rainfall processes (Marshall et al. Citation2004). Because of the “no backward effect” property of the Markov chain (Jothityangkoon et al. Citation2000, Kottegoda et al. Citation2000, Hlynka Citation2007, Jackson Citation2009, Li et al. Citation2010, Pachet et al. Citation2011), it is good at solving prediction problems with volatile strong random processes. Hydrological systems have the characteristics of a stationary random change tendency and nonstationary random processes, so we can combine the advantages of the fuzzy neural network forecasting method and the Markov forecasting model, which can form a fuzzy neural network–Markov prediction method, to deal with the forecasting of hydrological systems.

As mentioned above, the crucial and most difficult point is the development of effective models to reduce error accumulation and increase the accuracy of mid- to long-term runoff forecasts (Yang et al. Citation2013). In view of this, the main purpose of this paper is to develop an ideal point fuzzy neural network (NFNN)–Markov hybrid algorithm (NFNN-MKV) to estimate and predict mid- to long-term (annual and monthly) runoff using the Si Quan Reservoir on the Han River in China as one case study, and Weijiabao (Wei et al. Citation2012, Citation2013) on the Weihe River in China as one comparison case study. The main objectives include:

developing an NFNN-MKV hybrid model that can predict mid- to long-term runoff using the annual data and monthly data of the previous year and month to improve the accuracy of mid- to long-term serial prediction;
applying this model to simulate annual runoff and monthly runoff of the Si Quan Reservoir, and simulate instream flow at Weijiabao on the Weihe River in China;
comparing the fitting and prediction performances of the NFNN-MKV hybrid model with the traditional fuzzy neural network model, the Markov prediction model, the wavelet–ANN model (Wei et al. Citation2012, Citation2013) and the SVR model (Behzad et al. Citation2009);
investigating network generalization improvement of the ideal point fuzzy neural network.

This paper is organized as follows. Section 2 presents the proposed approach for mid- to long-term runoff forecasting, and provides the different criteria used to evaluate the forecasting accuracy. Section 3 provides the simulation results from a case study based on the Si Quan Reservoir on the Han River in China. Section 4 provides the simulation results from one comparison case study based on Weijiabao on the Weihe River in China. Section 5 gives the conclusions and discussion.

2 Materials and methods

The proposed approach is based on the combination of an NFNN and a Markov hybrid algorithm. The NFNN is used to carry out suppositional prediction, and the Markov algorithm is employed to improve the prediction accuracy.

According to the clustering principle, the percentage error sequence of the suppositional prediction is divided into a number of state intervals, then the transition probability matrix and the frequency probability matrix are identified and calculated, and the state transition probability of the Markov chain can be obtained. Finally, selecting the following state point with the maximum probability and the change rate of the following state, according to the change rate and data of the previous state, data for the following state can be calculated.

2.1 New fuzzy neural network algorithm and structure

Since the runoff forecasting procedure follows the dynamics of a nonlinear system, a common approach is to configure and train a neural network to represent the nonlinear autoregressive model structure. It is a natural perception to reflect the dynamic nature of the problem by sequential information processing. A major factor affecting the neural model prediction accuracy is the data coding method. The literature (Andrews et al. Citation2001, Aqil et al. Citation2007) has pointed out that the acquisition of expert knowledge and the parameter initialization capability constitute the two major advantages of FNN over ANN. In this paper, we develop the ideal point fuzzy neural network with Markov algorithm technique to improve the accuracy of the mid- to long-term serial prediction. This fuzzy system, called NFNN, combines the fuzzy inference principles with the neural network structure and learning abilities into an integrated neural-network-based ideal point fuzzy decision system. Its data coding method is called data fuzzification encoding (FE). According to the FE technique, each data value is represented as the mean value of a fuzzy pattern of excitation over several nodes at the network input and output. The reverse procedure is applied at the network output to decode the values back into the original variable range. Also, decoding of the network output using the FE method is performed by computing a weighted summation of the node excitations.

NFNN is a class of adaptive multilayer feedforward networks, which can be applied to nonlinear forecasting, whose past samples are used to forecast the sample ahead. NFNN incorporates the self-learning ability of NN with the linguistic expression function of fuzzy inference (Kasabov and Song Citation2002, Rong et al. Citation2006).

The NFNN architecture comprises an input layer, a hidden (interface) layer and an output layer. The NFNN network is composed of three layers. Each layer contains several nodes described by the node function. Its details are given as follows.

Assume there are n training sample sets, and m characteristic values of each sample. The predictors’ eigenvalue matrix is as follows:

(1)

a_ij is the ith predictor’s characteristic value of the training sample j.

The sample set is made up of n prediction objects. Its feature vector is:

(2)

The prediction object’s membership formula is:

(3)

where min b_j and max b_j are the minimum and maximum characteristic values of the prediction object.

The positively correlated predictors’ relative membership formula is:

(4)

The negatively correlated predictors’ relative membership formula is:

(5)

where min a_ij and max a_ij are the ith predictor’s minimum and maximum characteristic values of the training sample j.

The generalized delta rule is applied to adjust the weights of the feedforward networks, thus minimizing a predetermined cost error function. The weight adjustment rule is deduced by using a Sugeno-type function, and its expression is as follows:

(6)

The weights of the output layer are:

(7)

The weights of the hidden layer are:

(8)

The fuzzy optimization model is used as the activation function (Shi Citation2010a). The weights are as follows:

The weight of the hidden layer k is:

(9)

where ; η is the learning efficiency; α is the momentum operator; n is the number of the iteration; δ_{k j} is the error signal of the hidden layer.

The weight of the output layer is:

(10)

where ; δ_hj is the error signal of the output layer.

Thus, the output formula of the hidden layer k can be introduced by the model of the fuzzy ideal point, and its expression is as follows:

(11)

where j is sequence number of the sample; r_ij is the input layer’s input elements; ω_ik are the connection weights between the layer i and the layer k; d_kj are the output values of the layer k.

The output formula of the output layer h can be introduced by the model of the fuzzy ideal point, and its expression is as follows:

(12)

where d_kj are the input values of the hidden layer; ω_kh are the connection weights between the layer k and the layer h; d_hj are the output values of the layer h.

Thus, an ideal point fuzzy neural network is functionally equivalent to an adaptive Sugeno-type fuzzy inference system network.

2.2 Markov prediction algorithms

A Markov process is a special, perhaps the most important, subclass of a stochastic process (Bolch et al. Citation2006). In particular, a stochastic process provides a relation between the elements of a possibly infinite family of random variables. A series of random experiments can thus be taken into consideration and analysed as an ensemble. shows a schematic of the Markov prediction method.

The core of the Markov prediction method is “no backward effect”; the object must pass the change of state process from one random state to another state. The previous state, following state and state transition probability have the following relation:

Figure 1. Explanatory diagram of the Markov prediction algorithm, where S_n₋₁ is the previous state, S_n is the following state, P is the state transition probability matrix, and S_n = S_n₋₁P.

Figure 1. Explanatory diagram of the Markov prediction algorithm, where Sn−1 is the previous state, Sn is the following state, P is the state transition probability matrix, and Sn = Sn−1P.

(13)

where S_n is the following state, S_n₋₁ is the previous state, P is the state transition probability.

According to expression (13), two results can be derived: (a) if the state transition probability is known, it can predict the following state result according to the information of the previous state; (b) it can predict the change result of the next or the next several periods according to the information of the current state and the state transition probability.

Starting with state i, the Markov chains will go to some state j (including the possibility of j = i), the one-step transition probabilities p_ij are usually summarized in a non-negative number. Usually, the stochastic transition matrix P is as follows:

(14)

where j and i are the states of state space, i = 1, 2, …, n; j = 1, 2, …, n; p_ij is the state transition probability, and , where .

Assuming the number of state variables is n, state variables are denoted as S₁, S₂, …, S_n. The state transition probability is that the state S_i goes to the state S_j in the step r. The transition probability matrix of r is as follows:

(15)

Repeatedly applying one-step transitions generalizes immediately to r-step transition probabilities. The r-step transition probability matrix can be computed by the (r − 1)-fold multiplication of the one-step transition matrix by itself. So, the transition probability matrix of step r can be concluded:

(16)

where is the transition probability matrix of the step r. is the transition probability matrix of the step r − 1. is the first transition probability matrix, which is usually denoted by P.

2.3 Hybrid approach

In this section, the algorithm used to implement the proposed approach is described step-by-step.

Step 1: Suppositional prediction. According to historical data, carry out the object’s suppositional prediction with the fuzzy neural network forecast method; find the percentage error sequence of the object.

Step 2: The percentage errors are classified according to the classification of percentage errors; the percentage error sequence of the object is divided into a number of state intervals.

Step 3: Form the state transition probability matrix based on Step 2.

Step 4: According to the previous state of history point based on Step 3, predict the most likely state and solve the change rate of the following state (the next forecast point).

Step 5: Calculate the prediction value of (the following state) the next point, according to the change rate of Step 4 and the actual value of the previous state.

2.4 Model evaluation

Since the developed models are applied for management and planning, appropriate model evaluation methods are essential. The accuracy of developed models can be assessed using many measures. In this study, the following error statistics are used to evaluate the models. The criteria for comparing the performance are the percentage error (PE), mean absolute percentage error (MAPE), absolute percentage error (APE), mean squared error (MSE), the maximum APE (Max-APE), the minimum APE (Min-APE) and forecast accuracy rate (FAR) in this paper, which indicate the accuracy of recall.

These criteria are defined as follows:

(17)

(18)

(19)

(20)

(21)

where is the actual value, is the forecast value, and n is the total number of values predicted.

3 Application of hybrid algorithm to the Si Quan Reservoir

3.1 Results of annual runoff

3.1.1 Data and parameters

The Si Quan Reservoir is not entirely an annual regulating reservoir. The annual distribution of runoff of the Si Quan Reservoir is as much as 75–80% in summer, about 10–15% in spring, and only about 5% in winter. The main flood season is July to September and the flood water is abundant. The annual maximum flow appears in July and September; the flood runoff of the Han River has obvious characteristics of bimodal type. The river discharge also has clear seasonal character, and the highest discharge occurs from July to October and the lowest discharge occurs from December to March. So, the neural network model’s input values are the previous history data, such as rainfall, annual inflow and average inflow for 5–10 months, and the output values are predicted annual inflow.

According to the characteristics of runoff in the Han River, setting the training sample sequence, the training samples include rainfall, annual inflow and the average 5–10 monthly inflow for years 1954 to 1987 for the Si Quan Reservoir, and the test samples have data from years 1988 to 2008. Applying the fuzzy neural network, we forecast the annual inflow of the Si Quan Reservoir from years 2009 to 2013.

In this context, we used the previous annual data (rainfall, the average 5–10 monthly inflow and annual inflow) as inputs and the following annual data (annual inflow) as the targets for network training ().

New fuzzy neural network–Markov model and application in mid- to long-term runoff forecast

ABSTRACT

1 Introduction

2 Materials and methods

2.1 New fuzzy neural network algorithm and structure

2.2 Markov prediction algorithms

2.3 Hybrid approach

2.4 Model evaluation

3 Application of hybrid algorithm to the Si Quan Reservoir

3.1 Results of annual runoff

3.1.1 Data and parameters

Table 1. Data division for model training, testing and prediction. P: input vectors; T: target vectors; Ri: signal, i.e. observed rainfall dataset; AMi: signal, i.e. observed average 5–10 monthly inflow dataset; Ai: signal, i.e. observed annual inflow dataset.

3.1.2 Results of NFNN

Table 2. Actual annual inflow and prediction testing annual inflow of the Si Quan Reservoir and APE of NFNN.

Table 3. Actual annual inflow and predicted annual inflow for the Si Quan Reservoir and APE of NFNN.

Table 4. Evaluation of the NFNN model performance.

3.1.3 Prediction process of NFNN-MKV

Table 5. Division of the status space.

Table 6. State transition probability matrix R.

3.1.4 Results of NFNN-MKV

Table 7. Actual annual inflows and predicted annual inflows for the Si Quan Reservoir and percentage errors (PE) of the NFNN-MKV model from 1988 to 2013.

Table 8. Evaluation results of the NFNN-MKV, Markov and FNN model performances from 1988 to 2012.

3.2 Results of monthly runoff

3.2.1 Monthly data and parameters

Table 9. Data division for model training, testing and prediction. P: input vectors; T: target vectors; Ri: signal, i.e. observed rainfall dataset; Ai: signal, i.e. observed monthly inflow dataset.

3.2.2 Monthly runoff prediction process of NFNN-MKV

3.2.3 Prediction results of NFNN-MKV

Table 10. Actual monthly inflows and predicted monthly inflows for the Si Quan Reservoir and percentage errors (PE) of three forecasting models from January 2012 to November 2013.

Table 11. Evaluation results of the NFNN-MKV, Markov and FNN model performances from January 2012 to November 2013.

Table 12. Forecast monthly inflow accuracy rate (FAR) of NFNN-MKV model for the Si Quan Reservoir.

4 Application comparison of hybrid algorithm

4.1 Data and parameters

4.2 Simulation results

Table 13. Evaluation of the NFNN-MKV, WNN and SVR model performances for the whole training stage from January 1956 to December 1990.

Table 14. Evaluation results of the NFNN-MKV, WNN and SVR model performances in prediction testing stage from January 1991 to December 2000.

Table 15. Evaluation results of the NFNN-MKV, WNN and SVR model performances for the forecasting stage from January 2001 to December 2013.

5 Conclusions

Disclosure statement

Additional information

Funding

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date