592
Views
0
CrossRef citations to date
0
Altmetric
Original Articles

A Modular Real-time Tidal Prediction Model based on Grey-GMDH Neural Network

, &

ABSTRACT

Real-time prediction of tidal level is of great significance for activities of human beings in the fields of marine and coastal engineering. However, the disturbance factors of tidal level are very intricate, which deteriorate the tidal prediction accuracy. To improve the accuracy of real-time tidal-level prediction, a modular real-time tidal-level prediction approach is proposed based on the grey group method of data handling (Grey-GMDH) neural network. The modular model is composed of astronomical tide parts caused by celestial bodies’ movement and the nonastronomical tide parts caused by various meteorological and other environmental factors. The GMDH is a polynomial network that is commonly used in prediction and pattern recognition. However, GMDH is sensitive to nondeterministic time series, which would result in low accuracy of prediction. In this study, the grey prediction theory is introduced into the GMDH prediction model to alleviate the unfavorable effects of uncertainty caused by various environmental factors and the adverse effects caused thereby on the prediction accuracy. In this study of tidal prediction, the Grey-GMDH model is used to predict the nonastronomical tide parts, whereas the conventional harmonic analysis model is used to predict the astronomical tide parts. The final prediction result is achieved by combining the estimation outputs of the harmonious analysis model and the Grey-GMDH model. Measured tidal-level data of San Diego tidal station is selected as the testing database. Simulation and experimental results confirm that the proposed approach can achieve real-time predictions for tidal level with high accuracy, satisfactory convergence and stability.

Introduction

Tidal-level prediction is an indispensable activity for the design of coastal constructions such as wharves and harbors. Real-time tidal prediction also has a great influence on the decision-making procedures of vessels or drilling platforms in offshore areas. Accurate real-time tidal prediction is particularly crucial in operation scheduling, such as making navigating plan of ships through shallow water or bridge. Under these conditions, decision, taking into account the under-keel clearance or the water depth under bridge, can be adopted based on the hourly tidal-level forecast. Furthermore, tidal information is a significant influential factor for navigation scheduling of a platform’s operation planning. Consequently, the accurate real-time forecasting of tidal level in the fields of coastal and ocean engineering is an issue of concern.

In recent years, the most popular technique in tidal research is the harmonic analysis method, in which the components of tides can be expressed as the superposition of several sinusoidal constituents determined by the harmonic analysis method. In addition, the harmonic analysis method remains the basis for long-term tidal-level prediction (Lee Citation2006; Lee and Jeng Citation2002). Nevertheless, there are some drawbacks in practical applications of the harmonic analysis method. First of all, the necessary components of harmonic analysis need to be determined by a considerably long period of tidal-level records. However, long-term tidal-level records may not be obtained owing to the high cost of the in situ monitoring data. Second, the only consideration of this prediction model is the effects of celestial bodies and coastal topography like the framework of the coastline and the profile of the seabed. Consequently, it cannot express the complicated time-varying meteorological impacts on tide, which are produced by meteorological elements like sea wind, atmospheric temperature and pressure, sea ice, sea rainfall, etc. In addition, there are some other time-varying factors such as river discharge and fluctuation, which will also have an impact on tidal level at the estuary region. Consequently, the prediction accuracy of the harmonic analysis model is quite low under some circumstances.

Nowadays, with the rapid development of artificial intelligence technology such as neural network and fuzzy logic, the artificial intelligence technology has been widely used in the fields of engineering computation and simulation (Yin and Wang Citation2013; Yin et al. Citation2014) due to its strong nonlinear mapping and self-learning ability. Artificial neural networks (ANNs) have proved to be of versatile utility in engineering optimization and computation fields (Haykin Citation1999), owing to their excellent abilities of nonlinear mapping, generalization and self-learning. The neural network based on error back propagation (BP) is the most popular and practical neural network, and it is widely used in tide forecasting (Lee Citation2004, Citation2008). Variable-structure radial basis function neural network constructed by sequential learning is proposed for tidal prediction (Yin, Zou, and Xu Citation2013). One-day-ahead tide prediction of the west coast of India New Mangalore tide station was carried out using neural networks (Jain and Deo Citation2007); the estimation of monthly mean significant wave heights using ANN and regression methods (Günaydın Citation2008) and the development of a regional neural network were proposed for coastal water-level predictions (Huang et al. Citation2003).

The group method of data handling (GMDH) neural network (Farlow Citation1981, Citation1984) is also referred to as polynomial neural network. GMDH network is a kind of learning model based on heuristic self-organizing theory, which was proposed by AG Ivakhnenko in 1967. It is a kind of local feedforward network commonly used in prediction (Kordnaeij et al. Citation2015; Najafzadeh, Barani, and Azamathulla Citation2014, Citation2013). Unlike the fixed structure of traditional error BP neural network, the GMDH network structure is variable, which is constantly changing in the process of simulation training. The structure of the GMDH prediction model is determined according to the input and output information. The relationship between the dependent variable and the independent variable is obtained based on the regression method. The main mechanism of GMDH is to build an analytical function in a feedforward network based on a quadratic node transfer function, whose coefficients are obtained using the regression method combined with emulation of the self-organizing approach (Farlow Citation1984). Actually, the model parameters of the GMDH network are estimated utilizing the least square algorithm. Hwang (Citation2006) used a fuzzy GMDH-type neural network model for the prediction of mobile communication, and it was proved that the proposed method was suitable for the prediction of complex systems. Srinivasan (Citation2008) used the GMDH network for energy demand prediction, and it was significantly more precise and less labor-intensive than conventional time-series and regression-based models. The GMDH model is the optimal simplified model that possesses higher accuracy and simpler structure than conventional neural network models (Ketabchi et al. Citation2010). However, the conventional GMDH network is sensitive to the nondeterministic time series, which deteriorate its prediction accuracy.

To overcome these drawbacks and facilitate its practical applications, a modular Grey-GMDH prediction model is proposed to improve the performance of the conventional GMDH network. Modularization is the process of dividing the system into several modules with different attributes when analyzing and solving a specific problem (Knoernschild Citation2012). The tidal component is divided into astronomical tide parts and nonastronomical tide parts. The harmonic analysis method (Vladimir Citation2004) is used to predict the astronomical tidal parts, whereas the Grey-GMDH forecasting method is used to predict the nonastronomical tidal parts with strong nonlinearity. The proposed method combines the advantages of the two methods: the harmonic analysis model can achieve long-term, stable astronomical tide forecast and the Grey-GMDH model can complete the nonlinear fitting and prediction of tides with high accuracy. In this study, the grey predicting model is introduced in the GMDH tidal prediction model, which will weaken the impacts of various uncertain environmental factors. In addition, the correlation analysis is utilized to analyze the time series of the tidal-level data for determining the input dimensions of the Grey-GMDH prediction model; the final prediction result is achieved by combining the estimation outputs of the harmonious analysis model and the Grey-GMDH model. The observed tidal-level data of San Diego tidal station is selected as the testing database. The results of the simulation have confirmed that the proposed novel method can provide predictions for real-time tidal level with high accuracy, excellent convergence and satisfactory stability.

The rest of the paper is organized as follows. A brief description about harmonic analysis is given in section 2; the basic principle and structure of the Grey-GMDH model is represented in section 3; the modular prediction model is provided in section 4; simulation results and detailed discussions are displayed in section 5; and conclusions are provided in the last section.

Harmonic analysis

Theoretically explaining, tides are periodic vertical movements of the sea level. The origin of tide is the tide-generating force of celestial bodies in space, which is a combination of the centrifugal and gravitational forces between the earth and the celestial bodies.

The most commonly and widely utilized approach is the harmonic analysis method in practice. The harmonic analysis approach decomposes the complicated tides into a couple of periodic constituents, each of which is generated by a hypothetical celestial body. At moment t, for a definite tide station with the height of tide h(t), it can be expressed as

(1)

where H0 is the mean sea level, fk is the node factor, σk is the angular velocity of tidal components, (v0 + u)k is the initial phase of the tidal components, Hk is the amplitude of tidal components, gk is the time interval from the moments the celestial body passes the upper transit meridian to the moments when high water occurs and Hk and gk are the harmonic analysis constants.

In practical theory, the quantity of tidal components may be quite huge; fortunately, most of the tidal components can be ignored because of their smaller amplitude (Hk) and longer period (gk). In actual calculation, Eq. (1) can be rewritten as

(2)

where H0 is the mean sea level, n is the number of components, hk is the amplitudes of tidal components, ωk is the angular velocity of tidal components and ϕk is the initial phase of the tidal components.

The harmonic analysis method is widely used in tide prediction because of its stable prediction performance and simple calculation process. Although the harmonic analysis method is extensively used for tidal-level prediction, the drawback of the conventional harmonic analysis method is quite apparent: the harmonic constant of this approach requires a large number of measured tidal data, which is a laborious and time-consuming process.

GMDH model

GMDH neural network

GMDH is a technology based on evolutionary computation, and it is based on the dimension of the input variables, the structure of the network model and the network parameters to carry out a series of evolution, heredity, and variation and selection operations (Ivakhnenko Citation1971, Ivakhnenko and Ivakhnenko Citation2000). The typical training GMDH model is shown in .

Figure 1. Structure of the GMDH neural network.

Figure 1. Structure of the GMDH neural network.

When establishing the GMDH network model, the sample data is divided into two parts: training data set and checking data set. For the establishment of the predictive control model, the sample data is divided into training sets, checking sets and predicting sets. One of the remarkable features of the GMDH model is the utilization of external information. The training data set is used for modeling (parameter estimation and structure synthesis), whereas the information of the checking data set is used only when selecting the optimal complexity model. The basic theory of the GMDH model is that the sample data that influences the system are used to generate a number of candidate models, and an optimal complexity model is selected from the candidate model sets based on the external criterion. The multilayer algorithm of the GMDH algorithm is introduced in this study, and the main implementation step of the multilayer algorithm includes two important links: the utilization of internal criteria for establishing a competitive model (intermediate candidate model) and the utilization of the external criteria (selection criterion) for selecting the optimal complexity model. The external criterion of the GMDH algorithm is a quite significant section; taking into account the particularity of the prediction and control (Mozaffari et al. Citation2016; Najafzadeh, Barani, and Hessami-Kermani Citation2015) of the tidal level, the root mean square error (RMSE) is chosen as the standard of the external criterion.

According to the theory of GMDH network, a GMDH model is represented as a large number of neurons in which different pairs of neurons in each layer are joined by a quadratic polynomial and thus generates new neurons in the new layer. For the sake of generating output value y for a certain input vector X = (x1, x2, x3 … xn) as close as possible to its real output value y, the basic definition of the identification problem of the model is to figure out a function f’’ that can be substituted for the realistic function f approximately. Consequently, for a given set of measured data in a multi-input and single-output model:

(3)

The GMDH network can be trained to predict output value y for any certain input vector X = (xj1, xj2, xj3… xjn), that is,

(4)

The simulation purpose of the whole network model is to ensure that the predictive value y can be more close to the measured value y; the square of the differences between the measured data and the predicted data is utilized to determine the GMDH neural network, that is,

(5)

Generally, the relationship between the input and output variables of the GMDH network can be represented by a complex discrete form of the Volterra functional series (Ivakhnenko Citation1971):

(6)

where (6) is the Kolmogorov–Gabor polynomial (Ivakhnenko Citation1971). = (x1, x2, x3 … xM) represents the input vector, y represents the output variable and the number of input variable m is set to 4, which is based on the correlation analysis of the tidal-level data in this study. The transfer functions of the network, which connects neurons in different layers, can be represented in the form of

(7)

The coefficients ai, which are initialized to random number at the beginning of the program in (7), are calculated by the least squares regression means (Amanifard et al. Citation2008). In the fundamental formation of the GMDH algorithm, the linear regression polynomial that optimal fits the measurements (yi) in the form of (6) is established by utilizing all the possibilities of two input variables in the total of n input variables. Several optimal neurons that meet the chosen criterion (external criterion) between the outputs of the first layer are selected as the input into the second layer with all combinations of different pairs of them. The new generation (layer) of the network is obtained by the combination of the lower-order polynomials at each generation (layer) by employing the GMDH algorithm. The basic steps of the self-organizing algorithm are briefly summarized as follows: establishing the input and associated output sample data for the GMDH prediction model and dividing them into learning and checking data sets, determining the dimension of input data and computing coefficients of the regression polynomial for each pair of input variables (xi, xj) together with the corresponding output variables in the learning data sets by using the least square method. Calculating the m(− 1)/2 high-order variables of the initial input vector = (x1, x2, x3 … xM) to evaluate the output y, m is the dimension of the input variables. The learning algorithm produced a large number of new variables (neurons): , , …, (ml ml−1(ml−1 − 1)/2) in the previous generation (layer), where ml denotes the dimension of input variables for the l generation (layer). Each of these new generated variables is evaluated by judging which variable is the optimal estimation of the dependent variable. The columns of the new generated variable are sorted. A random critical value R is determined by model analysis. Each column of with RMSEj > R is eliminated, where j is the number of the network layer. The remaining variables are selected as the input variables for the next new generation (layer). This procedure is repeated until the introduction of new neurons cannot induce an obvious enhancement in the approximation capabilities of the model or the stop criterion is set in advance (Chao, Ferreira, and Liu Citation1988; Widrow and Lehr Citation1990).

Grey-GMDH neural network

The theory of the grey system was proposed by professor Deng (Deng Citation1986). The grey system model is based on the principle of “Grey Box” with small samples and incomplete information, which implies that the available information is not sufficient. The grey model can reveal the inherent regularity of a raw data sequence by using the data-mining and analysis methods. Practical results have proved the effectiveness of the Grey prediction model (GPM) in processing poor information with limited amount of data (Ma and Liu Citation2015; Tien Citation2009). Generally speaking, GPM consists of three essential processes: (1) accumulated generation operation (AGO) by accumulating the raw time-series data; (2) grey prediction based on the accumulated newfound time-series data in GPM, where the GPM is replaced by the GMDH model in this study, respectively; and (3) inverse accumulated generation operation (IAGO), where the time series in GPM, which is substituted for the Grey-GMDH model, is converted back to its original form to realize the predicted result. The GPM (1, 1), with a single variable, is normally the most commonly used forecasting model. The brief description of the prototype of GPM (1, 1) is as follows:

(7)

where x(0)(i) represents the time-series database in primary time series x(0) at time step of i, and the bracketed number in the superscript stands for the order of the GPM.

A newfound time series x(1) is generated by the AGO process:

(8)

where x(1)(k) =(0)(i).

By means of the AGO procedure, the preceding asymmetric and disordered database could become exponentially preformed so that the system performance can be characterized by the implementation of the differential equation. The time-response solution for forecast is subsequently completed by solving the differential equation:

(9)
(10)

Eq. (10) is the solution of Eq. (9).

Finally, the prediction result is achieved by the IAGO process:

(11)

where a and b are completed by means of the linear least squares (LLS) method. (1)(+ 1) and (1)(k) are the prediction results of (1)(+ 1) and (1)(k); and (0)(+ 1) is the prediction result of (0)(+ 1).

The schematic of the GMDH model based on the grey prediction system is shown in .

Figure 2. Schematic of Grey-GMDH prediction systems.

Figure 2. Schematic of Grey-GMDH prediction systems.

Modular tide prediction model

Depending on the origin of the tide, tidal-level prediction can be divided into two parts: astronomical tide parts caused by celestial bodies’ movement and nonastronomical tide parts caused by various meteorological factors such as sea wind and sea wave. Astronomical tide, which is mainly caused by the tidal force of celestial bodies, has a notable time-varying characteristic, whereas nonastronomical tide shows strong nonlinearity. In this study, the conventional harmonic analysis method is implemented to predict the astronomical tide section, and the residual tide section is predicted by the Grey-GMDH prediction model. The residual section is a complicated association caused by the topography as well as hydrological and meteorological factors such as sea wind, sea wave and seawater temperature. The Grey-GMDH prediction model could fit the residual tide section, which shows obvious nonlinear performances accurately. The final prediction model is composed of the harmonic prediction model and the Grey-GMDH prediction model. The composition and the training process of the final modular prediction model are shown in .

Figure 3. Flow chart of the modular tidal prediction model.

Figure 3. Flow chart of the modular tidal prediction model.

In the modular prediction model, the observed tidal data y(t), y(t–1),… y(t–q) is set as the input of the model. Here yH, yH(t–1), …, yH(t–q) is the tidal prediction value obtained by the harmonic prediction method and yR, yR(t-1), …, yR(t-q) is the difference between y and yH. Finally, the ultimate modular prediction model is the combination of the harmonic prediction model and the Grey-GMDH prediction model. Eventually, the predictive tidal level y(+ 1) is the association of yH(t + 1) and yR(+ 1).

Simulation of the tidal-level prediction

Tidal data analysis

The concept of correlation analysis (Young and Shellswell Citation1972) originates from signal processing and analysis; correlation analysis reflects how the correlation between any two values in a time series is changed over time. In addition, autocorrelation analysis depicts the correlation between neighboring variables of the time series. Autocorrelation function and partial autocorrelation function are an effective way of analyzing and dealing with complex time series (Nezli and Li Citation2003; Zhao et al. Citation2014): the autocorrelation function describes the relationship between the adjacent variables of time series, whereas the partial correlation function eliminates the influence of other intermediate variables in time series. In tidal data analysis, the time series of the tidal level is affected by many variable factors that are difficult to be measured by nautical equipment, thereby making it difficult to calculate the contribution to tidal level. Consequently, the correlation analysis method is utilized to analyze the correlation between the time series of tidal level and then to determine the input dimension of the Grey-GMDH prediction model. In this study, a correlation value of 0.5 is selected as a reference standard to determine the input dimension, and the analysis results are shown in and . shows that the autocorrelation coefficient of the tidal-level time-series data is tailing. Meanwhile, the autocorrelation coefficient reaches gradually closer to zero and tends to be stable. Furthermore, shows that the partial autocorrelation coefficient is four-order truncation. To summarize, the correlation analysis demonstrates that the time-series value from t-1 to t-4 moment has a significant relevance with the time-series value of moment t, which can be selected as the input of the modular prediction model.

Figure 4. Autocorrelation analysis of tidal data.

Figure 4. Autocorrelation analysis of tidal data.

Figure 5. Partial autocorrelation analysis of tidal data.

Figure 5. Partial autocorrelation analysis of tidal data.

Experimental measurements of San Diego tidal station over the duration from 1 January 00:00 GMT, 2015, to 7 September 23:00 GMT, 2015, with a time resolution of 1 h, with a total of 6000 pairs of observed tidal-level data, are chosen as the testing and training database. The modular model accomplishes initialization training stage by employing the advanced 4200 samples of measured tidal data, which includes 70% of the training data and 30% of the checking data, and the residual sections are supplied to the modular prediction model for a one-step-ahead prediction. The linear least square algorithm is chosen as the internal criterion of the prediction model, which is the generation principle of the intermediate candidate model. Moreover, the RMSE is chosen as the standard of the external criterion. All the measured tidal-level data applied in this study can be obtained from the website http://co-ops.nos.noaa.gov/.

In this study, the structure of the prediction model, which includes one input layer of four-input-nodes, four hidden layers and one output layer of one-output-nodes, is set to total six layers; the maximum number of neurons in each intermediate layer is limited to 25 in order to prevent the GMDH from tending to produce an exceedingly complicated network when it comes to highly nonlinear models owning to its limited general structure (quadratic two-variable polynomial) (Park et al. Citation2004).

For a further comprehensive comparison of the simulation result, the RMSE, the mean square error (MSE), the mean absolute error (MAE), the standard deviation (SD) and the mean error (ME) are introduced as an evaluation indicator to evaluate the performance of the proposed prediction model. The concepts of RMSE, MSE, MAE, SD and ME are described in Eqs. (12)–(16):

(12)
(13)
(14)
(15)
(16)

where n is the number of sample data, μ represents the arithmetic mean value of the observed data, yk represents the value of the observation and k denotes the value of prediction.

Tidal-level prediction using the harmonic analysis model

The tidal-level prediction of San Diego tidal station using the harmonic analysis prediction method is illustrated in together with the observed ones, which are exhibited on the website of NOAA, with a time resolution of 1 hour.

Figure 6. Location map of the tidal-level station on the California coast of the United States.

(Source: http://co-ops.nos.noaa.gov/)

Figure 6. Location map of the tidal-level station on the California coast of the United States.(Source: http://co-ops.nos.noaa.gov/)

Figure 7. Simulation results using the harmonic analysis method.

(one-step-ahead prediction of San Diego tidal station)

Figure 7. Simulation results using the harmonic analysis method.(one-step-ahead prediction of San Diego tidal station)

It can be clearly seen from that the predicted value of the tidal level falls below the observation value from the beginning to the end, which explains that the tidal level could be affected by unexpected factors of environmental changes, such as the rough sea or heavy sea-surface wind. In addition, there exists a comparatively large sustained deviation with the correlation coefficient (CC) of 0.9973, which is illustrated in . The sustained offset between the predicted and observed tidal levels reaches up to 1 foot, and the prediction error of the harmonic analysis model is in the range of [–0.1,1], which can be seen from the error diagram in . In addition, the distribution center of the prediction error of the prediction model is centered on about 0.4 feet, instead of tending to zero, and the harmonic prediction model also has a larger prediction error, which can be seen from the error histogram analysis in .

Figure 8. Scatter diagram of the measured and predicted results using the harmonic method.

Figure 8. Scatter diagram of the measured and predicted results using the harmonic method.

is the scatter diagram of the measured and predicted results by using the harmonic analysis method. The red line in denotes the measured value of the tidal level and the green mark in represents the predicted value of the tidal level using the harmonic analysis method.

Tidal-level prediction using the GMDH model

is the simulation diagram of the GMDH model. It can be analyzed from that the fitting consistency between the GMDH prediction value and the measured value is obviously better than the harmonic analysis model; the prediction error of the GMDH model is in the range of [–0.4,0.4], which can be seen from the error diagram in . Furthermore, the distribution center of the prediction error of the GMDH model is basically concentrated in about 0. Nevertheless, the error distribution of the GMDH model is not uniform, and the error fluctuation range is relatively larger, which can be seen from the error histogram analysis in .

Figure 9. Simulation results using the GMDH model.

(one-step-ahead prediction of San Diego tidal station)

Figure 9. Simulation results using the GMDH model.(one-step-ahead prediction of San Diego tidal station)

Tidal-level prediction using the modular GMDH model

It can be apparently recognized from that the prediction values produced by the modular GMDH prediction method coincide well with the observed tidal data than the harmonic method or the GMDH method. The modular prediction model consists of two sections: the harmonic analysis prediction model, which takes into consideration the celestial bodies’ movement, and the GMDH prediction model, which considers the residual sections. Consequently, the prediction results of the tidal level are more accurate than merely utilizing the harmonic analysis method or the GMDH method. Furthermore, the prediction error of the modular GMDH prediction model is in the range of [–0.3, 0.3], which can be seen from the error diagram in . It illustrates that the prediction accuracy is significantly improved compared with the direct prediction model of GMDH and the harmonic prediction model. Moreover, the distribution of the error of the modular model gradually tends to be uniform and smooth, the error distribution center is almost 0 and the error distribution is symmetrical with respect to the distribution center.

Figure 10. Simulation results using the modular GMDH model.

(one-step-ahead prediction of San Diego tidal station)

Figure 10. Simulation results using the modular GMDH model.(one-step-ahead prediction of San Diego tidal station)

Tidal-level prediction using the Modular Grey-GMDH model

The 1-h-head tidal-level forecasting results using the Modular Grey-GMDH method are represented in , together with the real observations. It can be noticed from the error diagram in that there is no large continuous offset between the forecasted values and the measured ones. In addition, the forecasted results locate around the actual ones conformably, which can also be obtained from . The closeness between the observed tidal values and the forecasted ones can be further validated by the corresponding higher CC of 0.9993, as indicated in . In addition, the prediction error of the Modular Grey-GMDH model is in the range of [–0.2, 0.2], which can be seen from the error diagram in . Moreover, the error distribution center almost completely tends to 0, the error distribution is more uniform, the center is almost completely symmetrical with respect to the error distribution center and the error fluctuation range is significantly reduced compared to the previous three models, which is illustrated from the error histogram analysis in . In conclusion, this proposed model significantly improves the prediction accuracy of tidal level.

Figure 11. Simulation results using the modular Grey-GMDH model.

(one-step-ahead prediction of San Diego tidal station)

Figure 11. Simulation results using the modular Grey-GMDH model.(one-step-ahead prediction of San Diego tidal station)

Figure 12. Scatter diagram of the measured and predicted results by the modular Grey-GMDH model.

Figure 12. Scatter diagram of the measured and predicted results by the modular Grey-GMDH model.

is the scatter diagram of the measured and predicted results by using the modular Grey-GMDH method, in which the red line denotes the measured value of tidal level and the green mark represents the predicted value of tidal level by using the modular Grey-GMDH method.

To further quantitatively validate the predictive performance of the modular Grey-GMDH prediction model proposed in this study, comparative simulations are conducted by using methods of harmonic analysis, GMDH and modular GMDH. Detailed simulation results based on the same simulation environment, the same simulation parameters and the same measured tidal data of San Diego tidal station are shown in .

Table 1. Tidal-level simulation results of San Diego tidal station.

It is clearly seen from that the corresponding error value of the modular Grey-GMDH prediction model is the smallest one of the four prediction models. As a consequence, the experimental results demonstrate that the proposed modular GMDH model has a higher forecasting accuracy for tidal-level prediction.

Different time spans of time-series prediction will affect the magnitude of prediction error. To further compare the accuracy of the conventional GMDH model and the modular Grey-GMDH model in the prediction of tidal level, simulation experiments of different time spans are carried out based on the same observed tidal-level data. Simulation results are represented in and .

Table 2. Tidal prediction results for San Diego using the conventional GMDH model.

Table 3. Tidal prediction results for San Diego using the modular prediction model.

It can be noticed from that the RMSEs of tidal-level prediction using the conventional GMDH method are 0.2793 and 0.4508 for 1-hour-ahead and 2-hours-ahead forecasting, respectively. In addition, the values of RMSE are in the range of 0.2793 and 1.0370 for 3 or more hours-ahead predictions. Furthermore, the RMSE values of tidal-level prediction by using the modular Grey-GMDH method for 1-hour-ahead and 2-hours-ahead predictions are 0.0926 and 0.1019, respectively, which can be obtained from , and the values of RMSE are in the range of 0.0926 and 0.1209 for 3 or more hours-ahead forecasting, which illustrates that the proposed novel modular prediction method is able to provide stable and reliable prediction results for tidal-level forecast with satisfactory prediction accuracy. In addition, with the increase of the forecasting time interval, the proposed novel method also exhibits higher forecasting accuracy compared with the conventional GMDH method, which can also be obtained from the corresponding error value illustrated in and . Moreover, the time consumption of the whole simulation process has no obvious increase, which validates the high efficiency of the proposed approach.

Performance of tidal-level prediction

To validate the prediction accuracy, universality and generalization ability for longer time scope tidal-level prediction, simulation experiments were carried out based on the observed tidal-level data of six different tidal stations located on the west coast of California in America. Simulation results of multistep tidal prediction based on the same simulation environment and the same simulation parameters are shown in . The time duration of the observed tidal-level data of the six tidal stations is the same as that of the San Diego tidal station. It is revealed in that the proposed approach can achieve stable and accurate tidal predictions for the observed tidal data of the six different tidal stations, which are shown in . Furthermore, the simulation results also illustrate that the proposed modular prediction method is able to provide stable and reliable prediction results for tidal-level forecast with satisfactory prediction accuracy. It is also noticed that the number of network layers of the GMDH optimal complex model is set in advance and the same network parameters are used for tidal prediction of different tidal stations in this study. As a consequence, an optimal method for self-selection of the network layers and further adjustment of the network parameters should enhance the performance of the proposed approach.

Table 4. Tidal prediction results using the modular prediction model based on the Grey-GMDH model.

Conclusion

A modular tidal-level prediction model is proposed based on the combination of the harmonic analysis method and the Grey-GMDH method, which takes advantages of both of them. The harmonic analysis model and the Grey-GMDH model denote the effects of celestial bodies and meteorological elements, respectively. The effectiveness and feasibility of the proposed model are demonstrated by the experimental results of real-time tidal-level prediction. Furthermore, the experimental results also confirm that the proposed model can produce accurate predictions with high operating speed. In this study, simulation and experiment results indicate that the proposed modular Grey-GMDH model could be a suitable tool that can be utilized for real-time tidal-level prediction.

Additional information

Funding

This research was funded by the National Natural Science Foundation of P. R. China [Grant No. 51379002], the Fundamental Research Funds for the Central Universities of P. R. China [Grant Nos. 3132016116 and 3132016314] and Applied Basic Research Project Fund of the Chinese Ministry of Transport [Grant No. 2014329225010].

References

  • Amanifard, N., N. Nariman-Zadeh, M. H. Farahani, and A. Khalkhali. 2008. Modeling of multiple short-length-scale stall cells in an axial compressor using evolved GMDH neural networks. Energy Conversion and Management 49 (10):2588–94. doi:10.1016/j.enconman.2008.05.025.
  • Chao, P. Y., P. M. Ferreira, and C. R. Liu. 1988. Applications of GMDH-type modeling in manufacturing. Manufacturing Systems 7 (3):241–53. doi:10.1016/0278-6125(88)90008-8.
  • Deng, J. L. 1986. Grey forecasting and decision making. Wuhan, China: Central China University of science and Technology Press.
  • Farlow, S. J. 1981. The GMDH algorithm of Ivakhnenko. American Statistician 35 (4):210–15.
  • Farlow, S. J. 1984. Self-organizing methods in modeling: GMDH type algorithms. New York, USA: Statistics Textbooks & Monographs.
  • Günaydın, K. 2008. The estimation of monthly mean significant wave heights by using artificial neural network and regression methods. Ocean Engineering 35 (14–15):1406–15. doi:10.1016/j.oceaneng.2008.07.008.
  • Haykin, S. 1999. Neural networks: A comprehensive foundation. New Jersey, USA: Prentice Hall.
  • Huang, W., C. Murray, N. Kraus, and J. Rosati. 2003. Development of a regional neural network for coastal water level predictions. Ocean Engineering 30 (17):2275–95. doi:10.1016/S0029-8018(03)00083-0.
  • Hwang, H. S. 2006. Fuzzy GMDH-type neural network model and its application to forecasting of mobile communication. Computers & Industrial Engineering 50 (4):450–57. doi:10.1016/j.cie.2005.08.005.
  • Ivakhnenko, A. G. 1971. Polynomial theory of complex systems. IEEE Transactions on Systems Man & Cybernetics 1 (4):364–78. doi:10.1109/TSMC.1971.4308320.
  • Ivakhnenko, A. G., and G. A. Ivakhnenko. 2000. Problems of further development of the group method of data handling algorithms. Part I. Pattern Recognition & Image Analysis 10 (2):187–94.
  • Jain, P., and M. C. Deo. 2007. Real-time wave forecasts off the western Indian coast. Applied Ocean Research 29 (1–2):72–79. doi:10.1016/j.apor.2007.05.003.
  • Ketabchi, S., H. Ghanadzadeh, A. Ghanadzadeh, S. Fallahi, and M. Ganji. 2010. Estimation of VLE of binary systems (tert-butanol+2-ethyl-1-hexanol) and (n-butanol+2ethyl-1-hexanol) using GMDH-type neural network. Chemical Thermodynamics 42 (11):1352–55. doi:10.1016/j.jct.2010.05.018.
  • Knoernschild, K. 2012. Java application architecture: Modularity patterns with examples using OSGi. New Jersey, Upper Saddle River, USA: Prentice Hall Press.
  • Kordnaeij, A., F. Kalantary, B. Kordtabar, and H. Mola-Abasi. 2015. Prediction of recompression index using GMDH-type neural network based on geotechnical soil properties. Soils and Foundations 55 (6):1335–45. doi:10.1016/j.sandf.2015.10.001.
  • Lee, T. L. 2004. Back-propagation neural network for long-term tidal predictions. Ocean Engineering 31 (2):225–38. doi:10.1016/S0029-8018(03)00115-X.
  • Lee, T. L. 2006. Neural network prediction of a storm surge. Ocean Engineering 33 (3):483–94. doi:10.1016/j.oceaneng.2005.04.012.
  • Lee, T. L. 2008. Back-propagation neural network for the prediction of the short-term storm surge in Taichung harbor, Taiwan. Engineering Application of Artificial Intelligence 21 (1):63–72. doi:10.1016/j.engappai.2007.03.002.
  • Lee, T. L., and D. S. Jeng. 2002. Application of artificial neural networks in tide-forecasting. Ocean Engineering 29 (9):1003–22. doi:10.1016/S0029-8018(01)00068-3.
  • Ma, X., and Z. B. Liu. 2015. Research on the novel recursive discrete multivariate grey prediction model and its applications. Applied Mathematical Modelling 40 (7–8):4876–90. doi:10.1016/j.apm.2015.12.021.
  • Mozaffari, A., N. L. Azad, J. K. Hedrick, and A. Taghavipour. 2016. A hybrid switching predictive controller with proportional integral derivative gains and GMDH neural representation of automotive engines for coldstart emission reductions. Engineering Applications of Artificial Intelligence 48 (3):72–94. doi:10.1016/j.engappai.2015.10.013.
  • Najafzadeh, M., G. A. Barani, and H. M. Azamathulla. 2013. GMDH to predict scour depth around a pier in cohesive soils. Applied Ocean Research 40 (2):35–41. doi:10.1016/j.apor.2012.12.004.
  • Najafzadeh, M., G. A. Barani, and H. M. Azamathulla. 2014. Prediction of pipeline scour depth in clear-water and live-bed conditions using group method of data handling. Neural Computing & Applications 24 (3):629–35. doi:10.1007/s00521-012-1258-x.
  • Najafzadeh, M., G. A. Barani, and M. R. Hessami-Kermani. 2015. Evaluation of GMDH networks for prediction of local scour depth at bridge abutments in coarse sediments with thinly armored beds. Ocean Engineering 104:387–96. doi:10.1016/j.oceaneng.2015.05.016.
  • Nezli, N. P., and B. L. Li. 2003. Time-series analysis of remote-sensed chlorophyll and environmental factors in the santa monica–san pedro basin off Southern California. Marine Systems 39 (39):185–202. doi:10.1016/S0924-7963(03)00030-7.
  • Park, H. S., B. J. Park, H. K. Kim, and S. K. Oh. 2004. Self-organizing polynomial neural networks based on genetically optimized multi-layer perceptron architecture. Control Automation & Systems 2 (4):423–34.
  • Srinivasan, D. 2008. Energy demand prediction using GMDH networks. Neurocomputing 72 (1–3):625–29. doi:10.1016/j.neucom.2008.08.006.
  • Tien, T. L. 2009. A new grey prediction model FGM(1, 1). Mathematical & Computer Modelling 49 (7–8):1416–26. doi:10.1016/j.mcm.2008.11.015.
  • Vladimir, C. 2004. Harmonic Analysis. In the proceedings of the IEEE International Conference on Electro/Information Technology 53–58. Milwaukee, USA: IEEE Press.
  • Widrow, B., and M. A. Lehr. 1990. 30 years of adaptive neural networks: Perceptron, madaline, and backpropagation. Proceedings of the IEEE 78 (9):1415–42. doi:10.1109/5.58323.
  • Yin, J. C., and N. N. Wang. 2013. Online grey prediction of ship roll motion using variable RBFN. Applied Artificial Intelligence 27 (27):941–60. doi:10.1080/08839514.2013.848753.
  • Yin, J. C., Z. J. Zou, and F. Xu. 2013. Sequential learning radial basis function network for real-time tidal level predictions. Ocean Engineering 57 (2):49–55. doi:10.1016/j.oceaneng.2012.08.012.
  • Yin, J. C., Z. J. Zou, F. Xu, and N. N. Wang. 2014. Online ship roll motion prediction based on grey sequential extreme learning machine. Neurocomputing 129 (4):168–74. doi:10.1016/j.neucom.2013.09.043.
  • Young, P., and S. Shellswell. 1972. Time series analysis forecasting and control. IEEE Transactions on Automatic Control 17 (2):281–83. doi:10.1109/TAC.1972.1099963.
  • Zhao, Y. J., M. C. Li, G. L. Li, and Y. Y. Cao. 2014. Time series correlation analysis of pollution in marine environment waters. Applied Mechanics and Materials 522-524:52–55. doi:10.4028/www.scientific.net/AMM.522-524.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.