757

Views

CrossRef citations to date

Altmetric

Original Articles

HYBRID GREY RELATIONAL ARTIFICIAL NEURAL NETWORK AND AUTO REGRESSIVE INTEGRATED MOVING AVERAGE MODEL FOR FORECASTING TIME-SERIES DATA

Roselina Sallehuddin Department of Computer Science and Information Systems, Universiti Teknologi Malaysia, Johor, MalaysiaCorrespondence[email protected]

Siti Mariyam Hj. Shamsuddin Department of Computer Science and Information Systems, Universiti Teknologi Malaysia, Johor, Malaysia

Abstract

The aim of this study is to develop a new hybrid model by combining a linear and nonlinear model for forecasting time-series data. The proposed model (GRANN_ARIMA) integrates nonlinear grey relational artificial neural network (GRANN) and a linear autoregressive integrated moving average (ARIMA) model by combining new features and grey relational analysis to select the appropriate inputs and hybridization succession. To validate the performance of the proposed model, small and large scale data sets are used. The forecasting performance is compared with several models, and these include: individual models (ARIMA, multiple regression, GRANN), several hybrid models (MARMA, MR_ANN, ARIMA_ANN), and an artificial neural network (ANN) trained using a Levenberg Marquardt algorithm. The experiments have shown that the proposed model has outperformed other models with 99.5% forecasting accuracy for small-scale data and 99.84% for large-scale data. The obtained empirical results have proven that the GRANN_ARIMA model can provide a better alternative for time-series forecasting due to its promising performance and capability in handling time-series data for both small- and large-scale data.

Time-series forecasting is an important area of forecasting. It involves analyzing the collected past observations of the same variables to examine the underlying relationship and to develop an appropriate model to represent it. This modeling approach is recommended only when little knowledge is available or when there is no satisfactory explanatory model that relates the prediction variable to other explanatory variables (Zou, Xia, Yang, and Wang, Citation2007).

Time-series forecasting has been applied widely in many different fields such as economics, sociology, and science. Forecasting methods can be broadly divided into two categories: statistical and artificial intelligence (AI)-based techniques. Box-Jenkins or autoregressive integrated moving average (ARIMA), multiple regressions, and exponential smoothing are the examples of statistical methods, while AI paradigms include fuzzy inference systems, genetic algorithm, neural networks, machine-learning, and etc. (Zhang, Patuwo, and Hu Citation2001). Statistical methods are usually associated with linear data, while neural networks are usually associated with nonlinear data. Statistical methods have been used successfully in time-series forecasting for several decades. As well as being simple and easy to interpret, statistical methods also have several limitations. One of the major limitations of statistical methods is that it is merely depicted as a linear model, also known as a model driven approach. Thus, they have to fit the data with the available data, and a prior knowledge about the relationships between inputs and outputs prior to modeling is highly desired.

With the aim to improve the forecasting performance of nonlinear systems, nonlinear statistical time series models such as bilinear model, threshold autoregressive model (TAR), smoothing transition autoregressive model (STAR), autoregressive conditional heteroscedastic model (ARCH), and generalized autoregressive conditional heteroscedastic model (GARCH) have been proposed. These models are known as the second generation of time-series models. However, limited success or gain has been found during the last two decades using nonlinear models since most of them are developed specifically for particular problems without broad-spectrum applicability for other situations. In addition, the formulations of these models are more complex and difficult to develop compared to linear models.

Hence, a different approach has been proposed and engaged successfully in time-series forecasting. An artificial neural network (ANN) has been applied in solving numerous time-series forecasting problems such as stock, electricity prices, breast cancer, rainfall-runoff (Abraham and Nath Citation2001; Delen, Waller, and Kadam Citation2005; Ganeta, Romeo, and Gill Citation2006; Hamid and Iqbal Citation2004; Srinivasulu and Jain Citation2006), and others. One of the main reasons that ANN performs better than statistical methods is due to its influential feature in handling nonlinear time series data. In addition, ANN has also been shown to be effective in modeling and forecasting nonlinear time-series with or without noise (Zhang et al. Citation2001). Artificial neural networks also do not require any knowledge nor prior information about systems of interest. Previous researchers (Ma and Khorasani Citation2004; Hippert, Pedreira, and Souza Citation2001; Zhang Citation2004) have claimed that forecasting is a major application area of ANN.

Zhang, Patuwo, and Hu (Citation1998) have compiled substantial results achieved by the previous researchers. Even though most published researches indicate the superiority of the ANN model in comparison to simpler linear models, quite a few studies give disparity comments on ANN performance. Gorr, Nagin, and Szcypula (Citation1994) and Denton (Citation1995) showed that ANN performed about the same as the linear model. Several other researchers (Brace, Schmidt, and Hadlin Citation1991; Caire, Hatabian, and Muller Citation1992; Heravi, Osborn, and Birchenhallc Citation2004; Taskaya-Termizel and Ahmad Citation2005) also reported the pessimistic findings about ANN in forecasting daily electric load for one step ahead forecast. In their study, they showed that ANN was not as effective as the linear time-series model in forecasting performance even if the data is nonlinear. However, Kang (Citation1991) had shown that ANN always performs well compared to ARIMA and even better when the forecasting horizon is increased.

Some researchers (Box and Jenkins Citation1982; Makridakis et al. 1982; Chatfield Citation2001; Zhang et al. Citation2001) have reported that there is no such single forecasting method that gives an appropriate result in all situations. This is due to the characteristics of the model itself, in which the statistical model is usually a linear model, and ANN is a nonlinear model. Each of them will perform well in linear and nonlinear data, respectively. Therefore, it is hard for us to determine whether the time-series problems under study are linear or nonlinear, particularly when we are dealing with real-world, time-series data.

Frequently, the real-world, time-series problems are not absolutely linear or nonlinear; they often contain both linear and nonlinear patterns. Furthermore, real time-series problems are often affected by irregular and infrequent events that cause time-series forecasting to become more difficult and complicated. Thus, a single model is not the best way for forecasting. Although both ANN and ARIMA models have succeeded in their linear and nonlinear domains, neither ANN nor ARIMA can adequately model and predict time-series. The linear model is unable to deal with nonlinear relationships while the ANN model alone is not able to handle linear and nonlinear patterns equally well.

With the intention to improve the forecasting accuracy, the combination of forecasting approaches has been proposed by many researchers (Bates and Granger Citation1969; Newbold and Granger Citation1974; Besseler and Brandt Citation1981; Chi Citation1998). From their studies, they indicate that the integrated forecasting techniques surpass the individual forecasts.

RELATED STUDIES ON HYBRID MODELS

Hybrid models have been introduced by Reid (Citation1968) and Bates and Granger (Citation1969) to overcome the deficiency of using individual models such as statistical methods and AI. Hybrid models merge two or more different methods to improve the prediction accuracy. These models can also be referred to as combined models or ensemble models. Combining the models is thought to improve the forecasting accuracy since the essential forecasting methods represent different information to produce their forecasting results. The methods that are based on the same information or suffering from the same biases usually gain slight improvements in forecasting accuracy compared to the methods that are based on different information (Nikolopoulos, Goodwin, Patelis, and Assimakopoulos Citation2007). Hybrid methods can be implemented in three different ways: linear models, nonlinear models, and both linear and nonlinear models.

In linear hybridization, two or more linear models are combined using the same dataset or a different dataset to gain ultimate forecasting value. Shamsuddin and Arshad (Citation1990) had used a multivariate autoregressive moving average (MARMA) model to predict natural rubber prices for the Malaysian domestic market. Their work differs from Shamsuddin (Citation1992) in terms of techniques, which is implemented using different models based on different data sets. Authors combined autoregressive moving average (ARMA) and econometric model (multiple regression), where an ARMA model is used to explain the residual yield from a multiple regression model. The findings show that the forecasting errors produced by a MARMA model is reduced by 4.5% compared to the individual econometric models. This result indicates that the hybrid model has the potential to improve forecasting accuracy.

Hybrid forecasting has also been implemented using a nonlinear model by hybridizing ANN with a genetic algorithm (GA), fuzzy logic (FL), and rough sets (RS) (Dorganis, Alexandidi, Patrinos, and Sarinevers Citation2006; Hou, Lian, Yao, and Yuan Citation2006; Yang, Ye, Wang, Khan, and Hu Citation2006; Abraham and Nath Citation2001). Authors found that by hybridizing ANN with these methods it could improve the forecasting accuracy. For example, (Citation2007) have hybrid ANN and rough set for short-term load forecasting. In their works, they used rough set to reduce the number of attributes prior to ANN learning. In other words, rough set is employed as a feature selection tool. Outcome from this study showed that the time taken for training ANN decreased and the forecasting accuracy improved. combined ANN and rough set to predict air-conditioning load. They used both univariate and multivariate time-series data. Their findings illustrated that the empirical results are better compared to the results given by the ANN model alone. The result also indicates that if more relevant data are used in the study, the forecasting accuracy could be better. In this hybridization, GA, RS, or FL are embedded in ANN as preprocessing tools to improve the ANN forecasting performance by extracting important and significant features in time-series data.

However, most of the hybridization methods, which have been proposed in the previous literature (Chi Citation1998; Shamsuddin and Arshad Citation1990; Dorganis et al. Citation2006), have major drawbacks. Most of them are designed to combine similar methods—linear model with linear model and nonlinear model with nonlinear model. In reality, time-series data typically contain both linear and nonlinear patterns. Therefore, neither linear nor nonlinear models can be sufficient in modeling time-series data since linear models cannot deal with nonlinear relationships. Additionally, nonlinear models cannot handle both linear and nonlinear patterns equally well.

To overcome this shortcoming, Zhang (Citation2003) has suggested combining linear models and nonlinear models since combining different relevant methods could improve the forecasting accuracy. The merging of this structure can help the researchers in modeling complex autocorrelation structures in time-series data more efficiently. Furthermore, by using different models or models that contradict each other significantly, lower generalization variances or errors could be generated Zhang (Citation2003). In addition, Goodwin (2000) has shown that the forecasting accuracy of combination models which involves a simple average is dependent on the correlation of the forecasted error of the constituent methods; the lower the correlation, the higher the expected accuracy will be.

Encouraging results gained from Zhang (Citation2003) which combined linear models and nonlinear models has become a favorite topic to improve the forecasting accuracy recently. Several studies have been conducted and their results clearly suggest that the hybrid model is able to outperform each component model used in isolation. For example, Pai and Lim (Citation2005) used a hybrid model to forecast daily stock by using a support vector machine and an ARIMA model. Lu, Niu, and Jia (Citation2004) used the hybrid model to forecast daily load data. Tseng and Tzeng (Citation2002) combined a seasonal autoregressive integrated moving average (SARIMA) and a backpropagation model to forecast seasonal time-series data. Their results showed that the hybrid model produced better forecasting results compared to the SARIMA model or to ANN model alone. Meanwhile, Jain and Kumar (Citation2007) found that more accurate results could be obtained by hybridizing ARIMA models and ANN in forecasting hydrologic time-series. Recently, Díaz-Robles et al. (Citation2008), combined an ARIMA model and ANN for predicting particulate matters in urban areas or specifically in Temuco, Chile. Their findings showed that the performance of a hybrid model (ARIMA and ANN) is better than the performance of the individual model.

However, several researchers have argued that predictive performance improves when using hybrid models (Armstrong Citation2001; Taskaya-Termizel and Ahmad Citation2005; Taskaya-Termizel and Casey Citation2005; Zou et al. Citation2007). For example, Taskaya-Termizel and Casey (Citation2005) showed that the individual model outperformed five of the nine used datasets. Recently, Zou et al. (Citation2007) have investigated the performance of individual ANN, individual ARIMA, and hybrid ARIMA_ANN in forecasting the Chinese food grain prices. Their results showed that the individual ANN outperformed the hybrid ARIMA_ANN and individual ARIMA. These inconsistent results indicate the need for further research on how to obtain a good forecasting result from hybrid linear and nonlinear models. It is observed that there are three weaknesses in the previous studies, and these include type of data used, redundancy factors, and the implementation of hybridization sequence. Each of these weaknesses is described as follows.

Most of the studies on time-series forecasting used univariate time-series data. Most of them are solely based on one historical data such as previous sales and previous income. However, Hou et al. (Citation2006) showed that by considering more significant input can improve the forecasting accuracy. Furthermore, previous studies have also illustrated that the accuracy of time-series methods can be improved by incorporating multivariate information that will affect the future behavior of the series; hence the prediction can be improved (Makridakis et al. 1982; Makridakis and Wheelwright 1989).
Second, most of the works utilizing ANN for prediction did not look at the possibility of input redundancy. For an ordinary user, ANN appears like a black box processor whcih does not have any capabilities to recognize insignificant inputs. Improper selection and redundancy of inputs can lead to instability that will affect the accuracy of prediction (Li, Li, Li, Wei, and Qin Citation2003). Several methods have been introduced to eliminate the redundancy inputs such as grey relational analysis (Zhang and He Citation2005), the Markov blanket model, decision trees (Chebrolu, Abraham, and Thomas Citation2005), genetic programming (Abraham, Grosan, and Martin-Vide Citation2007), and adaptive genetic algorithm (Chaivivatrakul and Somhom Citation2004).
Third, the hybrid sequence in conventional hybrids is normally started with a linear model and followed by a nonlinear model to model the residual. This is due to the ANN's capability to deal with linear data that tend to be usually overfitting. But Heravi et al. (Citation2004) had shown that the linear ARIMA model outperformed ANN in forecasting nonlinear stock data. This result indicated that the overfitting problem essentially occurred in both the linear and nonlinear model. Nevertheless, the issue is to choose the one that will suffer from an overfitting problem acutely.

Hence, this study proposes a new hybrid approach for combining a nonlinear model and a linear model to overcome the drawbacks of previous studies by including more additional features; these include multivariate time-series, feature selection in removing and selecting significant input data, and altering the sequence of combination execution. In this study, grey relational analysis (GRA) is integrated with ANN (GRANN) to remove redundancy inputs. Grey relational analysis is employed due to its adaptability in dealing with small or large data sets (Zhang and He Citation2005; Sallehuddin, Shamsuddin, and Yusof 2008a).

THE PROPOSED METHOD

The aim of this study is to combine a nonlinear model and a linear model with various features enhancement. In practice, multiple regression (MR) is usually used in modeling multivariate time-series data due to its simplicity (Chu and Zhang Citation2003; Mentzer and Bienstock Citation1998; Uysal and Roubi Citation1999). In this study, we propose GRANN instead of MR. Two experiments are conducted to validate the effectiveness of the proposed methods.

The first experiment compares the performance between a multiple regression model (MR) and a GRANN in handling multivariate time-series analysis, and the second experiment examines the accuracy of the combination between linear and nonlinear time-series forecasting models in predicting multivariate time-series analysis. Although several studies have shown that the combinations of a linear and a nonlinear model could improve the accuracy, most of the studies employed univariate time-series data (Lu et al. Citation2004; Pai and Lim 2005; Voort, Dougherty, and Watson Citation2005). Therefore, in this study, the experiments are conducted to see whether the same result will be formed whenever multivariate time-series data are employed. Furthermore, to investigate the effects of changing a hybrid sequence, two types of hybrid models (Hybrid I and Hybrid II) are developed. Hybrid I consists of MR and ANN using a conventional hybrid sequence, and Hybrid II integrates GRANN and ARIMA with an altered hybrid sequence. Hybrid I is used as a comparative model in order to evaluate the performance of the proposed hybrid model. As a benchmark, a conventional hybrid method as proposed in a previous study (ARIMA_ANN) for handling univariate time-series data is also developed. To find out whether GRANN_ARIMA is the best model for forecasting multivariate time-series data, several comparisons are conducted, and this will be given in the next section.

A Framework of the Proposed Hybrid Methods

Figures and illustrate the framework of the conventional hybrid model (Lu et al. Citation2004; Voort et al. Citation2005; Zhang Citation2003)—Hybrid I and the proposed Hybrid II. The conventional hybrid model and Hybrid I use the same sequence of hybridization in which a linear model is applied primarily to find the linear relationship in the data. Subsequently, ANN is utilized to model the residual derived from the linear model. In this case, we assume that the linear components have been fully identified by the linear model.

FIGURE 1 A conventional hybrid model.

FIGURE 2 A proposed hybrid model.

Consequently, the residual left in the data presents the nonlinear component. To confirm that the assumption is true, McLeod and Li test are used to verify the nonlinearity of residual data, before the modeling process using ANN is carried out by Hybrid I.

Meanwhile, proposed Hybrid II model and conventional hybrid model are reversed from each other in terms of the model used and the sequence of hybridization. In Hybrid II, GRANN is initially applied, and followed by linear model, ARIMA. In this model, GRA is used to select the significant inputs before the forecasting is implemented using ANN. McLeod and Li's test is conducted to verify the linearity of the residual data; both of these steps are not included in conventional methods.

Table summarizes the similarities and differences of each model. Hybrid I is a hybrid model using a conventional approach with multivariate time-series data. Meanwhile Hybrid II (GRANN_ARIMA) is the proposed approach for forecasting multivariate time-series data. Grey relational analysis is used as feature selection tools to extract the significant factors that have an effect on China crop yield and daily Kuala Lumpur Stock Exchange (KLSE) close price.

HYBRID GREY RELATIONAL ARTIFICIAL NEURAL NETWORK AND AUTO REGRESSIVE INTEGRATED MOVING AVERAGE MODEL FOR FORECASTING TIME-SERIES DATA

Abstract

RELATED STUDIES ON HYBRID MODELS

THE PROPOSED METHOD

A Framework of the Proposed Hybrid Methods

TABLE 1 Similarities and Differences Between a Conventional Model and a Proposed Hybrid Model

Experimental Data Set-up

TABLE 2 Grain Crop Yield and Its Affecting Factors

TABLE 3 Fraction of Daily KLSE Closing Price with 14 Affecting Factors

Grey Relational Analysis (GRA)

Basic Algorithm of GRA

Application of GRA

TABLE 4 Affecting Factors for Grain Crop Yield Selected by GRA

TABLE 5 Affecting Factors for KLSE Close Price Selected by GRA

Nonlinearity Test: McLeod and Li Test

Performance Measurement

PROPOSED HYBRID METHOD

Conventional Hybrid Method for Univariate Time-Series Data

Hybrid Model for Multivariate Time Series Data Using Conventional Approach

Hybrid Model for Multivariate Time-Series Data Using the Proposed Hybrid Model (Hybrid II)

VARIABLES SELECTION AND MODEL CONSTRUCTION

Experimental Design of MR Model

Development of ARIMA Model

Development of ANN Model

TABLE 6 Eight Steps in Developing ANN Model

Design and Development of Hybrid Model: Hybrid ARIMA_ANN Model

Design of Hybrid MR and ARIMA (MARIMA)

Development of GRANN_ARIMA

Design of MR and ANN

EXPERIMENTAL RESULTS

Results from Experiment I

TABLE 7 GRANN Structure

TABLE 8 Statistical Test for Grain Crop and KLSE Close Price

Comparison Between GRANN Model and MR Model

TABLE 9 Forecasting Values of GRANN Versus MR

Results of Experiment II

Experimental Results for the Hybrid I Model (MR_ANN)

TABLE 10 Results for the Hybrid I Model

Result from the Proposed Hybrid II

TABLE 11 Results for the Proposed Hybrid II

A Comparison of Hybrid I, Hybrid II, and Conventional Hybrid Models

TABLE 12 Comparison Performance of Hybrid I, Hybrid II, and the Conventional Hybrid Model

Comparison of the GRANN_ARIMA (Hybrid II) Model with the Benchmark Model

TABLE 13 Comparison of GRANN_ARIMA with MARMA and ARIMA

Comparison of the GRANN_ARIMA Model with ANN Using Second-Order Error (Lavemberg Marquet)

TABLE 14 Comparison of GRANN_ARIMA with Lavenberg Marquet (LVM)

Significant Test of the Proposed Model (GRANN_ARIMA)

One Sample t-Test

TABLE 15 One Sample Test for Crop Yield

TABLE 16 One Sample Test for the KLSE Closing Price

Paired t-Test

TABLE 17 Statistics and Correlations for Crop Yield

TABLE 18 Paired t-Test for Crop Yield

TABLE 19 Statistics and Correlations for KLSE Closing Price

TABLE 20 Paired t-Test for the KLSE Closing Price

CONCLUSION

Notes

Related Research Data

REFERENCES

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date