1,720
Views
7
CrossRef citations to date
0
Altmetric
Articles

Forecasting Croatian inbound tourism demand

&
Pages 1046-1062 | Received 21 Jun 2014, Accepted 14 Sep 2015, Published online: 04 Nov 2015

Abstract

The aim of this paper is to present a forecasting model for the overnight stays of foreign tourists in Croatia. Tourism is one of the most important parts of the Croatian economy. It is particularly important in the context of the services sector. Regular and significant surpluses and the consumption of foreign guests are an important element of budget revenues, especially VAT. The ability to forecast the development of inbound tourism demand in a timely manner is crucial for both business decisions and policy-making. We combine the Granger causality test for identifying leading indicators with a grid search of the weights used to construct a composite indicator. An endogenous grid search for data driven weights was employed to minimise the mean absolute percentage error (MAPE) of the out-of-sample forecast. In total, we carried out 7.7 billion out-of-sample regressions in order to find the optimal combination of leading indicator weights. Results indicate that only four out of the 12 identified leading indicators are relevant in explaining variations in inbound tourism demand. The most important leading indicators are: real GDP and imports in Poland and gross wages in the Czech Republic and Slovakia.

JEL classification:

1. Introduction

The aim of this paper is to present a forecasting model for the number of overnight foreign tourist stays in Croatia. Having timely forecasts of inbound tourism demand in Croatia is very important. According to recent research, tourism accounts for 14.7% of the total gross value added of the Croatian economy (Šutalo, Ivandić, & Marušić, Citation2011). In addition, tourism has an impact on almost all segments of Croatian society and makes a huge contribution to overall national identity (Čavlek, Bartoluci, Prebežec, Kesar, & co-authors, Citation2011). Hence, any attempt to research the quantitative dimension of tourism in Croatia is welcome and useful.

In order to assess the model, we have employed the NBER methodology to identify the leading indicators combined with an endogenous weight selection process that minimises the mean absolute percentage error (MAPE) of the out-of-sample forecast of the leading composite index.

Forecasting tourism demand is a very progressive field of tourism research. Modern econometric models such as vector autoregression (VAR), the error correction model (ECM) and the time varying parameter model (TVP) are frequently used in forecasting tourism demand. However, the general impression of all these attempts is that they are only experiments with the main aim of answering the question of whether or not these methods can be useful in forecasting tourism demand.

Attempts at comprehensive research with the aim of modelling an accurate analytical tool for forecasting tourist arrivals, overnight stays or expenditure are very rare in the literature. One of the major reasons for this is the complexity of the phenomenon of tourism demand and the very large number of factors that should be taken into account when forecasting its dynamics.

Kožić’s (Citation2013) dissertation is one of the rare attempts to tackle the issue of forecasting overnight tourist stays in Croatia. The composite leading indicator methodology was used in order to forecast turning points in inbound tourism demand in Croatia. In order to build a leading composite indicator for forecasting turning points of foreign tourist overnight stays in Croatia, Kožić (Citation2013) used economic indicators for the ten most important countries of origin of foreign tourists in Croatia (Germany, Slovenia, Austria, Italy, the Czech Republic, Slovakia, Hungary, the Netherlands, Poland and France), Croatia’s five most important competitors (Greece, Spain, France, Italy and Turkey), and the dynamics of indicators related to extraordinary events such as terrorist attacks or geopolitical crises.

In this paper, we follow Kožić’s (Citation2013) approach in identifying potential leading indicators. However, we have built upon his work and developed an improved methodology for the purpose of minimising statistical errors in forecasting foreign tourist overnight stays in Croatia.

The paper is organised as follows. Section 2 provides a brief overview of the field of forecasting tourism demand with an accent on recent practice in empirical studies and the usage of leading indicators in forecasting tourism demand. Section 3 briefly describes the leading composite indicators and the methodology for their construction. In addition, Section 3 also describes the weights optimisation methodology used in this paper. Finally, the results and implications of this paper are briefly discussed in Section 4.

2. Literature review of forecasting tourism demand

Over the last 50 years, tourism demand has grown rapidly in many countries in the world. Along with the physical growth of tourism demand, a growing interest in tourism research has been recorded. The field of forecasting tourism demand is an especially progressive field in tourism research. Since the pioneering work by Guthrie (Citation1961), followed by Gerakis (Citation1965) and Gray (Citation1966), more than 500 studies on forecasting tourism demand have been published. The general impression of these studies is that tourism demand forecasting practices have followed practices in macroeconomics and finance, which can be considered as paragons of economic forecasting. Novel forecasting techniques from macroeconomics and finance have been applied in studies that deal with tourism demand. This has been especially true in recent years. Comprehensive reviews of tourism demand forecasting have been made by Lim (Citation1997a, Citation1997b, Citation1999), Li, Song, and Witt (Citation2005), Song and Li (Citation2008) and Kožić (Citation2013). The conclusions drawn by these studies can be described as follows:

The analytical skills of authors have grown through the years along with the contents of the reports that they have provided in their papers. For example, simple econometric techniques such as multiple regression and simple time series models were used in earlier studies, whereas more sophisticated techniques such as VAR, ECM and TVP have been used in more recent papers. In addition, in the past authors were not always aware that it is very important to provide diagnostic tests for the models in their studies. Recent studies usually contain all the information on parameters and diagnostic tests.

The aim of earlier papers was mainly to identify the determinants of tourism demand by means of which its dynamics could be forecast. Recent papers have been more focused on the accuracy of forecasting models. Their major aim has mainly been to compare several competing models and to find the one with the best forecasting performance.

In terms of forecasting accuracy, the general rule is the more complex and sophisticated the forecasting model, the better will be its forecasting performance. Hence, it is crucial to pay attention to many different characteristics of tourism demand such as seasonality and non-stationarity. It is also recommended to combine the outputs of several different models to obtain a better forecasting performance (Wong, Song, Witt, & Wu, Citation2007).

The dependent variables in forecasting models are usually tourist arrivals or tourist overnight stays, while explanatory variables are income and prices, which are often combined with other economic and social factors influencing tourism demand.

The methods frequently used in studies of tourism demand forecasting are presented in Table . Although some examples can be found of the application of qualitative methods in forecasting tourism demand, quantitative methods predominate, especially in recent years. Among the quantitative methods, the leading indicators approach can be considered a novel approach in the field of forecasting tourism demand. It is a rare example of a methodology used heavily in macroeconomic forecasting but which has not yet been extensively used in forecasting tourism demand. Papers that apply leading indicators in forecasting tourism demand have only emerged over the past ten years.

Table 1. Frequently used methods in studies on forecasting tourism demand.

The most cited example of the application of leading indicators in forecasting tourism demand is the study by Rossello-Nadal (Citation2001) in which the author applied leading indicators to forecast international visitor arrivals in the Balearic Islands in Spain. The author selected a group of macroeconomic indicators from the tourists’ countries of origin and applied the method of cross-correlation coefficients to calculate their possible correlation with the dependent variable, i.e. international tourist arrivals in the Balearic Islands. After selecting the macroeconomic indicators which show an ability to influence the dynamics of the dependent variable, the author composed them into a linear regression model, which he tested and then used to successfully forecast turning points in inbound tourism demand in the Balearic Islands.

Cho (Citation2001) compares the forecasting performance of several models. Using the sample of the top five countries of origin of tourists visiting Hong Kong, the author compares the forecasting accuracy of the exponential smoothing model, ARIMA, and the adapted ARIMA model, supplemented with leading indicators. In this manner, the author transforms the classic univariate ARIMA model into the so-called multivariate ARIMAX model with exogenous variables. The author concludes that the multivariate ARIMAX model with leading indicators is the most accurate one, especially in the case of geographically closer countries of origin of tourists. It should be mentioned that the ARIMAX model is actually an autoregressive distributed lag model (ARDL) frequently applied in macroeconomic forecasting.

The ARDL model is also used by Kulendran and Witt (Citation2003). The authors create an ARDL model using the leading indicators of six countries, which represent the top tourist destinations for British tourists. Among all the potential leading indicators, the authors take into consideration those variables that are most frequently used in the econometric modelling of tourism demand. The authors conclude that the forecasting ability of the ARDL model is not stronger than the control model (ARIMA). However, the former is more precise in short-term forecasting compared with another control model (ECM).

Kulendran and Wong (Citation2009) use cross-correlation coefficients in order to identify leading indicators among selected economic variables, which are frequently used in econometric modelling of tourism demand. They arrange the leading indicators into a composite leading indicator, which is then used for the construction of an ARDL model. The authors conclude that the ARDL model with a composite leading indicator enhances the forecasting of tourism demand.

Composite leading indicators, i.e. synthetic indices, are frequently used in forecasting tourism demand. For example, Choi (Citation2003) develops a leading indicator system for the US hotel industry to predict the industry’s growth. The author applies the method of cross-correlation coefficients to identify the leading, coincident and lagging indicators of US hotel industry growth. He then uses the NBER methodology in order to form three composite indices: leading, coincident and lagging. The author identifies forecasting accuracy at a satisfactory level for all of three composite indices and concludes that together they can be a valuable forecasting tool in the hotel industry.

Fernando (Citation2010) criticises the frequent application of linear models in forecasting turning points in tourism demand, and proposes the use of non-linear models such as composite indicators, and LOGIT and PROBIT models. Using the case of Australian inbound tourism demand, the author confirms that LOGIT and PROBIT econometric models are the most accurate. The author argues that composite leading indicators also represent a good option in forecasting turning points in tourism demand.

Kulendran and Wong (Citation2011) apply LOGIT and PROBIT models in order to forecast turning points in international tourism demand in Hong Kong. The authors use these models in a comparative analysis of the forecasting ability of basic determinants of tourism demand, such as disposable income, tourism product prices, substitute prices and transportation prices, in relation to ready-made composite leading indicators such as the national composite leading indicator and the OECD composite leading indicator. The authors conclude that the LOGIT and PROBIT models can forecast equally well. However, these models prove to be more superior in forecasting if they incorporate ready-made composite leading indicators rather than individual determinants of tourism demand.

Tourism demand forecasting articles are usually published in the most prominent journals promoting tourism research, such as Annals of Tourism Research, Tourism Economics, Tourism Management and Journal of Travel Research. However, some papers have recently been published in economic and management journals (Song & Li, Citation2008). On average, between one and three articles on tourism demand forecasting are published every year in the aforementioned journals.

3. Leading composite indicator and methodology

We used the number of tourist overnight stays as a dependent variable in the analysis. Such approximation is common in the literature (Song, Li, Witt, & Fei, Citation2010) and very reasonable in a situation where the data series for total tourism revenue is not available for the entire analysis.

We used the Granger causality test in order to investigate statistical significance and the leading time of 118 potential leading indicators. The variables used as regressors were selected based on microeconomic theory: (i) variables used as proxies for income effects in the countries of origin of tourists; and (ii) variables used as proxies for substitution effects in countries that are competitors of Croatia. The list of all variables and sources of data are presented in Table , while Figures present the original data series used in the analysis.

Table 2. List of variables used in the analysis together with abbreviations and data sources.

Figure 1. Original data series used in the analysis.

Source: Authors’ calculation.
Figure 1. Original data series used in the analysis.

Figure 2. Original data series used in the analysis – continued.

Source: Authors’ calculation.
Figure 2. Original data series used in the analysis – continued.

The Granger causality test was employed on log-difference regressors. The results indicate that there are 12 statistically significant variables that Granger-cause Croatian inbound tourism demand.

Table shows results for 12 significant variables. Five variables are negatively correlated with Croatian inbound tourism demand, and the time lags of leading indicators are mostly 1 year, with the exception of real exchange rates and the Slovenian gross monthly wage with 2-year lags.

Table 3. Leading indicators of Croatian inbound tourism demand.

In the second step, 12 variables that were significant in the Granger causality test (also called leading indicators) were used to construct the leading composite indicator of Croatian inbound tourism demand. The leading composite indicator was constructed according to the OECD methodology (Citation2012) combined with the NBER methodology clearly presented by Bačić and Vizek (Citation2008).

A simplified overview of the composite indicator construction procedure is as follows:

(1)

standardisation of amplitudes of leading indicators;

(2)

inversion of leading indicators;

(3)

lag-shifting (synchronisation of leading indicators);

(4)

weighting of leading indicators;

(5)

aggregation of leading indicators into a leading composite indicator;

(6)

amplitude adjustment of the leading composite indicator with regard to matching the amplitude of the reference series, i.e. Croatian inbound tourism demand.

Figure shows the data for the 12 leading variables, which were standardised, inverted and lag-shifted in line with NBER methodology. In the construction of the weighted average of the 12 leading indicators, Kožić (Citation2013) used the shares of countries of origin of tourists in foreign overnight stays in Croatia as weights. Although such an approach yielded satisfactory results in the case of forecasting turning points in Croatian inbound tourism demand, the major problem with such an exogenous selection of weights is the fact that it is completely theoretical without empirical corroboration.

Figure 3. Leading indicators after standardisation of amplitudes, inversion and lag-shifting (Step 3 of the methodology).

Source: Authors’ calculation.
Figure 3. Leading indicators after standardisation of amplitudes, inversion and lag-shifting (Step 3 of the methodology).

Therefore, in this paper we employ endogenous weights selection. Instead of selecting 12 weights based on theory, we are looking for the 12 weights that will minimise the mean absolute percentage error (MAPE) of the out-of-sample forecast of magnitude of Croatian inbound tourism demand, i.e. the rate of change of overnight stays of foreign tourists in Croatia.

The leading composite indicator (Ω) is defined as the weighted average of the 12 leading indicators:

with the constraint that the sum of all weights ωi for i going from 1 to 12 should be equal to 1:

The mean absolute percentage error (MAPE) is defined as:

where Yf is the out-of-sample forecast of Croatian inbound tourism demand, Y is the actual Croatian inbound tourism demand and n is the number of observations in the out-of-sample forecast. In other words, Yf is the fitted value of Y for the out-of-sample period 2010–2012 forecast using the coefficient vector estimated in the in-sample data for 1955–2009 using:

In order to find the combination of weights ωi that minimise the MAPE statistics, we employed a grid search and estimated the coefficient vector for all possible combinations of weights ωi that satisfy the condition that the sum of all weights is equal to 1. We tried all the combinations between 0.01 and 1.00 for all weights, bearing in mind the constraint. In order to save computer time, we used steps of 0.03, meaning that we tried values 0.01, 0.04, 0.07, etc. for each weight.

4. Results

The weights optimisation procedure was conducted in several steps. It aimed to find the lowest possible rate of change of weights, i.e. the parameters of the regression model, which is feasible in the context of constraints regarding the computer hardware. Since the running of the weights optimisation procedure on PCs is extremely CPU-consuming, as has already been pointed out, the lowest reasonable rate of change of weights was 0.03. In total, 7.7 billion regression models were estimated. A single PC was used to run the computer algorithm for the weights optimisation procedure and the whole calculation process lasted continuously for two months.

The final results are shown in Table . The lowest MAPE is 5.1%, which can be considered as highly accurate forecasting (Frechtling, Citation2001, p. 26). Only four of the 12 initial leading indicators are included in the final leading composite indicator of Croatian inbound tourism demand. Two of them are related to wages (in the Czech Republic and Slovakia) and two (real GDP and the import of goods) are related to the macroeconomic situation in Poland.

Table 4. Results of weights optimisation procedure (grid search).

Figure shows the comparison of the out-of-sample forecast and the actual data during the out-of-sample period (the shaded area in Figure ). The implications of the final results are briefly discussed in the conclusion.

Figure 4. Out of sample forecast of composite leading indicator.

Note: Shaded area represents period of the out of sample forecast.

Source: Authors’ calculation.

Figure 4. Out of sample forecast of composite leading indicator.Note: Shaded area represents period of the out of sample forecast.Source: Authors’ calculation.

5. Conclusion

The weights determination process is highly important in modelling composite indicators. It is not only important for the proper setting of the parameter values in the model but also because the weights reflect the relative importance of each variable in the forecasting model. The method of weights optimisation should be considered in accordance with the main purpose of the forecasting model.

Unlike the example presented in Kožić (Citation2013), where the main purpose of the forecasting model was to predict tourism demand cycle stages and turning points, the main purpose of the analysis presented in this paper was to improve the model developed by Kožić (Citation2013) for accurate forecasting of the magnitude of the annual rate of change of Croatian inbound tourism demand.

Therefore, a more sophisticated and mathematically more rigid method was needed to accomplish the task. Thus, the grid search was used as a data driven (endogenous) weights determination technique. Using a grid search, we managed to minimise the MAPE of forecasting to 5.1%, which can be considered as highly accurate forecasting. In addition, the list of relevant leading indicators of the annual rate of change of overnight stays of foreign tourists in Croatia was reduced. We found that only four out of the 12 identified leading indicators of Croatian inbound tourism demand are relevant in forecasting the magnitude of its annual rate of change. They are ‘Czech gross monthly wages’, ‘Slovakian gross monthly wages’, ‘Polish real GDP’, and ‘Imports of goods in Poland’.

It is not surprising that the four relevant leading indicators are related to Eastern European countries. It is very likely that, on the basis of standard of living and travel habits, tourists from these countries are representative foreign visitors to Croatia. Moreover, it is also very likely that even those tourists in Croatia from Western European countries, Germany for example, have a standard of living and travel habits similar to those tourists from Eastern European countries.

Such a conclusion is supported by the TOMAS survey on the attitudes and expenditure of tourists in Croatia that is frequently conducted by the Croatian Institute for Tourism. According to the latest available data, the average daily expenditure of tourists in Croatia is €58, and nearly half (47%) of tourists come from households with a monthly income of €2000 or less (Marušić, Čorak, Sever, & Ivandić, Citation2011). Likewise, it is not surprising that two of the four relevant leading indicators for forecasting the magnitude of Croatian inbound tourism demand are related to Poland. Poland is a relatively large European economy. It has very close trade links with the rest of Europe, especially with the rest of Eastern Europe, and thus it could be expected that its economic indicators reflect economic developments in other countries.

Such practical implications of the study suggest that Croatian inbound tourism demand is mainly driven and can be explained by economic developments in Eastern Europe. This could serve as very practical input for tourism policymakers, especially those that make decisions on tourism marketing strategies. Business cycle marketing measures should depend in the first place on the Eastern European economic outlook. The measures should be designed to have an impact on middle class tourists. On the other hand, although the current long-term national strategy of attracting tourists with higher incomes might be considered as welcome, based on the current state of overall Croatian inbound tourism demand, such a goal could be very hard to achieve in the short-run.

Finally, future work on this topic should include the frequent revision of leading indicators and consequently the revision of their weights. The major limitation of this study is the fact that the leading indicators are not stable over a longer period of time. Prognostic models that are based on leading indicators tend to lose their prognostic qualities. An economic indicator that is a highly important leading indicator can lose its leading character very fast. On the other hand, certain other economic indicators can increase in importance as leading indicators. Therefore, it is very important to repeat the whole procedure of identifying leading indicators and determining their relative importance, reflected by their weights, every couple of years.

Another limitation of the study can be found in the relatively short data series for the out-of-sample forecasting. Since it was determined by the overall length of the data series, it can be expected that this problem will be less significant in the future, as the number of observations will naturally rise. It is also worth mentioning that the whole procedure should be conducted with tourism revenues as a dependent variable. A deeper insight could be also provided by the construction of a single model for every country of origin of tourists.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

  • Bačić, K., & Vizek, M. (2008). Forecasting business and growth cycles in Croatia. Ekonomski pregled, 59, 646–668.
  • Čavlek, N., Bartoluci, M., Prebežec, D., & Kesar, O. (2011). Tourism – economic fundamental and organizational system. Zagreb: Školska knjiga.
  • Cho, V. (2001). Tourism forecasting and its relationship with leading economic indicators. Journal of Hospitality and Tourism Research, 25, 399–420.10.1177/109634800102500404
  • Choi, J. G. (2003). Developing an economic indicator system (a forecasting technique) for the hotel industry. International Journal of Hospitality Management, 22, 147–159.10.1016/S0278-4319(03)00015-X
  • Fernando, D. E. (2010). Identifying and predicting turning points in Australian inbound tourism demand growth ( Doctoral thesis). Victoria University, Melbourne.
  • Frechtling, C. D. (2001). Forecasting tourism demand – methods and strategies. Oxford: Butterworth-Heinemann.
  • Gerakis, A. S. (1965). Effects of exchange-rate devaluations and revaluations on receipts from tourism. Staff Papers - International Monetary Fund, 12, 365–384.10.2307/3866335
  • Gray, H. P. (1966). The demand for international travel by the United States and Canada. International Economic Review, 7, 83–92.10.2307/2525372
  • Guthrie, H. W. (1961). Demand for tourists’ goods and services in a world market. Papers of the Regional Science Association, 7, 159–175.10.1007/BF01969078
  • Kožić, I. (2013). Forecasting turning points in Croatian inbound tourism demand. Zagreb: Faculty of Economics and Business of the University of Zagreb.
  • Kulendran, N., & Witt, S. F. (2003). Leading indicator tourism forecasts. Tourism Management, 24, 503–510.10.1016/S0261-5177(03)00010-4
  • Kulendran, N., & Wong, K. K. F. (2009). Predicting quarterly Hong Kong tourism demand growth rates, directional changes and turning points with composite leading indicators. Tourism Economics, 15, 307–322.10.5367/000000009788254340
  • Kulendran, N., & Wong, K. K. F. (2011). Determinants versus composite leading indicators in predicting turning points in growth cycle. Journal of Travel Research, 50, 417–430.10.1177/0047287510373280
  • Li, G., Song, H., & Witt, S. F. (2005). Recent developments in econometric modeling and forecasting. Journal of Travel Research, 44, 82–99.10.1177/0047287505276594
  • Lim, C. (1997a). Review of international tourism demand models. Annals of Tourism Research, 24, 835–849.10.1016/S0160-7383(97)00049-2
  • Lim, C. (1997b). An econometric classification and review of international tourism demand models. Tourism Economics, 3, 69–81.
  • Lim, C. (1999). A meta-analytic review of international tourism demand. Journal of Travel Research, 37, 273–284.10.1177/004728759903700309
  • Marušić, Z., Čorak, S., Sever, I., & Ivandić, N. (2011). TOMAS summer 2010. Zagreb: Institute for Tourism.
  • OECD (2012). OECD system of composite leading indicators. Paris: OECD.
  • Rossello-Nadal, J. (2001). Forecasting turning points in international visitor arrivals in the Balearic Islands. Tourism Economics, 7, 365–380.10.5367/000000001101297928
  • Song, H., & Li, G. (2008). Tourism demand modelling and forecasting—A review of recent research. Tourism Management, 29, 203–220.10.1016/j.tourman.2007.07.016
  • Song, H., Li, G., Witt, S. F., & Fei, B. (2010). Tourism demand modeling and forecasting: How should demand be measured? Tourism Economics, 16, 63–81.10.5367/000000010790872213
  • Šutalo, I., Ivandić, N., & Marušić, Z. (2011). Total contribution of tourism to Croatian economy: Input-output model and tourism satellite account. Ekonomski pregled, 62, 267–285.
  • Wong, K. K. F., Song, H., Witt, S. F., & Wu, D. C. (2007). Tourism forecasting: To combine or not to combine? Tourism Management, 28, 1068–1078.10.1016/j.tourman.2006.08.003