65
Views
5
CrossRef citations to date
0
Altmetric
Original Articles

Time series analysis and prediction using gated experts with application to energy demand forecasts

Pages 583-624 | Published online: 26 Nov 2010
 

In the analysis and prediction of real world systems two of the key problems are nonstation arity (often in the form of switching between regimes) and overfitting (particularly serious for noisy processes). This article addresses these problems using gated experts consisting of a nonlinear gating network and several also nonlinear competing experts. Each expert learns to predict the conditional mean and each expert adapts its width to match the noise level in its regime. The gating network learns to predict the probability of each expert given the input. This article focuses on the case where the gating network bases its decision on infor mation from the inputs. This can be contrasted to hidden Markov models where the decision is based on the previous state s i e on the output of the gating network at the previous time step as well as to averaging over several predictors. In contrast, gated experts soft partition the input space. This article discusses the underlying statistical assumptions, derives the weight update rules and compares the performance of gated experts to standard methods on three time series: 1 - a computer generated series obtained by randomly switching between two nonlinear processes; 2 - a time series from the Santa Fe Time Series Competition the light intensity of a laser in chaotic state; and 3 - the daily electricity demand of France (a real world multivariate problem with structure on several timescales). The main results are (1) the gating network correctly discovers the different regimes of the process (2) the widths associated with each expert are important for the segmentation task and they can be used to characterize the subprocesses and (3) there is less overfitting compared to single networks homogeneous multilayer perceptrons since the experts learn to match their variances to the local noise levels. This can be viewed as matching the local complexity of the model to the local complexity of the data.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.