675
Views
20
CrossRef citations to date
0
Altmetric
Articles

Analysis of classical and machine learning based short-term and mid-term load forecasting for smart grid

ORCID Icon & ORCID Icon
Pages 821-839 | Received 06 Sep 2020, Accepted 31 Dec 2020, Published online: 21 Jan 2021

ABSTRACT

The evolution of advanced metering infrastructure (AMI) has increased the electricity consumption data in real-time manifolds. Using this massive data, the load forecasting methods have undergone numerous transformations. In this paper, short-term and mid-term load forecasting (STLF and MTLF) is proposed using smart-metered data acquired from a real-life distribution grid at the NIT Patna campus with different classical and machine learning methods. Data pre-processing is done to transform the raw data into an appropriate format by removing the outliers present in the datasets. The influential meteorological variables obtained by correlation analysis along with the past load are used to train the load forecasting model. The proposed support vector regression (SVR) produces the best forecasting performance for the test system with a minimum mean absolute percentage error (MAPE) and root mean square error (RMSE). The proposed method outperforms the existing approaches for STLF and MTLF by an average MAPE of 3.60.

This article is part of the following collections:
Energy Innovation and Security: From Microgrids to Global Markets

Nomenclature

α=

Level coefficient

β=

Trend coefficient

AMI=

Advanced metering infrastructure

ANN=

Artificial neural network

ARIMA=

Autoregressive integrated moving average

BR=

Bayesian regularisation

DSM=

Demand-side management

GH=

Girls hostel

GRG=

Generalised reduced gradient

IT=

Incomer transformer

LM=

Levenberg Marquardt

LTLF=

Long-term load forecasting

MAPE=

Mean absolute percentage error

MLR=

Multiple linear regression

MTLF=

Mid-term load forecasting

R=

Regression coefficient

RBF=

Radial basis function

RNN=

Recurrent neural network

STLF=

Short-term load forecasting

SVC=

Support vector classification

SVR=

Support vector regression

SVM=

Support vector machine

THI=

Temperature humidity index

VSTLF=

Very short term load forecasting

1. Introduction

Due to the emergence of smart electrical grids for information sensing, processing, and transmission, advanced metering infrastructure (AMI) has evolved as a key unit in different energy sectors. This facilitated handling a large volume of metered data and relevant information. Further, accessibility of these huge smart meter data in real-time opens up the opportunities for multiple operations like energy management (Aghajani and Ghadimi Citation2018), load forecasting (Liu, Wang, and Ghadimi Citation2017; Kleebrang, Bunditsakulchai, and Wangjiraniran Citation2017), power system planning and operation, maintenance scheduling, error mitigation.

Load forecasting acts as an indispensable tool for optimum planning and operation in different energy sectors (such as residential, commercial, and industrial sectors) (Zhang, Wu, and Wang Citation2015). It plays a vital role in decision making, maintaining efficient economic operations in power system and demand-side management (DSM) by motivating customers to modify their electricity demand and the utilities to generate energy as per the requirement. Since electricity demand changes rapidly, it is very difficult for energy sectors to estimate the amount of electricity needed to balance supply and demand in the future. An accurate forecast of electricity pricing is also required for both the producers and the consumers of electric power for taking the energy trading decisions and to avail the minimum possible price of the electricity simultaneously (Akbary et al. Citation2019; Hamian et al. Citation2018).

Load forecasting methods can be broadly classified into three categories depending on the methodology adopted: (i) based on utilisation side (ii) based on weather information (iii) based on the time horizon. Depending upon the utilisation side, load forecasting can be classified into two categories – (1) Utility-based forecasting, which is used in power generation planning and management, and (2) Consumer-based forecasting, which is applied for the energy optimisation procedures supported by the receiving end-users (Taylor and McSharry Citation2007). The factors affecting load forecasting are the time of the day, day of the week, previous day load, seasonal variations, holidays, weather factors, etc. (Moghram and Rahman Citation1989). Depending on the utilisation of weather forecast information, load forecasting can be grouped into two groups namely – (1) Univariate methods, which do not require weather information for forecasting and (2) Multivariate methods, which require weather information for forecasting loads. Depending upon the length of the forecast interval, load forecasting is again classified into four groups – (1) Very short-term load forecasting (VSTLF) – over periods from several minutes to an hour ahead (2) Short-term load forecasting (STLF) – over several hours to a week ahead (3) Mid-term load forecasting (MTLF)-over several weeks to a month ahead, and (4) Long-term load forecasting (LTLF) – over several months to a year ahead (Wang et al. Citation2018).

STLF and MTLF have always been a matter of great interest in different energy sectors over the years (Zhang, Yu, and Xu Citation2020). STLF is required for an hour ahead bidding, load flow analysis, DSM (Tian et al. Citation2018), contingency analysis, stability studies, ancillary services management, day-ahead reserve setting, reliability analysis, power system operation and scheduling, electricity price calculation, etc. which requires higher accuracy than LTLF. MTLF is generally used for fuel reserve planning, unit commitment, and maintenance scheduling operations. Load forecasting is also very important for the sustainable development of a smart grid system.

This work focuses on the STLF and MTLF of a practical system set up at the NIT Patna campus (which is an academic-cum-residential grid). Smart meters are connected at different nodes at the campus and real-time data is stored online. Unlike other similar works, which generally consider only one or two types of load (Tian et al. Citation2018) (i.e. industrial or commercial or commercial with residential type, etc.), the NIT Patna campus system includes all these types of loads together. It consists of academic loads including lecture halls and different laboratories on campus. The campus grid also consists of residential (viz. hostels and staff residences) and commercial (viz. office buildings, cafeterias) loads. The suitability of the proposed method for the campus grid signifies that the method will be applicable to any type of smart distribution system for forecasting hour ahead, a day ahead, and week ahead loads. This paper reports various load forecasting methodologies to predict a load of this campus grid system at multiple nodes. In this direction, the contribution of this work may be summarised as follows:

  1. Load Forecasting at multiple nodes of a campus grid consisting of a variety of loads including academic, residential, commercial, and mixed loads, etc.

  2. Application of Gaussian filter for data preprocessing as a novel contribution in this work to eliminate the outliers present in the dataset.

  3. Implementation of various classical and modern regression methods for load forecasting to select the best candidate model.

The rest of the paper is organised as follows: Section 2 presents the literature review followed by the complete forecasting methodology in Section 3. Section 4 describes the case study on the smart meter based distribution system which includes a dataset description followed by correlation analysis and data pre-processing. The results and discussion are shown in Section 5. Finally, Section 6 concludes the paper.

2. Literature review

With the increased availability of load data owing to the installation of smart meters at various nodes of distribution systems, different research works have been carried out to measure future consumption by forecasting loads using several load forecasting techniques (Hernandez et al. Citation2014). Different approaches, which have been proposed by the researchers for forecasting load can be broadly classified into two categories: Traditional or Statistical methods including regression analysis (Amara et al. Citation2019; Alex and Timothy Citation1990; Moghram and Rahman Citation1989), time-series methods e.g. Kalman filter method (Rahman and Drezga Citation1992), ARIMA methods (Moshkbar-Bakhshayesh and Ghofrani Citation2014) exponential smoothing methods (Hagan and Behr CitationAugust 1987; Ji et al. Citation2012; Gob, Lurz, and Pievatolo Citation2013) and Modern or Intelligent methods including artificial neural networks (ANN) (Xiao et al. Citation2009; Taylor and Buizza Citation2002; Filho, Lotufo, and Minussi Citation2011; Hippert, Pereira, and Souza Citation2001; Brka, Al-Abdeli, and Kothapalli Citation2016), fuzzy theory (Liao and Tsao Citation2006; Salim et al. Citation2014), support vector machines (Zhang, Wang, and Zhang Citation2017), and wavelet analysis (Deihimi, Orang, and Showkati Citation2013). In recent years, many hybrid approaches which include neural networks along with different intelligent algorithms have been proposed for forecasting other entities such as wind power (Leng et al. Citation2018; Haque et al. Citation2015), solar energy (Kazantzidis et al. Citation2018), and wind-solar output power (Mirzapour et al. Citation2019) in a hybrid power system. The electrical load of a system mainly depends on the past load along with the exogenous factors like weather variables (THI), day, time, etc. For handling such loads, a recurrent neural network (RNN) is used which has a memory capability to remember the previously predicted load and use it as an input training feature for load forecasting (Zhang, Yu, and Xu Citation2020; Wen, Zhou, and Yang Citation2020; de Andrade et al. Citation2014). In addition to these, a relatively complex technique called ‘ensemble methods’ is very popular these days (Aly Hamed Citation2020; Nazar et al. Citation2018). The ensemble methods are mainly the amalgamation of classical and intelligent methods which introduces computational complexity in load forecasting. Further, efforts are being made towards the application of various regression methods (both classical and intelligent) on the different types of load datasets to identify the most suitable load forecasting model (Alex and Timothy Citation1990; Moghram and Rahman Citation1989; Taylor and Buizza Citation2002; Haque et al. Citation2015; Aly Hamed Citation2020). In this direction, this paper reports various techniques including classical as well as intelligent methods of load forecasting at multiple nodes with different load characteristics located in the campus grid.

3. Methodology

In this section, the proposed forecasting methodology is presented for STLF and MTLF. The detailed steps of the forecasting procedure are presented in . The load data is collected using smart meters installed at various nodes inside the campus. The raw data is then normalised in the range of [0 1] to reduce wide variation in the range of load data over different seasons and the weather variables and hence, the load forecasting model can be trained perfectly. The normalization is done using the min–max normalization technique as per EquationEquation (1): (1) xnew=xxminxmaxxmin(1) where x is the actual load of the test node at a given instant; xmin and xmax are the minimum and maximum values of a load of that node and xnew is the normalized load of the test node at that particular instant. The normalized data is then pre-processed using a Gaussian filter (GF) to remove the outliers (noise, data communication errors, error due to smart meter failure, etc.) without altering the original characteristics of the load.

Figure 1. Pictorial representation of the forecasting methodology.

Figure 1. Pictorial representation of the forecasting methodology.

To compute the relationship between the dependent variable (load) and the independent variables (i.e. weather factors, time, etc.) and to select the variables that affect the load significantly, correlation analysis is performed. Load at any node is highly dependent on weather parameters like THI, time, and day, as a load of weekends is less as compared to the weekday loads. Therefore, these features are considered to be important attributes for training the load forecasting model. The complete dataset is divided into two parts: training and testing datasets. Subsequently, different classical and modern load forecasting techniques are used to predict the load at different nodes inside the campus. Finally, the MAPE and RMSE values are calculated by comparing the forecasted load and the actual load from each method to select the best model with minimum error values.

3.1. Load forecasting methods

Different statistical/conventional techniques are applied along with the modern/ intelligent techniques to forecast the loads at two different nodes as described below.

3.1.1. Multiple linear regression (MLR)

Multiple linear regression describes the relationship between the independent variable such as weather components like temperature, humidity, etc., and the dependent variable i.e. load. The relation between the independent variable and the dependent variable can be represented as: (2) y~=β0+β1x1+β2x2+..+βnxn+ε(2) Where;y~is the dependent variable, x1,x2,.. and xn are the independent variables or input variables used to predict the dependent variable i.e. load, β is regression coefficients and ε is represented as an error term (Alex and Timothy Citation1990). For multiple observations, we can write: y~1=β0+β1x11+β2x12+..+βkx1k+ε y~2=β0+β1x21+β2x22+..+βkx2k+ε y~n=β0+β1xn1+β2xn2+..+βkxnk+ε

In general form, the equation can be written as: (3) y~i=β0+i=1nβixi+ε(3)

The main goal of MLR is to develop a function that closely relates these variables so that the dependent variable can be forecasted from the values of independent variables. The basic problem is that using this method gives a linear forecasting model but the load pattern is generally a nonlinear function.

3.1.2. Artificial neural network (ANN)

ANN is a modern computational machine learning technique often used for forecasting loads that reveals certain performance characteristics resembling biological NN with the capability of parallel processing like a human brain. One major advantage of ANN is that it can be used to model any type of nonlinear multivariate function (Taylor and Buizza Citation2002). This method is suitable for the problems where it is difficult to specify the entire knowledge required for the solution but the number of observations is quite high. In this paper, a sigmoid activation function is used for artificial neurons. The neural network is trained using the Levenberg- Marquardt (LM) and Bayesian Regularisation (BR) algorithm by varying the number of neurons from 1 to 10 in one hidden layer. The comparative study is prepared between LM and BR algorithms to develop the ANN architecture that results in a minimum mean absolute percentage error in forecasting the load. One major drawback is that it is very tough to decide whether the derived NN is the best possible solution since it is a tiresome hit and trial process and the obtained model may get over-fitted which results in less forecasting accuracy (Hippert, Pereira, and Souza Citation2001).

3.1.3. Holt’s exponential smoothening method

Holt’s double parameter smoothening is a univariate method which deals with the data series having a trend. The method is based upon two smoothening equations, one for the level component and the other for the trend component which can be given as (Gob, Lurz, and Pievatolo Citation2013): (4) Level:At=α×yt+(1α)×(At1+Tt1)(4) (5) Trend:Tt=β×(AtAt1)+(1β)×Tt1(5)

The forecast equation can be written as: (6) Forecast:Ft+m=At+Tt×m(6)

The initial value of the level and trend component can be found out by the following equation: (7) A1=y1,T1=((y2y1)+(y4y3))2(7) Where α and β are smoothening constants; At is the level component; Tt is the trend component; yt is the actual load; t is the time period; Ft+m is the forecasted load for m periods ahead.

α is the smoothening parameter that represents weighting and the range of α is [0, 1]. β is the smoothening coefficient that represents the trend component of the dataset.β also lies in the range [0, 1]. Value of β closer to 1 signifies that more weight is given to recent observations.

3.1.4. Support vector regression (SVR)

Support vector machine (SVM) is a powerful machine learning tool based on supervised learning for classification and regression analysis which is named SVC and SVR. SVR is a non-parametric technique as it relies on the kernel function. The kernel is a way of calculating the dot product of two vectors in a very high dimensional feature space, and the kernel function is a function used to map lower-dimensional data into higher dimensional data (Zhang, Wang, and Zhang Citation2017). The kernel can be linear, polynomial, and radial basis function (RBF) or Gaussian kernel. The choice of kernel function depends on the type of dataset used for modeling. In this work, a polynomial kernel is used to train the model, since the data used is highly nonlinear. The polynomial kernel is defined by Equation (8) as: (8) K(x,y)=(c+xTy)n(8) Where; n is the degree of the polynomial and c is the constant term (Zhang, Wang, and Zhang Citation2017).

3.2. Forecasting performance

The forecast accuracy is determined by using MAPE which is a common metric used in the field of load forecasting. It is calculated as: (9) MAPE=1N|i=1N(yiy~i)yi|×100(9) Where; yi is the actual load, and y~i is the forecasted load, N is the number of samples used for testing.

The root mean square error (RMSE) between actual and predicted load is calculated as: (10) RMSE=i=1N(yiy~i)2N(10)

4. Case study – NIT Patna campus

Smart meters are installed at 15 different nodes inside the NIT Patna campus which gives real-time data via the cloud. At any node, all the different parameters of the load data e.g. active power, reactive power, line and phase voltages, phase angle, frequency, power factor, etc. can be accessed at the desired time interval varying from 5 minutes to several hours as required.

4.1. Dataset description

For load forecasting, various combinations of load datasets used are: (a) daily average load at two different nodes for six months period ranging from 15th July 2018 to 15th January 2019, (b) hourly load at a node for six months, (c) daily load at a particular hour (10 am) for three months, (d) daily average load of the weekend for six months. These load datasets of different time horizons are used for forecasting the load at two different nodes namely Girl’s Hostel (GH) node consisting of mixed (residential+ academic+ health center) load and Incomer transformer (IT) load. The corresponding weather variables such as temperature and humidity which have a considerable effect on the load have been taken from the meteorological department. A sample dataset of daily average weekend load for both the mentioned nodes has been shown in . From the table, it can be seen that a load of September is more as compared to the load in December and January for both the nodes. The 0 value of the load at any instant may be due to the communication link failure of the smart metering device. The actual load curve of the two nodes for six months period along with the corresponding temperature and humidity is shown in . The influence of THI on the load is analysed by the correlation analysis done in the next section.

Figure 2. A plot of (a) humidity, (b) temperature, (c) daily load of GH node, (d) daily load of IT node for six months period.

Figure 2. A plot of (a) humidity, (b) temperature, (c) daily load of GH node, (d) daily load of IT node for six months period.

Table 1. Weekend dataset from August 2018 to January 2019

4.2. Correlation analysis

Before performing regression, correlation analysis is performed to check the relation between the dependent variable (load) and the independent variables (weather variables). The correlation coefficients obtained are tabulated in . Column 1, 2, 3, and 4 represents temperature, humidity, load (girls hostel) and load (incomer transformer) respectively. From the obtained Pearson correlation coefficients, it is observed that both the temperature and humidity are closely correlated with the load as the correlation coefficients for both temperature and humidity are very high for both the nodes. It can be also said that both the loads are closely correlated to each other.

Table 2. Correlation of weather data with load

From , it can be observed that the load at both the nodes strongly follow the temperature and humidity pattern.

4.3. Data pre-processing

The data obtained directly from smart meters contain errors such as noise, communication link errors, wrong or duplicate data, etc, and this data used for load forecasting may contain some abnormality. The abnormal data should be filtered out from the entire dataset to get accurate and reliable forecasts. The abnormal data is first normalized using the min–max normalization technique (as given in EquationEquation (1)) and then filtered using a Gaussian filter. Gaussian filter is a low pass filter whose effectiveness can be controlled by adjusting the width of the filter known as preset window length. The window calculates the mean of the data points and then move one step down and again calculates the mean. This procedure is repeated until the complete data have been covered. The window length is selected after the iterative evaluation to get the best forecasting performance, and here, the selected length is 2. The processed data is then used for building the load forecasting model using different techniques. Gaussian filter is isotropic, i.e. the standard deviation is the same along both the dimensions and the Fourier transform is a Gaussian distribution centered around zero frequency. It is mainly used to reduce noise from the time series data. and present the hourly load curve of the GH node before and after data pre-processing. These figures show the pre-processing has maintained the trend in data while enclosing variation in its magnitude within a tight range that enables improved forecasting.

Figure 3. Actual load curve of GH node

Figure 3. Actual load curve of GH node

Figure 4. Load curve of GH node after pre-processing

Figure 4. Load curve of GH node after pre-processing

5. Results and discussion

The smart metered data obtained from the NIT Patna campus is used for building the forecast models. Here, the daily average data for six months, hourly data for six months, and weekend and weekdays data for six months at two different nodes have been considered for analysis. 80% of the total dataset is used for training and the remaining 20% is used for testing and validating the model.

5.1. Load forecasting using different classical and machine learning techniques

presents the MAPE and RMSE obtained for different input datasets by using the MLR technique. The feed-forward neural network is used for load forecasting in all the different testing scenarios. The algorithms that are used for training the neural network are Levenberg-Marquardt (LM) and Bayesian Regularisation (BR) algorithms. One hidden layer is used for the study and the number of neurons is varied from 5 to 100. Here, 80% of the total dataset is used for training the neural network, 10% is used for testing and the remaining 10% is used for validation of the model.

Table 3. Results of load forecast using MLR

shows the performance of ANN for six months’ daily average load at two nodes (girls hostel and incomer transformer) and three months load at a particular hour for one node (girls hostel) respectively. The regression plot for six months’ daily average load and weekend load as input dataset for girls hostel node is shown in and . From both the plots it can be seen that the regression coefficient (R-value) is very close to 1 for both the training and testing datasets. This means that the model is well trained and the actual and predicted values are highly correlated to each other, resulting in good forecasting performance.

Figure 5. Regression plot for six months daily average load of GH node

Figure 5. Regression plot for six months daily average load of GH node

Figure 6. Regression plot for six months weekend load of GH node.

Figure 6. Regression plot for six months weekend load of GH node.

Table 4. Performance of ANN for six months daily average load

Holt’s double exponential smoothening method is used for load forecasting. The related equations are presented in (3) and (4). The input dataset used for STLF is daily average load data of six months for two nodes, an hourly load of six months for one node, and a weekend load for six months at two nodes. In this method, the forecasting contains two components namely level and trend components as given in (3) and (4). The total forecasted load is the sum of the level and trend components. The values of level and trend constants, known as α and β, are initially taken as 0.5 each for forecasting and the corresponding MAPE and RMSE values are calculated. To improve the forecasting error the values of α and β are optimised (between 0 to1) to get the minimum MAPE error. The objective function is given below: MinimizeMAPE(α,β)=1N|i=1N(yiy~i)yi|×100Where y values can be found using (5) and (6). Generalised Reduced Gradient (GRG) nonlinear solver is used in this work for optimisation. The range of the coefficients alpha and beta is given 0 and 1. Initially, the solver calculates the MAPE value by taking alpha and beta as 0.5 (which is the initial set value by default). The GRG nonlinear solver calculates the slope of the objective function as the input values change and determine that it has reached the optimum solution when the partial derivatives equal to 0. Finally, it calculates the optimum value of coefficients α and β for which the MAPE is minimised. In the optimised values of α and β and the corresponding minimum MAPE and RMSE for different input datasets are given. Non-linear SVR with the polynomial kernel is used to forecast the load at two given nodes under different training and testing datasets. The kernel function transforms the data into a higher dimensional feature space so that the linear separation can be done.

Table 5. Optimised values of α and β for different cases with respective MAPE and RMSE values.

The polynomial kernel not only considers the given attributes of input samples to discover the similarities but also their combinations known as interactive features. Here, the order of the polynomial is varied from 1 to 5 and the corresponding MAPE and RMSE values are calculated as given in .

Table 6. Results of load forecast using SVR

5.2. Comparison of different load forecasting methods

The comparison of the results of forecasting by the above-mentioned methods is presented in . From the table, it can be seen that the MAPE calculated by the MLR method for different input datasets is in the range of 0.5% to 7%. By the ANN method, the MAPE is obtained from 0.18% to 5%. Holt’s double exponential smoothening method with the optimised value of α and β yields the MAPE around 3% to 9% for three cases and has a high MAPE value for rest as given in . SVR yields a low forecasting error in all the cases as the range of error lies from 1% to 5%. From the above discussion, it can be concluded that the SVR method gives the minimum MAPE and RMSE for the residential academic load of the NIT Patna campus system with smart metering data compared to the other classical and machine learning methods described above. However, RNN is an effective method for time series forecasting in recent years as it uses its own predicted output as a training input for future prediction. Therefore, to test the superiority of the proposed SVR method, the MAPE obtained by the proposed method is compared with the MAPE obtained by RNN for all the six cases as given in . The forecasting results show that in this work the SVR method yielded a better forecast than RNN. This might be on account of the erroneous data obtained from the smart meters used for data collection in real-time. The study also suggests these regression methods heavily depend upon the nature of recorded data.

Table 7. MAPE for different forecast methods applied for various datasets

The comparison plot between actual and predicted load for different datasets using various techniques are shown in .

Figure 7. Comparison plot for six months daily average load of Girls hostel.

Figure 7. Comparison plot for six months daily average load of Girls hostel.

Figure 8. Comparison plot for six months daily average load of incomer transformer.

Figure 8. Comparison plot for six months daily average load of incomer transformer.

Figure 9. Comparison plot for a weekend load of Girls hostel.

Figure 9. Comparison plot for a weekend load of Girls hostel.

Figure 10. Comparison plot for a weekend load of incomer transformer.

Figure 10. Comparison plot for a weekend load of incomer transformer.

Figure 11. Comparison plot for hourly load at 10 am of Girls hostel.

Figure 11. Comparison plot for hourly load at 10 am of Girls hostel.

Figure 12. Comparison plot for six months hourly load of Girls hostel.

Figure 12. Comparison plot for six months hourly load of Girls hostel.

Also, the proposed result is compared with the other similar studies and the results are tabulated in . This comparative study shows that the proposed SVR method outperforms the recent works (Yildiz, Bilbao, and Sproul Citation2017; Solyali Citation2020). In addition to this, the other forecasting methods presented in this work (MLR, ANN, and Holt’s) are either better or comparative as far as the MAPE is concerned. On the contrary, it is observed that the MAPE obtained in (Moon et al. Citation2018) is relatively less than that of the proposed work. This may be on account of double-stage analysis in the existing work (Moon et al. Citation2018). Therefore, further research may be carried out in a direction in which multistage forecasting methods are implemented for better results.

Table 8. MAPE obtained with the proposed method compared with other similar studies.

6. Conclusion

This paper presents multiple STLF and MTLF methods and their performance analysis using smart metered data. The forecasting is done using statistical techniques like MLR and Holt’s exponential smoothening method along with the machine learning-based approaches like ANN and SVR. Data pre-processing using Gaussian filter is used to get rid of the erroneous data or missing data which is a very common phenomenon in the case of a smart metering system due to communication failure. The suitability of the proposed method is validated through the broad comparison studies on the dataset available at the NIT Patna campus. It was found that the SVR method provides the best load forecasting for both STLF and MTLF which can be verified from the least values of MAPE and RMSE for all the testing scenarios. The average MAPE and RMSE obtained by the SVR model for all test datasets are 3.06% and 0.025 respectively, which shows that the proposed model gives very accurate and consistent load forecasting performance. Comparison with other similar studies shows that the proposed work yields better forecasting performance. The same work can be extended for a long term load forecasting which has not been done due to the unavailability of long-term training data for the used system. Additionally, the analysis with a large scale dataset for long term forecasting would attract big data analytics and introduce more complexities. Nevertheless, the methodology adopted in this work has given satisfactory load forecasting performance for short-term and mid-term analysis.

Acknowledgements

This work is supported by the Science & Engineering Research Board, a statutory body of the Department of Science and Technology (DST), Government of India, under grants number ECR/2017/001027.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by Science and Engineering Research Board [grant number ECR/2017/001027].

References

  • Aghajani, G., and N. Ghadimi. 2018. “Multi-Objective Energy Management in a Micro-Grid.” Energy Reports 4: 218–225. doi:https://doi.org/10.1016/j.egyr.2017.10.002.
  • Akbary, P., M. Ghiasi, M. R. R. Pourkheranjani, H. Alipour, and N. Ghadimi. 2019. “Extracting Appropriate Nodal Marginal Prices for All Types of Committed Reserve.” Computational Economics 53 (1): 1–26. doi:https://doi.org/10.1007/s10614-017-9716-2.
  • Alex, D., and C. Timothy. 1990. “A Regression-Based Approach to Short Term System Load Forecasting.” IEEE Transactions on Power Systems. 5 (4): 1535–1550.
  • Aly Hamed, H. H. 2020. “A Proposed Intelligent Short-Term Load Forecasting Hybrid Models of ANN, WNN and KF Based on Clustering Techniques for Smart Grid.” Electric Power Systems Research 182.
  • Amara, F., K. Agbossou, Y. Dube, S. Kelouwani, A. Cardenas, and S. S. Hosseini. 2019. “A Residual Load Modeling Approach for Household Short-Term Load Forecasting Application.” Energy and Buildings 187: 132–143.
  • Brka, A., Y. M. Al-Abdeli, and G. Kothapalli. 2016. “Influence of Neural Network Training Parameters on Short-Term Wind Forecasting.” International Journal of Sustainable Energy 35 (2): 115–131. doi:https://doi.org/10.1080/14786451.2013.873437.
  • de Andrade, L. C. M., M. Oleskovicz, A. Q. Santos, D. V. Coury, and R. A. S. Fernandes. 2014. “Very Short-Term Load Forecasting Based on NARX Recurrent Neural Networks.” In 2014 IEEE PES General Meeting Conference Exposition, 1–5. doi: https://doi.org/10.1109/PESGM.2014.6939012.
  • Deihimi, A., O. Orang, and H. Showkati. 2013. “Short-Term Electric Load and Temperature Forecasting Using Wavelet Echo State Networks with Neural Reconstruction.” Energy 57 (4): 382–401.
  • Filho, K. N., A. D. P. Lotufo, and C. R. Minussi. Oct. 2011. “Multinodal Load Forecasting Using a General Regression Neural Network.” IEEE Transactions on Power Delivery 26 (4): 2862–2869.
  • Gob, R., K. Lurz, and A. Pievatolo. 2013. “Electrical Load Forecasting by Exponential Smoothing with Covariates.” Applied Stochastic Models in Business and Industry 29 (6): 629–645.
  • Hagan, M. T., and S. M. Behr. August 1987. “The Time Series Approach to Short Term Load Forecasting.” IEEE Transactions of Power Systems. 2 (3). https://doi.org/10.1109/TPWRS.1987.4335210.
  • Hamian, M., A. Darvishan, M. Hosseinzadeh, M. J. Lariche, N. Ghadimi, and A. Nouri. 2018. “A Framework to Expedite Joint Energy-Reserve Payment Cost Minimization Using a Custom-Designed Method Based on Mixed Integer Genetic Algorithm.” Engineering Applications of Artificial Intelligence 72: 203–212. doi:https://doi.org/10.1016/j.engappai.2018.03.022.
  • Haque, A. U., P. Mandal, J. Meng, and M. Negnevitsky. 2015. “Wind Speed Forecast Model for Wind Farm Based on a Hybrid Machine Learning Algorithm.” International Journal of Sustainable Energy 34 (1): 38–51.
  • Hernandez, L., C. Baladron, J. M. AguiarCarro, B. Sanchez-Esguevillas, A. J. Lloret, and J. Massana. 2014. “A Survey on Electric Power Demand Forecasting: Future Trends in Smart Grids, Microgrids, and Smart Buildings.” IEEE Communications Surveys and Tutorial 16: 1460–1495.
  • Hippert, H. S., C. E. Pereira, and R. C. Souza. 2001. “Neural Network for Short Term Load Forecasting: A Review and Evaluation.” IEEE Trans. on Power Syst 16 (1): 44–55.
  • Ji, P. R., D. Xiong, P. Wang, and J. Chen. 2012. “A Study on Exponential Smoothing Model for Load Forecasting.” In Proceedings of 2012 Power and Energy Eng. Conf. (APPEEC), Shanghai, China, 1–4.
  • Kazantzidis, A., E. Nikitidou, V. Salamalikis, P. Tzoumanikas, and A. Zagouras. 2018. “New Challenges in Solar Energy Resource and Forecasting in Greece.” International Journal of Sustainable Energy 37 (5): 428–435.
  • Kleebrang, W., P. Bunditsakulchai, and W. Wangjiraniran. 2017. “Household Electricity Demand Forecast and Energy Savings Potential for Vientiane, Lao PDR.” International Journal of Sustainable Energy 36 (4): 344–367. doi:https://doi.org/10.1080/14786451.2015.1017501.
  • Leng, H., X. Li, J. Zhu, H. Tang, Z. Zhang, and N. Ghadimi. 2018. “A New Wind Power Prediction Method Based on Ridgelet Transforms, Hybrid Feature Selection and Closed-Loop Forecasting.” Advanced Engineering Informatics 36: 20–30.
  • Liao, G. C., and T. P. Tsao. 2006. “Application of a Fuzzy Neural Network Combined with a Chaos Genetic Algorithm and Simulated Annealing to Short-Term Load Forecasting.” IEEE Transactions on Evolutionary Computation 10 (3): 330–340.
  • Liu, Y., W. Wang, and N. Ghadimi. 2017. “Electricity Load Forecasting by an Improved Forecast Engine for Building Level Consumers.” Energy 139: 18–30. doi:https://doi.org/10.1016/j.energy.2017.07.150.
  • Mirzapour, F., M. Lakzaei, G. Varamini, M. Teimourian, and N. Ghadimi. 2019. “A New Prediction Model of Battery and Wind-Solar Output in Hybrid Power System.” Journal of Ambient Intelligence and Humanized Computing 10 (1): 77–87.
  • Moghram, I., and S. Rahman. 1989. “Analysis and Evaluation of Five Short-Term Load Forecasting Techniques.” IEEE Trans. Power Systems 4 (4): 1484–1491.
  • Moon, J., K. Kim, Y. Kim, and E. Hwang. 2018. “A Short-Term Electric Load Forecasting Scheme Using 2-Stage Predictive Analytics.” In 2018 IEEE International Conference on Big Data and Smart Computing (BigComp), 219–226.
  • Moshkbar-Bakhshayesh, K., and M. B. Ghofrani. 2014. “Development of a Robust Identifier for NPPs Transients Combining ARIMA Model and EBP Algorithm.” IEEE Transactions on Nuclear Science 61 (4): 2383–2391.
  • Nazar, M. S., A. E. Fard, A. Heidari, M. S. Khah, and J. P. S. Catalao. 2018. “Hybrid Model Using Three-Stage Algorithm for Simultaneous Load and Price Forecasting.” Electrical Power System Research 165: 214–228.
  • Rahman, S., and I. Drezga. 1992. “Identification of a Standard for Comparing Short-Term Load Forecasting Techniques.” Electric Power Systems Research 25 (3): 149–158.
  • Salim, O. M., M. A. Zohdy, H. T. Dorrah, and A. M. Kamel. 2014. “Adaptive Neuro-Fuzzy Short-Term Wind-Speed Forecasting for Egypt’s East-Coast.” International Journal of Sustainable Energy 33 (1): 16–34. doi:https://doi.org/10.1080/14786451.2011.630468.
  • Shukur, O., N. Fadhil, M. H. Lee, and M. Ahmad. 2014. “Electricity Load Forecasting Using Hybrid of Multiplicative Double Seasonal Exponential Smoothing Model with Artificial Neural Network.” Jurnal Teknologi 69, doi:https://doi.org/10.11113/jt.v69.3109.
  • Solyali, D. 2020. “A Comparative Analysis of Machine Learning Approaches for Short-/Long-Term Electricity Load Forecasting in Cyprus.” Sustainability 12 (9), doi:https://doi.org/10.3390/su12093612.
  • Taylor, J. W., and R. Buizza. 2002. “Neural Network Load Forecasting with Weather Ensemble Predictions.” IEEE Transactions on Power Systems 17 (3): 626–632.
  • Taylor, J. W., and P. E. McSharry. 2007. “Short-Term Load Forecasting Methods: Evaluation Based on European Data.” IEEE Transactions on Power Systems 22 (4): 2213–2219.
  • Tian, C., J. Ma, C. Zhang, and P. Zhan. 2018. “A Deep Neural Network for Short-Term Load Forecast Based on LSTM and Conv. Neural Network.” Energies 11: 3493.
  • Wang, Y., Q. Chen, T. Hong, and C. Kang. 2018. “Review of Smart Meter Data Analytics: Applications, Methodologies, and Challenges.” IEEE Transactions on Smart Grid.
  • Wen, L., K. Zhou, and S. Yang. 2020. “Load Demand Forecasting of Residential Buildings Using a Deep Learning Model.” Electric Power Systems Research 179: p. 106073. doi:10.1016/j.epsr.2019.106073.
  • Xiao, Z., S. Ye, Bo Zhong, and C. Sun. 2009. “BP Neural Network with Rough Set for Short Term Load Forecasting.” Expert Systems With Applications 36 (1): 273–279.
  • Yildiz, B., J. I. Bilbao, and A. B. Sproul. 2017. “A Review and Analysis of Regression and Machine Learning Models on Commercial Building Electricity Load Forecasting.” Renewable and Sustainable Energy Reviews 73: 1104–1122. doi:https://doi.org/10.1016/j.rser.2017.02.023.
  • Zhang, X., J. Wang, and K. Zhang. 2017. “Short-Term Electric Load Forecasting Based on Singular Spectrum Analysis and Support Vector Machine Optimized by Cuckoo Search Algorithm.” Electric Power Systems and Research 146: 270–285.
  • Zhang, P., X. Wu, and X. Wang. September 2015. “Short-Term Load Forecasting Based on Big Data Technologies.” CSEE Journal of Power and Energy Systems 1 (3): 59–67.
  • Zhang, M., Z. Yu, and Z. Xu. 2020. “Short-Term Load Forecasting Using Recurrent Neural Networks With Input Attention Mechanism and Hidden Connection Mechanism.” IEEE Access 8: 186514–186529. doi:https://doi.org/10.1109/ACCESS.2020.3029224.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.