ABSTRACT
The grid operators face challenges due to the fluctuations in the energy production from Photovoltaic (PV) power plants. Therefore, reliable prediction models are critical as they aid grid operators in optimizing operational planning and lowering operating costs. One of the most challenging components of production prediction accuracy is determining a proper set of features that affect the models’ prediction accuracy. In this work, an accurate data-driven prediction model has been developed based on new timestamps and weather condition features that affect the PV system. Specifically, standard PV physics-based models have been employed for feature transformation and extraction from the traditional available historical features. The new features are then utilized to build an Artificial Neural Network (ANN), whose production predictions are compared to those obtained using the old features and other features extracted using the principal component analysis (PCA). The suggested approach’s efficacy is demonstrated using a 264 kWp PV system in Jordan. The proposed approach has shown superior with performance gain in accuracy reaches up to 21% and 64% compared to the models that employ old and PCA features, respectively. Additionally, the proposed approach reduces the computational cost by 20% and 7% compared to old and PCA models, respectively.
List of acronyms and notations
CSP | = | Concentrated Solar Power |
PV | = | Photovoltaic |
ASU | = | Applied Science Private University |
NWPs | = | Numerical Weather Predictions |
ML | = | Machine Learning |
FS | = | Feature Selection |
PCA | = | Principal Component Analysis |
RF | = | Random Forest |
YJT | = | Yeo-Johnson Transformation |
MLR | = | Multiple Linear Regression |
ESN | = | Echo State Network |
CNN | = | Convolutional Neural Network |
GRU | = | Gated Recurrent Unit |
DL | = | Deep Learning |
KM | = | K-Means |
EEMD | = | Ensemble Empirical Mode Decomposition |
MFA | = | Modified Firefly Algorithm |
NN | = | Neural Network |
SVR | = | Support Vector Regression |
WPD | = | Wavelet Packet Decomposition |
LSTM | = | Long Short-Term Memory |
ANN | = | Artificial Neural Network |
WT | = | Wavelet Transform |
BC | = | Bias Compensation |
GBM | = | Gradient Boosting Machine |
PC | = | Physics-Constrained |
SVM | = | Support Vector Machine |
BP | = | Back-Propagation |
= | Historical timestamps and weather data | |
= | New timestamps and weather data | |
= | Tranformed | |
= | Historical PV production data | |
= | Overall available input-output dataset | |
= | New overall input-output dataset | |
= | PCA overall input-output dataset | |
= | Number of years | |
= | Timestep of measurements | |
= | Hour counter | |
= | Hour counter index | |
= | Day counter | |
= | Day counter index | |
= | Global solar irradiance on a horizontal surface | |
= | Ambient temperature | |
= | Wind speed | |
= | Solar time | |
= | Solar angel | |
= | Global solar irradiance on a tilted surface | |
= | Hour Angle | |
= | Local Solar Time | |
= | Declination angle | |
= | Elevation or altitude angle | |
= | Latitude of the location | |
= | Zenith angle | |
= | Solar azimuth | |
= | Incidence angle | |
= | Surface azimuth angle | |
= | Beam irradiance tilt factor | |
= | Beam irradiance on a horizontal surface | |
= | Diffuse irradiance on a horizontal surface | |
= | Tilt angle of the surface | |
= | Ground albedo | |
= | Number of overall features/ANN nodes | |
= | Number of selected PCA features/ANN nodes | |
= | Number of ANN hidden neurons | |
= | Index of the hidden neuron, | |
= | Generic training pattern | |
= | Generic training pattern index | |
= | The | |
= | The | |
= | Average actual production | |
= | Internal ANN parameters (weights) | |
= | Internal ANN parameters (biases) | |
= | PV plant capacity | |
= | Hidden neuron activation function | |
= | Output neuron activation function | |
Rprop | = | Resilient Backpropagation |
OSS | = | One Step Secant |
LM | = | Levenberg-Marquardt |
= | Mean Square Error | |
= | Optimum number of hidden nodes | |
= | Principal Component | |
= | Old training input-output dataset | |
= | New training input-output dataset | |
= | Old validation input-output dataset | |
= | New validation input-output dataset | |
= | Old test input-output dataset | |
= | New test input-output dataset | |
= | Number of training patterns | |
= | Number of validation patterns | |
= | Number of test patterns | |
= | Root Mean Square Error | |
= | Mean Absolute Error | |
= | Weighted | |
= | Normalized | |
= | Coefficient of Determination | |
= | Overall combined performance metric | |
= | A performance metric | |
= | Performance gain of a metric | |
= | Benchmark’s performance metric | |
= | Proposed’s performance metric |
Acknowledgments
The authors would like to acknowledge the Renewable Energy Center at the ASU for sharing with us the solar PV system data.
Disclosure statement
No potential conflict of interest was reported by the author(s).
Data Availability
The data that support the findings of this study are available from the authors upon reasonable request and with permission of the Renewable Energy Center at ASU.