94
Views
0
CrossRef citations to date
0
Altmetric
Research Article

A feature transformation and extraction approach-based artificial neural network for an improved production prediction of grid-connected solar photovoltaic systems

ORCID Icon, , ORCID Icon, ORCID Icon & ORCID Icon
Pages 9232-9254 | Received 20 Jun 2022, Accepted 17 Sep 2022, Published online: 06 Oct 2022
 

ABSTRACT

The grid operators face challenges due to the fluctuations in the energy production from Photovoltaic (PV) power plants. Therefore, reliable prediction models are critical as they aid grid operators in optimizing operational planning and lowering operating costs. One of the most challenging components of production prediction accuracy is determining a proper set of features that affect the models’ prediction accuracy. In this work, an accurate data-driven prediction model has been developed based on new timestamps and weather condition features that affect the PV system. Specifically, standard PV physics-based models have been employed for feature transformation and extraction from the traditional available historical features. The new features are then utilized to build an Artificial Neural Network (ANN), whose production predictions are compared to those obtained using the old features and other features extracted using the principal component analysis (PCA). The suggested approach’s efficacy is demonstrated using a 264 kWp PV system in Jordan. The proposed approach has shown superior with performance gain in accuracy reaches up to 21% and 64% compared to the models that employ old and PCA features, respectively. Additionally, the proposed approach reduces the computational cost by 20% and 7% compared to old and PCA models, respectively.

List of acronyms and notations

CSP=

Concentrated Solar Power

PV=

Photovoltaic

ASU=

Applied Science Private University

NWPs=

Numerical Weather Predictions

ML=

Machine Learning

FS=

Feature Selection

PCA=

Principal Component Analysis

RF=

Random Forest

YJT=

Yeo-Johnson Transformation

MLR=

Multiple Linear Regression

ESN=

Echo State Network

CNN=

Convolutional Neural Network

GRU=

Gated Recurrent Unit

DL=

Deep Learning

KM=

K-Means

EEMD=

Ensemble Empirical Mode Decomposition

MFA=

Modified Firefly Algorithm

NN=

Neural Network

SVR=

Support Vector Regression

WPD=

Wavelet Packet Decomposition

LSTM=

Long Short-Term Memory

ANN=

Artificial Neural Network

WT=

Wavelet Transform

BC=

Bias Compensation

GBM=

Gradient Boosting Machine

PC=

Physics-Constrained

SVM=

Support Vector Machine

BP=

Back-Propagation

W=

Historical timestamps and weather data

W=

New timestamps and weather data

WPCA=

Tranformed W data

P=

Historical PV production data

X=

Overall available input-output dataset

X=

New overall input-output dataset

XPCA=

PCA overall input-output dataset

Y=

Number of years

Δt=

Timestep of measurements

hr=

Hour counter

hr=

Hour counter index

d=

Day counter

d=

Day counter index

GHI=

Global solar irradiance on a horizontal surface

T=

Ambient temperature

WS=

Wind speed

S=

Solar time

Sa=

Solar angel

Gt=

Global solar irradiance on a tilted surface

HRA=

Hour Angle

LST=

Local Solar Time

δ=

Declination angle

α=

Elevation or altitude angle

φ=

Latitude of the location

ζ=

Zenith angle

γ=

Solar azimuth

θ=

Incidence angle

ZS=

Surface azimuth angle

RB=

Beam irradiance tilt factor

GB=

Beam irradiance on a horizontal surface

GD=

Diffuse irradiance on a horizontal surface

β=

Tilt angle of the surface

ρG=

Ground albedo

F=

Number of overall features/ANN nodes

F=

Number of selected PCA features/ANN nodes

H=

Number of ANN hidden neurons

h=

Index of the hidden neuron, h=1,…,H

Xj=

Generic training pattern

j=

Generic training pattern index

Pj=

The j-th actual production

Pˆj=

The j-th predicted production

Pˉ=

Average actual production

Wh,βh=

Internal ANN parameters (weights)

bh,bo=

Internal ANN parameters (biases)

C=

PV plant capacity

f1()=

Hidden neuron activation function

f2()=

Output neuron activation function

Rprop=

Resilient Backpropagation

OSS=

One Step Secant

LM=

Levenberg-Marquardt

MSE=

Mean Square Error

hopt=

Optimum number of hidden nodes

PC=

Principal Component

Xtrain=

Old training input-output dataset

Xtrain=

New training input-output dataset

Xvalid=

Old validation input-output dataset

Xvalid=

New validation input-output dataset

Xtest=

Old test input-output dataset

Xtest=

New test input-output dataset

Ntrain=

Number of training patterns

Nvalid=

Number of validation patterns

Ntest=

Number of test patterns

RMSE=

Root Mean Square Error

MAE=

Mean Absolute Error

WMAE=

Weighted MAE

NMAE=

Normalized MAE

R2=

Coefficient of Determination

Combined=

Overall combined performance metric

Metric=

A performance metric

PGMetric=

Performance gain of a metric metric

MetricBenchmark=

Benchmark’s performance metric

MetricProposed=

Proposed’s performance metric

Acknowledgments

The authors would like to acknowledge the Renewable Energy Center at the ASU for sharing with us the solar PV system data.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data Availability

The data that support the findings of this study are available from the authors upon reasonable request and with permission of the Renewable Energy Center at ASU.

Additional information

Funding

This research received no external funding.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.