343
Views
12
CrossRef citations to date
0
Altmetric
Articles

A comparative study of prediction and classification models on NCDC weather data

&
Pages 414-425 | Received 27 Apr 2019, Accepted 25 Apr 2020, Published online: 20 May 2020
 

Abstract

Weather forecasting plays a significant role in different aspects of life such as in the operation of hydro-power plants, renewable energy, flood management, and agriculture. Recently, machine learning techniques have been used for weather forecasting for large periods of time, as it is more accurate than models based on physical principles. To address various problems, varieties of machine learning algorithms are applied in different fields. In our work, to examine whether these models are robust to predict National Climatic Data Center (NCDC) weather conditions, we carried out to compare newly emerging models with traditional meteorological models. In this paper, a set of the most common machine learning techniques are explored to generate robust weather forecasting model for long periods of time. Moreover, the combinations of all the model parameters are considered for simulations and the performance results of each method using a 10-fold cross-validation procedure are presented. The experimental results of the classifiers show that the decision tree CART, XGBoost and AdaBoost models exhibit better classification accuracy when compared with the other methods and for regression task, the linear regression method performs better in terms of R2 metric.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Notes on contributors

Ibrahim Gad

Ibrahim Gad graduated from Tanta University in 2006 with a B.S. in Computer Science. From 2006-2014, he worked as an instructor at the Department of Computer Science, Faculty of Science, Tanta University. He received his Master's degree in Computer Science in 2014, from the Department of Computer Science, Faculty of Science, Ain Shams University, Cairo, Egypt. From 2014 to- present date, he is a Teaching Assistant at the Department of Computer Science, Faculty of Science, Tanta University. From 2016 to- present date, he is a research scholar at the Department of Computer Science, Faculty of Science, Mangalore University. His research interests are in Data Mining, Big Data, Machine Learning, Deep Learning, and Natural Language Processing.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.