ABSTRACT
The Moderate Resolution Imaging Spectroradiometer (MODIS) level-3 chlorophyll-a (Chl-a) product is one of the widely used ocean colour products that is often used for water quality monitoring of marine ecosystems. However, this product includes a large amount of missing data due to high surface reflectance and cloudy conditions that inevitably affect its suitability for spatiotemporal analysis of water quality. The objective of this study was to compare four Machine Learning (ML) techniques including K-nearest neighbour (KNN), Support Vector Regression (SVR), Random Forest Regression (RFR), and Artificial Neural Network (ANN) with well-known Data Interpolation Empirical Orthogonal Function (DINEOF) method for spatiotemporal missing imputation of MODIS Chl-a. The Southern Caspian Sea, which has a high Chl-a concentration, was selected as the case study. A cross-validation approach ranging missing data ratio from 0.1 to 0.8 was implemented to investigate the optimal parameters of the models and compare their performance for missing imputation. The results indicated that all ML models, except KNN, outperformed the DINEOF method for missing imputation of MODIS Chl-a. The SVR with the highest accuracy and the lowest variability of errors had the best performance among the five competing models, while the KNN showed the worst performance. The main reason for the better accuracy of the SVR than the other models is its structural risk minimization procedure that leads to the better generalization of the SVR model. The current results showed that the ML techniques used in the current study, the SVR in particular, are able to produce reliable imputations of the MODIS Chl-a missing data and can be a useful tool in water quality monitoring of marine ecosystems.
Acknowledgements
The authors appreciate NASA for providing the MODIS level-3 chlorophyll-a data. The authors appreciate the reviewers for their constructive comments which improved the results noticeably.
Disclosure statement
No potential conflict of interest was reported by the authors.
Supplementary material
Supplemental data for this article can be accessed here.
Correction Statement
This article has been republished with minor changes. These changes do not impact the academic content of the article.