1,536
Views
44
CrossRef citations to date
0
Altmetric
Original Articles

Comparative analysis of gradient boosting algorithms for landslide susceptibility mapping

ORCID Icon
Pages 2441-2465 | Received 20 Apr 2020, Accepted 18 Sep 2020, Published online: 16 Oct 2020
 

Abstract

The aim of the study is to compare four recent gradient boosting algorithms named as Gradient Boosting Machine (GBM), Categorical Boosting (CatBoost), Extreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM) for modelling landslide susceptibility (LS). In the first step of the study, the geodatabase including landslide inventory map and landslide conditioning factors was constructed. In the second step, chi-square (CHI) statistic-based feature selection (FS) technique was utilized to compute the importance of the landslide causative factors. In the third step, tree-based ensemble learning algorithms were applied to predict the potential distribution of landslide susceptibility. Also, the prediction performance of ensemble methods was compared to that of Random Forest (RF) ensemble method. Finally, the prediction capabilities of the methods were assessed using overall accuracy (Acc), area under the receiver operating characteristic curve (AUC), kappa index, root mean square error (RMSE), and F score measures. In order to further evaluation, the McNemar's test was utilized to assess statistical significance in the differences between the four gradient boosting models. The accuracy results indicated that the CatBoost model had the highest prediction capability (Acc= 0.8503 and AUC= 0.8975), followed by the XGBoost (Acc= 0.8336 and AUC= 0.8860), the LightGBM (Acc= 0.8244 and AUC= 0.8796) and the GBM (Acc= 0.8080 and AUC= 0.8685). On the other hand, the estimated accuracy measures considered in this study showed that the RF method had the lowest prediction capability of compared the others. Although the individual performances of the methods were found to be acceptable level, the CatBoost method showed the superior performance compared to others with respect to the AUC and Acc values estimated in this study. The results of the study confirmed that the relatively new ensemble learning techniques were efficient and robust for producing LS maps and furthermore, it is probably that these algorithms will be preferred more often in the future studies due to their robustness.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.