437
Views
4
CrossRef citations to date
0
Altmetric
Research Articles

Improvement of the predictive performance of landslide mapping models in mountainous terrains using cluster sampling

, , , , , , & show all
Pages 12294-12337 | Received 27 Sep 2021, Accepted 10 Apr 2022, Published online: 08 May 2022
 

Abstract

Landslide predictive performance is expected to vary with different sampling techniques, such as landslide random and cluster sampling. Current advancements in remote sensing technologies and machine learning (ML) have enhanced landslide prediction performance. The Himalayan Mountain range in Pakistan poses an unadorned threat to the ecosystem and valley population because of landslide occurrence. The present study explores, and tests alternative sampling technique based on spatial pattern characterization in the wake of increased landslide prediction efficacy, rather than a renowned random technique for training and testing sampling. Thereupon, landslide inventory data with 17 geo-environmental factors (i.e. topographic, hydrological and seismic factors) were determined. Landslide cluster patterns were confirmed by the Nearest Neighbor Index (NNI) method and after getting the cluster patterns, the predicted performance of landslide sampling was tested using ML and statistical methods. Advanced ML algorithms including Random Forest (RF), Extreme Gradient Boosting (XGBoost), Naive Bayes (NB), K-nearest Neighbors (KNN) and statistical methods including Weight-of-Evidence (WofE) and Logistic Regression (LR) were used and validated. The landslide-prone district of Azad Jammu and Kashmir (Neelum Valley), Kashmir Himalayas, Pakistan, was selected as a case study. Prediction performance rates are high with area under the curve (AUC) ranging from 0.802 to 0.912; accuracy (ACC) ranges from 0.78 to 0.89, and kappa ranges from 0.50 to 0.68 with cluster sampling technique, whereas the performance was low with random sampling technique, with AUC ranges from 0.768 to 0.895; ACC ranges from 0.74 to 0.86 and kappa ranges from 0.48 to 0.64. The descending order of accuracy of the six algorithms was XGboost, RF, KNN, NB, LR and WofE. Our results confirmed that the landslides followed cluster patterns in the study area, and ML algorithms with cluster training samples positively affected landslide susceptibility prediction with a statistically significant difference. The outcomes support the hypothesis that using landslides spatial natural existence, as training samples, instead of random concepts, improves the prediction ability; and highlights that alternative landslide partitioning technique could be a practicable and robust choice for landslides prediction modelling.

Acknowledgements

We are grateful to the Director Institute of Geology, University of Azad Jammu and Kashmir, for providing transportation during field surveys, which was a critical component of our work.

Disclosure statement

This manuscript has not been published or presented elsewhere in part or in entirety and is not under consideration by another journal. There are no conflicts of interest to declare.

Ethical approval

Not applicable.

Consent to participate

Not applicable.

Consent to publish

Not applicable.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access
  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart
* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.