Abstract
The study focuses on the preparation of landslide susceptibility maps in the Kali River valley, Kumaun Himalaya using three machine learning algorithms, namely K-nearest neighbour (KNN), random forest (RF) and extreme gradient boosting (XGB). Fifteen landslide conditioning factors (LCFs) were selected and an inventory of 368 landslides was used for the analysis. Multicollinearity analysis using the variation inflation factor, tolerance and Pearson correlation coefficient (PCC) depicted less to no similarity between all factors. Evaluation of variable importance suggests LCFs such as slope, elevation and distance to thrust contributed significantly and consistently for all three models. Model accuracy was determined and compared using the area under the receiver operating characteristic curve and other statistical signifiers like accuracy, sensitivity, F-measure, accuracy, specificity and recall. The results show that the ensemble algorithms, XGB and RF, yield higher accuracy of approximately 85% compared to the KNN model with 81% accuracy.
Acknowledgements
The authors are thankful to the Director, Wadia Institute of Himalayan Geology, Dehradun for providing all necessary facilities and encouragement to carry out the present research work. This work is a part of Ambar Solanki’s doctoral thesis on the landslides in the Kumaun Himalaya.
Disclosure statement
No potential conflict of interest was reported by the authors.