1,579
Views
4
CrossRef citations to date
0
Altmetric
Article

Spatial prediction of the geological hazard vulnerability of mountain road network using machine learning algorithms

, , , &
Article: 2170832 | Received 05 Dec 2022, Accepted 17 Jan 2023, Published online: 25 Jan 2023

Abstract

The current assessment index of the geological hazard vulnerability assessment for mountain road network is relatively simple, and the assessment methods used are subjective, complex, and inefficient. This study proposes a prediction model for geological hazard vulnerability assessment of mountain road network incorporating machine learning algorithms. First, based on the quantification of the characteristics of the mountain road network and the local rescue forces, an objective and reasonable index-based system of vulnerability assessment of the mountain road network was constructed by combining the population, economic, and material factors. Second, the FAHP and AHP-TOPSIS were applied for the development of the vulnerability assessment models to carry out the preliminary vulnerability assessment for different road types. Third, the results of the preliminary vulnerability assessment were used as the sample set to build a road vulnerability prediction model using SVM, RF, and BPNN algorithms. Finally, the five-fold cross-validation and statistical parameter accuracy analysis were conducted to determine the most reasonable model with the highest prediction accuracy for geological hazard vulnerability mapping of the mountain road network. The results indicated that the vulnerability prediction model based on the FAHP sample set using the RF algorithm demonstrated the highest accuracy and robustness.

1. Introduction

Due to the complex geological environment, human activities, and extreme climatic conditions, geological hazards such as landslides and debris flows frequent occur around mountain roads, seriously threatening road safety and human life. Therefore, it is important to conduct a risk assessment of geological hazards for mountain roads.

At present, several studies on vulnerability and risk assessment for roads have been conducted globally. Alsabhan et al. (Citation2022) performed a landslide susceptibility mapping of roads within the Himalayas based on the comparison of the accuracy of the three methods, namely, weights of evidence, information value, and frequency ratio. Kadi et al. (Citation2021) conducted the analysis and discussion of the best routes for forest roads in the Macka region of Turkey based on the landslide susceptibility mapping. Saravanan et al. (Citation2021) extracted eight factors influencing landslides, and then performed landslide susceptibility mapping for the NH67 road in India using the frequency ratio method (Zayed et al. Citation2008).

However, the vulnerability assessment of roads involves a broad range of complex influencing factors and cross-cutting disciplines. Therefore, there are only a few research studies on the geological hazards vulnerability assessment of road, which can be mainly divided into the analysis of population vulnerability (Maletta and Mendicino Citation2022; Zhang et al. 2014) and economic and physical vulnerability (Eidsvig et al. Citation2014; Hearn and Pongpanya Citation2021). Moreover, most of these studies only use the possible casualties and direct losses resulting from geological hazards as the vulnerability assessment index. On the whole, the assessment factors used were oversimplified is one of the main problems that need to be solved in geological hazard vulnerability assessment of road.

The construction of the index-based system for assessing vulnerability and the selection of the vulnerability assessment model are the two most important steps in the vulnerability assessment for mountain roads. Among them, the selection of assessment indices has a great influence on the reliability of geological hazards vulnerability assessment of roads. As an important component of the transportation system, the major function of the road is to ensure the normal passage of vehicles. Therefore, the process of selection of factors influencing the vulnerability assessment for the road should initially focus on the characteristics of the road itself and the local disaster relief capacity and then combine them with the local population, economy, material, and other quantitative assessment indices to construct the most suitable index-based assessment system for the vulnerability assessment of mountain road network (Ahmad et al. Citation2022; Saleh et al. Citation2022; Sitti et al. Citation2022).

The assessment methods in the recent road vulnerability studies are mostly based on expert qualitative methods. For example, Zhang et al. (Citation2020) established the Analytical Hierarchy Process (AHP) based on certain assessment factors to analyze the vulnerability of the road network in Oahu, Hawaii. Some researchers (MartínezCarvajal et al. Citation2018; Petrucci and Gullà Citation2010) applied mathematical methods in combination with the assessment factors to define landslide susceptibility in a relevant manner and performed the susceptibility zoning calculations for the study area. Although these studies have promoted the development of road vulnerability research to a certain extent, the assessment results of the above studies have the limitation of being strongly subjective. Among them, the selection of evaluation factors and the determination of weights rely on expert rating or empirical formulas, which not only amplify the uncertainty of the assessment, but also tend to be less efficient when dealing with complex data. It has been demonstrated that using algorithm design, and machine learning techniques can be trained to learn from sample data for the assessment of target regions without expert-developed weights, thus, helping in overcoming this limitation.

Over the past 20 years, machine learning techniques have developed rapidly. As an efficient prediction method, machine learning techniques have evolved from initial research in the laboratory to its use in a wide range of commercial applications (Jordan and Mitchell Citation2015). Especially in the field of susceptibility assessment of geological hazards, machine learning algorithms have gradually become widely accepted assessment models. These mainly include support vector machine models (Chen et al. Citation2017; Marjanovi et al. Citation2011; Zhou et al. Citation2018), logistic regression models (Ayalew and Yamagishi Citation2005; Bai et al. Citation2010; Umar et al. Citation2014), random forest models (Chen et al. Citation2018; Hong et al. Citation2016; Kim et al. Citation2018), artificial neural networks (Pradhan and Lee Citation2010a; Pradhan and Lee Citation2010b; Pradhan et al. Citation2010; Xu et al. Citation2015), and deep learning algorithms such as multilayer perception models, deep neural networks, and recurrent neural network models (Dong et al. Citation2020; Lv et al. Citation2022; Mandal et al. Citation2021). In addition, many researchers have made successful attempts toward developing machine learning with combinatorial optimization. For example, combined model based on fuzzy theory (Chen et al. Citation2017), combined model based on random forests (Qiu et al. Citation2017), and combined model based on heuristic algorithms. The fact that such a large number of studies on geological hazards have been conducted using machine learning models confirms the advantages of machine learning models. However, challenges still exist regarding how machine learning models can be better applied to the vulnerability assessment of geological hazards.

In this context, this study proposed to develop an efficient, objective, and reasonable method for predicting the geological hazards vulnerability of mountain roads. Firstly, based on the targeted construction of the index-based system of vulnerability assessment of the mountain road network, the preliminary vulnerability assessment of the mountain roads was completed. Then, the preliminary results were used as the sample set to construct a road vulnerability prediction model using several machine learning algorithms. Finally, the model with the highest prediction accuracy was selected to complete the geological hazards vulnerability of mountain road network.

2. Study area

Three distinct road levels were chosen in this study, which included two national roads, three provincial roads, and three county roads, with a total length of 432.479 km, as shown in .

Table 1. Research road network basic information.

The road network selected for this study is located in Youxi County, Central Fujian Province, east of Sanming City, China (). The geomorphology of the study area is complicated and diverse due to neotectonic uplift, intense erosion, and the cutting effect of flowing water. The study area is characterized by several mountains, hills, basins, valleys, and narrow plains, with the mountains and hills making up around 93% of the entire area. The study area comprises many rivers, with a total watershed area of over 5,000 km2 and a total river length of 202 km. The majority of the rock layers in the study area are covered by Quaternary residual soils, resulting in a wide distribution of soil-rock dual structure slope. Youxi County is located in the subtropics, and its climate is affected by alternating continental and oceanic climates, resulting in high precipitation, especially during the spring and summer seasons.

Figure 1. Location map of the study area.

Figure 1. Location map of the study area.

Since the main object of this study is a road, a typical linear project with huge length-to-width ratio, the road is divided into an evaluation unit according to 0.5 km and a total of 758 units, of which the number of units occupied by Chinese roads, provincial roads and county roads are 304, 286 and 168 respectively.

3. Establishment of the index-based vulnerability assessment system

Several objects are threatened by geological hazards, and some of these objects change dynamically with time, such as the buildings in the study area. Meanwhile, different types of geological hazards affect some things differently, for example, a creeping landslide may cause a little damage to moving objects, such as humans, but sudden-onset landslides can be fatal to humans under certain circumstances. Therefore, it is not realistic to include all the objects that may be affected by the geological hazards in the index-based system of vulnerability assessment. When the assessment indices are selected, the objects in the study area that are most likely to be affected by the damage due to geological hazards should be chosen. Moreover, the problem of acquisition is also one of the factors to be considered, and assessment indices that are easy to develop and convenient to quantify should be selected. According to the obtained data and considering the situation in the study area, a hierarchical structure of vulnerability assessment of export network is constructed (). Firstly, the vulnerability evaluation of road network was used as the target layer. Then the four major categories of road network characteristics, disaster relief capacity, population, economy and physical characteristics were identified as the criterion layer. Finally, three index factors were used to quantitatively evaluate each major category of the criterion layer, and thus an evaluation system based on 12 indexes was established.

Considering the obtained data and the situation in the study area, this research quantitatively evaluated the characteristics of the road network, disaster relief capacity, population, and economic and material characteristics of the study area and then used them as indices for the vulnerability assessment of the road network.

3.1. Road network characteristics

  1. Road grade: The relative grade of the road was determined by the actual local situation based on the Design Specification for Highway Alignment.

  2. Road width: Road width is one of the important indices of road capacity, and high-precision remote sensing image identification was used as the method in this study to obtain the road width.

  3. Frequency of hidden danger exclusion: Hidden danger exclusion means arranging professional staff to conduct a comprehensive and detailed mapping and survey of the roads as well as the surrounding affiliated facilities, considering the characteristics of geological hazards and the actual situation of road traffic in the study area. This helps to accurately identify the road areas prone to landslides, mudslides, and other geological hazards so that government personnel can take practical and reliable measures for the management of hidden areas before the onset of geological hazards to ensure road safety.

3.2. Disaster relief capability

  1. Detour Index: The detour index represents the difference between the detour path and the original travel path when a geological hazard occurs, and the vehicle can choose to bypass the road section affected by the geological hazard before the road is opened for traffic. In this study, the intersection of the boundary of the study area and the road network was defined as the ‘exit’. The road intersection was defined as the ‘node’. The section of road between two adjacent nodes or the road intersection near the exit was known as ‘road section’. As shown in , the exits, nodes, and road sections together form the internal road network of the study area. The roads of the study area were divided into different assessment units based on certain road mileage, and the shortest path from the assessment unit to a certain exit was used as the travel path of the valuation unit. If the geological hazards damage the highway, the vehicle can bypass the section affected by the geological hazards to complete the passage of other paths called detour paths.

    In this study, the detour index for each assessment cell is defined as the ratio of the detour path of the road unit at each exit to the original travel path, as given by EquationEquation (1).

(1) C=i=1mCx=i=1mDxDy(1)

where Cx represents the detour index of the road unit at exit x; Dy is the length of the original travel path; and Dx is the length of the detour path.

  • Rescue index: If the geological hazards occur somewhere in the region and block or damage the highway, the rescue team starts from the nearest county and moves to the place affected by the geological hazards. If the affected place is far away from the nearest county, it takes a longer time for the rescue team to reach the disaster site. In this study, the rescue index is defined as the ratio of the rescue distance to the design speed of the rescue road.

(2) T=dv(2)

where T is the rescue index, d is the rescue distance, and v is the design speed of the rescue road.

  • Disaster relief equipment supplies: For geological disaster-prone areas, not only a good emergency plan is needed, but also special equipment to deal with geological disasters is essential. In this study, this index is defined as the density of disaster relief equipment. The following is the specific method of obtaining this index, based on the survey of the number of disaster relief equipped facilities in each township, the density of disaster relief equipped facilities in the study area was calculated by the density analysis tool in the ARCGIS, and the corresponding values were extracted to road units.

Figure 2. Original travel path and post-disaster detour path.

Figure 2. Original travel path and post-disaster detour path.

Figure 3. A hierarchical structure of vulnerability assessment of road network.

Figure 3. A hierarchical structure of vulnerability assessment of road network.

3.3. Population characteristics

This index usually represents the possibility of human casualties in the study area affected by the geological hazards and mainly reflects the resilience of humans to geological hazards in the study area. Based on the characteristics of the study area, three factors, namely, population density, education level, and age structure, were selected as the assessment indices of the population characteristics in the study area.

3.4. Economic and material characteristics

The economic losses caused by the geological hazards are huge. The economic losses can be divided into direct economic losses and indirect economic losses. The direct economic losses include damage to buildings, different equipment, and various production materials. In this study, the two indicators of per capita income and road maintenance cost were selected for economic vulnerability assessment. Meanwhile, material vulnerability leads to losses from damages to fixed structures, such as buildings and roads, during geological hazards. According to the collected information, the building density was selected as the indices of material vulnerability assessment.

Finally, according to the index-based system of road vulnerability assessment, a normalized decision matrix D, as shown below, was constructed. (3) D=[a11a12a1na21a22a2nam1am2amn](3) where amn represents the normalized value of the nth assessment index of the mth road unit.

4. Preliminary assessment of road vulnerability

4.1. Determination of weights using analytical hierarchy process

In this study, Analytical Hierarchy Process (AHP) was used to calculate the weights of each level of the index in the index-based system of vulnerability assessment. This process uses the 9-point scale proposed by Saaty () to construct the judgment matrices for the criterion layer and the index layer and then obtains the comprehensive weights of each assessment index by the product method.

Table 2. Takes values using the 9-point scale.

As the AHP method is subjective, its judgment matrix was derived based on the data-fitting results of the index-based system of vulnerability assessment of roads, in combination with the opinions of relevant road engineering experts. All the elements in the pairwise comparison matrix passed the consistency test with a certain degree of rationality. The pairwise comparison matrix, the largest eigenvalue, and consistency ratio of layers are shown in , on this basis we can calculate the comprehensive weight ().

Table 3. The pairwise comparison matrix, factor weights.

Table 4. Comprehensive weight acquire via the AHP.

At present, the AHP is widely used in risk assessment (Aminbakhsh et al. Citation2013), geological and environmental assessment (Cengiz and Ercanoglu Citation2022; Das et al. Citation2022), and group decision-making because of its simplicity of usage, computational efficiency, and generalizability to multilevel qualitative and quantitative data (Kahraman et al. Citation2004). However, the decision-makers simplify this strategy to reduce the complexity of the problem or based on the need, resulting in biased judgments. This can lead to a certain degree of inaccuracy and subjectivity in the assessment results, which is the major limitation of this method (Kahraman et al. Citation2003).

4.2. Determination of vulnerability assessment using FAHP

To overcome the shortcomings of the very subjective traditional hierarchical analysis method, the fuzzy mathematical theory was introduced in combination with the hierarchical analysis to develop Fuzzy-Analytical Hierarchy Process (FAHP). On this basis, the uncertainty in the preferences of decision-makers is quantified to achieve more flexibility in judgment and decision-making. FAHP retains the advantages of simplicity and efficiency of AHP, while the hierarchical structure obtained by this method is easy to decompose and perform a pairwise comparison (Feizizadeh et al. Citation2014). Moreover, it can reduce inconsistency and generate priority vectors, and reflects the human reasoning of applying approximate information and uncertainty for decision making.

In general, the vulnerability results are taken in the range of [0, 1], and the road network vulnerability assessment results were classified into five levels using the equivalence method (), i.e. very low, low, medium, high, and very high (Hoque et al. Citation2019; Tehrany et al. Citation2014).

Table 5. Vulnerability assessment of earthen sites of road.

In this study, the vulnerability assessment was determined using FAHP in three main steps, as given below:

  • Step 1: Determination of the relative membership using the fuzzy set theory

    In the fuzzy set theory, the affiliation degree is used for expressing the fuzzy relation of elements belonging to a certain set, and the affiliation degree quantifies the uncertainty of the affiliation information by some specific functions. Therefore, a fuzzy set can be described as a set containing elements with varying degrees of membership in the set.

    The fuzzy relationship matrix R is represented as, (4) R=[R1R2Rm]=[r11r12r1nr21r22r2nrm1rm2rmn](4)

    where R represents the membership of the ith index belonging to the jth rank. The membership function is established according to the characteristics of the index-based system. When the membership function is comparable to the assessment result, the triangular membership function can be selected, which has been successfully applied by Guo et al. (Citation2017). The triangular membership function () was applied to structure the fuzzy set based on the criteria in , as represented by EquationEquation (5): (5) ri1={1a0.70.20a0.90.7<a<0.9a<0.7ri2={00.9a0.2a0.50.20a0.90.7a<0.90.5a<0.7a<0.5ri3={00.7a0.2a0.30.20a0.70.5a<0.70.3a<0.5a<0.3ri4={00.5a0.2a0.10.20a0.50.3a<0.50.1a<0.3a<0.1ri5={10.3a0.20a0.10.1a<0.3a0.3}(5)

    where a is the element in the normalized decision matrix D of the index-based system of vulnerability assessment, i1,2,,m.

  • Step 2: The integrated weights calculated by AHP were inserted into the fuzzy assessment system, and the comprehensive assessment vector of the goal layer G was calculated, as provided by EquationEquation (6). (6) G=w×R=(w1,w2,,wm)×[r11r12r1nr21r22r2nrm1rm2rmn](6) where wm is the comprehensive weight calculated by AHP ().

  • Step 3: The assessment results of the road network vulnerability V were calculated from the weighted average method given by EquationEquation (7). (7) V=G×KT(7) where K is the mid-score of each level and K=(0.9, 0.7, 0.5, 0.3, 0.1)

4.3. Determination of vulnerability assessment using AHP-TOPSIS

Analytical Hierarchy Process-Technique for Order Preference by Similarity to Ideal Solution (AHP-TOPSIS) computes the weights of each influencing factor, constructs the judgment matrix based on these weights, and determines the posting schedule and the ranking of the judging objects, such that the negotiation result is close to the optimal result (Lixin et al. Citation2017). In this study, the vulnerability assessment using FAHP was determined by the three main steps as given below:

Figure 4. Triangular membership function.

Figure 4. Triangular membership function.
  • Step 1: The first step in the AHP-TOPSIS method includes the decision matrix S given by EquationEquation (8). (8) S=(sij)m×n=[w1a11w2a12wna1nw1a21w2a22wna2nw1am1w2am2wnamn](8) where a is the element in the normalized decision matrix D of the index-based system of vulnerability evaluation, and wm is the integrated weight calculated by AHP ().

  • Step 2: Determination of relative closeness

    This step produces the positive ideal solution and the negative ideal solution, as given in EquationEquation (9). (9) {S+={(maxsij1im|jJ1),(minsij1im|jJ2)}S={(minsij1im|jJ1),(maxsij1im|jJ2)}(9) where S+ represents the positive ideal solution, and S represents the negative ideal solution. J1 and J2 are associated with positive and negative attributes, respectively. In this study, the positive attributes include maintenance intervals, detour index, rescue distances, population density, age structure, road maintenance cost, and building density, while the negative attributes include road grade, road width, disaster relief equipment supplies, education level, and per capita income.

  • Step 3: The Euclidean separation distance between each road unit and the ideal solution is calculated as: (10) {Di+=j=1n(sijsj+)2Di=j=1n(sijsj)2(10) where Di+ and Di are associated with the positive and negative-ideal solutions, respectively. Sij is each element of the decision matrix S, sj+ is the positive ideal related to each column, and sj is the negative ideal related to each column.

  • Step 4: The relative closeness (Bi) value of each alternative of the criterion layer for the ideal solution is determined using EquationEquation (11): (11) Bi=DiDi++Di(11)

  • Step 5: Finally, the comprehensive assessment score V can be calculated according to EquationEquation (12). (12) V=w1i×B(12) where w1i is the comprehensive weight of the criterion layer calculated by AHP (), and B is the assessment matrix formed by the relative closeness value of each alternative for the ideal solution.

4.4. Comparison between the vulnerability assessment results using FAHP and AHP-TOPSIS models

The results of the geological hazards vulnerability assessment of road network calculated using the FAHP and AHP-TOPSIS methods are shown in and b. The results indicated that the road vulnerability values calculated by the FAHP method ranged from 0.1746 to 0.6679, while the vulnerability values calculated by the AHP-TOPSIS method ranged from 0.1734 to 0.6608. Thus, the ranges of vulnerability results obtained by the two methods were relatively consistent, which confirmed the accuracy of the vulnerability results to some extent.

Figure 5. Vulnerability assessment results using FAHP and AHP-TOPSIS models: (a) FAHP, (b) AHP-TOPSIS.

Figure 5. Vulnerability assessment results using FAHP and AHP-TOPSIS models: (a) FAHP, (b) AHP-TOPSIS.

The results obtained by the two methods were processed according to the grading criteria in for a comprehensive analysis of the vulnerability assessment results using FAHP and AHP-TOPSIS models and represented in .

Figure 6. Vulnerability grading results.

Figure 6. Vulnerability grading results.

As shown in , the vulnerability assessment results of the roads in the study area calculated by FAHP and AHP-TOPSIS methods do not indicate ‘very high’ vulnerability levels. According to the grading standard of road network vulnerability (), a very high road vulnerability grade indicates that the road is under serious threat from geological hazards and the risk to life and property in the area is serious. However, this is contrary to the principles of road design and construction in China. Hence, such a result is in agreement with the actual situation. At the same time, the results of the vulnerability assessment of the road unit indicated that more than 90% of the data were at ‘low’ and ‘medium’ vulnerability levels. Thus, it is evident that the onset of geological hazards can lead to different degrees of damage to most mountain roads. Therefore, urgent steps should be undertaken by the road management departments to formulate and take appropriate protection measures for the safety of roads and surrounding facilities in the study area.

Further, a comparison of the results of the two assessment methods indicated certain differences in the vulnerability results calculated by FAHP and AHP-TOPSIS methods. These differences were mainly in terms of the degree of distribution of the vulnerability levels. For example, the vulnerability assessment data values obtained by the FAHP method were mainly at ‘low’ (40.4%) and ‘medium’ (57.7%) vulnerability levels, while the assessment data values obtained by AHP-TOPSIS were very concentrated, with more than 90% of the road units at ‘low’ vulnerability levels. The major reason for the difference in the results may be due to the different judgment matrices applied by the two methods. FAHP mainly applies the fuzzy triangular mathematical function for the construction of the affiliation matrix and then performs a comprehensive assessment of the site vulnerability, while AHP-TOPSIS mainly performs a comprehensive assessment based on the judgment matrix constructed by the relative closeness of each evaluation object to the positive and negative ideal solutions.

Further analysis of the vulnerability character of mountain roads was conducted by dividing the vulnerability results calculated by FAHP and AHP-TOPSIS methods based on the road type (). As shown in , firstly, the vulnerability assessment results for national roads and county roads obtained by FAHP and AHP-TOPSIS methods were different, with those obtained by FAHP found to be more conservative. However, the results of the two methods for the provincial roads were found to be relatively in agreement. Secondly, the vulnerability results obtained by the FAHP method indicated that the vulnerability severity levels of national roads and county roads were significantly lower than those of provincial roads. However, the assessment results of AHP-TOPSIS demonstrated the vulnerability trend, i.e. national roads > provincial roads > county roads. This phenomenon is observed due to various reasons, one of which may be the influence of AHP assessment weights or the selection of assessment indices. However, in general, the results of the two methods were found to be relatively in agreement, especially for the lower-ranked mountain roads.

Figure 7. Classification of road type vulnerability.

Figure 7. Classification of road type vulnerability.

Based on the above analysis and discussion of the FAHP and AHP-TOPSIS results of road vulnerability assessment in the study area, some of the characteristic features of these two methods for vulnerability assessment were identified, and these characteristic features can provide some theoretical basis for the selection of vulnerability assessment methods in other areas. For example, the AHP-TOPSIS method can be used for the evaluation of the vulnerability of some areas or facilities with high-security level requirements, while the FAHP method can be used for calculating the vulnerability assessment in some less important areas.

5. Vulnerability assessment-based machine learning

In the previous section, the results of the vulnerability assessment of geological hazards in combination with the actual situation were analyzed, which validated the rationality of FAHP and AHP-TOPSIS methods to some extent at the subjective level. However, these methods still have some limitations; for example, the calculation of the initial weights is somewhat subjective and involves a high probability of miscalculations in the complex process. Therefore, the study proposed a machine learning algorithm-based geological hazards vulnerability assessment model for mountain road network, which was more rapid, objective, and reliable. The process involved in the model is shown in .

Figure 8. Machine learning-based road vulnerability modeling process.

Figure 8. Machine learning-based road vulnerability modeling process.

As shown in , the machine learning-based vulnerability assessment of the road was divided into four steps. In step 1, the five-fold cross-validation method was used to divide the vulnerability assessment results obtained from the two types of assessment methods into a training set (80%) and a test set (20%). In step 2, the support vector machine (SVM), random forest (RF), and back propagation neural network (BPNN) algorithms were used to conduct the training of the validation model based on the training set and complete the operation of the test set results. In step 3, four statistical metrics, including mean square error (MSE), absolute variance (R2), root mean square error (RMSE) and mean absolute percentage error (MAPE), was introduced for the verification of the accuracy of the test set results. In step 4, the accuracy verification parameters of the results obtained from different vulnerability assessment methods were compared, and the most reasonable method with the highest accuracy was selected and used for carrying out geological hazards vulnerability mapping of mountain road network.

5.1. Models and validation methods

5.1.1. Support vector machine model

Support Vector Machine (SVM) was proposed by Vapnik in 1963 and are novel and promising classification techniques for overcoming the limitations of machine learning methods. This model demonstrated efficient overfitting, under-learning, and local optimality while dealing with insufficient nonlinear and high-dimensional data. It is especially advantageous for less number of samples. Currently, SVM models have become a new research focus for machine learning algorithms following the neural network model and have been successfully applied in several fields, such as image classification, text, handwriting recognition (Campbell et al. Citation2006; Li et al. Citation2012; Fanos and Pradhan Citation2019), etc.

5.1.2. Random forest model

The Random forest (RF) was developed by improving the classification tree model proposed by Breiman in 1984. When compared with the traditional classification tree model, the RF model has the following three advantages. Firstly, it uses random resampling of the original data to effectively restrict over-fitted simulation results. Secondly, the RF model has a high tolerance for data types yielding good classification results even if the input data has anomalies or missing cases. Thirdly, the model can not only solve the classification and regression problems but also has high accuracy in dealing with regression problems.

5.1.3. Back propagation neural network

Back propagation neural network (BPNN) is one of the most widely used artificial neural networks and is a multilayer feedforward neural network trained according to the error backpropagation algorithm. BPNN continuously adjusts the network weights and thresholds by the gradient descent method for error minimization. Nowadays, BPNNs have been widely used because of their advantages, such as simple modeling, efficient extraction of useful information from samples, and high accuracy of output. In addition, BPNN can be trained for automatically acquiring knowledge without assuming functional forms related to input and output variables and effectively model nonlinear problems. Therefore, they have been successfully applied to many fields, such as prediction of mechanics (Wang et al. Citation2021), road traffic flow prediction (Cui Citation2010), weather prediction (Kakar et al. Citation2018), and geological hazard assessment and prediction (Xiong et al. Citation2019; Rajabi et al. Citation2022). In this study, the 12–20–1 structure was used as the BP neural network model, i.e. the input layer contained the above 12 assessment metrics, the hidden layer contained 20 neurons, and the output layer contained 1 neuron for the vulnerability score, as shown in .

Figure 9. Structure of back propagation neural network model.

Figure 9. Structure of back propagation neural network model.

5.1.4. Precision assessment indices

Mean Square Error (MSE), Root Mean Square Error (RMSE), and Mean Absolute Percentage Error (MAPE), Absolute Variance (R2) were the four precision assessment indices introduced to monitor the accuracy of the prediction results of the machine learning model. These indices can be calculated by EquationEquations (13) to Equation(16). (13) MSE=1Ni=1N(OiTi)2(13) (14) RMSE=i=1N(OiTi)2N(14) (15) MAPE=1Ni=1N100×|OiTiOi|(15) (16) R2=1(i=1N(OiTi)2i=1NTi2)(16) where O represents the actual vulnerability result calculated by the two assessment methods in the previous section, T represents the vulnerability value of each road unit calculated by the machine learning model, and N is the number of input samples in the training or testing phase. The model accuracy was higher at lower values of MSE, RMSE, and MAPE, and R2 closer to 1.

5.2. Results and discussion

5.2.1. Comparative analysis

Taking the preliminary evaluation data as the training set, conduct predictive training based on the machine learning model, and complete the calculation of the test set results, so as to obtain the results of the vulnerability assessment of the road network in the study area. f show the vulnerability zones obtained from six different machine learning models.

Figure 10. Results of vulnerability obtained based on machine learning models: (a) FAHP-SVM, (b) FAHP-RF, (c) FAHP-BPNN, (d) AHP-TOPSIS-SVM, (e) AHP-TOPSIS-RF, (f) AHP-TOPSIS-BPNN.

Figure 10. Results of vulnerability obtained based on machine learning models: (a) FAHP-SVM, (b) FAHP-RF, (c) FAHP-BPNN, (d) AHP-TOPSIS-SVM, (e) AHP-TOPSIS-RF, (f) AHP-TOPSIS-BPNN.
Figure 10. Results of vulnerability obtained based on machine learning models: (a) FAHP-SVM, (b) FAHP-RF, (c) FAHP-BPNN, (d) AHP-TOPSIS-SVM, (e) AHP-TOPSIS-RF, (f) AHP-TOPSIS-BPNN.
Figure 10. Results of vulnerability obtained based on machine learning models: (a) FAHP-SVM, (b) FAHP-RF, (c) FAHP-BPNN, (d) AHP-TOPSIS-SVM, (e) AHP-TOPSIS-RF, (f) AHP-TOPSIS-BPNN.

In order to intuitively compare the differences between the results of vulnerability zoning based on different preliminary evaluation data and machine learning algorithms, the results of the six road network vulnerability prediction models are presented in the form of box plots in .

Figure 11. Box plot of vulnerability results.

Figure 11. Box plot of vulnerability results.

The following points can be draw from the analysis of : (I) the prediction results based on AHP-TOPSIS preliminary evaluation data training all have outliers, while those based on FAHP preliminary evaluation data training have no outliers. (II) the boxes of the evaluation results obtained from training based on the preliminary evaluation data of AHP-TOPSIS are flatter, which indicates the presence of exceptionally large or small outliers. (III) the means of the evaluation results obtained from the FAHP preliminary evaluation data training are smaller than the median, indicating that all three machine learning models obey the left-skewed distribution and the obtained prediction results are relatively conservative. (IV) comparing the evaluation results obtained from the FAHP-based preliminary evaluation data training, it can be found that the evaluation results using the SVM prediction model have a smaller value domain compared to the other two machine learning models. This indicates to a certain extent that the model constructed using the SVM algorithm is less capable of digging into the differences existing between complex data.

The corresponding vulnerability prediction results were obtained by training and testing the machine learning models established in the previous section, and the statistical parameter index of each model in the five-fold cross-validation process was calculated. Based on these indices, the rationality and accuracy of the three types of machine learning models were analyzed in detail.

The statistical parameters of the accuracy of the vulnerability assessment results of the three types of machine learning models are presented in . First, the MSE values of the models constructed using the RF and BPNN algorithms were found to be less than 1 × 103, and the values of R2 ranged from 0.94 to 1 in either stage, indicating that the models using both machine learning algorithms exhibited high prediction accuracy. In contrast, the model constructed using the SVM algorithm demonstrated poor accuracy, which may be because the SVM model was less suitable for situations with a large number of factors and samples. Second, a comparison of the statistical parameters of the three types of machine learning models in the training and testing phases indicated that all the values of the statistical parameters in the testing phase decreased when compared to those in the training phase. Third, the prediction accuracy of the RF model was higher than that of the BPNN model in both FAHP or AHP-TOPSIS sample sets, which may be due to a large number of parameters and threshold settings in the BPNN model affecting the subsequent model fitting and generalization of the prediction results. Fourth, the combined sample set, as well as the prediction model, indicated that the model based on the FAHP sample set using the RF algorithm had the highest prediction accuracy, while the model based on the AHP-TOPSIS sample set using the SVM algorithm showed the lowest prediction accuracy.

Table 6. Prediction accuracy of three types of machine learning models.

The rationality of the selected samples and methods was further verified by processing the FAHP and AHP-TOPSIS sample sets using five-fold cross-validation, and the corresponding machine-learning models were constructed. The MSE values of these models were calculated for the model training as well as prediction results and are provided in .

Table 7. MSE result values for 5-fold cross-validation (×10−5).

indicates that the average MSE of the vulnerability prediction model based on the FAHP sample set using the RF algorithm was 3.87 × 10−5 in the five-fold cross-validation process, which was higher than that of the other three types of models, indicating differences in the accuracy levels. The standard deviation of the MSE of the model was 0.52 × 10−5, which was much lower than 0.05. This indicated that the model also has good robustness and can provide strong technical support for the vulnerability assessment of other roads in the future.

Although the above multi-level assessment process has greatly reduced the subjective error in the vulnerability assessment of mountain road networks, it is worth mentioning that there is still no way to verify the results of the vulnerability zoning in the current studies(Hossein, et,al 2021). This is because vulnerability is the social attribute of natural disasters, not the natural attribute, so it is inappropriate to use historical disaster data to verify the vulnerability assessment results. But this does not mean that vulnerability research is meaningless and unreasonable. On the contrary, it is very important to carry out spatio-temporal prediction of the vulnerability of hazard-affected bodies to mitigate the impact of natural disasters.

5.2.2. Road vulnerability mapping

Based on the machine learning model established in the previous section to carry out the study area road network geological hazards vulnerability mapping work, the mapping work was completed using the FAHP-RF model with smaller MSE and MAPE values in the five-fold cross-validation process, and the final geo-hazard vulnerability map of the mountain road network obtained is provided in .

Figure 12. Geological hazard vulnerability map of the mountain road network.

Figure 12. Geological hazard vulnerability map of the mountain road network.

As shown in , the results of the geological hazard vulnerability assessment of the mountain road network indicated that the vulnerability of the road network was governed by a combination of factors, and the following are the major characteristics of its distribution. First, the areas with higher road vulnerability are located at the junction of the study area and other counties because these areas are farther away from the central urban area and need to pay more for road management as well as maintenance and repair. Second, the road units located in remote mountainous areas are generally at a higher level of threat from geological hazards than those in the central urban areas. Third, road units in the vicinity of emergency management points are significantly less vulnerable than those away from emergency management points.

Based on the above analysis, the actual situation of the road network in the study area, and expert opinions, some targeted disaster prevention, and mitigation suggestions can be put forward. (1) The road sections with higher transport frequency have high traffic flow, greater population density, and high vulnerability to road disasters. Therefore, the road slopes in these areas should be inspected in a focused manner to exclude dangerous rock bodies and eliminate the hidden dangers of rock fall and collapse. (2) Fujian area is affected by frequent typhoons and rainstorms, which are highly likely to cause rainfall-type landslides. If conditions permit, the highway department should set up several disaster monitoring stations in disaster-prone areas to issue safety notices and evacuate residents before the onset of disasters, as well as carry out the corresponding de-risking strategy. (3) According to the actual situation of the road network in the study area, the highway department, together with the disaster management department, should draw up a targeted disaster prevention and management plan. Local government should invest more human and material resources for carrying out the maintenance and repair of the road network, improving the protection facilities of the roadbed and road surface, improving the disaster resistance and prevention capacity of the highway, and reducing the vulnerability of the highway to disasters.

6. Conclusions

  1. Considering the oversimplification of assessment index for the geological hazards vulnerability of mountain road network, this study takes into account the quantitative indices of road characteristics and rescue capacity and combines them with the vulnerability of the population, economic and material characteristics to objectively and reasonably construct an index-based system of vulnerability assessment for the mountain road.

  2. Two combined models (FAHP and AHP-TOPSIS) were applied to perform a preliminary assessment of the geological hazards vulnerability of mountain roads in the study area, and to a certain extent, the negative influence of the subjective factors on the assessment results was reduced. Moreover, the applicability of the two combined models was compared and analyzed based on the vulnerability results.

  3. The preliminary assessment results of the geological hazards vulnerability of mountain road network derived from the FAHP and AHP-TOPSIS models were used as the sample sets, and machine learning algorithms (SVM, RF, and BPNN) were used to build a geological hazards vulnerability prediction model for mountain road network. The results indicated that the model based on the FAHP sample set and using the RF model demonstrated the highest prediction accuracy, and its robustness was significantly better than that of other models.

Acknowledgements

The authors would like to acknowledge the financial support from the National Natural Science Foundation of China (No. U2005205, No. 42007235), Natural Science Foundation of Shandong Province (ZR2021QE259) and the Science and Technology Innovation Platform Project of Fuzhou Science and Technology Bureau (No. 2021-P-032).

Disclosure statement

All authors disclosed no relevant relationships, and authors have no conflict of interest to declare.

Data availability

Data will be made available on request.

References

  • Ahmad ME, Sotirios AA, Moustafa MK, Lee VL, Fadzli MN. 2022. An integrated framework for the quantification of road network seismic vulnerability and accessibility to critical services. Sustainability. 14(12474):12474.
  • Alsabhan AH, Singh K, Sharma A, Alam S, Pandey DD, Rahman SAS, Khursheed A, Munshi FM. 2022. Landslide susceptibility assessment in the Himalayan range based along Kasauli–Parwanoo road corridor using weight of evidence, information value, and frequency ratio. J King Saud Univ-Sci. 34(2):101759.
  • Aminbakhsh S, Gunduz M, Sonmez R. 2013. Safety risk assessment using analytic hierarchy process (AHP) during planning and budgeting of construction projects. J Safety Res. 46:99–105.
  • Ayalew L, Yamagishi H. 2005. The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan. Geomorphology. 65(1–2):15–31.
  • Bai S, Wang J, Lü G, Zhou P, Hou S, Xu S. 2010. GIS-based logistic regression for landslide susceptibility mapping of the Zhongxian segment in the Three Gorges area, China. Geomorphology. 115(1–2):23–31.
  • Campbell WM, Campbell JP, Reynolds DA, Singer E, Torres-Carrasquillo PA. 2006. 2006. Support vector machines for speaker and language recognition. Comput Speech Language. 20(2–3):210–229.
  • Cengiz LD, Ercanoglu M. 2022. A novel data-driven approach to pairwise comparisons in AHP using fuzzy relations and matrices for landslide susceptibility assessments. Environ Earth Sci. 81(7):1–23.
  • Chen W, Pourghasemi HR, Panahi M, Kornejady A, Wang J, Xie X, Cao S. 2017. Spatial prediction of landslide susceptibility using an adaptive neuro-fuzzy inference system combined with frequency ratio, generalized additive model, and support vector machine techniques. Geomorphology. 297:69–85.
  • Chen W, Xie X, Peng J, Shahabi H, Hong H, Bui DT, Duan Z, Li S, Zhu AX. 2018. GIS-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method. Catena. 164:135–149. https://doi.org/10.1016/j.catena.2018.01.012.
  • Cui F. 2010. Study of traffic flow prediction based on BP neural network. In 2010 2nd International Workshop on Intelligent Systems and Applications, Wuhan, China. IEEE.
  • Das S, Sarkar S, Kanungo DP. 2022. GIS-based landslide susceptibility zonation mapping using the analytic hierarchy process (AHP) method in parts of Kalimpong Region of Darjeeling Himalaya. Environ Monit Assess. 194(4):1–28.
  • Dong VD, Jaafari A, Bayat M, Gholami DM, Pham BT. 2020. A spatially explicit deep learning neural network model for the prediction of landslide susceptibility. Catena. 188:104451.
  • Eidsvig UMK, McLean A, Vangelsten BV, Kalsnes B, Ciurean RL, Argyroudis S, Winter MG, Mavrouli OC, Fotopoulou S, Pitilakis K, et al. 2014. Assessment of socioeconomic vulnerability to landslides using an indicator-based approach: methodology and case studies. Bull Eng Geol Environ. 73(2):307–324.
  • Fanos AM, Pradhan B. 2019. A Spatial Ensemble Model for Rockfall Source Identification from High Resolution LiDAR Data and GIS. IEEE Access. 7:74570–74585.
  • Feizizadeh B, Roodposhti MS, Jankowski P, Blaschke T. 2014. A GIS-based extended fuzzy multi-criteria evaluation for landslide susceptibility mapping. Comput Geosci. 73:208–221.
  • Guo Z, Chen W, Zhang J, Ye F, Liang X, He F, Guo Q. 2017. Hazard assessment of potentially dangerous bodies within a cliff based on the Fuzzy-AHP method: a case study of the Mogao Grottoes, China. Bull Eng Geol Environ. 76(3):1009–1020.
  • Hearn GJ, Pongpanya P. 2021. Developing a landslide vulnerability assessment for the national road network in Laos. QJEGH. 54(3).
  • Hong H, Pourghasemi HR, Pourtaghi ZS. 2016. Landslide susceptibility assessment in Lianhua County (China): a comparison between a random forest data mining technique and bivariate and multivariate statistical models. Geomorphology. 259:105–118.
  • Hosseini FS, Sigaroodi SK, Salajegheh A, Moghaddamnia A, Choubin B. 2021. Towards a flood vulnerability assessment of watershed using integration of decision-making trial and evaluation laboratory, analytical network process, and fuzzy theories. Environ Sci Pollut Res. 28(44):62487–62498.
  • Hoque MA, Tasfia S, Ahmed N, Pradhan B. 2019. Assessing spatial flood vulnerability at Kalapara Upazila in Bangladesh using an analytic hierarchy process. Sensors-Basel. 19(6):1302.
  • Jordan MI, Mitchell TM. 2015. Machine learning: trends, perspectives, and prospects. Science. 349(6245):255–260.
  • Kadi F, Yildirim F, Saralioglu E. 2021. Risk analysis of forest roads using landslide susceptibility maps and generation of the optimum forest road route: a case study in Macka, Turkey. Geocarto Int. 36(14):1612–1629.
  • Kahraman C, Cebeci U, Ruan D. 2004. Multi-attribute comparison of catering service companies using fuzzy AHP: the case of Turkey. Int J Prod Econ. 87(2):171–184.
  • Kahraman C, Cebeci U, Ulukan Z. 2003. Multi-criteria supplier selection using fuzzy AHP. Logistics information Management. 16(6):382–394.
  • Kakar SA, Sheikh N, Naseem A, Iqbal S, Rehman A, Ullah Kakar A, Kakar BA, Kakar HA, Khan B. 2018. Artificial neural network based weather prediction using Back Propagation Technique. IJACSA. 9(8):462–470.
  • Kim J, Lee S, Jung H, Lee S. 2018. Landslide susceptibility mapping using random forest and boosted tree models in Pyeong-Chang, Korea. Geocarto Int. 33(9):1000–1015.
  • Li CH, Kuo BC, Lin CT, Huang CS. 2012. A Spatial-Contextual Support Vector Machine for Remotely Sensed Image Classification. IEEE Trans Geosci Remote Sens. 50(3):784–799.
  • Lixin Y, Ke C, Xiaoying C, Yueling S, Xiaoqing C, Ye H. 2017. Analysis of social vulnerability of residential community to hazards in Tianjin. Nat Hazards. 87(2):1223–1243.
  • Lv L, Chen T, Dou J, Plaza A. 2022. A hybrid ensemble-based deep-learning framework for landslide susceptibility mapping. Int J Appl Earth Obs. 108:102713.
  • Maletta R, Mendicino G. 2022. A methodological approach to assess the territorial vulnerability in terms of people and road characteristics. Georisk. 16:301–314.
  • Mandal K, Saha S, Mandal S. 2021. Applying deep learning and benchmark machine learning algorithms for landslide susceptibility modelling in Rorachu river basin of Sikkim Himalaya, India. Geosci Front. 12(5):101203.
  • Marjanovi M, K, Evi M, B, Jat B, Enílek VV. 2011. Landslide susceptibility assessment using SVM machine learning algorithm. Eng Geol. 123:225–234.
  • MartínezCarvajal HE, de Moraes Guimarães Silva MT, García Aristizábal EF, Aristizábal-Giraldo EV, Larios Benavides MA. 2018. A mathematical approach for assessing landslide vulnerability. Earth Sci Res J. 22:251–273.
  • Petrucci O, Gullà G. 2010. A simplified method for assessing landslide damage indices. Nat Hazards. 52(3):539–560.
  • Pradhan B, Lee S. 2010a. Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling. Environ Modell Softw. 25(6):747–759.
  • Pradhan B, Lee S. 2010b. Regional landslide susceptibility analysis using back-propagation neural network model at Cameron Highland, Malaysia. Landslides. 20107(1):13–30.
  • Pradhan B, Lee S, Buchroithner MF. 2010. A GIS-based back-propagation neural network model and its cross-application and validation for landslide susceptibility analyses. Comput Environ Urban. 34(3):216–235.
  • Qiu X, Zhang L, Suganthan PN, Amaratunga GA. 2017. Oblique random forest ensemble via least square estimation for time series forecasting. Inform Sci. 420:249–262.
  • Rajabi AM, Khodaparast M, Mohammadi M. 2022. Earthquake-induced landslide prediction using back-propagation type artificial neural network: case study in northern Iran. Nat Hazards. 110(1):679–694.
  • Rumelhart DE, Hinton GE, Williams RJ. 1986. Learning representations by back-propagating errors. NATURE. 323(6088):533–536.
  • Saleh Y, Abolfazl J, Aleksandar V, Christopher G, Saskia K. 2022. Vulnerability assessment of road networks to landslide hazards in a dry-mountainous region. Environ Earth Sci. 81(22):1–17.
  • Saravanan S, Istijono B, Jennifer JJ, Abijith D, Sivaranjani S. 2021. Landslide susceptibility assessment using frequency ratio technique–A case study of NH67 road corridor in the Nilgiris district. Tamilnadu, India, Paper presented at IOP Conference Series: earth and Environmental Science, IOP Publishing, p. 012017.
  • Sitti AH, Hamizah AA, Nor EA, Mariyana AA, Nur SAS. 2022. Vulnerability of road transportation networks under natural hazards: a bibliometric analysis and review. Int J Disaster Risk Reduct. 83:103393.
  • Tehrany MS, Pradhan B, Jebur MN. 2014. Flood susceptibility mapping using a novel ensemble weights-of-evidence and support vector machine models in GIS. J Hydrol. 512:332–343.
  • Umar Z, Pradhan B, Ahmad A, Jebur MN, Tehrany MS. 2014. Earthquake induced landslide susceptibility mapping using an integrated ensemble frequency ratio and logistic regression models in West Sumatera Province, Indonesia. Catena. 118:124–135.
  • Wang Y, Sha A, Li X, Hao W. 2021. Prediction of the mechanical properties of titanium alloy castings based on a back-propagation neural network. J Mater Eng Perform. 30(11):8040–8047.
  • Xiong J, Sun M, Zhang H, Cheng W, Yang Y, Sun M, Cao Y, Wang J. 2019. Application of the Levenburg-Marquardt back propagation neural network approach for landslide risk assessments. Nat Hazards Earth Syst Sci. 19(3):629–653.
  • Xu K, Guo Q, Li Z, Xiao J, Qin Y, Chen D, Kong C. 2015. Landslide susceptibility evaluation based on BPNN and GIS: a case of Guojiaba in the Three Gorges Reservoir Area. Int J Geogr Inf Sci. 29(7):1111–1124.
  • Zayed T, Amer M, Pan J. 2008. Assessing risk and uncertainty inherent in Chinese highway projects using AHP. Int J Proj Manag. 26(4):408–419.
  • Zhang Q, Yu H, Li Z, Zhang G, Ma DT. 2020. Assessing potential likelihood and impacts of landslides on transportation network vulnerability. Transport Res Part D 82:102304.
  • Zhang S, Zhang LM. 2014. Human vulnerability to quick shallow landslides along road: fleeing process and modeling. Landslides. 11(6):1115–1129.
  • Zhou C, Yin K, Cao Y, Ahmed B, Li Y, Catani F, Pourghasemi HR. 2018. Landslide susceptibility modeling applying machine learning methods: A case study from Longju in the Three Gorges Reservoir area, China. Comput Geosci 112:23–37.