Application of a hybrid model of neural networks and genetic algorithms to evaluate landslide susceptibility
© The Author(s). 2017
Received: 19 September 2016
Accepted: 23 March 2017
Published: 14 April 2017
In the last few decades, the development of Geographical Information Systems (GIS) technology has provided a method for the evaluation of landslide susceptibility and hazard. Slope units were found to be appropriate for the fundamental morphological elements in landslide susceptibility evaluation.
Following the DEM construction in a loess area susceptible to landslides, the direct-reverse DEM technology was employed to generate 216 slope units in the studied area. Of the 216 slope units, 123 involved landslides. To analyze the mechanism of these landslides, six environmental factors were selected to evaluate landslide occurrence: slope angle, aspect, the height and shape of the slope, distance to river and human activities. The spatial analysis demonstrates that most of the landslides are located on convex slopes at an elevation of 100-150 m with slope angles from 135°-225° and 40°-60°. Landslide occurrence was then checked according to these environmental factors using an artificial neural network with back propagation, optimized by genetic algorithms. A dataset of 120 slope units was chosen for training the neural network model, and the parameters of genetic algorithms and neural networks were set.
After training on the datasets, the susceptibility of landslides was mapped for the land-use plan and hazard mitigation. Comparing the susceptibility map with landslide inventory, the verification shows satisfactory agreement with an accuracy of 86.46% between the susceptibility map and the landslide locations. In the landslide susceptibility assessment, ten new slopes were predicted to show potential for failure, which can be confirmed by the engineering geological conditions of these slopes.
KeywordsLandslide Geographical Information Systems Genetic algorithms Back propagation neural networks Susceptibility evaluation
Landslides are local phenomena occurring in different geomorphic contexts; they can be triggered by a variety of mechanisms, such as earthquakes or rainfall, and some of the causes are not yet well known. Landslides cannot be predicted accurately; however, the susceptibility of a given area to landslides can be determined and depicted using hazard zonation. Various methods have been proposed to partition landscape for the purpose of landslide hazard assessment and zonation mapping, including grid cells, terrain units, unique-condition units, slope units and topographic units (Carrara et al. 1991; van Westen, 1994; Guzzetti et al. 1999; Chung and Fabbri, 2003). With respect to the landscape partitioning methods mentioned above, slope units can be resized according to the prevailing failure type and dimension, thereby partitioning a river basin into nested subdivisions: coarser for larger landslides and finer for smaller failures. Because a clear physical relationship exists between landslides and the fundamental morphological elements of a hilly or mountain region, namely, drainage and divided lines, the slope-unit technique seems appropriate for landslide susceptibility assessment.
A careful review of the concepts, principles, techniques and methodologies for landslide susceptibility evaluation reveals that the most commonly used methods are geomorphological hazard mapping, analysis of landslide inventories, heuristic or index-based methods, functional, statistically based models and geotechnical or physically based models (Guzzetti et al. 1999; Wang et al. 2005; Fell et al., 2008). Recently, probabilistic models such as frequency ratio and logistic regression methods have been applied to evaluate landslide susceptibility and have been integrated with Geographical Information Systems (GIS) (Ayalew and Yamagishi, 2005; Chung, 2006; Dahal et al., 2008; Nefeslioglu, et al., 2008; Yilmaz, 2009; Bai et al. 2011; García-Rodríguez and Malpica, 2010; Hasekiogullar and Ercanoglu 2012; Martinovic et al., 2016). Due to geological complexity of slopes and self-organized system, however, many variables are involved in slope stability evaluation, and these variables display a highly nonlinear relationship with the evaluation results. Under the consideration of the nonlinear characteristics of the sliding process, artificial neural networks (ANNs) have thus been introduced to produce landslide susceptibility and hazard maps (Ercanoglu and Gokceoglu, 2002; Neaupane and Achet, 2004; Catani et al., 2005; Gómez and Kavzoglu, 2005; Kanungoa et al. 2006; Nefeslioglu, et al., 2008; Nefeslioglua et al., 2011).
The main characteristics of ANNs dealing with quantitative and qualitative indices include large-scale parallel distributed processing, continuously nonlinear dynamics, collective computation, high fault-tolerance, self-organization, self-learning and real-time treatment (Rumelhart and McClelland 1986). It is worth noting that a neural network system is a processing device, implemented as an algorithm or in hardware, whose design is inspired by the design and the function of mammalian brains; they react to training data input in such a way to alter their initial state, and they learn using unconventional algorithms. Neural networks integrated with GIS may be an effective approach when dealing with landslide hazard assessments where meaningful outcomes are difficult to achieve by means of standard mathematical models. Because artificial neural network models are adaptive and capable of generalization, they can handle imperfect or incomplete data and can capture nonlinear and complex interactions among the several variables of a system (Ermini et al. 2005; Melchiorre et al. 2008; Lee and Pradhan 2010; Conforti et al., 2014; Pham et al., 2017).
However, it was found that the slow training speed and difficulty in achieving a local minimum cannot be resolved in practical applications for the most commonly used back propagation neural networks (BPNN). To solve this problem, several methods were proposed to improve the training speed of networks, such as improving error functions and adjusting the studying rates. For this kind of solutions, the BPNN is likely to converge to a local solution, which may not be the global solution, with the random selection of initial weights. Thus, a global search algorithm was then introduced, e.g., evolutionary programming, simulated annealing or genetic algorithms (GAs) (Kavzoglu et al., 2015). Among them, GA has mainly been used to search for the optimal solution in BPNNs due to its excellent global search ability (Holland 1975; Sexton and Gupta, 2000; Kesign, 2004; Madaeni, et al., 2010; Nourani, et al., 2014). There are two main aspects of applying GA to BPNNs for finding global optima in complex problems: one is to optimize the weights of the network, and the other is to optimize the topological structure of the network.
After an overview on landslide susceptibility using ANNs, it was noted that the weights were randomly selected and that the optimization cannot be carried out for global searching. This paper thus proposes a hybrid model of a GA and BPNN to evaluate landslide susceptibility for the optimization of weights. In this study, landslide inventory was mapped after detailed field investigation and interpretation from high-resolution imagery. The outline of the watershed polygons was mapped as the ridge line using DEM, and reverse DEM data were used to detect the valley line. The combined DEM and reverse DEM, slope units were then mapped within ArcGIS. After the definition of slope units, environmental factors were analyzed for the presence of landslide occurrence, and a hybrid of GA and BPNN was developed to evaluate landslide susceptibility. In this method, the GA was used to search for the optimal or approximately optimal connection weights and thresholds for the networks, and then, using back-propagation learning rules and training algorithms, the final weights could be adjusted.
Historical records show that three earthquakes with Ms ≥ 8 occurred on Loess Plateau, which triggered numerous landslides in loess areas (Zhang et al. 1995; Derbyshire et al. 1999). However, only three earthquakes with around Ms 3.5 have been recorded in the study area.
The climate of the study area is notable for Asiatic monsoons and marked seasonal shifts in dominant winds. The mean annual temperature is 12.9 °C, and the maximum is up to 41.6 °C. The mean annual precipitation reaches 679.1 mm, and almost 50% of the precipitation falls in the period from July to September.
It was also observed that the landslides are concentrated at the confluence of two streams. As described in the previous section, these landslides are attributed to the undercutting of the slopes associated with gullying. Major factors affecting the initiation of slope failure and subsequent modes of movement appear to be dependent upon the morphology, the nature and degree of weathering of the underlying bedrock, and the moisture status of the loess deposits. Numerous ancillary factors include bedrock-loess interface, slope steepness, vegetation cover, and land utilization. As already noted, the upper surface of the Neogene bedrock is often weathered and contains variable amounts of smectites. In this zone, wetting-drying results in a progressive decrease in strength that may ultimately lead to slope failure. Undercutting of the slopes along the gullies is frequent, which increases the risk of slope failure. As a result of the high relative relief, the steep slopes and the relatively uniform geological and geomorphological conditions, the landslides tend to be very large and to occur in groups.
Defining slope units
Selection of environmental factors
Slope angleUsing the function of Mean in the ArcGIS Spatial Analyst model, an average slope angle was defined for each slope unit. In the study area, slope angles were categorized into six classes: 0–20°, 21°–30°, 31°–40°,41°–50°, 51°–60° and larger than 60° (Fig. 5).
Slope heightSlope height plays an important role in landslide occurrence, especially in loess areas. Considering only the slope height for slope failures, higher slopes can increase stress values in different areas of slope, making the slopes unstable. The slope height was classified into five classes: 0–50 m, 51–100 m, 101–150 m, 151–200 m and 201–300 m (Fig. 6). Using Min and Max functions in the model of ArcGIS Spatial Analyst, the slope height was obtained from the difference of the minimum and maximum elevation.
Slope aspectThe slope aspect information can be obtained from the Majority function within the ArcGIS. Aspect was categorized as 0–15°,16°–90°, 91°–135°, 136°–180°, 181°–225°, 226°–270°, 271°–315 ° and 316°–360° (Fig. 7).
Slope morphologyIt is known that water concentrates in concave topographic positions and makes the slope susceptible to (sub) surface flow as the main hydrological triggering mechanism. According to the analysis of the relationships between the landslide occurrence and slope morphology, however, most of landslides are located in the convex topographic positions (Fig. 8).
Human activities influence the susceptibility due to the land use. In the Changshou valley, the land-use is very homogenous. Forests are present closed to the boundary of the loess plateau, upper the slope of the valley. Areas covered by shrubby and herbaceous vegetation are scattered into the study area, while parts of space are open with more vegetation for local farmers. In the analysis of human activity, the influence was simply determined by whether the slope area had been used for vegetation or not.
Distance to river
The distance to the river is presented by the proximity of the Changshougou valley and various drainages. These data were obtained from the buffer of rivers and drainages using maps at the scale of 1:25000 in ArcGIS.
Normalization of data
where X represents the original data, X max and X min are the maximum and minimum of original data, respectively. T is the target data after normalization.
where mse represents mean square errors for BPNN, and k is training time.
Using BPNN modeling optimized by genetic algorithms, the parameters of GAs and neural networks are set in the present study. The population size is 100; crossover probability is 0.65; mutation probability is 0.01; momentum factor is 0.60; learning rate is 0.7; max learning number is 10000; and target error is 0.000001. From a database of 216 landslides in units of slope, 120 landslides were randomly used for training the neural network models, and 96 landslides were used for the validation of landslide susceptibility.
Results and discussion
Following the process of mapping the slope units, the environmental factors can be categorized into slope, aspect, slope height, and the shape of the slope, human activity and distance to river. These factors were statistically analyzed by slope units within the scope of GIS, after the construction of the geo-database. As mentioned above, the geological condition is almost the same in the study area, in which the strata in these units are mainly Neogene argillites, and the fluvial deposits consist of clayey silts and gravels and Quaternary loess. Due to human activity to effect the landslide susceptibility, it was simply determined by whether the slope area had been used for vegetation or not. The land use for vegetation needs more space and artificial excavation, even irrigation, then affecting the stability of landslides. With respect to the distance to river, the buffer of the Changshougou valley and drainages was selected for the analysis of susceptibility. A small number of landslides, which are closed to the river and drainage, can be in partial deformation, especially at the toe of slope due to erosion. The environmental factors are analyzed by focusing on the slope parameters, such as slope angle, aspect, height and the shape of the slope.
Ten slopes (in yellow in Fig. 19) were predicted to be prone to landslides. In view of the environmental factors, all of these slopes were toward the southeast and southeast, were higher than 100 m and had slope angles greater than 43°. Furthermore, seven of the slopes were in convex topographic conditions, whereas the other slopes were located in concave topographic conditions.
As a useful tool, which addresses a nonlinear system and is capable of response to inputs and adaptation to the environment, the most widely used BPNNs are capable of evaluating landslide susceptibility at both the regional and site-specific scales (Lee et al. 2003; Neaupane and Achet 2004). BPNNs can be applied better over a wide area using non-parametric variables with large extensions. However, BPNNs are prone to falling into local extremes, and their convergence is slow. To overcome these drawbacks, a GA-based BPNN was proposed to optimize the neural network weights for landslide susceptibility assessment, and the topology was subjectively kept in three layers. In the optimization of the GA-based BPNN, the number of neurons can be decided in the hidden layer by the Kolmogorov theorem. For the training of the BPNN, the weights and thresholds were represented as genes of a chromosome, and the global optimum was then searched for using the selection, crossover and mutation operators of the genetic algorithm. To sum up, the advantages of using genetic algorithms were based on the performance of neural networks on the testing datasets, instead of only on the minimal square error in the modeling datasets. In addition, the minimal improvement of the genetic algorithms in this study occurred in the ratios between the numbers of chromosomes of a generation.
In this study, through the use of direct-reverse DEM technology, the Changshougou valley was divided into 216 slope units, of which 123 units included landslides. According to the mechanism analyses of the landslides in the loess area, six environmental factors were selected to evaluate landslide occurrence, such as slope height, slope angle, aspect, shape of slope, distance to rivers, and human activities. The spatial analysis shows that most of landslides in the Changshougou valley are located at an elevation of 100–150 m, have a slope angle of 135°–225° and 40°–60° in the slope, and have convex slope conditions. After the spatial analysis of the environmental factors, a case study was presented for landslide susceptibility prediction using BPNN modeling optimized by genetic algorithms. From a database of 216 slopes, 120 units, including 80 with landslide presence and 40 without, were used for training the neural network models, and 96 slopes, i.e., 43 with landslide presence and 53 without landslide presence, were used for the validation of landslide susceptibility. Comparing landslide presence with a susceptibility map, it was noted that the prediction accuracy for landslide occurrence is 93.02%, whereas units without landslide occurrence could be predicted with an accuracy of 81.13%. It was also noted that 10 slopes were predicted to be prone to landslides. In view of the environmental factors, all of these slopes are toward southeast and southeast, are at an elevation greater than 100 m, and have slope angles greater than 43°. It was also noted that seven of the slopes are in convex topographic conditions, whereas the other slopes are located in concave topographic conditions. Furthermore, the prediction of 10 slopes can be used as a general planning tool but is not intended for individual site-specific evaluations.
This research was supported by funding from the National Natural Science Foundation of China (No.41372296 and 41672267). A special note of appreciation is extended to the Ministry of National Science and Technology for their funding support (2012BAK10B00). Two anonymous reviewers were greatly appreciated for their comments and suggestions.
HB carried out the improvement of GA-BPNN for landslide susceptibility, and drafted the manuscript as the corresponding author. JM carried out the data analysis for landslide susceptibility, and drafted parts of the manuscript. BZ carried out the GIS dataset preparation, and helped to draft the manuscript. YZ participated in the preparation of all figures. ZQ participated in the design of the methodology and performed the statistical analysis for landslide susceptibility. YP conceived of the study, and participated in its design and coordination and helped to draft the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
- Ayalew, L., and H. Yamagishi. 2005. The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, central Japan. Geomorphology 65: 15–31.View ArticleGoogle Scholar
- Bai, S., G. Lu, J. Wang, P. Zhou, and L. Ding. 2011. GIS-based rare events logistic regression for landslide-susceptibility mapping of Lianyungang, China. Environmental Earth Sciences 62: 139–149.View ArticleGoogle Scholar
- Carrara, A., M. Cardinali, R. Detti, F. Guzzetti, V. Pasqui, and P. Reichenbach. 1991. GIS techniques and statistical models in evaluating landslide hazard. Earth Surface Processes Landforms 16: 427–445.View ArticleGoogle Scholar
- Catani, F., N. Casagli, L. Ermini, G. Righini, and G. Menduni. 2005. Landslide hazard and risk mapping at catchment scale in the Arno River basin. Landslides 2: 329–342.View ArticleGoogle Scholar
- Chung, C.F., and A.G. Fabbri. 2003. Validation of spatial prediction models for landslide hazard mapping. Natural Hazards 30: 451–472.View ArticleGoogle Scholar
- Chung, C.J. 2006. Using likelihood ratio functions for modeling the conditional probability of occurrence of future landslides for risk assessment. Computer and Geosciences 32: 1052–1068.View ArticleGoogle Scholar
- Conforti, M., S. Pascale, G. Robustelli, and F. Sdao. 2014. Evaluation of prediction capability of the artificial neural networks for mapping landslide susceptibility in the Turbolo River catchment (northern Calabria, Italy). Catena 113: 236–250.View ArticleGoogle Scholar
- Dahal, R.K., S. Hasegawa, A. Nonomura, M. Yamanaka, S. Dhakal, and P. Paudyal. 2008. Predictive modelling of rainfall-induced landslide hazard in the Lesser Himalaya of Nepal based on weights-of-evidence. Geomorphology 102: 496–510.View ArticleGoogle Scholar
- Derbyshire, E., J.T. Wang, and X.M. Meng. 1999. A treacherous terrain: background to natural hazards in northern China, with special reference to the history of landslides in Gansu Province. In Landslides in the thick loess terrain of north-west China, ed. E. Derbyshire, X.M. Meng, and T.A. Dijkstra, 11–18.Google Scholar
- Ercanoglu, M., and C. Gokceoglu. 2002. Assessment of landslide susceptibility for a landslide-prone area (north of Yenice, NW Turkey) by fuzzy approach. Environ Geol 41: 720–730.View ArticleGoogle Scholar
- Ermini, L., F. Catani, and N. Casagli. 2005. Artificial Neural Networks applied to landslide susceptibility assessment. Geomorphology 66: 327–343.View ArticleGoogle Scholar
- Fell, R., J. Corominas, C. Bonnard, L. Cascini, E. Leroi, and W.Z. Savage. 2008. Guidelines for landslide susceptibility, hazard and risk zoning for land use planning. Engineering Geology 102: 85–98.View ArticleGoogle Scholar
- García-Rodríguez, M.J., and J.A. Malpica. 2010. Assessment of earthquake-triggered landslide susceptibility in El Salvador based on an Artificial Neural Network model. Nat. Hazards Earth Syst. Sci. 10: 1307–1315.View ArticleGoogle Scholar
- Gómez, H., and T. Kavzoglu. 2005. Assessment of shallow landslide susceptibility using artificial neural networks in Jabonosa River Basin, Venezuela. Engineering Geology 78: 11–27.View ArticleGoogle Scholar
- Guzzetti, F., A. Carrara, M. Cardinali, and P. Reichenbach. 1999. Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy. Geomorphology 31: 181–216.View ArticleGoogle Scholar
- Hasekiogullar, G.D., and M. Ercanoglu. 2012. A new approach to use AHP in landslide susceptibility mapping: a case study at Yenice (Karabuk, NW Turkey). Natural Hazards 63: 1157–1179.View ArticleGoogle Scholar
- Holland, J.H. Adaptation in natural and artificial systems. Ann Arbour: The University of Michigan Press, 1975.Google Scholar
- Kanungoa, D.P., M.K. Arorab, S. Sarkara, and R.P.A. Guptac. 2006. Comparative study of conventional, ANN black box, fuzzy and combined neural and fuzzy weighting procedures for landslide susceptibility zonation in Darjeeling Himalayas. Engineering Geology 85: 347–366.View ArticleGoogle Scholar
- Kavzoglu, T., E.K. Sahin, and I. Colkesen. 2015. Selecting optimal conditioning factors in shallow translational landslide susceptibility mapping using genetic algorithm. 192: 101–112.Google Scholar
- Kesign, U. 2004. Genetic algorithm and artificial neural network for engine optimisation of efficiency and NOx emission. Fuel 83: 885–895.View ArticleGoogle Scholar
- Lee, S., J.H. Ryu, K. Min, and J.S. Won. 2003. Landslide susceptibility analysis using GIS and artificial neural network. Earth Surface Processes and Landforms 23: 1361–1376.View ArticleGoogle Scholar
- Lee, S., and D. Pradhan. 2010. Regional landslide susceptibility analysis using back-propagation neural network model at Cameron Highland, Malaysia. Landslides 7(1): 13–30.View ArticleGoogle Scholar
- Madaeni, S.S., N.T. Hasankiadeh, A.R. Kurdian, and A. Rahimpour. 2010. Modeling and optimization of membrane fabrication using artificial neural network and genetic algorithm. Separation of Purification Technology 76: 33–43.View ArticleGoogle Scholar
- Maidment, D. 2002. Arc Hydro: GIS for water resources. ESRI 380, New York Street, Redland, California.
- Martinovic, K., K. Gavin, and C. Reale. 2016. Development of a landslide susceptibility assessment for a rail network. Engineering Geology 215: 1–9.View ArticleGoogle Scholar
- Melchiorre, C., M. Matteucci, A. Azzoni, and A. Zanchi. 2008. Artificial neural networks and cluster analysis in landslide susceptibility zonation. Geomorphology 94: 379–400.View ArticleGoogle Scholar
- Meng, X. M., Dijkstra, T. D., Derbyshire, E. 2000. Loess slope instability. In: Derbyshire E, Meng, X.M., Dijkstra, T.A. (eds). Landslides in the thick loess terrain of north-west China, 175–181. Chichester: John Wiley.
- Neaupane, K.M., and S.H. Achet. 2004. Use of back propagation neural network for landslide monitoring: a case study in the higher Himalaya. Engineering Geology 74: 213–226.View ArticleGoogle Scholar
- Nefeslioglu, H.A., C. Gokceoglu, and H. Sonmez. 2008. An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps. Eng. Geol. 97: 171–191.View ArticleGoogle Scholar
- Nefeslioglua, H.A., C. Gokceoglub, H. Sonmez, and T. Gorum. 2011. Medium-scale hazard mapping for shallow landslide initiation: the Buyukkoy catchment area (Cayeli, Rize, Turkey). Landslides 8(4): 459–483.View ArticleGoogle Scholar
- Nourani, V., B. Pradhan, H. Ghaffari, and S.S. Sharifi. 2014. Landslide susceptibility mapping at Zonouz Plain, Iran using genetic programming and comparison with frequency ratio, logistic regression and artificial neural network models. Nat. Hazards 71: 523–547.View ArticleGoogle Scholar
- Peng, S.Z., and Z.T. Guo. 2007. Clay mineral composition of the Tertiary red clay and the Quaternary loess-palaeosols as well as its environmental implication. Chinese J. Quaternary Science 27: 277–285.Google Scholar
- Pham, B.T., D.T. Bui, I. Prakash, and M.B. Dholakia. 2017. Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibilityassessment at Himalayan area (India) using GIS. CATENA 149: 52–63.View ArticleGoogle Scholar
- Rumelhart, D.E., and J.L. McClelland. 1986. Parallel Distributed processing: Exploration in the Microstructure of Cognition. Cambridge: MIT-Press.Google Scholar
- Sexton, R.S., and J.N.D. Gupta. 2000. Comparative evaluation of genetic algorithm and back propagation for training neural networks. Inform. Sciences 129: 45–59.View ArticleGoogle Scholar
- Van Westen, C.J. 1994. GIS in landslide hazard zonation: a review, with examples from Andes of Colombia. In Mountain Environments and Geographic Information Systems, ed. M. Price and I. Heywood, 135–165. Basingstoke: Taylor & Francis.Google Scholar
- Wang, H.B., G.J. Liu, W.Y. Xu, and G.H. Wang. 2005. GIS-based landslide hazard assessment: An overview. Progress in Physical Geography 29: 548–567.View ArticleGoogle Scholar
- Xie, M.W., T. Esaki, and G.Y. Zhou. 2004. GIS-based probabilistic mapping of landslide hazard using a three-dimensional deterministic model. Natural Hazards 33: 265–282.View ArticleGoogle Scholar
- Yilmaz, Y. 2009. An Agent Simulation Study on Conflict, Community Climate, and Innovation in Open Source Communities. International Journal of Open Source Software and Processes 1(4): 1–25.View ArticleGoogle Scholar
- Zhang, A.L., Z.T. Yang, J. Zhong, and F.S. Mi. 1995. Characteristics of late Quaternary activity along the southern border fault zone of Weihe graben basin. Quaternary International 25: 25–31.View ArticleGoogle Scholar