Skip to main content

Exploring time series models for landslide prediction: a literature review

Abstract

Introduction

Landslides pose significant geological hazards, necessitating advanced prediction techniques to protect vulnerable populations.

Research Gap

Reviewing landslide time series analysis predictions is found to be missing despite the availability of numerous reviews.

Methodology

Therefore, this paper systematically reviews time series analysis in landslide prediction, focusing on physically based causative models, highlighting data preparation, model selection, optimizations, and evaluations.

Key Findings

The review shows that deep learning, particularly the long-short-term memory (LSTM) model, outperforms traditional methods. However, the effectiveness of these models hinges on meticulous data preparation and model optimization.

Significance

While the existing literature offers valuable insights, we identify key areas for future research, including the impact of data frequency and the integration of subsurface characteristics in prediction models.

Introduction

Landslides represent prevalent geological hazards observed frequently across the globe (Froude and Petley 2018; Yang et al. 2022a), resulting in fatalities, infrastructure damage, and economic losses. Globally, nearly 60000 citizens have been killed in 12 years as a result of 4862 non-seismic landslides (Froude and Petley 2018). Landslides are triggered by several triggerings such as earthquakes, volcanoes, floods, and rainfall (Ebrahim et al. 2024a). Among these triggers, due to the severe climate change conditions, rainstorms can induce catastrophic landslides (Wu et al. 2020). This means that despite the extensive efforts by the governmental authorities to address the risk of landslides, it still needs to be eliminated (Song et al. 2017). Consequently, studying such environmental risks (i.e., rainfall-induced landslides) is paramount for the safigffety of the existing structures and the city's future development.

Several actions can be taken to prevent such risks, saving the civilians and the infrastructure from any possible damage. The simplest approach is to reinforce all slopes against the worst-case scenario of external triggering by employing stabilizing piles, soil nailing, drainage channels, etc. (Huang and He 2023). However, this methodology wastes resources and time, necessitating prioritization and planning of such resources. Predictions and Early warning systems can forecast future scenarios, assisting authorities in taking timely action to protect vulnerable areas and populations from the devastating impact of landslides, avoiding over 90% of these losses, and prioritizing stabilization planning (Guerrero-Rodriguez et al. 2024; Baum and Godt 2010; Guzzetti et al. 2020; Intrieri et al. 2012)

Early warning and prediction techniques exhibit considerable challenges as geohazard mitigation presents considerable complexity. To illustrate, the landslide triggering features, the corresponding landslide response, and the geological, geotechnical, and hydrological features that control landslide behavior all have complex, nonlinear, uncertain, and dynamic relationships (Dai et al. 2021; He et al. 2021; Tien Bui et al. 2019; Wu et al. 2022; Kang et al. 2017).

To address the aforementioned limitations, artificial intelligence (AI) has emerged as a powerful solution for dealing with geohazard complexities, effectively capturing their nonlinear attributes by mapping input features to output results (Ma et al. 2020; Jiang et al. 2022; Liu et al. 2024). Furthermore, deep learning, a subset of machine learning approaches (LeCun et al. 2015; Phoon and Zhang 2023), uses deep network topologies and complex nonlinear processes to extract different characteristics from data, resulting in exact representations of training datasets. For example, these models have demonstrated their ability to integrate complex nonlinear patterns seen in historical landslide displacement monitoring data (Meng et al. 2024). To clarify, various deep learning models, including convolutional neural networks (CNNs) (Pei et al. 2021), recurrent neural networks (RNNs) (Wang et al. 2021), long short-term memory (LSTM) (Yang et al. 2019), and gated recurrent unit (GRU) (Zhang et al. 2021d), have proven successful in landslide displacement prediction, for example. LSTM and GRU models outperform traditional methods like support vector machine (SVM) (Cai et al. 2016). Deep learning's advantages lie in its versatility across domains, ease of feature extraction, parameter optimization, and scalability (Zhu et al. 2020). The methods mentioned above are commonly used with physically-based causative thresholds (Ebrahim et al. 2024b).

The powerful capabilities of deep learning techniques are dependent on a thorough understanding of the underlying physical responses of landslides, the quality of monitoring data, and the empirical hypertuning of the model parameters (Yuan et al. 2019; Ebrahim et al. 2024a, 2024c). Regarding the physical response, one of the key physical responses is the time delay between triggerings and landslide response (i.e., infiltration and wetting front process dynamics) that deep learning should understand (Bednarczyk 2018; Chen et al. 2018; Sasahara 2017; Zhang et al. 2016; Li et al. 2021; Zhang et al. 2011). As for the data quality, missing data is a common occurrence because of harsh environmental circumstances (Ebrahim et al. 2024c). Concerning the model parameters, deep learning is a random process, hence hypertunning is required to properly choose the model parameters. These challenges can be addressed by time series analysis for accurate predictions, better utilization of deep learning capabilities, and data-related issues management. The time series relationship between triggerings and landslide response can be identified in one of the following scenarios: univariate, multivariate, trends, seasonality, randomness, or autocorrelation.

To demonstrate, a time series denotes a sequential organization of data, typically evenly spread across time intervals. It can be classified into two main types: univariate, which involves a single value at each time, and multivariate, where multiple values are recorded per time step. Time series analysis has a wide range of applications, including landslides. Its applications include prediction, handling missing data, and anomaly detection (Chatfield 2013; Shumway et al. 2000) (Fig. 1).

Fig. 1
figure 1

Time series analysis visualization

Time series analysis is classified into four categories: trend, seasonality, randomness, and autocorrelation. Trends (Fig. 2a) illustrate directional movements, either upward or downward, and seasonality (Fig. 2b) represents patterns that repeat at regular intervals. Randomness (Fig. 2c) emerges as white noise with no recognizable patterns. Autocorrelation (Fig. 2d) represents correlations with delayed duplicates of themselves, marked by unexpected spikes known as "innovations." A time series may exhibit all these trends simultaneously yielding a complex time series. Time series can be stationary and have constant statistical features, but non-stationary series undergo structural changes in response to major events, modifying their behavior (Fig. 2e). The interaction of these variables changes the dynamics of time series data, altering prediction accuracy and analytical methodologies (Chatfield 2013; Shumway et al. 2000).

Fig. 2
figure 2

Types of time series analysis

After reviewing the literature on landslides and their mitigation strategies, it becomes evident that various systematic and methodological approaches have been employed to address the multifaceted challenges posed by these geological hazards. A comprehensive review of existing studies, as presented in Table 1, showcases a diverse range of topics covered, including the management of landslide risks, the utilization of advanced monitoring technologies, the assessment of susceptibility factors, and the development of predictive models. These reviews offer valuable insights into the current state of research and provide essential foundations for further exploration into landslide mitigation strategies.

Table 1 Available literature review that addresses landslides-related studies (modified after (Ebrahim et al. 2024b, 2024c)

Despite the breadth of topics covered in these reviews (Table 1), it is notable that time series analysis, a crucial aspect in landslide prediction and early warning systems, is largely absent. This omission is significant for several reasons. Firstly, time series analysis allows for the examination of temporal correlations between various factors influencing landslides, which is essential for accurate prediction and proactive risk management. Secondly, selecting appropriate hyperparameters in statistical models used for landslide prediction often relies on empirical methods, highlighting the need for a systematic exploration of time series techniques to enhance predictive accuracy.

To this end, this paper aims to bridge this gap by presenting a comprehensive review of time series applications for physically based causative threshold techniques (i.e., based on machine learning and geotechnically and environmentally monitored data), structured to cover various key aspects outlined in Fig. 3. Beginning with data preparation, it progresses through model selection, optimizations, model evaluations, and, ultimately, prediction generation. The paper then delves into identifying gaps and offering future recommendations, culminating in a conclusive summary. This study seeks to provide valuable insights for researchers and practitioners seeking to enhance landslide mitigation efforts through advanced predictive techniques.

Fig. 3
figure 3

Time series modeling process

Research methodology

In this study, a qualitative systematic review methodology is applied. The systematic review process, outlined in Fig. 4, involves three main stages: a) defining research questions, clarifying goals, conducting preliminary investigations, and validating concepts, establishing inclusion and exclusion criteria, and devising a research plan along with selecting appropriate research databases; b) assessing and evaluating the screened and retrieved studies; and c) determining eligibility, extracting, and refining data.

Fig. 4
figure 4

Systematic review process

Identification process

In the initial phase of identification, the research methodology commences by sourcing significant studies related to time series applications in landslide analysis. This section employs keywords, search databases, and predefined inclusion and exclusion criteria to filter the acquired papers. It is recommended to utilize multiple databases in a systematic review to ensure a comprehensive retrieval and assessment of relevant literature. While Scopus, Web of Science, and Google Scholar are commonly utilized databases in engineering research, this study primarily relies on Scopus for the preliminary search, supplemented by the snowballing approach involving Google Scholar and Web of Science. Following the selection of the search database, relevant keywords such as "landslide or landslides" and "machine learning or artificial intelligence or deep learning" are chosen to encompass all available datasets concerning time series applications of landslides.

In any systematic review, the criteria for inclusion and exclusion are pivotal in refining search results and focusing on the most pertinent studies. This research adhered to specific inclusion criteria: 1) studies focusing on time series applications of landslides utilizing machine learning techniques; 2) articles published in peer-reviewed journals; 3) studies published as articles or review submissions; and 4) papers that underwent final publication. Exclusion criteria encompassed: 1) papers published in languages other than English; 2) studies lacking accessible full texts; and 3) publications not originating from journal sources. The Scopus search with inclusion and exclusion were: “(TITLE-ABS-KEY ( "landslide" ) OR TITLE-ABS-KEY ( landslides ) AND TITLE-ABS-KEY ( machine AND learning OR artificial AND intelligence OR deep AND learning ) ) AND ( LIMIT-TO ( SUBJAREA , "eart" ) OR LIMIT-TO ( SUBJAREA , "engi" ) ) AND ( LIMIT-TO ( DOCTYPE , "ar" ) OR LIMIT-TO ( DOCTYPE , "re" ) ) AND ( LIMIT-TO ( EXACTKEYWORD , "machine learning" ) OR LIMIT-TO ( EXACTKEYWORD , "landslide" ) OR LIMIT-TO ( EXACTKEYWORD , "landslides" ) OR LIMIT-TO ( EXACTKEYWORD , "deep learning" ) ) AND ( LIMIT-TO ( LANGUAGE , "english" ) ) AND ( LIMIT-TO ( SRCTYPE , "j" )”.

Screening and evaluation of collected articles

By July 2024, a search of the Scopus database yielded a total of 273 articles. These publications underwent evaluation and assessment using the systematic reviews and meta-analyses (PRISMA) process, as outlined by Moher et al. (2009) (see Fig. 5). Following this approach, 232 papers were excluded due to duplication, irrelevance, or unavailability of complete texts. To demonstrate, landslide prediction systems are classified into four types: a) empirical thresholds, b) physically based causative thresholds, c) deterministic models, and d) susceptibility maps (Ebrahim et al. 2024a, 2024b). This study exclusively includes time series models with physically based causative thresholds, ignoring other studies with different prediction methods or different triggerings. Upon thorough examination of the full texts of each included article, 41 papers met the established inclusion criteria. To broaden the scope of the search, the backward and forward snowballing method (Wohlin 2014) was employed, resulting in the discovery of additional relevant articles beyond those identified through the Scopus search. To demonstrate the snowballing approach, for each article that matched the inclusion criteria, we searched for related studies in the reference lists as well as the article's citation; this procedure is known as backward and forward snowballing. This procedure helps to consider papers that were not included in the search dataset. In combination with manual searches, a total of 159 articles were deemed suitable for inclusion in the study. The manual search is done to consider research related to the methodological and relevant topics.

Fig. 5
figure 5

PRISMA: Screening and selection process diagram

Systematic review discussion

Data preparation

Time step interval

The time series data can be collected in several frequencies, such as minutes, hours, days, and even months. It is a function of the data size, power consumption optimization of the monitoring system, and the required accuracy. Fig. 6a reviews the bibliometric data according to the retrieved studies where two frequencies are utilized: Monthly time steps (Dai et al. 2022; Huang et al. 2022a2023a; Li et al. 2020a; Lian et al. 2013; Liu et al. 2020; Wang et al. 2022, 2023a, b; Xing et al. 2019); and daily time steps (Dassanayake et al. 2023; Filipović et al. 2022; Granata et al. 2022; Han et al. 2021; Nava et al. 2023; Togneri et al. 2022; Xi et al. 2023; Xu et al. 2023; Zhang et al. 2021b). This chart demonstrates that around 59% of the literature that is now accessible uses monthly time steps, whereas 41% integrates daily time steps. In these studies, the frequency is chosen based only on the availability of monitoring data, ignoring the physical and computational backgrounds of the process.

Physically, Ebrahim et al. (2024a), Bontemps et al. (2020), Ng et al. (2001), and Rahimi et al. (2011) concluded that the temporal prediction of the landslides mainly relies on rainfall pattern (i.e., data frequency as illustrated in Fig. 6b). Computationally, Fig. 6b shows that the time series pattern completely differs between monthly and daily time steps. The monthly time steps (line red chart) are a relatively smoothed series in which model performance was adequate even for basic models. In comparison, the daily time steps (blue column chart) are more biased and random necessitating advanced modeling. Till now, investigating how data frequency affects the model performance is lacking, necessitating more focus in future research. In other words, comparative studies should consider this factor to balance data collection and sensor power consumption and prediction accuracies.

Fig. 6
figure 6

a data frequency employed in literature as a percentage of retrieved studies; b illustrative example of different rainfall patterns (i.e., frequencies) as an example of time series

Splitting ratio

Employing physically based threshold models is a random procedure that needs to be well-trained. The training process is then evaluated using additional untrained data, known as validation or testing sets. This method is also random and requires consideration of two factors: a) the temporal sequence of splitting the dataset, and b) the ratio of the training to validation sets. Time-independent applications ignore the temporal ordering while analyzing the data and the data is split using randomly folding techniques. In contrast, time series applications necessitate keeping the temporal ordering while analyzing the data in which the standard holdout strategy should be adopted in which the validation and test sets are at the end of the series (Fig. 7a) (Roberts et al. 2017; Togneri et al. 2022).

Fig. 7
figure 7

a Illustrative view of the testing and training sets; b Training ratio utilized in the literature

The training-to-testing ratio should be carefully chosen since a higher ratio results in better model training. However, larger training-to-testing ratios may affect the evaluation process as the testing set will be small. On the other hand, a smaller ratio will not be sufficient for model training. Du et al. (2013) concluded that a 50% to 90% ratio can achieve reasonable results. Ebrahim et al. (2024a) reviewed related landslide applications and concluded that a ratio of 70% is widely considered in the literature. According to Bergmeir and Benítez (2012), the last 10 to 15% of the time series is typically used as a validation and testing set for better generalization and prediction accuracy.

Figure 7b shows the bibliometric data for the reviewed literature. This figure represents the number of manuscripts versus the training set ratio adopted. It is seen that the ratio of 80% is utilized, followed by the ratios of 90%, 85%, and 70%, respectively. The ratios of 65% and 75% were rarely used. It was found that the dataset itself (i.e., the nature of the collected data) is why a training ratio greater than 80% was employed. To illustrate, generally, it was seen that the time series monitored was in monthly steps and ranged between 48 to 357 steps (Han et al. 2021; Xing et al. 2019). This temporal length is relatively small to train the model (Wang et al. 2023b). As a result, the testing set was selected to be a minimum of 12 steps to be able to evaluate at least the last year. (Huang et al. 2022a; Liu et al. 2020; Wang et al. 2023b; Xing et al. 2019). In other words, the ratio of the validation set should be selected to represent the temporal response of the training set.

Decomposition

The retrieved studies were mainly about landslide surface displacement predictions. As stated earlier, the response to landslides is quite complex. As a result, various attempts have been made in literature to simplify such a procedure. For studies that utilize monthly time series, Fig. 8 depicts how the complicated response of landslide displacement may be decomposed into residual, trend, and seasonal or periodic components, which could help facilitate the analytical process (Han et al. 2021; Huang et al. 2022a, 2023a; Li et al. 2020a; Lian et al. 2013; Liu et al. 2020; Meng et al. 2024; Nava et al. 2023; Wang et al. 2022; Xing et al. 2019; Yang et al. 2019; Zhang et al. 2021b). The analysis and predictions at this time are employed for each pattern individually, then the final prediction is the summation of the trend and periodic terms.

Fig. 8
figure 8

Time series decomposition (modified from (Ebrahim et al. 2024b)

Additionally, the prediction of such an application can be performed for the original data set without any decomposition (Dai et al. 2022; Wang et al. 2023a; Wei et al. 2019; Xi et al. 2023; Xu et al. 2023). Figure 9a) depicts the bibliometric data of the literature, taking into account the number of studies that use decomposition and non-decomposition. Decomposition was found to be commonly employed, with a percentage of 63% compared to 37% for non-decompositions. However, there is still a gap in the literature comparing and highlighting how the decomposition or non-decomposition affects the modeling process and the prediction accuracy.

Fig. 9
figure 9

a Bibliometric data for decomposition and non-decomposition studies; b decomposition techniques

Several decomposition techniques can be used (Fig. 9b) such as ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm (Meng et al. 2024), modified ensemble empirical mode decomposition (EEMD) (Lian et al. 2013), variational mode decomposition (VMD) (Huang et al. 2022a; Wang et al. 2022), double moving average (DMA) (Xing et al. 2019), density-based spatial clustering of applications with noise (DBSCAN) (Huang et al. 2023a), support vector classifier (SVC) (Han et al. 2021), continuous wavelet analysis (Li et al. 2020a), and differencing (Nava et al. 2023).

These methods assume that the trend term depends on the creep behavior and is not affected by external triggering. In contrast, the seasonal term is the only term triggered by seasonal triggering. This assumption has not yet been proved; further proof and research are required. However, differencing methods can overcome this assumption, where the current value is calculated as the difference between two successive steps. Pouzols and Lendasse (2010) revealed that differencing effectively removes the trend, minimizes uncertainties, and offers high prediction accuracy.

Lagged sequence

Rainfall-induced landslides are complex mechanisms where the slope response lagged with a temporal period with the triggering (Chang et al. 2023). Physically, this process is illustrated by the infiltration and surface-runoff mechanisms (Zhang et al. 2011). As for the physically based models such as time series applications, it is paramount to consider such mechanisms by empirically tuning the model and selecting the optimum hyperparameters. This can be accomplished by considering an antecedent period.

To illustrate, Uwihirwe et al. (2020) and Zhao et al. (2019) proved that considering the antecedent period of rainfall improves the model performance in terms of higher accuracies and lower false positive rates while studying empirical models. Similarly, the same concept could be integrated with time series analysis by considering a lagged sequence as an input instead of one single input time step.

Figure 10 illustrates the rainfall variation over time, where such rainfall can be divided into effective and non-effective rainfall events. Effective rainfall is the amount of rainfall that still affects the hydrological response of the slope. In contrast, non-effective rainfall is the amount of water that the slope has drained off already and does not affect the current response of the slope. Thus, the antecedent effect can be simplified to represent the effective rainfall period. The lagged sequence is affected by the recovery time which is a function of the triggering and mechanical and hydrological characteristics of the slope. It is paramount to consider this effect as Han et al. (2021) concluded that neglecting the lagged period has a more significant deviation from the predicted results than those that consider the lagged period.

Fig. 10
figure 10

Effective and non-effective rainfall patterns

The lagged period (i.e., antecedent period) varies in the literature as this period is a function of the slope hydraulic conductivity and other mechanical and hydrological characteristics. Dai et al. (2022), Huang et al. (2023a), Liu et al. (2020), Nava et al. (2023), and Xu et al. (2023) utilized a lagged period of 12-time steps. Zhang et al. (2021b) integrated set pair analysis (SPA) to optimize the antecedent period, and it was found that 18 days provides reasonable accuracy. Li et al. (2020a) utilized different lagges for each input feature, such as two months for rainfall, ten months for reservoir levels, and one month for displacement. According to a related study of irrigation application, Filipović et al. (2022) sensitively investigated several lagged intervals and found that a lagged period of 60 days offers the best prediction accuracy. In this regard, Granata et al. (2022) applied a grid search to select the optimum lagged interval, which was found to be 7 days. However, the research has not given enough consideration to the window size (sequence length) and how the antecedent value influences the prediction model's performance. The reason is that widely the available literature considers smoothed monthly time steps in predicting surface displacement, neglecting to study such factors with spatially varied subsurface dynamic responses.

Feature selection

Intelligence models are governed by their controlling features, such as the high dependency between the model and the feature-controlling factors. Outdated or non-related features can negatively affect the model, necessitating (Cao et al. 2016). Numerous factors affect landslides, including creep and triggering features. Creep features can be represented by geology, geomorphology, soil, hydraulic, and land use features, while triggering features are rainfall, earthquakes, human activities, blasting, reservoir fluctuation, and others, as reviewed by (Ebrahim et al. 2024a) (Fig. 11).

Fig. 11
figure 11

Features affecting landslides (modified from (Ebrahim et al. 2024a))

According to the reviewed studies, which were generally about displacement predictions due to the availability of the surface displacement monitoring, generally rainfall and reservoir level fluctuation are considered to be the external triggering features (Han et al. 2021; Huang et al. 2022a, 2023a; Li et al. 2020a; Liu et al. 2020; Meng et al. 2024; Miao et al. 2018; Wang et al. 2023b; Xing et al. 2019; Xu et al. 2023). Additionally, Selby (1988) concluded that including the state evolution of the landslides improves the prediction process. Similarly, displacement features such as historical values, displacement velocity, displacement increment, displacement change, and displacement evolution state is included as input features for improving the model performance (Huang et al. 2022a, 2023a; Li et al. 2020a; Miao et al. 2018; Nava et al. 2023; Wang et al. 2023b; Xing et al. 2019; Zhang et al. 2021b). The displacement time series was solely considered in the study of (Xi et al. 2023). The study of Xu et al. (2023) employed the groundwater level, surface displacement, and deep displacement features besides the rainfall and reservoir water level variation.

Several statistical methods can be used for feature selection such as gray relation analysis (GRA) (Jiang et al. 2022; Meng et al. 2024; Zhang et al. 2021b), partial autocorrelation function (PACF) algorithms (Meng et al. 2024), the maximal information coefficient (MIC) Huang et al. 2022a), kernel sHAP (Ge et al. 2023), Pearson correlation (Jiang et al. 2022; Wei et al. 2019), R2-adj (Togneri et al. 2022), akaike information criterion (AIC) (Togneri et al. 2022), and the least absolute shrinkage and selection operator (LASSO) (Granata et al. 2022). However, such models face a significant challenge because they neglect temporal dependencies in landslide responses, resulting in the selection of unrelated features. As a result, it is recommended to incorporate knowledge-based methods and sensitivity analysis to consider several temporal dependencies and select the best-related features (Ebrahim et al. 2024a) (refer to Section "Statistical Correlations" for more details).

Statistical correlations

Section "Feature selection" mentions that statistical correlation can be employed to examine the linear and non-linear relation between the input features and the target output (Li et al. 2020b; Reshef et al. 2011). Figure 12 shows the most commonly used models in the literature. Among these models, Pearson's correlation coefficient is commonly used in feature selection algorithms due to its simple yet practical nature. Table 2 shows the Pearson coefficient values, which range from 0.0 to 0.2 (extremely weak correlation) to 0.8-10 (robust correlation). According to Li et al. (2020b), the maximal information coefficient (MIC) outperforms Pearson's correlation coefficient because it extracts both linear and non-linear correlations, as well as complex correlations, whereas Pearson cannot capture such non-linear behavior.

Fig. 12
figure 12

Statistical models employed for feature selection

Table 2 Pearson coefficient values (modified from (Jiang et al. 2022))

Another technique called the Shapley additive explanations (SHAP) algorithm is developed based on game theory to illustrate the output of any machine learning model (Lundberg and Lee, 2017). SHAP evaluates the contribution of each feature and indicates how positive or negative its role is in the prediction process. Many approximation methods have been developed to overcome the challenges in calculating SHAP, such as Tree SHAP, Deep SHAP, and Kernel SHAP (Baptista et al. 2022).

Gray relational analysis theory, a key component of gray system theory, is a statistical approach that examines the relationships between multiple factors by assessing the degree of correlation, known as Gray correlation, using sample data from each factor. This correlation assesses how closely the geometric shapes of data curves align, with closer shapes indicating stronger correlations (Liu et al. 2022). In the study by Zhang et al. (2021b), a gray correlation was employed to identify primary influencing factors, with a correlation coefficient exceeding 0.6, indicating a significant association with periodic displacement (Wang et al. 2004).

The stacked prediction models were developed using the elastic net (EN) algorithm as the meta-classifier, as outlined by (Granata et al. 2022). The EN algorithm, introduced by Zou and Hastie (2005), combines two widely used regularized variants of linear regression: the least absolute shrinkage and selection operator (LASSO) method and the ridge method. The LASSO method identifies the most influential variables by introducing an absolute penalty in ordinary least squares (OLS) regression. Meanwhile, ridge regularization applies a penalty in the OLS formulation by penalizing the square weights rather than the absolute weights. Consequently, this approach penalizes large weights significantly while distributing many small weights across the feature spectrum.

Various techniques can also be incorporated, such as the information gain ratio (Tien Bui et al. 2016), the least support vector machine (Pham et al. 2018), and the Gini information gain (Quinlan 1993). Liu et al. (2020) utilized the Gini information gain method, employing the random forest (RF) approach proposed by Zhang et al. (2020) to evaluate the relative significance of each key factor. Information gain serves the purpose of identifying which feature provides the most helpful information for predicting outcomes. The akaike information criterion (AIC) serves as a measure for evaluating the effectiveness of a statistical model, taking into account both its accuracy in predicting data and its simplicity by penalizing complex models. It offers a means to compare and choose among different models based on their performance (McElreath 2018).

As stated in Section "Feature selection", combining knowledge-based models with sensitivity analysis can enhance performance when selecting controlling features. To illustrate, even though rainfall is one of the main triggers, Hemalatha et al. (2019) observed that rainfall has a low correlation when considered alone. The dynamic effect of rainfall is well explained by the infiltration process, which involves rainwater seeping through the surface and into the landslide body. This process is influenced by a variety of factors, including surface loss, scour, evapotranspiration, plant transpiration, air temperature, net solar radiation, soil temperature, humidity, and wind speed (Suk et al. 2022; Ahmed et al. 2021; Granata et al. 2022). To have a strong correlation between rainfall and the other responses, the elements listed above must be considered.

Another factor that should be sensitively considered is the effective antecedent period which can be achieved using statistical correlations. In the study of Zhang et al. (2021b), Zhang employed the set pair analysis (SPA) method to determine the optimal lag time, which they identified as 18 days. Set pair analysis, initially introduced by Zhao (1989), is a statistical approach that deals with deterministic-uncertain problems through quantitative analysis of identical and contrary sets. The antecedent period can be assessed using the Pearson correlation method, as suggested by Dai and Lee (2001). A higher Pearson correlation coefficient indicates a stronger relationship between variables. According to Han et al. (2017), a Pearson correlation exceeding 0.6 indicates a high correlation among independent variables (refer to Table 2 for further details). However, combining these models with knowledge-based approaches is rarely discussed in the literature, necessitating additional research since these methods can effectively capture and incorporate temporal dependencies, improving feature selection robustness and accuracy.

Model selection

According to the reviewed literature, some research relied on such static models, with some do not account for time series dependencies, such as (Dassanayake et al. 2023; Filipović et al. 2022; Ge et al. 2023; Huang et al. 2023a; Meng et al. 2024; Togneri et al. 2022; Wang et al. 2023a; Xi et al. 2023; Gong et al. 2021; Guo et al. 2019; Han et al. 2021; Li et al. 2018; Xu et al. 2023). However, such models can be further improved as it was found that deep learning models offer higher performance than static and shallow models. Varangaonkar and Rode (2023) proved that the long short-term memory (LSTM) model outperforms support vector machine (SVM) and artificial neural network (ANN) models. Ge et al. (2023) showed that gated recurrent unit (GRU) surpasses particle swarm optimization - support vector regression (PSO-SVR) and bidirectional recurrent neural network (BRNN). Wang et al. (2023b) indicated that the LSTM model can surpass the primary recurrent neural network (RNN) model. Dai et al. (2022) revealed that the LSTM has higher accuracy compared to the classical back propagation neural network (BPNN). Liu et al. (2020) showed that LSTM is more accurate than GRU and random forest (RF) models. Filipović et al. (2022) figured that LSTM offered better prediction results than RF and the auto-regressive integrated moving average (ARIMA). Conventional neural networks (CNN) (Wang et al. 2023a), (CNN- BiGRU- Attention) (Meng et al. 2024) can offer higher prediction results. Huang et al. (2022a) showed that the LSTM and the salp-swarm-algorithm-optimized temporal convolutional network (SSA-TCN) have almost similar performance. Based on the reviewed literature, the LSTM model generally has better prediction results than other models (Huang et al. 2022a; Xi et al. 2023; Xing et al. 2019).

To illustrate, artificial intelligence can be divided into two terms: a) ANI (artificial narrow intelligence) and b) AGI artificial general intelligence (Goertzel 2014). AGI is still under development, while ANI is widely adopted for numerous applications. The learning algorithms of ANI have been revolutionized as a result of the advancement of computational devices (Semmler and Rose 2017). The advanced learning algorithms include neural networks, deep learning, and decision trees. The architecture of the neural network is presented in Fig. 13. The architecture of the layer consists of input features, hidden layers, activation functions, and output layers. The hidden layer consists of several neurons, and each neuron applies a function f(x) in which the input is (x) and the output is the probability of y=1. Each hidden layer aims to convert the input features to new features that fit well with the labeled output y (Wilamowski 2009). Each problem necessitates a different design of the network in terms of the number of hidden layers, neurons in each layer, activation function utilized, and the number of output neurons (single or multiple).

Fig. 13
figure 13

Neural networks architecture (modified from (Alekseev et al. 2023))

Sequences and time series applications typically involve inputs and outputs that vary over time. Unlike basic neural networks, recurrent neural networks (RNNs) are preferred for such tasks. This preference stems from the fact that basic models do not share features learned at different positions within the sequence. Figure 14 illustrates the architecture of the recurrent neural network. RNN is affected by vanishing gradients and exploding gradients. To overcome this issue, a gradient clipping is assigned with a maximum cap or threshold for exploding gradients. However, vanishing gradients are challenging to overcome. To illustrate vanishing gradients, for deep RNN, the first layer's effect vanishes with time steps. For this reason, gated recurrent unit (GRU) and long-short-term memory (LSTM) models were developed to overcome such issues (Wang et al. 2020).

Fig. 14
figure 14

Recurrent neural network (RNN) network (modified from (Yang et al. 2021)). U, V, and W represent the weights from the input layer to the hidden layer, from the hidden layer to the output layer, and for self-recursion, respectively. Tx and Ty represent the input (x) and the output (y) sequence length

As illustrated earlier, landslide response lagged with the rainfall triggering, necessitating selecting the appropriate features while using static models to account for such temporal relation (Meng et al. 2024). Static models cannot build a temporal connection between the input features and the external triggering (Zhang et al. 2022a). On the other hand, dynamic and deep learning models can extract the non-linear correlation between the triggering and the landslide response (Zhang et al. 2022b). For more illustration, under the same triggering conditions, the slope may present different responses based on the antecedent characteristics of the slope. Under an extensive rainfall event, the slope may still be stable, but it may fail if small rainfall events trigger it after that due to its antecedent status (Crozier and Glade 2005; Li et al. 2018; Yang et al. 2019; Zhang et al. 2021a). Consequently, the temporal effect and the time series are also paramount factors to consider necessitating advanced deep models.

Optimizations

Hypertuning

Prediction using machine learning models is an empirical process that necessitates selecting the appropriate parameters. This can be accomplished using optimization techniques. Two optimization processes are required: the first is to optimize the selected model by Hypertuning its parameters to achieve the best accuracy, and the second is to optimize the training process to achieve fast convergence. Model structure Hypertuning can be achieved through several optimization techniques as outlined in Fig. 15. Optimization techniques (Fig. 15) include grid search, random search, and other optimization techniques such as Bayesian optimization, particle swarm optimization (PSO), genetic algorithms (GA), successive halving (SH), sparrow search algorithm (SSA), etc (Bergstra and Bengio 2012; Jiang et al. 2022; Ma et al. 2023; Snoek et al. 2012; Xu et al. 2023).

Fig. 15
figure 15

Utilized optimization techniques in the retrieved studies

Grid search is a heuristic search method within a predetermined subset of a learning algorithm's hyperparameter space, as outlined by (Granata et al. 2022). This algorithm uses a specific performance metric, such as R2, MAE, or RMSE, to guide its exploration and evaluation of potential hyperparameter combinations. Grid search, while comprehensive, is often time-consuming due to its requirement to evaluate and try all possible combinations of hyperparameters (Liu et al. 2020). On the other hand, random search employs a randomized sampling approach, which, although faster, may not always yield the optimal hyperparameter combination. According to Xu et al. (2023), Bayesian optimization demonstrates similar performance to random search, particularly in high-dimensional searching space.

PSO has emerged in recent years (Ni et al. 2013; Parsopoulos and Vrahatis 2002; Poli et al. 2007). It operates by iteratively refining solutions, starting from random initial solutions and evaluating their quality based on fitness. PSO stands out for its simplicity, high precision, and rapid convergence compared to alternative algorithms. The genetic algorithm (GA), belonging to the family of evolutionary algorithms, stands as a meta-heuristic search technique. Renowned for its robust global search capability, the GA effectively navigates solution spaces even without gradient information from error functions, making it a potent tool across optimization, search, and machine learning domains. Wei et al. (2019) employed the GA to optimize the connection weights of neural networks, addressing the common challenge of local minimum entrapment. Genetic algorithms have effectively enhanced learning efficiency and computational accuracy when tuning model hyperparameters (Ma et al. 2023).

The SH model, as utilized by Xu et al. (2023), operates on the principle of dynamic resource allocation. It aims to optimize hyperparameters by efficiently allocating computational resources (Jamieson and Talwalkar, 2015). If specific hyperparameter configurations prove less effective, their evaluation is halted, and resources are redirected to more promising configurations. Inspired by the non-stochastic 'Best-Arm Problem,' SH prioritizes allocating resources to the most promising methods. By halving the available resources successively over multiple rounds, SH selects the optimal configuration from a set of hyperparameter configurations. The sparrow search algorithm (SSA), as mentioned by Jiang et al. (2022) and Xue and Shen (2020), is an innovative method inspired by the foraging and anti-predatory behaviors observed in sparrows. This approach offers several advantages, including its robustness, fast convergence rate, and effectiveness in seeking optimal solutions.

Given that machine learning models are entirely empirical processes, hyperparameters play a critical role in shaping a model's structure by selecting the appropriate model dimensions. They must be established before the learning process starts. The primary goal of the aforementioned techniques is to a) accurately capture the best dimensions that help the model achieve better prediction performance, and b) to converge faster with fewer computational demands. The main concept for the above-mentioned models can be simplified as shown in Fig. 16, where the model searches from coarse to fine scales to accurately capture the model's best structure while also converging faster.

Fig. 16
figure 16

Random searching with course to fine method for effective hyperparameter searching

Training optimizer

As for optimizing the training process, choosing the appropriate number of iterations, learning rate, and monitoring metrics is challenging. To clarify, a small number of iterations causes high bias issues while, on the contrary, long iterations may cause high variance issues, as illustrated in Fig. 17a). Figure 17a depicts the typical relationship between iteration of the training and validation sets and their corresponding loss, highlighting two issues: a) underfitting (high bias), and b) overfitting (high variance). Furthermore, selecting the optimum learning rate is challenging to avoid the local minima issues, as illustrated in Fig. 17b. To illustrate, the learning rate should be adjusted to reach the optimum values of the model weights (w) and the Baias (b) within a reasonable computational time. In other words, a too-small value of learning rate (α) requires huge computational time, and large learning (α) rate value will be misleading (refer to Fig. 17b). Thus, choosing an appropriate learning rate is essential as a larger value may not converge.

Ebrahim et al. (2024d) investigated several optimization techniques, and it was found that Adam optimizers offer the best prediction accuracy. The Adam algorithm, widely employed for loss function optimization, outperforms traditional gradient descent methods (Togneri et al. 2022) by ensuring swift convergence and mitigating the risk of getting trapped in local minima. Its effectiveness lies in its ability to dynamically adapt learning rates for each parameter, leading to improved convergence speed and more efficient exploration of the solution space (Chiang et al. 2022; Huang et al. 2023a; Wang et al. 2023a; Xing et al. 2019).

Loss functions

Quantifying the loss during the training and validation of the training process is a vital step in monitoring the accuracy of the model performance. Section "Model evaluations" provides equations for several evaluation metrics that can be used to monitor model performance during the training phase. For instance, the Huber loss function (Equation 1) (Meng et al. 2024; Nava et al. 2023), mean squared error (MSE) (Chiang et al. 2022), and root mean squared error (RMSE) (Granata et al. 2022; Togneri et al. 2022). Ebrahim et al. (2024d) investigated several loss functions and found that MSE and Huber provide better performance. The Huber loss function, as utilized in the study by Holland and Welsch (1977) and Huang and Wu (2021), integrates and refines both the mean square error (MSE) and mean absolute error (MAE), offering a balanced optimization approach. Monitoring and quantifying training and validation loss can assist improve performance by overcoming underfitting and overfitting challenges. This can be achieved by terminating the model training at the time when the loss is minimal for both training and validation losses (refer to Fig. 17a).

$$\left\{ {\begin{array}{*{20}c} {\frac{{a^{2} }}{2}\quad \quad\quad\quad\;{\text{ if}}\;\left| a \right| \le \delta } \\ {\delta \left( {\left| a \right| - \frac{a}{2}} \right)\;{\text{ otherwise}}} \\ \end{array} } \right.$$
(1)

where \(a = y_{i} - \widehat{y}_{i}\) is the difference between the true value \(\widehat{y}_{i}\) and the predicted value \(y_{i}\) and \(\delta\) is a threshold parameter (James et al. 2023).

Fig. 17
figure 17

a Loss versus iterations for cross-validation (CV) and training sets; b Loss versus iterations for different learning rates (modified from (Khang Pham, 2023))

Normalization

Since the triggering and landslide responses vary quantitatively, training process convergence may be difficult, as illustrated in Fig. 18. Fig. 18 depicts two scenarios: a) two features with different scales, and b) two features that have comparable scales. The first case is computationally challenging because the gradient descent during training proceeds in small steps, whereas the second case converges more quickly. Consequently, normalizing and scaling the data is paramount (Varangaonkar and Rode 2023). Several techniques can be used, such as minimum and maximum normalization (Equation 2) (Granata et al. 2022), mean normalization (Equation 3), Z score normalization (Equation 4) (Togneri et al. 2022), etc.

$$X_{j - scaled}^{(i)} = \frac{{X_{j}^{(i)} }}{{X_{j - \max } }},0 \le X_{j - scaled}^{(i)} \le 1$$
(2)
$$X_{j - scaled}^{(i)} = \frac{{X_{j}^{(i)} - \mu_{j} }}{{X_{j - \max } - X_{j - \min } }}$$
(3)
$$X_{j - scaled}^{(i)} = \frac{{X_{j}^{(i)} - \mu_{j} }}{{\sigma_{j} }}$$
(4)
$$\mu_{j} = \frac{1}{m}\sum\limits_{i = 1}^{m} {X_{j}^{(i)} }$$
(5)
$$\sigma_{j}^{2} = \frac{1}{m}\sum\limits_{i = 1}^{m} {\left( {X_{j}^{(i)} - \mu_{j} } \right)}^{2}$$
(6)

where Xj(i) is the input feature or variable, Xj-max(i) is the maximum value of Xj(i), Xj-min(i) is the minimum value of Xj(i), μ refers to the mean, and σ indicates the standard deviation.

Fig. 18
figure 18

Feature scaling visualization

Overfitting

Although model training monitoring indicates model performance, this indication may be misleading because the model may be overfitting. The overfitting issue can be illustrated as visualized in Fig. 19, in which the model behaves well on the training set; however, the performance largely deviates from the validation or testing sets. This issue can be solved by four different techniques: a) by getting more data; b) by not using more unrelated features; c) by monitoring the training process; and d) by reducing the size of the training weights (regularization - R).

Fig. 19
figure 19

Comparison of different regularization configurations: a high bias (underfit); b just proper (generalization); c overfit (high variance) (modified from (Singh 2018))

The first two proposed solutions to the overfitting problem are discussed in Sections "Splitting ratio" (splitting ratio and dataset volume) and "Feature Selection" (proper feature selection). The third technique, as illustrated in Section "Training optimizer" and Fig. (17a), is to monitor the training process to select the best iteration and stop training the model at the point of best performance with no overfitting.

Besides the aforementioned techniques, some additional layers can be added to the model to overcome this issue, such as L2 regularization (Xing et al. 2019) or dropout layers (Chiang et al. 2022; Huang et al. 2023a). These layers are designed to reduce the size of the training weights (w) while maintaining the same model output. The generalization term can be assigned a value R that is greater than zero. Figure 19 depicts three cases with various R values: a) A high R-value that reduces the value of w, resulting in underfitting; b) An appropriate R-value that improves performance while overcoming overfitting and underfitting issues; and c) A low R-value that eliminates the generalization term. Thus, selecting and hypertunning the appropriate value of R is critical to overcoming the problem of overfitting.

Activation functions

The activation function is a higher-level feature that aims to convert the input features to new features that fit well with the labeled output Y. It should be noted that in simple models such as linear regression, choosing the appropriate feature is performed manually. In contrast, in neural networks, this process is performed automatically using the activation functions. As a result, activation functions (refer to Fig. 20) play a pivotal role in shaping the behavior and performance of neural networks. Several activations can be used, such as linear, ReLU, Leaky ReLU, tanh, sigmoid, etc. Each activation function presents distinct characteristics, requiring careful consideration based on the nature of the data and the objectives of the neural network model (Dubey et al. 2022). Table 3 summarizes the activation function characteristics. Among all these functions ReLU and Leaky ReLU remain more popular for hidden layer activations due to more straightforward gradient calculation (Wang et al. 2023a; Xi et al. 2023; Togneri et al. 2022).

Fig. 20
figure 20

Different activation functions: a ReLU; b sigmoid; c Leaky ReLU; d linear; e and tanh

Table 3 Activation functions characteristics

Model evaluations

The time series analysis in landslide applications is limited, and generally, it is about landslide displacement prediction. This section reviews the widely adopted metrics among these studies, which may help future studies select the appropriate metrics (Scikit-Learn 2024a). Such metrics evaluate the model performance during the training, validation, and testing sets. Model evaluation can be performed using two techniques: a) unweighted method and b) weighted method. Most current research uses the first (unweighted) method, in which all the datasets are assigned the same error weight. On the other hand, the weighted method is rarely used. To clarify, the weighted method assigns different error weights for the creep and the mutual points in which the critical points receive a high error weight (Togneri et al. 2022).

Figure 21 shows the number of manuscripts versus the metric employed among the selected literature retrieved for this aim. Among all metrics, the RMSE (Scikit-Learn 2024b) is the most employed among all metrics, followed by the MAE (Scikit-Learn 2024c) and MAPE (Scikit-Learn 2024d), recording a ratio of 86.5%, 37.8%, and 35.1%, respectively. Some studies utilized R2 (Scikit-Learn 2024e) and absolute error as metrics, while these studies are around 32.4% and 13.5%, respectively. The remaining metrics such as MSE (Liu et al. 2016), R2 adj (Togneri et al. 2022), R (Huang et al. 2022a), EF (Huang et al. 2023a), PICP (Ge et al. 2023), RI (Wei et al. 2019), MASE, and SMAPE (Filipović et al. 2022) were utilized rarely in literature according to the objective of the study. Equations 7-11 illustrate the widely used metrics: RMSE, MAE, MAPE, R2, and Absolute error. For the RMSE, MAE, MAPE, and absolute error, the greater the value of these parameters, the worse the prediction performance of the model whereas when these values are closer to 0 indicates higher accuracy. For the R2, the greater the factor, the better the model behaves.

$$RMSE = \sqrt {\frac{1}{N}\sum\limits_{i = 1}^{N} {\left( {X_{i} - Y_{i} } \right)^{2} } }$$
(7)
$$MAE = \frac{1}{N}\sum\limits_{i = 1}^{N} {\left| {X_{i} - Y_{i} } \right|}$$
(8)
$$MAPE = \frac{1}{N}\sum\limits_{i = 1}^{N} {\frac{{\left| {X_{i} - Y_{i} } \right|}}{{X_{i} }}}$$
(9)
$$R^{2} = 1 - \left( {\sum\limits_{i = 1}^{N} {\left( {X_{i} - Y_{i} } \right)^{2} } /\sum\limits_{i = 1}^{N} {\left( {Y_{i} - \overline{Y} } \right)^{2} } } \right)$$
(10)
$$Absolute{\text{ error}} = \left| {X_{i} - Y_{i} } \right|$$
(11)

where Yi is the specific value of the ith real data; \(\overline{Y}\) is the average value of the real data; Xi is the specific value of the ith predicted data.

Fig. 21
figure 21

Metrics employed in literature

Predictions

Predicted target

This study reviews the physically based causative thresholds time series applications of landslides. Among the reviewed studies, the surface displacement predictions are the most predicted targets (Huang et al. 2022a, 2023a; Lian et al. 2013; Liu et al. 2020; Nava et al. 2023; Wang et al. 2022, 2023a, b; Xi et al. 2023). The reason for such several studies is the availability of the GPS or GNSS-monitored surface displacement through long intervals of up to 12 years in some studies (Wang et al. 2023b). Such studies may not represent the physical response of the slope as they only rely on the surface response. Recently, some studies considered the deep displacement predictions (Han et al. 2021; Xu et al. 2023; Zhang et al. 2021b) in which information about the sliding surface initiation and also the surface commutative values can be predicted (Fig. 22a). However, predictions of the hydrological response, such as the volumetric water content predictions, matric suctions, and the groundwater level variation of the landslides, are rarely considered.

Fig. 22
figure 22

a predicted target; b landslide type versus the number of studies

Reservoir landslides received significant attention among scholars (Han et al. 2021; Huang et al. 2022a, 2023a; Li et al. 2020a; Meng et al. 2024; Nava et al. 2023; Wang et al. 2023a, b; Xu et al. 2023) while other types of landslides such as deep-seated landslides (Wei et al. 2019), rock slope type (Jiang et al. 2022) received a little attention. On the other hand, rainfall-induced shallow landslides are missing from the retrieved literature (Fig. 22b).

The above two paragraphs suggest that more research should be employed to account for the inner response of several types of landslides. Some key differences may arise in three aspects: a) data preparation; b) feature selection; and c) physical response. As for the data preparation, current available studies were mainly about surface displacement that showed trends and periodic terms. In contrast, the inner response may have no trend and cannot be decomposed into a trend or periodic terms, necessitating investigating its prediction. Regarding feature selection, the features that control reservoir landslides are not necessarily to be the same that control other types of landslides, highlighting the need for further investigation. Concerning the physical response, the spatio-temporal responses are completely different for the surface and inner responses, highlighting the need for further investigations.

Single and interval predictions

Predictions are associated with unavoidable uncertainties that arise from the model assumptions. Therefore, it is vital to quantify such uncertainties that may help in understanding the predicted values well. According to the reviewed studies, widely single predictions were utilized where a single time step ahead is predicted neglecting to quantify the corresponding uncertainties (Dai et al. 2022; Dassanayake et al. 2023; Filipović et al. 2022; Granata et al. 2022; Han et al. 2021; Huang et al. 2022a, 2023a; Li et al. 2020a; Lian et al. 2013; Liu et al. 2020; Nava et al. 2023; Togneri et al. 2022; Wang et al. 2023a, b; Wang et al. 2022; Xi et al. 2023; Xu et al. 2023; Zhang et al. 2021b).

Figure 23 a shows the bibliometric data of the retrieved studies where the studies that account for the uncertainties are around 9% only. These minor studies quantified the uncertainties through interval predictions (Ge et al. 2023; Xing et al. 2019). To illustrate the interval predictions, Figure 23b shows a schematic view of the single and interval predictions where the X represents the single predictions, and the dashed line shows the upper and lower boundaries of the interval predictions.

Fig. 23
figure 23

a prediction status versus the number of studies; b Single and interval prediction schematic view

Xing et al. (2019) provided a detailed derivation of calculating the interval predictions: Based on the assumption that the random variable ζ has a zero mean Gauss distribution and is independent of the input variable x and for a given confidence level of po, the output interval of the model is presented in Equation 12.

$$\left[ {\widehat{y} - \left| {\widehat{\sigma }\ln \left( {1 - p_{0} } \right)} \right|,\widehat{y} + \left| {\widehat{\sigma }\ln \left( {1 - p_{0} } \right)} \right|} \right]$$
(12)

where y is the output variable, σ is a parameter of the density function of the Laplace distribution and \(\widehat{\sigma }\) is the estimated value of σ.

Discussions

Time series analysis of physically-based causative thresholds is a key aspect of landslide prediction and early warning systems. These models are simpler than deterministic models since they do not require extensive geotechnical datasets. Furthermore, these models surpass empirical thresholds because they account for the physical subsurface response.

Table 1 lists the available reviews in the literature which cover the following topics: Management of landslide risks, Utilization of advanced monitoring technologies, Assessment of susceptibility factors, and Development of predictive models. However, to the best of the author's knowledge, there is no literature addressing the time series application for rainfall-induced landslides in the accessible literature, highlighting the novelty of this study.

The main modeling processes were discussed: a) data preparation; b) model selection; c) optimizations; d) model evaluation; and e) predictions. This process is completely empirical where the AI model is affected by numerous factors. As a result, for each modeling step, several modeling concepts were discussed to highlight the physical meaning, illuminating how to select the appropriate parameter dimensions. For instance, the data preparation process is controlled by selecting the appropriate data frequency, splitting ratio, decomposition technique, window size, and the best-related features. Similarly, Table 4 highlights the main modeling process and the findings summary of the retrieved studies.

Table 4 Summary of the review's main findings

As discussed in the preceding sections, model performance is determined by a variety of modeling factors, including data preparation, model selection, optimizations, evaluations, and predictions which are also affected by the inventory data accuracy. This process is completely empirical and necessitates the use of an appropriate optimization technique. Moreover, this process also necessitates integrating knowledge-based techniques for selecting the best-related features. To illustrate, the theoretical relationship between input and output parameters is already known. Thus, theoretically, if the model's input parameters represent physical features, the output will be identical to the physical model (i.e., analytical models), emphasizing the importance of knowledge-based techniques. Table 5 compares some of the related literature considering surface displacement as a case study for the comparison. This table highlights some of the aforementioned factors, as well as a comparison of all models based on the final accuracy and performance of the provided models. The arrangement of the discussion is intended to provide knowledge of controlling features and initial conditions. However, the theoretical background of these AI models is discussed by Merghadi et al. (2020), who thoroughly examined the theoretical background for machine learning algorithms.

Table 5 Comparison among several models considering surface displacement as a case study

In general, there is no superior model, but there is a superior prediction based on considering the aforementioned modeling process. Regarding the data preparation and model selection process, it is seen that investigating the affecting features while taking the actual initial condition into account, greatly improves the model's accuracy (Han et al. 2021). As a result, outdated or unrelated features can have a negative impact on the model, so it is best to remove them (Cao et al. 2016). For example, BPNN (Liu et al. 2016) and MLR (Krkač et al. 2020) offer reasonable accuracy, while they are limited in their ability to account for dynamic and non-linear relationships. This is because the well-established dataset accurately depicts the physical mechanism. Marrapu et al. (2021) concluded that ANNs with a large dataset are more accurate than ANNs with a small dataset. However, if the dataset lacks useful information, the model must be improved to better fit these complex relationships (Zhang et al. 2021c). The improvement of the modeling can be achieved by data decomposition, for example, Wang et al. (2023d) decomposed the displacement into trend and periodic datasets, making it easier to select a suitable model. Moreover, optimization techniques can improve such processes where Li et al. (2020c) and Zhang et al. (2021d) showed that optimizations can enhance the accuracy of a model by selecting the most suitable parameter. Other controlling parameters are clearly illustrated in Tables 4 and 5.

These models are built on the assumptions listed in the following lines: a) The relationship between the landslide and the feature factor that controls this phenomenon will not change significantly in the future; b) These models cannot predict sudden failure because such events are rarely present in the training set. Furthermore, these models have a few limitations: a) Regression models, unlike landslide susceptibility maps, are only appropriate for small areas due to their reliance on field monitoring data; b) causative thresholds take into account only one limited feature, such as displacement or groundwater level, ignoring all other characteristics, such as spatial variation in land cover, soil type, topography, and geotechnical and hydrological parameters. These models have the following advantages: a) artificial intelligence models can predict any causative features using available monitoring data; thus, these models can provide an accurate warning than empirical-statistical thresholds; and b) these models are a cost-effective method for landslide prediction because they do not necessitate extensive geotechnical investigation.

Gaps and future directions

The literature presents valuable insights into landslide prediction by utilizing deep learning models, which have demonstrated notable accuracy. However, there are notable gaps, particularly in the realm of time series physically based causative thresholds models, which integrate mechanical and hydrological characteristics of landslides. These gaps primarily pertain to the accessibility of monitoring data and the prediction methodology. The current literature is constrained by the availability of monitoring data, emphasizing the necessity for expanding subsurface monitoring systems. Additionally, some literature lacks a comprehensive understanding of landslide mechanisms, thus highlighting the need for a more knowledge-based approach. Refer to Table 6 for a summary of these gaps and recommendations for future directions as follows:

  1. 1.

    Assessment of Subsurface Characteristics: Prediction models predominantly rely on surface measurements, neglecting subsurface mechanical and hydrological characteristics. Prioritizing the assessment of these characteristics offers a more accurate depiction of landslide mechanisms, directly considering the underlying factors driving landslides.

  2. 2.

    Spatiotemporal dynamics in prediction methods: Single-point prediction methods often overlook spatiotemporal dynamics, limiting their effectiveness. To address this, integrating information from diverse monitored data sources is recommended, emphasizing the need for comprehensive monitoring to enhance predictive accuracy.

  3. 3.

    Impact of Data Frequency: The influence of data frequency on prediction accuracy is frequently disregarded. It is suggested that incorporating field-monitored data into prediction techniques is essential for understanding the impact of data frequency on accuracy and operational costs. Analyzing data frequency comprehensively can lead to more accurate predictions and optimize operational efficiency.

  4. 4.

    Effect of Time Series Decomposition: The impact of time series decomposition on prediction accuracy remains largely unexplored. Future research should conduct comparative analyses between decomposed and non-decomposed methodologies to gain deeper insights into their mechanisms.

  5. 5.

    Temporal Correlations in Feature Selection: Statistical techniques commonly employed for feature selection often overlook temporal correlations in the data. Leveraging deep learning and knowledge-based approaches is recommended to capture and incorporate temporal dependencies, thus enhancing feature selection robustness and accuracy.

  6. 6.

    Weighted Evaluation Methodologies: Many studies assign equal weight to all datasets, potentially resulting in misleading conclusions, particularly for lengthy datasets with numerous non-critical points. Weighted evaluation methodologies are proposed to accurately capture critical points in landslide applications, prioritizing their detection to mitigate risks effectively.

Table 6 The current gaps and future recommendations

Conclusions

The systematic review of time series models for landslide prediction yields valuable insights into the current research landscape in this domain. Analysis of diverse studies reveals several key findings, implications, and avenues for future exploration as follows:

Firstly, the review underscores the significance of data frequency in landslide prediction models. It highlights substantial pattern disparities between monthly and daily time steps, underscoring the need to further explore how data frequency influences model efficacy. Temporal ordering considerations in splitting training, validation, and testing sets are emphasized. Notably, an 80% training ratio is widely adopted. Furthermore, challenges associated with time series data decomposition are discussed, particularly in discerning trend and seasonal components. Monthly time series, for instance, often exhibit seasonal, trend, and residual terms, necessitating meticulous decomposition to trend and periodic components. Data frequency variations, such as the absence of seasonality in daily time steps, require tailored methodologies. The review also addresses the variability in lagged periods across literature, influenced by factors like slope hydraulic conductivity. Feature selection methods, such as statistical models, cannot extract temporal correlations, necessitating integrating knowledge-based considerations.

Regarding modeling approaches, dynamic and deep learning models like LSTM are found to outperform static models such as artificial neural networks (ANN), support vector machine (SVM), random forest (RF), and statistical models such as autoregressive integrated moving average (ARIMA). Deep learning models extract the temporal and non-linear correlation between the triggering and the landslide response. However, meticulous data preparation, including data frequency, sampling ratio, decomposition, temporal correlations, and feature selection, is emphasized to ensure model effectiveness.

In machine learning, hyperparameter optimization strategies are crucial for model performance. Notably, using the random search method and the preference for the Adam optimizer and loss functions like MSE and Huber provide better performance and help save time and converge faster. Overfitting mitigation strategies such as getting more data, not using more unrelated features, monitoring the training process, reducing the training weights' size, and carefully selecting activation functions are underscored. Additionally, the review notes the prevalent use of unweighted RMSE, MAE, and MAPE metrics for evaluation. According to the reviewed studies, widely single predictions for the surface displacement of reservoir landslides were utilized where a single time step ahead is predicted, urging exploration of interval predictions and a broader scope encompassing diverse landslide typologies.

Addressing gaps and future recommendations, the review advocates for integrating diverse data sources, exploring the impact of diverse monitored data for spatiotemporal predictions, data frequency on prediction accuracy for balancing between monitoring systems cost and prediction accuracy, adopting weighted evaluation methodologies to account for the critical points, especially for such catastrophic events (i.e., landslides), investigating time series decomposition effects, integrating knowledge-based models with statistical models to account for the temporal correlations, and developing further research related to the subsurface response of landslides instead of relying on the shallow responses.

In conclusion, the review is a comprehensive guide for scholars and practitioners in advancing landslide prediction techniques. By addressing identified gaps and implementing recommended future directions, researchers can enhance the accuracy and efficacy of time series models, contributing to improved disaster mitigation efforts.

Availability of data and materials

No datasets were generated or analysed during the current study.

References

  • Ahmed FS, Bryson LS, Crawford MM (2021) Prediction of seasonal variation of in-situ hydrologic behavior using an analytical transient infiltration model. Eng Geol 294:106383

    Article  Google Scholar 

  • Alekseev A, Kozhemyakin L, Nikitin V, Bolshakova J (2023) Data preprocessing and neural network architecture selection algorithms in cases of limited training sets—on an example of diagnosing alzheimer’s disease. Algorithms 16(5):219

    Article  Google Scholar 

  • Angeli MG, Pasuto A, Silvano S (2000) A critical review of landslide monitoring experiences. Eng Geol 55(3):133–147

    Article  Google Scholar 

  • Auflič MJ, Herrera G, Mateos RM, Poyiadji E, Quental L, Severine B, Marturia J (2023) Landslide monitoring techniques in the Geological Surveys of Europe. Landslides 20(5):951–965

    Article  Google Scholar 

  • Baptista ML, Goebel K, Henriques EM (2022) Relation between prognostics predictor evaluation metrics and local interpretability SHAP values. Artif Intell 306:103667

    Article  Google Scholar 

  • Barra A, Solari L, Béjar-Pizarro M, Monserrat O, Bianchini S, Herrera G, Moretti S (2017) A methodology to detect and update active deformation areas based on Sentinel-1 SAR images. Remote Sens 9(10):1002

    Article  Google Scholar 

  • Baum RL, Godt JW (2010) Early warning of rainfall-induced shallow landslides and debris flows in the USA. Landslides 7:259–272

    Article  Google Scholar 

  • Bednarczyk Z (2018) Identification of flysch landslide triggers using conventional and ‘nearly real-time’ monitoring methods – An example from the Carpathian Mountains, Poland. Eng Geol 244:41–56

    Article  Google Scholar 

  • Bergmeir C, Benítez JM (2012) On the use of cross-validation for time series predictor evaluation. Inf Sci 191:192–213

    Article  Google Scholar 

  • Bergstra J, Bengio Y (2012) Random Search for Hyper-Parameter Optimization. J Mach Learn Res 13:281–305

    Google Scholar 

  • Bontemps N, Lacroix P, Larose E, Jara J, Taipe E (2020) Rain and small earthquakes maintain a slow-moving landslide in a persistent critical state. Nat Commun 11(1):780

    Article  CAS  Google Scholar 

  • Breglio G, Bernini R, Berruti GM, Bruno FA, Buontempo S, Campopiano S, Cusano A (2023) Innovative photonic sensors for safety and security, part III: environment, agriculture and soil monitoring. Sensors 23(6):3187

    Article  CAS  Google Scholar 

  • Cai Z, Xu W, Meng Y, Shi C, Wang R (2016) Prediction of landslide displacement based on GA-LSSVM with multiple factors. Bull Eng Geol Env 75:637–646

    Article  Google Scholar 

  • Cao Y, Yin K, Alexander DE, Zhou C (2016) Using an extreme learning machine to predict the displacement of step-like landslides in relation to controlling factors. Landslides 13(4):725–736

    Article  Google Scholar 

  • Chae BG, Park HJ, Catani F, Simoni A, Berti M (2017) Landslide prediction, monitoring and early warning: a concise review of state-of-the-art. Geosci J 21:1033–1070

    Article  Google Scholar 

  • Chang Z, Huang F, Huang J, Jiang S-H, Liu Y, Meena SR, Catani F (2023) An updating of landslide susceptibility prediction from the perspective of space and time. Geosci Front 14(5):101619

    Article  Google Scholar 

  • Chatfield C (2013) The analysis of time series: theory and practice, 1st edn. Springer, Cham

    Google Scholar 

  • Chen G, Zhang G, Lu S, Wang X (2018) An attempt to quantify the lag time of hydrodynamic action based on the long-term monitoring of a typical landslide, Three Gorges China. Math Probl Eng 2018:5958436. https://doi.org/10.1155/2018/5958436

    Article  CAS  Google Scholar 

  • Chiang JL, Kuo CM, Fazeldehkordi L (2022) Using deep learning to formulate the landslide rainfall threshold of the potential large-scale landslide. Water 14(20):3320

    Article  Google Scholar 

  • Crozier, M. J., & Glade, T. (2005). Landslide hazard and risk: issues, concepts and approach. Landslide hazard and risk (eds T. Glade, M. Anderson and M.J. Crozier), 1–40.

  • Dai FC, Lee CF (2001) Frequency–volume relation and prediction of rainfall-induced landslides. Eng Geol 59(3–4):253–266

    Article  Google Scholar 

  • Dai C, Li W, Wang D, Lu H, Xu Q, Jian J (2021) Active landslide detection based on Sentinel-1 data and InSAR technology in Zhouqu county, Gansu province, Northwest China. J Earth Sci 32:1092–1103

    Article  Google Scholar 

  • Dai Y, Dai W, Yu W, Bai D (2022) Determination of landslide displacement warning thresholds by applying DBA- LSTM and numerical simulation algorithms. Appl Sci 12(13):6690

    Article  CAS  Google Scholar 

  • Dassanayake SM, Mousa A, Fowmes GJ, Susilawati S, Zamara K (2023) Forecasting the moisture dynamics of a landfill capping system comprising different geosynthetics: A NARX neural network approach. Geotext Geomembr 51(1):282–292

    Article  Google Scholar 

  • De Graff JV (2011) Perspectives for systematic landslide monitoring. Environ Eng Geosci 17(1):67–76

    Article  Google Scholar 

  • Du J, Yin K, Lacasse S (2013) Displacement prediction in colluvial landslides, Three Gorges Reservoir China. Landslides 10(2):203–218. https://doi.org/10.1007/s10346-012-0326-8

    Article  Google Scholar 

  • Dubey SR, Singh SK, Chaudhuri BB (2022) Activation functions in deep learning: A comprehensive survey and benchmark. Neurocomputing 503:92–108

    Article  Google Scholar 

  • Ebrahim KMP, Zayed T, Meguid MA (2024d) Enhancing landslide prediction with deep learning: insights into soil moisture dynamics. Faculty of Construction and Environment, The Hong Kong Polytechnic University, Department of Building and Real Estate

    Google Scholar 

  • Ebrahim KMP, Gomaa SMMH, Zayed T, Alfalah G (2024a) Rainfall-induced landslide prediction models, part ii: deterministic physical and phenomenologically models. Bull Eng Geol Env 83(3):1–30

    Article  Google Scholar 

  • Ebrahim KMP, Gomaa SMMH, Zayed T, Alfalah G (2024b) Landslide prediction models, Part I: Empirical statistical and physically based causative thresholds. Faculty of Construction and Environment, The Hong Kong Polytechnic University, Department of Building and Real Estate

    Google Scholar 

  • Ebrahim KMP, Gomaa SMMH, Zayed T, Alfalah G (2024c) Recent phenomenal and investigational subsurface landslide monitoring techniques: a mixed review. Remote Sens 16(2):385

    Article  Google Scholar 

  • Eyo EE, Musa TA, Omar KM, Idris M, K., Bayrak, T., Onuigbo, I. C., & Opaluwa, Y. D. (2014) Application of low-cost GPS tools and techniques for landslide monitoring: A review. Jurnal Teknologi 71(4):71–78

    Article  Google Scholar 

  • Filipović N, Brdar S, Mimić G, Marko O, Crnojević V (2022) Regional soil moisture prediction system based on Long Short-Term Memory network. Biosys Eng 213:30–38

    Article  Google Scholar 

  • Froude MJ, Petley DN (2018) Global fatal landslide occurrence from 2004 to 2016. Nat Hazard 18(8):2161–2181

    Article  Google Scholar 

  • Gao W, Dai S, Chen X (2020) Landslide prediction based on a combination intelligent method using the GM and ENN: two cases of landslides in the Three Gorges Reservoir. China Landslides 17(1):111–126

    Article  Google Scholar 

  • Ge Q, Sun H, Liu Z, Wang X (2023) A data-driven intelligent model for landslide displacement prediction. Geol J 58(6):2211–2230

    Article  Google Scholar 

  • Goertzel B (2014) Artificial general intelligence: concept, state of the art, and future prospects. J Artif General Intell 5(1):1–48

    Article  Google Scholar 

  • Gong W, Juang CH, Wasowski J (2021) Geohazards and human settlements: Lessons learned from multiple relocation events in Badong. China-Eng Geol Perspect Eng Geol 285:106051

    Google Scholar 

  • Granata F, Di Nunno F, Najafzadeh M, Demir I (2022) A stacked machine learning algorithm for multi-step ahead prediction of soil moisture. Hydrology 10(1):1

    Article  Google Scholar 

  • Guerrero-Rodriguez B, Garcia-Rodriguez J, Salvador J, Mejia-Escobar C, Cadena S, Cepeda J, Mulero-Perez D (2024) Improving landslide prediction by computer vision and deep learning. Integr Comput-Aided Eng 31(1):77–94

    Article  Google Scholar 

  • Guo Y, Wu W, Du M, Liu X, Wang J, Bryant CR (2019) Modeling climate change impacts on rice growth and yield under global warming of 1.5 and 2.0 C in the Pearl River Delta China. Atmosphere 10(10):567

    Article  Google Scholar 

  • Guzzetti F, Gariano SL, Peruccacci S, Brunetti MT, Marchesini I, Rossi M, Melillo M (2020) Geographical landslide early warning systems. Earth Sci Rev 200:102973

    Article  Google Scholar 

  • Han Y, Zheng FL, Xu XM (2017) Effects of rainfall regime and its character indices on soil loss at loessial hillslope with ephemeral gully. J Mt Sci 14:527–538

    Article  Google Scholar 

  • Han H, Shi B, Zhang L (2021) Prediction of landslide sharp increase displacement by SVM with considering hysteresis of groundwater change. Eng Geol 280:105876

    Article  Google Scholar 

  • He X, Xu C, Qi W, Huang Y, Cheng J, Xu X, Dai B (2021) Landslides triggered by the 2020 Qiaojia M w5. 1 earthquake, Yunnan, China: distribution, influence factors and tectonic significance. J Earth Sci 32(5):1056–1068

    Article  Google Scholar 

  • Hemalatha T, Ramesh MV, Rangan VP (2019) Effective and accelerated forewarning of landslides using wireless sensor networks and machine learning. IEEE Sens J 19(21):9964–9975

    Article  Google Scholar 

  • Holland PW, Welsch RE (1977) Robust regression using iteratively reweighted least-squares. Commun Stat-Theory Methods 6(9):813–827

    Article  Google Scholar 

  • Huang Y, He Z (2023) Rainfall-oriented resilient design for slope system: Resilience-enhancing strategies. Soils Found 63(2):101297

    Article  Google Scholar 

  • Huang S, Wu Q (2021) Robust pairwise learning with Huber loss. J Complex 66:101570

    Article  Google Scholar 

  • Huang D, He J, Song Y, Guo Z, Huang X, Guo Y (2022a) Displacement prediction of the Muyubao landslide based on a GPS time-series analysis and temporal convolutional network model. Remote Sens 14(11):2656

    Article  Google Scholar 

  • Huang J, Wu X, Ling S, Li X, Wu Y, Peng L, He Z (2022b) A bibliometric and content analysis of research trends on GIS-based landslide susceptibility from 2001 to 2020. Environ Sci Pollut Res 29(58):86954–86993

    Article  Google Scholar 

  • Huang F, Xiong H, Chen S, Lv Z, Huang J, Chang Z, Catani F (2023a) Slope stability prediction based on a long short-term memory neural network: Comparisons with convolutional neural networks, support vector machines and random forest models. Int J Coal Sci Technol 10(1):18

    Article  CAS  Google Scholar 

  • Huang G, Du S, Wang D (2023b) GNSS techniques for real-time monitoring of landslides: a review. Satell Navigat 4(1):5

    Article  Google Scholar 

  • Intrieri E, Gigli G, Mugnai F, Fanti R, Casagli N (2012) Design and implementation of a landslide early warning system. Eng Geol 147:124–136

    Article  Google Scholar 

  • James G, Witten D, Hastie T, Tibshirani R, Taylor J (2023) An introduction to statistical learning: with applications in Python, 1st edn. Springer, Cham

    Book  Google Scholar 

  • Jamieson, K., & Talwalkar, A. (2015). Non-stochastic Best Arm Identification and Hyperparameter Optimization. ArXiv. /abs/1502.07943

  • Jiang S, Liu H, Lian M, Lu C, Zhang S, Li J, Li P (2022) Rock slope displacement prediction based on multi- source information fusion and SSA-DELM model. Front Environ Sci 10:982069

    Article  Google Scholar 

  • Kang F, Xu B, Li J, Zhao S (2017) Slope stability evaluation using Gaussian processes with various covariance functions. Appl Soft Comput 60:387–396

    Article  Google Scholar 

  • KhangPham.(2023).Overfitting,Generalization&theBias-VarianceTradeoff.Retrievedfrom https://medium.com/@khang.pham.exxact/overfitting-generalization-the-bias-variance-tradeoff-5800f8c2200

  • Krkac M, Spoljaric D, Bernat S, Arbanas SM (2017) Method for prediction of landslide movements based on random forests. Landslides 14(3):947–960

    Article  Google Scholar 

  • Krkač M, Bernat Gazibara S, Arbanas Ž, Sečanj M, Mihalić Arbanas S (2020) A comparative study of random forests and multiple linear regression in the prediction of landslide velocity. Landslides 17(11):2515–2531

    Article  Google Scholar 

  • Lapenna V, Perrone A (2022) Time-lapse electrical resistivity tomography (TL-ERT) for landslide monitoring: recent advances and future directions. Appl Sci 12(3):1425

    Article  CAS  Google Scholar 

  • LeCun Y, Bengio Y, Hinton G (2015) Deep Learning. Nature 521(7553):436–444

    Article  CAS  Google Scholar 

  • Li H, Xu Q, He Y, Deng J (2018) Prediction of landslide displacement with an ensemble-based extreme learning machine and copula models. Landslides 15:2047–2059

    Article  Google Scholar 

  • Li H, Xu Q, He Y, Fan X, Li S (2020a) Modeling and predicting reservoir landslide displacement with deep belief network and EWMA control charts: a case study in Three Gorges Reservoir. Landslides 17(3):693–707

    Article  Google Scholar 

  • Li W, Fang H, Qin G, Tan X, Huang Z, Zeng F, Li S (2020b) Concentration estimation of dissolved oxygen in Pearl River Basin using input variable selection and machine learning techniques. Sci Total Environ 731:139099

    Article  CAS  Google Scholar 

  • Li SH, Wu LZ, Chen JJ, Huang RQ (2020c) Multiple data-driven approach for predicting landslide deformation. Landslides 17(3):709–718

    Article  Google Scholar 

  • Li Z, Cheng P, Zheng J (2021) Prediction of time to slope failure based on a new model. Bull Eng Geol Env 80(7):5279–5291. https://doi.org/10.1007/s10064-021-02234-1

    Article  Google Scholar 

  • Lian C, Zeng Z, Yao W, Tang H (2013) Displacement prediction model of landslide based on a modified ensemble empirical mode decomposition and extreme learning machine. Nat Hazards 66:759–771

    Article  Google Scholar 

  • Lian C, Zeng Z, Yao W, Tang H (2014) Extreme learning machine for the displacement prediction of landslide under rainfall and reservoir level. Stoch Env Res Risk Assess 28(8):1957–1972

    Article  Google Scholar 

  • Liu Y, Liu D, Qin Z, Liu F, Liu L (2016) Rainfall data feature extraction and its verification in displacement prediction of Baishuihe landslide in China. Bull Eng Geol Env 75(3):897–907

    Article  CAS  Google Scholar 

  • Liu ZQ, Guo D, Lacasse S, Li JH, Yang BB, Choi JC (2020) Algorithms for intelligent prediction of landslide displacements. J Zhejiang Univ-Sci A 21(6):412–429

    Article  Google Scholar 

  • Liu G, Ye L, Chen Q, Chen G, Fan W (2022) Abnormal event detection of city slope monitoring data based on multi-sensor information fusion. Bull Geol Sci Technol 41(2):13–25

    Google Scholar 

  • Liu S, Wang L, Zhang W, Sun W, Wang Y, Liu J (2024) Physics-informed optimization for a data-driven approach in landslide susceptibility evaluation. J Rock Mech Geotech Eng. https://doi.org/10.1016/j.jrmge.2023.11.039

    Article  Google Scholar 

  • Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Advances in neural information processing systems, 30.

  • Ma J, Niu X, Tang H, Wang Y, Wen T, Zhang J (2020) Displacement prediction of a complex landslide in the Three Gorges Reservoir Area (China) using a hybrid computational intelligence approach. Complexity 2020:1–15

    Google Scholar 

  • Ma Y, Li H, Wang L, Zhang W, Zhu Z, Yang H, Yuan X (2022) Machine learning algorithms and techniques for landslide susceptibility investigation: a literature review. Tumu Yu Huanjing Gongcheng Xuebao/j Civ Environ Eng 44:53–67

    Google Scholar 

  • Ma HS, Wang HL, Wang RB, Meng QX, Yang LL (2023) Automatic back analysis of mechanical parameters using block discrete element method and PSO algorithm. Eur J Environ Civ Eng 27(7):2576–2586

    Article  Google Scholar 

  • Marrapu BM, Kukunuri A, Jakka RS (2021) Improvement in prediction of slope stability & relative importance factors using ANN. Geotech Geol Eng 39(8):5879–5894

    Article  Google Scholar 

  • McElreath R (2018) Statistical rethinking: A Bayesian course with examples in R and Stan. Chapman and Hall/CRC, London

    Book  Google Scholar 

  • Meng S, Shi Z, Peng M, Li G, Zheng H, Liu L, Zhang L (2024) Landslide displacement prediction with step- like curve based on convolutional neural network coupled with bi-directional gated recurrent unit optimized by attention mechanism. Eng Appl Artif Intell 133:108078

    Article  Google Scholar 

  • Merghadi A, Yunus AP, Dou J, Whiteley J, ThaiPham B, Bui DT, Abderrahmane B (2020) Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth-Sci Rev 207:103225

    Article  Google Scholar 

  • Miao F, Wu Y, Xie Y, Li Y (2018) Prediction of landslide displacement with step-like behavior based on multialgorithm optimization and a support vector regression model. Landslides 15:475–488

    Article  Google Scholar 

  • Moher D, Liberati A, Tetzlaff J, Altman DG (2009) Preferred reporting items for systematic reviews and meta- analyses: the PRISMA statement. J Clin Epidemiol 62(10):1006–1012

    Article  Google Scholar 

  • Nava L, Carraro E, Reyes-Carmona C, Puliero S, Bhuyan K, Rosi A, Catani F (2023) Landslide displacement forecasting using deep learning and monitoring data across selected sites. Landslides 20(10):2111–2129

    Article  Google Scholar 

  • Ng CW, Wang B, Tung YK (2001) Three-dimensional numerical investigations of groundwater responses in an unsaturated slope subjected to various rainfall patterns. Can Geotech J 38(5):1049–1062

    Article  Google Scholar 

  • Ni L, Jiang J, Pan Y (2013) Leak location of pipelines based on transient model and PSO-SVM. J Loss Prev Process Ind 26(6):1085–1093

    Article  Google Scholar 

  • Niu X, Ma J, Wang Y, Zhang J, Chen H, Tang H (2021) A novel decomposition-ensemble learning model based on ensemble empirical mode decomposition and recurrent neural network for landslide displacement prediction. Appl Sci (switzerland) 11(10):4684

    CAS  Google Scholar 

  • Parsopoulos KE, Vrahatis MN (2002) Recent approaches to global optimization problems through particle swarm optimization. Nat Comput 1:235–306

    Article  Google Scholar 

  • Pei H, Meng F, Zhu H (2021) Landslide displacement prediction based on a novel hybrid model and convolutional neural network considering time-varying factors. Bull Eng Geol Env 80(10):7403–7422

    Article  Google Scholar 

  • Petrucci O (2022) Landslide fatality occurrence: a systematic review of research published between January 2010 and March 2022. Sustainability 14(15):9346

    Article  Google Scholar 

  • Pham BT, Prakash I, Bui DT (2018) Spatial prediction of landslides using a hybrid machine learning approach based on random subspace and classification and regression trees. Geomorphology 303:256–270

    Article  Google Scholar 

  • Phoon KK, Zhang W (2023) Future of machine learning in geotechnics. Georisk: Assess Manag Risk Eng Syst Geohazards 17(1):7–22

    Google Scholar 

  • Poli R, Kennedy J, Blackwell T (2007) Particle swarm optimization: an overview. Swarm Intell 1:33–57

    Article  Google Scholar 

  • Pouzols, F. M., & Lendasse, A. (2010). Effect of different detrending approaches on computational intelligence models of time series. In The 2010 international joint conference on neural networks (IJCNN) (pp. 1–8). IEEE.

  • Quinlan, J. R. (1993). Combining instance-based and model-based learning. In Proceedings of the tenth international conference on machine learning (pp. 236–243).

  • Rahimi A, Rahardjo H, Leong EC (2011) Effect of antecedent rainfall patterns on rainfall-induced slope failure. J Geotech Geoenviron Eng 137(5):483–491

    Article  Google Scholar 

  • Reshef DN, Reshef YA, Finucane HK, Grossman SR, McVean G, Turnbaugh PJ, Sabeti PC (2011) Detecting novel associations in large data sets. Science 334(6062):1518–1524

    Article  CAS  Google Scholar 

  • Roberts DR, Bahn V, Ciuti S, Boyce MS, Elith J, Guillera-Arroita G, Dormann CF (2017) Cross- validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure. Ecography 40(8):913–929

    Article  Google Scholar 

  • Sasahara K (2017) Prediction of the shear deformation of a sandy model slope generated by rainfall based on the monitoring of the shear strain and the pore pressure in the slope. Eng Geol 224:75–86

    Article  Google Scholar 

  • Scikit-Learn. (2024a). Metrics and scoring: quantifying the quality of predictions. Retrieved 4, 2024, from https://scikit- learn.org/stable/modules/model_evaluation.html.

  • Scikit-Learn. (2024b). sklearn.metrics.mean_squared_error. Retrieved 4, 2024, from Scikit- Learn: https://scikit- learn.org/stable/modules/generated/sklearn.metrics.mean _squared_error.html.

  • Scikit-Learn.(2024c).sklearn.metrics.mean_absolute_error.Retrieved4,2024,from https://scikit- learn.org/stable/modules/generated/sklearn.metrics.mean_absolute_error.html#sklearn.metricsmean_absolute_error.

  • Scikit-Learn. (2024d). sklearn.metrics.mean_absolute_percentage_error. Retrieved 4, 2024, from https://scikit- learn.org/stable/modules/generated/sklearn.metrics.mean_absolute_percentage_error.html#sklearn.metrics.mean_absolut e_percent age_error.

  • Scikit-Learn.(2024e).sklearn.metrics.r2_score.Retrieved4,2024,from https://scikit- learn.org/stable/modules/generated/sklearn.metrics.r2_score.html?high light=r2#sklearn.metrics.r2_score.

  • Segoni S, Piciullo L, Gariano SL (2018) A review of the recent literature on rainfall thresholds for landslide occurrence. Landslides 15(8):1483–1501

    Article  Google Scholar 

  • Selby MJ (1988) Landslides: causes, consequences and environment. J R Soc N Z 18(3):343–343

    Article  Google Scholar 

  • Semmler S, Rose Z (2017) Artificial Intelligence: Application today and implications tomorrow. Duke l & Tech Rev 16:85

    Google Scholar 

  • Shamshi MA (2004) Technologies convergence in recent instrumentation for natural disaster monitoring and mitigation. IETE Tech Rev 21(4):277–290

    Article  Google Scholar 

  • Shano L, Raghuvanshi TK, Meten M (2020) Landslide susceptibility evaluation and hazard zonation techniques– a review. Geoenvironmental Disasters 7:1–19

    Article  Google Scholar 

  • Shumway RH, Stoffer DS, Stoffer DS (2000) Time series analysis and its applications, 4th edn. Springer, New York

    Book  Google Scholar 

  • Singh, S. (2018). Understanding the Bias-Variance Tradeoff. Published in Towards Data Science. Retrieved from https://towardsdatascience.com/understanding-the-bias-variance-tradeoff-165e6942b229

  • Snoek, J., Larochelle, H., & Adams, R. P. (2012). Practical Bayesian Optimization of Machine Learning Algorithms. ArXiv. /abs/1206.2944

  • Song D, Choi C, Ng CWW, Zhou G (2017) Geophysical flows impacting a flexible barrier: effects of solid-fluid interaction. Landslides 15:99–110

    Article  Google Scholar 

  • Suk JW, Jeong HS, Jung MS, Kang HS, Kim HJ, Choi SG (2022) Prediction of Shallow Failure on a Slope Using Volumetric Water Content Gradient Characteristics. Appl Sci 12(11):5308

    Article  CAS  Google Scholar 

  • Tien Bui D, Tuan TA, Klempe H, Pradhan B, Revhaug I (2016) Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 13:361–378

    Article  Google Scholar 

  • Tien Bui D, Moayedi H, Gör M, Jaafari A, Foong LK (2019) Predicting slope stability failure through machine learning paradigms. ISPRS Int J Geo Inf 8(9):395

    Article  Google Scholar 

  • Tofani V, Raspini F, Catani F, Casagli N (2013) Persistent Scatterer Interferometry (PSI) technique for landslide characterization and monitoring. Remote Sensing 5(3):1045–1065

    Article  Google Scholar 

  • Togneri R, dos Santos DF, Camponogara G, Nagano H, Custodio G, Prati R, Kamienski C (2022) Soil moisture forecast for smart irrigation: The primetime for machine learning. Expert Syst Appl 207:117653

    Article  Google Scholar 

  • Uwihirwe J, Hrachowitz M, Bogaard TA (2020) Landslide precipitation thresholds in Rwanda. Landslides 17(10):2469–2481

    Article  Google Scholar 

  • Varangaonkar P, Rode SV (2023) Lightweight deep learning model for automatic landslide prediction and localization. Multimed Tools Appl 82(21):33245–33266

    Article  Google Scholar 

  • Wang Y, Fang Z, Wang M, Peng L, Hong H (2020) Comparative study of landslide susceptibility mapping with different recurrent neural networks. Comput Geosci 138:104445

    Article  Google Scholar 

  • Wang J, Nie G, Gao S, Wu S, Li H, Ren X (2021) Landslide deformation prediction based on a GNSS time series analysis and recurrent neural network model. Remote Sens 13(6):1055

    Article  Google Scholar 

  • Wang R, Zhang K, Qi J, Xu W, Long Y, Huang H (2022) A prediction model of hydrodynamic landslide evolution process based on deep learning supported by monitoring big data. Front Earth Sci 10:829221

    Article  Google Scholar 

  • Wang L, Wu C, Yang Z, Wang L (2023a) Deep learning methods for time-dependent reliability analysis of reservoir slopes in spatially variable soils. Comput Geotech 159:105413

    Article  Google Scholar 

  • Wang L, Xiao T, Liu S, Zhang W, Yang B, Chen L (2023b) Quantification of model uncertainty and variability for landslide displacement prediction based on Monte Carlo simulation. Gondwana Res 123:27–40

    Article  Google Scholar 

  • Wang H, Long G, Shao P, Lv Y, Gan F, Liao J (2023c) A DES-BDNN based probabilistic forecasting approach for step-like landslide displacement. J Cleaner Prod 394:136281

    Article  Google Scholar 

  • Wang R, Zhang K, Wang W, Meng Y, Yang L, Huang H (2023d) Hydrodynamic landslide displacement prediction using combined extreme learning machine and random search support vector regression model. Eur J Environ Civ Eng 27(6):2345–2357

    Article  Google Scholar 

  • Wang, Y., Yin, K. L., & An, G. F. (2004). Grey correlation analysis of sensitive factors of landslide. ROCK AND SOIL MECHANICS-WUHAN-, 25(1; ISSU 90), 91–93.

  • Wei ZL, Lü Q, Sun HY, Shang YQ (2019) Estimating the rainfall threshold of a deep-seated landslide by integrating models for predicting the groundwater level and stability analysis of the slope. Eng Geol 253:14–26

    Article  Google Scholar 

  • Wilamowski BM (2009) Neural network architectures and learning algorithms. IEEE Ind Electron Mag 3(4):56–63

    Article  Google Scholar 

  • Wohlin, C. (2014). Guidelines for snowballing in systematic literature studies and a replication in software engineering. ACM International Conference Proceeding Series, 1–10.

  • Wu L, Huang R, Li X (2020) Hydro-mechanical analysis of rainfall-induced landslides. Springer, Singapore, pp 1–235

    Google Scholar 

  • Wu H, Chen Y, Lv H, Xie Q, Chen Y, Gu J (2022) Stability analysis of rib pillars in highwall mining under dynamic and static loads in open-pit coal mine. Int J Coal Sci Technol 9(1):38

    Article  Google Scholar 

  • Xi N, Zang M, Lin R, Sun Y, Mei G (2023) Spatiotemporal prediction of landslide displacement using deep learning approaches based on monitored time-series displacement data: A case in the Huanglianshu landslide. Georisk: Assess Manag Risk Eng Syst Geohazards 17(1):98–113

    Google Scholar 

  • Xing Y, Yue J, Chen C (2019) Interval estimation of landslide displacement prediction based on time series decomposition and long short-term memory network. IEEE Access 8:3187–3196

    Article  Google Scholar 

  • Xu W, Kang Y, Chen L, Wang L, Qin C, Zhang L, Zhang W (2023) Dynamic assessment of slope stability based on multi-source monitoring data and ensemble learning approaches: A case study of Jiuxianping landslide. Geol J 58(6):2353–2371

    Article  Google Scholar 

  • Xue J, Shen B (2020) A novel swarm intelligence optimization approach: sparrow search algorithm. Syst Sci Control Eng 8(1):22–34

    Article  Google Scholar 

  • Yang B, Yin K, Lacasse S, Liu Z (2019) Time series analysis and long short-term memory neural network to predict landslide displacement. Landslides 16:677–694

    Article  Google Scholar 

  • Yang H, Jiang J, Chen G, Mohamed MS, Lu F (2021) A recurrent neural network-based method for dynamic load identification of beam structures. Materials 14(24):7846

    Article  CAS  Google Scholar 

  • Yang HQ, Zhang L, Gao L, Phoon KK, Wei X (2022a) On the importance of landslide management: Insights from a 32-year database of landslide consequences and rainfall in Hong Kong. Eng Geol 299:106578

    Article  Google Scholar 

  • Yang S, Jin A, Nie W, Liu C, Li Y (2022b) Research on SSA-LSTM-Based Slope Monitoring and Early Warning Model. Sustainability 14(16):10246

    Article  Google Scholar 

  • Yao W, Zeng Z, Lian C, Tang H (2014) Training enhanced reservoir computing predictor for landslide displacement. Eng Geol 188:101–109

    Article  Google Scholar 

  • Yuan X, Ou C, Wang Y, Yang C, Gui W (2019) A layer-wise data augmentation strategy for deep learning networks and its soft sensor application in an industrial hydrocracking process. IEEE Trans Neural Netw Learn Syst 32(8):3296–3305

    Article  Google Scholar 

  • Zhang LL, Zhang J, Zhang LM, Tang WH (2011) Stability analysis of rainfall-induced slope failure: a review. Proc Inst Civ Eng-Geotech Eng 164(5):299–316

    Article  Google Scholar 

  • Zhang W, Goh ATC, Zhang Y (2016) Multivariate adaptive regression splines application for multivariate geotechnical problems with big data. Geotech Geol Eng 34(1):193–204

    Article  Google Scholar 

  • Zhang P, Yin ZY, Jin YF, Chan TH (2020) A novel hybrid surrogate intelligent model for creep index prediction based on particle swarm optimization and random forest. Eng Geol 265:105328

    Article  Google Scholar 

  • Zhang J, Tang H, Tannant DD, Lin C, Xia D, Liu X, Ma J (2021a) Combined forecasting model with CEEMD-LCSS reconstruction and the ABC-SVR method for landslide displacement prediction. J Cleaner Prod 293:126205

    Article  Google Scholar 

  • Zhang L, Shi B, Zhu H, Yu XB, Han H, Fan X (2021b) PSO-SVM-based deep displacement prediction of Majiagou landslide considering the deformation hysteresis effect. Landslides 18:179–193

    Article  Google Scholar 

  • Zhang YG, Tang J, He ZY, Tan J, Li C (2021c) A novel displacement prediction method using gated recurrent unit model with time series analysis in the Erdaohe landslide. Nat Hazards 105:783–813

    Article  Google Scholar 

  • Zhang YG, Tang J, Liao RP, Zhang MF, Zhang Y, Wang XM, Su ZY (2021d) Application of an enhanced BP neural network model with water cycle algorithm on landslide prediction. Stoch Env Res Risk Assess 35(6):1273–1291

    Article  Google Scholar 

  • Zhang W, Li H, Tang L, Gu X, Wang L, Wang L (2022a) Displacement prediction of Jiuxianping landslide using gated recurrent unit (GRU) networks. Acta Geotech 17(4):1367–1382

    Article  Google Scholar 

  • Zhang Y, Tang J, Cheng Y, Huang L, Guo F, Yin X, Li N (2022b) Prediction of landslide displacement with dynamic features using intelligent approaches. Int J Min Sci Technol 32(3):539–549

    Article  Google Scholar 

  • Zhao, K. Q. (1989). Theory and analysis of set pair ea new concept and system analysis method. In Conference thesis of system theory and regional planning (pp. 87–91).

  • Zhao B, Dai Q, Han D, Dai H, Mao J, Zhuo L, Rong G (2019) Estimation of soil moisture using modified antecedent precipitation index with application in landslide predictions. Landslides 16(12):2381–2393

    Article  Google Scholar 

  • Zhu ZW, Liu DY, Yuan QY, Liu B, Liu JC (2011) A novel distributed optic fiber transduser for landslides monitoring. Opt Lasers Eng 49(7):1019–1024

    Article  Google Scholar 

  • Zhu L, Huang L, Fan L, Huang J, Huang F, Chen J, Wang Y (2020) Landslide susceptibility prediction modeling based on remote sensing and a novel deep learning algorithm of a cascade-parallel recurrent neural network. Sensors 20(6):1576

    Article  Google Scholar 

  • Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc Ser B Stat Methodol 67(2):301–320

    Article  Google Scholar 

  • Zou Y, Zheng C (2022) A Scientometric analysis of predicting methods for identifying the environmental risks caused by landslides. Appl Sci 12(9):4333

    Article  CAS  Google Scholar 

Download references

Acknowledgements

The authors gratefully acknowledge the fund provided by the Hong Kong Polytechnic University.

Funding

There was no funding provided for this research.

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization, K.M.P.E., and T.Z.; methodology, K.M.P.E., and T.Z.; formal analysis, K.M.P.E., and T.Z.; investigation, K.M.P.E., and T.Z.; resources, T.Z.; data curation, K.M.P.E., F.A., F.N., and T.Z.; writing— original draft preparation, K.M.P.E.; writing—review and editing, F.A., F.N., and T.Z.; supervision, T.Z. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Kyrillos M. P. Ebrahim.

Ethics declarations

Ethics approval and consent to participate

Not applicable, as this study is a review of existing literature and does not involve human or animal subjects.

During the preparation of this work, the author(s) used [GPT 3.5] to [rephrase, check grammar and spelling]. After using this tool/service, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ebrahim, K.M.P., Fares, A., Faris, N. et al. Exploring time series models for landslide prediction: a literature review. Geoenviron Disasters 11, 25 (2024). https://doi.org/10.1186/s40677-024-00288-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40677-024-00288-3

Keywords