A Comparative Analysis of Landslide Susceptibility Assessment by Using Global and Spatial Regression Methods in Inje Area, Korea

Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography.
2015.
Dec,
33(6):
579-588

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

- Received : December 01, 2015
- Accepted : December 26, 2015
- Published : December 31, 2015

Download

PDF

e-PUB

PubReader

PPT

Export by style

Share

Article

Metrics

Cited by

TagCloud

Landslides are major natural geological hazards that result in a large amount of property damage each year, with both direct and indirect costs. Many researchers have produced landslide susceptibility maps using various techniques over the last few decades. This paper presents the landslide susceptibility results from the geographically weighted regression model using remote sensing and geographic information system data for landslide susceptibility in the Inje area of South Korea. Landslide locations were identified from aerial photographs. The eleven landslide-related factors were calculated and extracted from the spatial database and used to analyze landslide susceptibility. Compared with the global logistic regression model, the Akaike Information Criteria was improved by 109.12, the adjusted R-squared was improved from 0.165 to 0.304, and the Moran’s I index of this analysis was improved from 0.4258 to 0.0553. The comparisons of susceptibility obtained from the models show that geographically weighted regression has higher predictive performance.
Study area
Data type and scale of data used in the study
The dataset consisted of 232 rows×370 columns, for a total of 85,840 cells, with landslides represented in 446 of the cells. A total of 446 cells were divided randomly into two groups, training and validation set. 624 cells, accounting for 70% of the total positive events (landslide affected areas), were randomly selected as the training set. In addition, cells of the negative events (landslide non-affected areas) were collected with same number of the positive events. The remaining portion of the training set was used as validation set.
where,
P
is the probability of landslide occurrence, ranging between 0 and 1 on an s-shaped curve, and
z
represents a linear combination of the variables through Eq. (2).
where,
β
_{0}
is the intercept of the model,
β
_{i}
= (
i
= 1, 2, 3, ⋯ n) are the regression coefficients, and
x_{i}
(
i
= 1, 2,,3 ⋯ n) are the explanatory variables (
Youssef ., 2015
). The value
z
varies from −∞ to +∞ .
A positive sign of the probability represents that the explanatory variables has increased the probability of change, and a negative sign indicates the opposite effect. In addition, maximizing likelihood function is used to obtain the regression coefficients. A coefficient is significant if the tested null hypothesis that the estimated coefficient was zero could be rejected at a 0.05 significance level(
Hosmer and Lemeshow, 2000;
Kleinbaum and Klein, 2002
;
Van Den Eeckhaut , 2006
).
In addition, multicollinearity among the independent variables is tested using the TOL (tolerance) and the VIF (Variance Inflation Factor) to improve the model fitting. The variables with VIF > 10 and TOL < 0.1 are represented serious multicollinearity between explanatory variables and excluded from the logistic analysis (
Hosmer and Lemeshow, 2000
;
Menard, 2002
;
Zhu and Huang, 2006
).
t
-test values (
Fotheringham ., 2002
). The GWR model extends the OLS (Ordinary Regression Squares) regression by allowing regression coefficients to be estimated locally (
Feuillet ., 2014
).
The GWR model can be expressed as:
where
u_{j}
and
v_{j}
are the spatial position of location j,
β
_{0}
(
u_{j}
,
v_{j}
) acts as intercept, and
β_{i}
(
u_{j}
,
v_{j}
)is the local estimated coefficient for explanatory variables (
Su ., 2012
).
The GWR uses kernel bandwidth to determine the spatial scope of spatialdependence, and then employs distance decay function to weight all the observations within the spatial scope. Because it is assumed that observations near point
i
have more influence on the estimation of
β_{i}
(
u_{j}
,
v_{j}
) than observations located farther from
i
(
Feuillet ., 2014
;
Tu and Xia, 2008
). The distance decay functions can be calculated by Gaussian and bi-square (
Brunsdon ., 1998
;
Fotheringham ., 2002
). In this research, the Gaussian distance decay is used to express the weight function:
where
w_{ij}
is the weight for observation
j
within the neighborhood of observation
i
,
d_{ij}
represents the distance between observations
i
and
j
, and
h
denotes the kernel bandwidth (
Su ., 2012
). The CV (Cross-Validation) and AICc (Akaike Information Criterion) are used to select optimum bandwidth. The AICc is generally more applicable and can be applied in non-Gaussian GWR than CV (
Fotheringham ., 2002
;
Lloyd, 2010
).
Three goodness-of-fit criteria such as deviance, AICc, and the BIC (Bayesian Information Criterion (BIC, also known as the Schwartz criterion) are used to consider both fit and complexity of the model. Lower values of these criteria indicate a more efficient model (
Feuillet ., 2014
).
Result of logistic regression
Summary of spatially varying coefficients
Comparison of GLR and GWR results
GWR coefficient surfaces of elevation (a), aspect cosine (b), aspect sin (c), slope degree (d), slope length (e), curvature (f), topographic wetness index (g), soil drainage (h), soil thickness (i), timber diameter (j), and timber density (k)

Landslide
;
Landslide Susceptibility Map
;
Geographically Weighted Regression
;
Logistic Regression Model

1. Introduction

Over the last decade, natural disasters such as hurricanes, earthquakes, extreme erosion, tsunamis, and landslides have increased sharply. Because of increasing threats from these phenomena, national and local government agencies have expressed concern for human injuries and economic loss (
Yilmaz, 2009
). Landslides, which account for 4.4% of natural disasters around the world, have increased rapidly in frequency and cause significant damage (1990-2009) (
Akgun , 2008
;
Vos ., 2010
;
Park ., 2013
). This trend will continue in the coming decades, as regional precipitation, deforestation, urbanization, and development increase (
Schuster, 1996
).
Under these circumstances, interest in landslide assessment has grown significantly among experts in various fields, such as engineers, geologists, planners, local administrators, and decision makers (
Ercanoglu and Gokceoglu, 2004
). Assessment and management of landslide damage can be aided by thematic mapping, with the following steps: 1. Landslide inventory maps; 2. Landslide susceptibility maps; 3. Landslide hazard maps; and 4. Landslide risk maps (
Kamp ., 2008
). Among these maps, the production of a landslide susceptibility map in the early stage of the assessment process is of crucial importance.
Landslide susceptibility maps have been drawn using various methods across numerous research studies. The methods are divided into qualitative and quantitative. Currently, quantitative techniques are widely used, aided by the technological development of GIS (Geographical Information Systems), which provide a powerful tool for managing and manipulating spatial data. Quantitative techniques are based on numerical expressions of the relationships between controlling factors and landslides (
Aleotti and Chowdhury, 1999
). Quantitative techniques are divided into deterministic and statistical methods (either bivariate or multivariate), and the majority of researchers prefer the GLR (Global Logistic Regression) model, a statistical method (
Ayalew and Yamagishi, 2005
;
Bai ., 2010
;
Bui ., 2011
;
Chauhan ., 2010
;
Chen and Wang, 2007
;
Falaschi , 2009
;
Xu ., 2013
).
However, the GLR model cannot take into account the spatial dependence or autocorrelation characteristics of observational data (
Erener and Düzgün, 2010
). This reduces the efficiency of estimated parameters when evaluating landslide susceptibility. Therefore, the GWR (Geographically Weighted Regression) has been introduced as a method that incorporates spatial variation (
Feuillet et al., 2014
). Because the GWR model uses a regression model, the advantages of existing models can be applied, and different factors can be estimated for respective regions. This makes it possible to confirm a spatially heterogeneous pattern that is difficult to grasp with existing models. Additionally, it enables the visualization of spatial interactions among data by mapping the results of the GWR analysis using GIS (
Ercanoglu and Gokceoglu, 2004
).
The goal of our study is to analyze and quantify improvements in the accuracy and explanatory power of landslide susceptibility compared with a previously used the GLR model when analyzing landslide susceptibility using the GWR. To accomplish this, the Inje region was selected as the research area, as it was subjected to severe landslide damage in 2006. A spatial database of landslide-related factors was compiled using the DEM (Digital Elevation Model) and various thematic maps. The GWR model was analyzed and compared with the GLR model analysis results using conformity-measured values and various diagnostic indices.
2. Study Area

Approximately 81% of the total area of Gangwon-do in the central eastern region of Korea is composed of mountains. Most of these mountains have steep and rough terrain with 2 m or less of effective soil depth: suitable conditions for landslides (
Im, 2009
).Three instances of localized heavy rainfalls occurred in the Gangwon-do area in 2006 (July 11–13, July 14–20, July 25–29), including Ewiniar, a category 3 typhoon. These rains were regionally concentrated in the Inje, Yangyang, and Pyeongchang areas, with the heaviest rainfall in about 500 years lasting for about 1–6 hours (
Lee and Talib, 2005
). This caused approximately 160 billion won in property damage and resulted in 40 or more human deaths. According to
Kim . (2012)
, landslides occurred in about 400 locations around Inje-eup, Girin-myun, and Nam-myun, Inje-gun. Among these, a survey revealed that Injee-up experienced the most landslides and the most damage. Therefore, the entire area of Inje-eup was selected as the study area for this analysis of landslide susceptibility (
Fig. 1
).
PPT Slide

Lager Image

3. Data set and methodology

- 3.1. Landslide identification

Accurate identification of landslide locations is critical to analyses of landslide hazards. Field surveys are the most accurate way to identify landslide locations, but terrain and environmental conditions may make it difficult and costly to access these areas as an initial landslide identification method. Remote sensing methods using data such as aerial photos and satellite imagery are more effective due to their lower cost, and are widely used to identify landslide locations (
Liangjie ., 2012
).
Landslide locations for this study were identified using aerial photos taken soon after landslide occurrences. Aerial photos taken on 2 August 2006 using the PKNU (Pukyong National University) IV system were used to identify the locations of landslides that had occurred in the Inje area in July 2006. The collected aerial photos were geometrically corrected using a 1:5,000 digital topographic map and then used to produce orthophotos by creating a mosaic using a DTM (Digital Terrain Model). Landslide locations were digitized by visual interpretation using orthorectified aerial photos.
- 3.2 Spatial dataset

Because landslides result from a combination of various factors such as topology, soil, and forest, these landslide-related factors need to be built into a spatial database for landslide susceptibility analysis. The relevant thematic maps acquired from government were used to construct a spatial database (
Table 1
). A total of eleven landslide-related factors were compiled into a spatial database with 10×10-m cells relative to the research area using ArcGIS 10.2 software.
Data type and scale of data used in the study

PPT Slide

Lager Image

- 3.3. GLR

Regression approaches including linear regression, log-linear regression and logistic regression can be considered a process to extract the coefficients of empirical relationships from observations (
Ozdemir and Altural, 2013
). The goal of GLR is to find the best-fitting model to describe the relationship between a dichotomous depend variable (the presence or absence of landslides) and several explanatory variables.The explanatory variables may be continuous or discrete (with dummy variables) and do not need a normal frequency distribution (
Ayalew and Yamagishi, 2005
;
Van Den Eeckhaut , 2006
). Quantitatively, the relationship between depend variable and explanatory variables can be expressed in Eq. (1).
PPT Slide

Lager Image

PPT Slide

Lager Image

- 3.4. GWR

GWR, which is a local modeling technique, aims to capture spatial non-stationarity in the influence of factors on the occurrence of a landslide (
Feuillet ., 2014
). The spatial non-stationarity is identified by generating a set of local-specific coefficients, including local R square, local model residuals, local parameter estimates as well as the corresponding
PPT Slide

Lager Image

PPT Slide

Lager Image

4. Results and Discussion

- 4.1. Logistic regression

The results of logistic regression model are represented in
Table 2
.All explanatory variables had the value of VIF <10 and TOL > 0.1 respectively. This result indicated that there was no serious multicollinearity between explanatory variables. In addition, the significance probability value was less than 0.01 against all variables except for aspect sine, topographic wetness index, and soil thickness. This indicates that the other variables with exception of the above three variables had statistically significant effects on landslide at the 5% significance level. From the analysis results, Aspect sine, slope degree, slope length, soil drainage, soil thickness, timber diameter, and timber density had positive effects on landslide occurrence and showed a higher possibility of landslide. On the other hand, the other variables except for the above variables had less effect on landslide occurrence. Timber diameter was the most influential of the landslide-related factors, whereas aspect cosine contributed least to landslide occurrence.
Result of logistic regression

PPT Slide

Lager Image

- 4.2. GWR

Table 3
summarizes the spatially varying coefficients for 714 sample points. All of the eleven explanatory variables have both positive and negative coefficient values although with differences in the portions of both values. This represented that the constant coefficient estimates in the logistic regression tent to make the spatially non-stationary process of landslide occurrence. The values of slope degree, slope length, and timber density represented over 80% of positive coefficients and the values of elevation and aspect cosine had over 80% of negative coefficients. Also, the values of aspect sin, curvature, topographic wetness index, soil drainage, soil thickness, and timber diameter had apparent divisions of positive and negative results. Such spatially varying coefficients are mostly ignored in the orthodox logistic models.
Summary of spatially varying coefficients

PPT Slide

Lager Image

- 4.3. Comparison of model performances

The model performance between the GLR and the GWR models was compared using statistical parameters (
Table 4
). The GWR model showed significant improvement over the GLR. First, the GWR model had a much better goodness-of-fit than the GLR though the significant decrease of -2 Log likelihood. Second, the AICc index was 1148.2641 in the GLR model and 1049.1441 in the GWR model. If the difference in the AICc index between two models is greater than 4, the model is considered to be improved (
Charlton and Fotheringham, 2009
). The difference between two models in this study was 109.12, indicating that the conformity of the GWR model was significantly improved. Third, the adjusted R-squared value was 0.165 in the GLR model and 0.304 in the GWR model. Examination of model conformity reveals whether the general explanation power of the model has improved. Fourth, spatial autocorrelation can be examined more quantitatively using Moran’s I index. Moran’s I index in the GLR model was 0.3018 (p<0.01), indicating the existence of a spatial autocorrelation. However, Moran’s I index in the GWR model was 0.1765 (p<0.01), indicating that the spatial dependence evident in the standardized residual in the GWR model was removed through geographical weighting.
Comparison of GLR and GWR results

PPT Slide

Lager Image

- 4.4. Spatial varying relationships

The GWR model generates a set of coefficient estimates of explanatory variables for each landslide sample point. A set of coefficient surfaces based on the sample points with coefficient estimates were generated to reveal the spatially non-stationary relationship between landslide occurrence and explanatory variables. An IDW (Inverse Distance Weighted) interpolation was employed to generate coefficient surfaces.
Fig.2
represents the coefficient surface of each explanatory variables. As an example, the coefficient of aspect cosine had a negative effect from the result of the GLR model and were mostly negative across the entire study area (
Fig.2-b
). However, although the coefficient of slope degree obtained from the GLR model had a positive effect on landslide occurrence, this did not hold true for the entire study area. From the result of the GWR model, slope degree had a stronger negative influence in the east of the study area than the west (
Fig.2-d
).
PPT Slide

Lager Image

5. Summary and Conclusions

This study analyzed landslide susceptibility in the Inje region. A spatial database was compiled using landslide-related factors derived from aerial photographs and various thematic maps produced by the government. This study analyzed landslide susceptibility using a GWR model, compared it with the results of the GLR model analysis, and analyzed how much the model has improved. The adjusted R-squared value improved from 0.165 to 0.304 and the AICc, a conformity-measured value of a model, was 1148.2641 in the GLR model and 1039.1441 in the GWR model, for a difference of 109.12. In addition, Moran’s I index for the GWR model was 0.1765 compared to 0.3018 for the GLR model for spatial dependence. From these result, the GWR model has significantly improved the GLR model with better goodness-of-fit. It also reduced the spatial dependence of residuals.
Therefore, the GWR model was more powerful and effective in interpreting relationships between landslide-related factors and landslide occurrence. Especially, character and strength of the relationships identified by the GWR model showed great spatial non-stationarity and scale-dependence. However, the GWR model still presents some disadvantages. The lack of independence among local estimates may led to the failure in valid inferences for the local estimates. In addition, when the number of sample is quite small, the estimated local coefficients can be ineffective or invalid (
Su , 2012
).
Brunsdon C.
,
Fotheringham A.S.
,
Charlton M.
(1998)
Geographically weighted regression-modelling spatial non-stationarity
Statistician
47
(3)
31 -
443

Bui D.T.
,
Lofman O.
,
Revhaug I.
,
Dick O.
(2011)
Landslide susceptibility analysis in the Hoa Binh province of Vietnam using statistical index and logistic regression
Natural Hazards
59
(3)
1413 -
1444

Charlton M.
,
Fotheringham A. S.
(2009)
Geographically weighted regression: white paper
National Centre for Geocomputation
Ireland
http://gwr.nuim.ie/downloads/GWR_WhitePaper.pdf

Vos F.
,
Rodriguez J.
,
Below R.
,
Guha-Spair D.
(2010)
Annual Disaster Statistical Review 2009, The Numbers and Trend
Centre for Research on the Epidemiology of Disasters (CRED)
Brussels
38 -

Feuillet T.
,
Coquin J.
,
Mercier D.
,
Cossart E.
,
Decaulne A.
,
Jónsson H.P.
(2014)
Focusing on the spatial non-stationarity of landslide predisposing factors in northern Iceland Do paraglacial factors vary over space?
Progress in Physical Geography
0309133314528944

Fotheringham A.S.
,
Brunsdon C.
,
Charlton M.E.
(2002)
Geographically Weighted Regression: The Analysis of Spatially Varying Relationships
Wiley
New York

Hosmer D.W.
,
Lemeshow S.
(2000)
Applied Logistic Regression
Wiley
New York

Im O.
(2009)
A Study on Characteristics of Landslides and Restoration Works in Damaged Area by Heavy Rainfall-focused on Hongcheon Area in Gangwondo, Master’s thesis
Kangwon University
Korea
(in Korean with English abstract)
77 -

Kim K.
,
Gyoo G.
,
Yoon H.
,
Lee C.
,
Won M.
,
Lee B.
,
Woo C.
,
Kim S.
,
Lee M.
(2012)
2011 Forest Disaster White Paper
Korea Forest Research Institute
Seoul
29 -

Kleinbaum D.G.
,
Klein M.
(2002)
Logistic Regression, a Self-learning Text
2nd ed.
Springer
New York

Liangjie W.
,
Kazuhide S.
,
Shuji M.
(2012)
GIS-based landslide susceptibility mapping using logistic regression method with LiDAR data in natural slopes
Disaster Advances
5
(4)
258 -
263

Lloyd C.D.
(2010)
Local models for spatial analysis
CRC Press

Menard S.
(2002)
Applied Logistic Regression Analysis, SAGE University Paper
111 -

Schuster R.
,
Turner A.K.
,
Schuster R.L.
(1996)
Landslides: Investigation and Mitigation, Special Report
National Academic Press
Washington, DC, 247
12 -
36

Youssef A.M.
,
Pradhan B.
,
Pourghasemi H.R.
,
Abdullahi S.
(2015)
Landslide susceptibility assessment at Wadi Jawrah Basin, Jizan region, Saudi Arabia using two bivariate models in GIS
Geosciences Journal
1 -
21

Citing 'A Comparative Analysis of Landslide Susceptibility Assessment by Using Global and Spatial Regression Methods in Inje Area, Korea
'

@article{ GCRHBD_2015_v33n6_579}
,title={A Comparative Analysis of Landslide Susceptibility Assessment by Using Global and Spatial Regression Methods in Inje Area, Korea}
,volume={6}
, url={http://dx.doi.org/10.7848/ksgpc.2015.33.6.579}, DOI={10.7848/ksgpc.2015.33.6.579}
, number= {6}
, journal={Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography}
, publisher={Korean Society of Surveying, Geodesy, Photogrammetry and Cartography}
, author={Soyoung, Park
and
Jinsoo, Kim}
, year={2015}
, month={Dec}