Improved landslide assessment using support vector machine with bagging, boosting, and stacking ensemble machine learning framework in a mountainous watershed, Japan (original) (raw)

Abstract

Heavy rainfall in mountainous terrain can trigger numerous landslides in hill slopes. These landslides can be deadly to the community living downslope with their fast pace, turning failures into catastrophic debris flows and avalanches. Active tectonics coupled with rugged topography in a complex geoenvironment multiplies this likelihood. The available hazard maps are usually helpful in mitigating disasters. However, fool-proof predicting landslide susceptibility identification remains a challenge in landslide discipline. Recently, ensemble machine learning (ML) techniques have proved the potential to provide a more accurate and efficient solution in spatial modeling. The main purposes of the current study are to examine and evaluate the predictive capability of support vector machine hybrid ensemble ML algorithms, i.e., the bagging, boosting, and stacking for modeling the catastrophic rainfall-induced landslide occurrences in the Northern parts of Kyushu Island, at the watershed scale in Japan. In this study, a landslide inventory map containing 265 landslide polygons was first interpreted from the aerial photographs and fieldwork after the September 2017 rainfall event. The raw data were randomly separated into two parts using a 70/30 sampling strategy for training and validating the landslide models. Then, 13 predisposing factors were prepared as predictors and dependent variable. The landslide susceptibility maps (LSM) were validated by the area under the receiver operating characteristic curve (AUC). The results of validation showed that the AUC values of the four models (SVM-Stacking, SVM, SVM-Bagging, and SVM-Boosting) varied from 0.74 to 0.91. The SVM-boosting model outperformed the other models, while SVM-stacking model has found to be the lowest performance. The outcome suggests that an ensemble ML model does not necessarily mean good performance. It is always preferable to select an appropriate model, such as the one proposed the hybrid novel ensemble SVM-boosting model, which could significantly improve the accuracies of LSM. Also, from Information Gain Ratio (IGR) we found that the rainfall factor mainly affects the results, that agrees with the analogy of present study.

Access this article

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime View plans

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

References

Alves A (2017) Stacking machine learning classifiers to identify Higgs bosons at the LHC. J Instrum. https://doi.org/10.1088/1748-0221/12/05/T05005
Article Google Scholar
Beasley TM, Zumbo BD (2003) Comparison of aligned Friedman rank and parametric methods for testing interactions in split-plot designs. Comput Stat Data Anal. https://doi.org/10.1016/S0167-9473(02)00147-0
Article Google Scholar
Bengio Y (2009) Learning deep architectures for AI. Found Trends Mach Learn 2(1):1–127. https://doi.org/10.1561/2200000006
Article Google Scholar
Brabb EE (1984) Innovative approaches to landslide hazard mapping. Proc. 4th Int. Symp. Landslides 1:307–324
Google Scholar
Breiman L (1996) Bagging predictors. Mach Learn. https://doi.org/10.1007/BF00058655
Google Scholar
Bui DT, Pradhan B, Lofman O, Revhaug I (2012) Landslide susceptibility assessment in Vietnam using support vector machines, decision tree, and Naïve Bayes models. https://doi.org/10.1155/2012/974638
Article Google Scholar
Camilo DC, Lombardo L, Mai PM et al (2017) Handling high predictor dimensionality in slope-unit-based landslide susceptibility models through LASSO-penalized generalized linear model. Environ Model Softw 97:145–156. https://doi.org/10.1016/j.envsoft.2017.08.003
Article Google Scholar
Chang KT, Dou J, Chang Y et al (2016) Spatial resolution effects of digital terrain models on landslide susceptibility analysis. ISPRS Int Arch Photogramm Remote Sens Spat Inf Sci XLI-B8:33–36. https://doi.org/10.5194/isprs-archives-XLI-B8-33-2016
Article Google Scholar
Chang K-T, Merghadi A, Yunus AP, et al (2019) Evaluating scale effects of topographic variables in landslide susceptibility models using GIS-based machine learning techniques. Sci Rep 9:12296. https://doi.org/10.1038/s41598-019-48773-2
Chen W, Panahi M, Pourghasemi HR (2017a) Performance evaluation of GIS-based new ensemble data mining techniques of adaptive neuro-fuzzy inference system (ANFIS) with genetic algorithm (GA), differential evolution (DE), and particle swarm optimization (PSO) for landslide spatial modelling. Catena 157:310–324. https://doi.org/10.1016/j.catena.2017.05.034
Article Google Scholar
Chen W, Xie X, Peng J et al (2017b) GIS-based landslide susceptibility modelling: a comparative assessment of kernel logistic regression, Naïve-Bayes tree, and alternating decision tree models. Geomat Nat Hazards Risk 8:950–973. https://doi.org/10.1080/19475705.2017.1289250
Article Google Scholar
Choubin B, Moradi E, Golshan M et al (2019) An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. Sci Total Environ 651:2087–2096. https://doi.org/10.1016/j.scitotenv.2018.10.064
Article Google Scholar
Dietterich TG (2000) Ensemble methods in machine learning, pp 1–15
Google Scholar
Dormann CF, Elith J, Bacher S et al (2012) Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography (Cop) 36:27–46. https://doi.org/10.1111/j.1600-0587.2012.07348.x
Article Google Scholar
Dou J, Bui DT, Yunus AP et al (2015a) Optimization of causative factors for landslide susceptibility evaluation using remote sensing and GIS data in parts of Niigata, Japan. PLoS One 10:e0133262. https://doi.org/10.1371/journal.pone.0133262
Article Google Scholar
Dou J, Chang KT, Chen S et al (2015b) Automatic case-based reasoning approach for landslide detection: integration of object-oriented image analysis and a genetic algorithm. Remote Sens 7:4318–4342. https://doi.org/10.3390/rs70404318
Article Google Scholar
Dou J, Li X, Yunus AP, Paudel U, Chang KT, Zhu Z, Pourghasemi HR (2015c) Automatic detection of sinkhole collapses at finer resolutions using a multi-component remote sensing approach. Nat Hazards 78:1021–1044. https://doi.org/10.1007/s11069-015-1756-0
Article Google Scholar
Dou J, Paudel U, Oguchi T et al (2015d) Shallow and deep-seated landslide differentiation using support vector machines: a case study of the Chuetsu Area, Japan. Terr Atmos Ocean Sci 26:227. https://doi.org/10.3319/TAO.2014.12.02.07(EOSI)
Article Google Scholar
Dou J, Yamagishi H, Pourghasemi HR et al (2015e) An integrated artificial neural network model for the landslide susceptibility assessment of Osado Island, Japan. Nat Hazards 78:1749–1776. https://doi.org/10.1007/s11069-015-1799-2
Article Google Scholar
Dou J, Yamagishi H, Xu Y et al (2017) Characteristics of the torrential rainfall-induced shallow landslides by Typhoon Bilis, in July 2006, using remote sensing and GIS. In: Yamagishi H, Bhandary NP (eds) GIS landslide. Springer, Japan, pp 221–230
Chapter Google Scholar
Dou J, Yamagishi H, Zhu Z et al (2018) TXT-tool 1.081-6.1 A comparative study of the binary logistic regression (BLR) and artificial neural network (ANN) models for GIS-based spatial predicting landslides at a regional scale. In: Landslide dynamics: ISDR-ICL landslide interactive teaching tools. Springer, Cham, pp 139–151
Chapter Google Scholar
Dou J, Yunus AP, Tien Bui D et al (2019a) Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-Oshima Volcanic Island, Japan. Sci Total Environ 662:332–346. https://doi.org/10.1016/j.scitotenv.2019.01.221
Article Google Scholar
Dou J, Yunus AP, Tien Bui D et al (2019b) Evaluating GIS-based multiple statistical models and data mining for earthquake and rainfall-induced landslide susceptibility using the LiDAR DEM. Remote Sens 11:638. https://doi.org/10.3390/rs11060638
Article Google Scholar
Fagerland MW, Sandvik L (2009) The Wilcoxon-Mann-Whitney test under scrutiny. Stat Med. https://doi.org/10.1002/sim.3561
Article Google Scholar
Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc. https://doi.org/10.1080/01621459.1937.10503522
Article Google Scholar
Ganjisaffar Y, Caruana R, Lopes CV (2011) Bagging gradient-boosted trees for high precision, low variance ranking models. In: Proceedings of the 34th International ACM SIGIR Conference on Research and development in Information - SIGIR ‘11
Guzzetti F (1999) Landslide cartography, hazard assessment and risk evaluation: overview, limits and prospective, p 1–12
Healey SP, Cohen WB, Yang Z et al (2018) Mapping forest change using stacked generalization: an ensemble approach. Remote Sens Environ. https://doi.org/10.1016/j.rse.2017.09.029
Article Google Scholar
Hengl T, Mendes de Jesus J, Heuvelink GBM et al (2017) SoilGrids250m: global gridded soil information based on machine learning. PLoS One 12:e0169748. https://doi.org/10.1371/journal.pone.0169748
Article Google Scholar
Holec J, Bednarik M, Sabo M et al (2013) A small-scale landslide susceptibility assessment for the territory of Western Carpathians. Nat Hazards 69:1081–1107
Article Google Scholar
Hong H, Pradhan B, Xu C, Tien Bui D (2015) Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines. Catena 133:266–281. https://doi.org/10.1016/j.catena.2015.05.019
Article Google Scholar
Jenness J (2006) Topographic position index (tpi_jen.avx) extension for ArcView3.x,version 1.3a. In: Jenness Enterp
Kanda T, Takata Y, Kohyama K, et al (2018) New soil maps of Japan based on the comprehensive soil classification system of Japan - first approximation and its application to the world reference base for soil resources 2006. Jpn Agric Res Q
Kannan SS, Ramaraj N (2010) A novel hybrid feature selection via symmetrical uncertainty ranking based local memetic search algorithm. Knowl Based Syst. https://doi.org/10.1016/j.knosys.2010.03.016
Article Google Scholar
Khosravi K, Shahabi H, Thai B et al (2019) A comparative assessment of flood susceptibility modeling using multi-criteria decision-making analysis and machine learning methods. J Hydrol 573:311–323. https://doi.org/10.1016/j.jhydrol.2019.03.073
Article Google Scholar
Kokusaki Kogyo (2007) Aerial photo interpretation of earthquake damage from the 2007 Niigata Chuetsu–oki Earthquake consultant report, Kokusai Kogyo Co., Ltd, Japan. http://www.kkc.co.jp/social/disaster/200707_nigata/parts_tuika/gaikyozu.pdf. (in Japanese), Accessed 1 May 2008
Le LT, Nguyen H, Dou J, Zhou J (2019a) A comparative study of PSO-ANN, GA-ANN, ICA-ANN, and ABC-ANN in estimating the heating load of buildings’ energy efficiency for smart city planning. Appl Sci 9:2630. https://doi.org/10.3390/app9132630
Article Google Scholar
Le LT, Nguyen H, Jian Z, Dou J (2019b) Estimating the heating load of energy efficiency of buildings for smart city planning using a novel artificial intelligence technique PSO-XGBoost. Appl Sci
Lee S, Min K (2001) Statistical analysis of landslide susceptibility at Yongin, Korea. Environ Geol 40:1095–1113. https://doi.org/10.1007/s002540100310
Article Google Scholar
Merghadi A, Abderrahmane B, Tien Bui D (2018) Landslide susceptibility assessment at Mila Basin (Algeria): a comparative assessment of prediction capability of advanced machine learning methods. ISPRS Int J Geo-Inf 7. https://doi.org/10.3390/ijgi7070268
Article Google Scholar
Michel GP, Kobiyama M, Goerl RF (2014) Comparative analysis of SHALSTAB and SINMAP for landslide susceptibility mapping in the Cunha River basin, southern Brazil. J Soils Sediments 14:1266–1277. https://doi.org/10.1007/s11368-014-0886-4
Article Google Scholar
National Research Institute for Earth Science and Disaster Resilience (NIED). Available online: http://www.bosai.go.jp/mizu/dosha.html. (accessed on 10 Jan 2019)
Nguyen QK, Tien Bui D, Hoang ND, Trinh P, Nguyen VH, Yilmaz I (2017) A novel hybrid approach based on instance based learning classifier and rotation forest ensemble for spatial prediction of rainfall-induced shallow landslides using GIS. Sustainability 9(5):813. https://doi.org/10.3390/su9050813
Article Google Scholar
O’Brien RM (2007) A caution regarding rules of thumb for variance inflation factors. Qual Quant 41:673–690. https://doi.org/10.1007/s11135-006-9018-6
Article Google Scholar
Oliveira SC, Zêzere JL, Lajas S, Melo R (2017) Combination of statistical and physically based methods to assess shallow slide susceptibility at the basin scale. Nat Hazards Earth Syst Sci 17:1091–1109. https://doi.org/10.5194/nhess-17-1091-2017
Article Google Scholar
Peters A, Hothorn T, Lausen B (2002) ipred: improved predictors. R News
Pham BT, Pradhan B, Tien Bui D et al (2016) A comparative study of different machine learning methods for landslide susceptibility assessment: a case study of Uttarakhand area (India). Environ Model Softw 84:240–250. https://doi.org/10.1016/j.envsoft.2016.07.005
Article Google Scholar
Pham BT, Tien Bui D, Prakash I, Dholakia MB (2017) Hybrid integration of multilayer perceptron neural networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS. Catena. https://doi.org/10.1016/j.catena.2016.09.007
Article Google Scholar
Pham TB, Prakash I, Dou J et al (2018) A novel hybrid approach of landslide susceptibility modeling using rotation forest ensemble and different base classifiers. Geocarto Int 0:1–38. https://doi.org/10.1080/10106049.2018.1559885
Pourghasemi HR, Mohammady M, Pradhan B (2012) Landslide susceptibility mapping using index of entropy and conditional probability models in GIS: Safarood Basin, Iran. Catena 97:71–84. https://doi.org/10.1016/j.catena.2012.05.005
Article Google Scholar
Pourghasemi HR, Jirandeh AG, Biswajeet P et al (2013) Landslide susceptibility mapping using support vector machine and GIS at the Golestan Province, Iran. J Earth Syst Sci 122:349–369
Article Google Scholar
Pradhan B, Sezer EA, Gokceoglu C, Buchroithner MF (2010) Landslide susceptibility mapping by neuro-fuzzy approach in a landslide-prone area (Cameron Highlands, Malaysia). IEEE Trans Geosci Remote Sens 48:4164–4177. https://doi.org/10.1109/TGRS.2010.2050328
Article Google Scholar
Quinlan JR (1996) Bagging, boosting, and C4. 5. In: AAAI/IAAI, Vol. 1
Ray RL, Jacobs JM, de Alba P (2010) Impacts of unsaturated zone soil moisture and groundwater table on slope instability. J Geotech Geoenviron Eng. https://doi.org/10.1061/(ASCE)GT.1943-5606.0000357
Article Google Scholar
Saito H, Nakayama D, Matsuyama H (2010) Two types of rainfall conditions associated with shallow landslide initiation in Japan as revealed by normalized soil water Index. Sola. https://doi.org/10.2151/sola.2010-015
Article Google Scholar
Saito H, Uchiyama S, Hayakawa YS, Obanawa H (2018) Landslides triggered by an earthquake and heavy rainfalls at Aso volcano, Japan, detected by UAS and SfM-MVS photogrammetry. Prog Earth Planet Sci 5:1–10. https://doi.org/10.1186/s40645-018-0169-6
Article Google Scholar
Sekiguchi T, Sato HP (2006) Feature and distribution of landslides induced by the Mid Niigata Prefecture Earthquake in 2004, Japan. J Jpn Landslide Soc 43:142–154. https://doi.org/10.3313/jls.43.142
Article Google Scholar
Shaikhina T, Lowe D, Daga S et al (2017) Decision tree and random forest models for outcome prediction in antibody incompatible kidney transplantation. Biomed Signal Process Control:1–7. https://doi.org/10.1016/j.bspc.2017.01.012
Article Google Scholar
Shirzadi A, Soliamani K, Habibnejhad M et al (2018) Novel GIS based machine learning algorithms for shallow landslide susceptibility mapping. Sensors 18:3777. https://doi.org/10.3390/s18113777
Article Google Scholar
Tien Bui D, Pradhan B, Lofman O, Revhaug I (2012, 2012) Landslide susceptibility assessment in vietnam using support vector machines, decision tree, and nave bayes models. Math Probl Eng. https://doi.org/10.1155/2012/974638
Article Google Scholar
Tien Bui D, Ho T-C, Pradhan B et al (2016) GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks. Environ Earth Sci 75:1101. https://doi.org/10.1007/s12665-016-5919-4
Article Google Scholar
van Asch TWJ, Malet J-P, van Beek LPH, Amitrano D (2007) Techniques, issues and advances in numerical modelling of landslide hazard. Bull Soc Geol Fr 178:65–88. https://doi.org/10.2113/gssgfbull.178.2.65
Article Google Scholar
Vapnik VN (1998) Statistical learning theory (Adaptive and learning systems for signal processing, communications and control series). Wiley-Interscience
Wartman J, Dunham L, Tiwari B, Pradel D (2013). Landslides in eastern Honshu induced by the 2011 Tohoku earthquake. Bulletin of the Seismological Society of America, 103(2B), 1503–1521
Article Google Scholar
Wolpert D (1992) Stacked generalization (stacking). Neural Netw. https://doi.org/10.1016/S0893-6080(05)80023-1
Article Google Scholar
Yagi H, Sato G, Higaki D et al (2009) Distribution and characteristics of landslides induced by the Iwate-Miyagi Nairiku Earthquake in 2008 in Tohoku District, Northeast Japan. Landslides. https://doi.org/10.1007/s10346-009-0182-3
Article Google Scholar
Yalcin A (2008) GIS-based landslide susceptibility mapping using analytical hierarchy process and bivariate statistics in Ardesen (Turkey): comparisons of results and confirmations. Catena 72:1–12
Article Google Scholar
Yamagishi H, Iwahashi J (2007) Comparison between the two triggered landslides in Mid-Niigata, Japan by July 13 heavy rainfall and October 23 intensive earthquakes in 2004. Landslides 4:389–397. https://doi.org/10.1007/s10346-007-0093-0
Article Google Scholar
Yamagishi H, Yamazaki F (2018) Landslides by the 2018 Hokkaido Iburi-Tobu Earthquake on September 6. Landslides 15:2521–2524. https://doi.org/10.1007/s10346-018-1092-z
Article Google Scholar
Yan J, Han S (2018) Classifying imbalanced data sets by a novel RE-sample and cost-sensitive stacked generalization method. Math Probl Eng. https://doi.org/10.1155/2018/5036710
Google Scholar
Youssef AM, Pourghasemi HR, Pourtaghi ZS, Al-Katheeri MM (2015) Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia. Landslides. https://doi.org/10.1007/s10346-015-0614-1
Article Google Scholar
Yunus AP, Dou J, Song X, Avtar R (2019) Improved bathymetric mapping of coastal and lake environments using Sentinel-2 and Landsat-8 images. Sensors 19:2788. https://doi.org/10.3390/s19122788
Article Google Scholar
Zêzere JL, Pereira S, Melo R et al (2017) Science of the total environment mapping landslide susceptibility using data-driven methods. Sci Total Environ 589:250–267. https://doi.org/10.1016/j.scitotenv.2017.02.188
Article Google Scholar
Zhou Z-H (2014) Ensemble methods. In: Combining pattern classifiers. Wiley, Hoboken, pp 186–229
Google Scholar

Download references

Acknowledgments

We would like to thank GSI, JMA, and GSJ for providing the research data. We sincerely thank the anonymous reviewers and editor for improving the quality of our manuscript.

Funding

This study was financially supported by CAS Pioneer Hundred Talents Program, the National Key R&D Program of China (Grant No. 2018YFC1505401, Z. Han), the National Natural Science Foundation of China (Grant No. 41702310, Z. Han), and the Natural Science Foundation of Hunan (Grant No. 2018JJ3644, Z. Han). Also, this work was supported by the National Nature Science Foundation of China (Grant Nos. 51679127 and 51439003) and the National Key R&D Program of China (ID: 2018YFC1504803). This research is partially supported by Japan Society for the Promotion of Science.

Author information

Authors and Affiliations

Department of Civil and Environmental Engineering, Nagaoka University of Technology, 1603-1, Kami-Tomioka, Nagaoka, Niigata, 940-2188, Japan
Jie Dou
State Key Laboratory of Geo-hazard Prevention and Geo-environment Protection, Chengdu University of Technology, Chengdu, 610059, China
Ali P. Yunus
GIS Group, Department of Business and IT, University of South-Eastern Norway, Gullbringvegen 36, 3800, Bø i Telemark, Norway
Dieu Tien Bui
Research Laboratory of Sedimentary Environment, Mineral and Water resources of Eastern Algeria, Larbi Tebessi University, Tebessa, Algeria
Abdelaziz Merghadi
Department of Geography, Faculty of Natural Science, Jamia Millia Islamia, New Delhi, 110025, India
Mehebub Sahana
College of Water Sciences, Beijing Normal University, Xinjiekouwai Street 19, Beijing, 100875, China
Zhongfan Zhu
National Science and Technology Center for Disaster Reduction, No. 200, Sec. 3, Beixin Road, Xindian District, New Taipei City, Taiwan
Chi-Wen Chen
School of Civil Engineering, Central South University, Changsha, 410075, China
Zheng Han
State Key Laboratory of Geohazard Prevention and Geo-environment Protection, Chengdu University of Technology, Chengdu, 610059, China
Zheng Han
Institute of Research and Development, Duy Tan University, Da Nang, 550000, Vietnam
Binh Thai Pham

Authors

Jie Dou
Ali P. Yunus
Dieu Tien Bui
Abdelaziz Merghadi
Mehebub Sahana
Zhongfan Zhu
Chi-Wen Chen
Zheng Han
Binh Thai Pham

Corresponding authors

Correspondence toJie Dou, Zheng Han or Binh Thai Pham.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

About this article

Cite this article

Dou, J., Yunus, A.P., Bui, D.T. et al. Improved landslide assessment using support vector machine with bagging, boosting, and stacking ensemble machine learning framework in a mountainous watershed, Japan.Landslides 17, 641–658 (2020). https://doi.org/10.1007/s10346-019-01286-5

Download citation

Received: 11 February 2019
Accepted: 13 September 2019
Published: 25 October 2019
Version of record: 25 October 2019
Issue date: March 2020
DOI: https://doi.org/10.1007/s10346-019-01286-5

Keywords

Profiles

Jie Dou View author profile
Abdelaziz Merghadi View author profile
Mehebub Sahana View author profile