Spatial datasets for benchmarking machine learning-based landslide susceptibility models

Published: 10 September 2024| Version 1 | DOI: 10.17632/vrtx3w6mjd.1
Contributor:
Guruh Samodra

Description

The spatial dataset consists of 743 landslide polygons, landslide centroid points, randomly non-landslide points, and 11 landslide-controlling factors. Landslide polygons were delineated through manual interpretation of high-resolution satellite imagery. The landslide-controlling factor data were extracted from topographic maps and Indonesia’s national digital elevation model (DEMNAS). The landslide-event dataset was mapped by comparing pre- and post-event (Tropical cyclone (TC) Cempaka, which occurred on 27–29 November 2017) high-resolution satellite imageries and conducting field surveys. The landslide polygons indicate areas with confirmed landslide occurrences, while the landslide-controlling factors data includes slope aspect, distance to river, distance to road, elevation, lithology, landuse, plan curvature, profile curvature, slope, stream power index, and terrain wetness index. The landslide polygons and points are stored in gpkg format, while the landslide controlling factors are stored in tif format. Files with xml and tfw extensions are text files used to store metadata and georeference of a tif raster file. All data can be opened using GIS software such as QGIS. The datasets can also be accessed and opened using R or Python using specified geospatial libraries such as SF and Terra.

Files

Steps to reproduce

GIS software such as QGIS can open the datasets. The datasets can also be accessed and opened using R or Python using specified geospatial libraries such as SF and Terra. The landslide dataset was compiled in a GIS format (.gpkg). The centroid point of landslides and random point of landslides were generated using the centroid tool and random points in QGIS. The landslide-controlling factors were generated from DEM, topographic maps, and geological maps. Slope aspect, elevation, plan curvature, slope curvature, stream power index, and terrain wetness index were derived from DEMNAS (https://tanahair.indonesia.go.id/demnas/#/) using SAGA GIS software. Raster of landuse and lithology were converted from a landuse polygon from a digital topographic map (1:25.000) and geological map (1:100.000) using the rasterize tool in QGIS. Distance to the river and distance to the road were generated from the river line and the road line from the digital topographic map (1:25.000) using the proximity buffer tool in SAGA GIS.

Institutions

Universitas Gadjah Mada

Categories

Artificial Intelligence, Earth-Surface Processes, Remote Sensing, Geomorphology, Geographic Information System, Machine Learning, Landslide, Environmental Modeling

Funding

Universitas Gadjah Mada

431/UN1.P1/KPT/HUKOR/2024

Licence