Spatiotemporal Dynamics and Driving Mechanisms of Water Erosion in the Shiyang River Basin, Northwest China (2001–2020)

Published: 14 May 2025| Version 1 | DOI: 10.17632/gp4n7xwwsw.1
Contributor:
leyao pan

Description

This dataset was developed to investigate the spatiotemporal patterns and driving factors of water erosion in the Shiyang River Basin from 2001 to 2020. Water erosion intensity in this region is influenced not only by natural factors such as precipitation and topography, but also significantly by land use change and vegetation dynamics. Moreover, these driving factors may interact to amplify erosion risks. The dataset includes the following components: (1)Simulated Water Erosion Data: Annual raster layers of potential soil erosion, transport capacity, and net water erosion rate, generated using an integrated RUSLE and TLSD model. These data capture both the potential for soil detachment and the actual sediment transport across the basin landscape. In addition, multi-year spatial trend data and inter-annual change maps are provided to illustrate erosion dynamics over time. (2)Erosion Classification and Transition Data: Net water erosion intensity was categorized into seven classes. The spatial variation in erosion severity and the area transition matrix across different years were compiled to assess temporal changes and spatial dynamics in erosion patterns. (3)Driving Factor Analysis (Random Forest): Output from the Random Forest model quantifies the relative importance of various natural and anthropogenic drivers of erosion. Summary tables and maps illustrate erosion responses under different environmental gradients, including topography, LUCC, and NDVI. (4)Supplementary Data: Raster datasets for all RUSLE input factors (rainfall erosivity, soil erodibility, slope and slope length, cover management, and support practice), NDVI, topsoil texture, and LUCC data are included. The dataset also provides statistics on net soil erosion rate, sediment deposition, and NDVI values across different slope classes and under various land use change scenarios. Data Collection and Processing: Climate and topographic data were obtained from national and international geospatial databases. Slope and slope length were derived from DEM data, and rainfall data were used to calculate rainfall erosivity. NDVI and land use/land cover (LUCC) data were extracted from MODIS and Landsat imagery and reclassified based on regional characteristics. Soil erosion was simulated using a calibrated RUSLE-TLSD model. Model inputs include rainfall erosivity, soil erodibility, topographic factors, vegetation cover, and land management practices. Sediment yield observations were used for model validation. Random forest analysis was conducted in R using the “randomForest” and “rfPermute” packages. The model was trained with 500 trees to evaluate the importance of environmental variables in driving net water erosion. All raster datasets were resampled to a uniform 90 m resolution for spatial analysis.

Files

Steps to reproduce

Data Acquisition and Reproducibility Statement The dataset was developed using a combination of remote sensing data, geospatial analysis, and process-based modeling. Climatic and topographic inputs were obtained from national meteorological records and high-resolution DEMs. NDVI and land use/land cover data were derived from MODIS and Landsat imagery and preprocessed through classification and resampling based on regional conditions. Soil erosion modeling was conducted using a calibrated RUSLE-TLSD model framework, incorporating rainfall erosivity, soil erodibility, slope length and steepness, vegetation cover, and land management factors. Sediment yield data were used to validate model outputs. To assess erosion drivers, Random Forest analysis was performed in R using the“randomForest”and“rfPermute”packages. Environmental variables were standardized, and model performance was evaluated using 500 classification trees. All spatial layers were resampled to a 90 m resolution and projected to the WGS_1984_UTM_Zone_48N coordinate system. The entire workflow, including data preprocessing, modeling, and analysis, can be reproduced using standard geospatial tools (e.g., ArcGIS) and statistical software (e.g., R).

Institutions

  • Lanzhou University

Categories

Erosion, River Basin, Erosional Process, River Basin Management, Random Decision Forest

Licence