Data for: Predicting the spatial distribution of invasive species richness using the combination of machine learning and geostatistical algorithms

Published: 13 June 2025| Version 2 | DOI: 10.17632/y2t3cgwnr2.2
Contributors:
,
,

Description

This dataset is the original data of the paper “Predicting the spatial distribution of invasive species richness using the combination of machine learning and geostatistical algorithms” .The dataset includes forest survey data, eco-climatic data, and topographic and geomorphological data. Among them, the forest survey data comes from the US Forest Inventory and Analysis (FIA) program, which collects information on the occurrence and distribution of invasive plants in all public and private forests in the United States. The ecological and climatic data includes 31 climatic variables, extracted from the WorldClim Global Climate Data (Version 2.1, https://www.worldclim.org/data/worldclim21.html). The topographic and geomorphological data includes three variables: elevation, soil carbon content, and aridity index. Among these, the elevation data comes from WorldClim Global Climate Data, soil carbon content data is extracted from the International Soil Reference and Information Centre (ISRIC-World Soil Information, https://www.isric.org/), and the aridity index is extracted from the Global Aridity Index (http://www.cgiar-csi.org/data). The definitions of each variable are as follows: prov_ID: Eco-region code; LAT/LON: Decimal latitude/longitude; Seasonability: SD of mean annual temperature; Alt: Altitude (m); PLT_TPA/Tpha: Trees per acre/hectare; RelDen: Successional development proportion (0–1); prpfor: Forested plot proportion; plt_drybio_adj/ha: Native tree biomass (English tons/acre/hectare); native_spp: Native tree species richness; PD_all: Phylogenetic diversity of tree species; PSV_all/var: Phylogenetic variability and variance; PSR_all/var: Phylogenetic richness and variance; PSE_all/PSC_all: Phylogenetic evenness/clustering; InvSpRichness: Invasive species richness; soilcarbon: 0–20 cm soil carbon content; aridity: Precipitation/evapotranspiration ratio; BIO1–BIO19: Standard climatic metrics (e.g., temperature, precipitation); vaprmin/max/range/avg: Water vapor pressure metrics (kPa); sradmin/max/range/avg: Solar radiation metrics (KJ/m²/day); windmin/max/range/avg: Wind speed metrics (m/s).

Files

Categories

Meteorology, Algorithms, Machine Learning, Spatial Auto-Correlation, Invasive Species Management

Licence