Recorded and predicted occurrence of Eumycetozoa in Poland from Central and Eastern European data
Description
This dataset contains supplementary tables and maps supporting a study on recorded and predicted occurrence of slime moulds (Eumycetozoa) in Poland. All files are machine‑readable and derived from a harmonised Darwin Core occurrence resource for Central and Eastern Europe combined with gridded climatic predictors. Supplementary Material 1.1 – full Polish checklist Species‑level checklist of Eumycetozoa recorded from Poland. For each taxon (order, genus, species, scientificName), the table includes semicolon‑separated habitat and substrate summaries and a collated list of Polish references. It represents the full, annotated national checklist underlying the shortened version in the main text. Supplementary Material 1.2 – candidate species table List of 365 candidate species that occur in Central and Eastern Europe but are not yet recorded from Poland. For each taxon, the table provides taxonomic placement, countries of occurrence, aggregated habitat and substrate descriptors, collated bibliography and, where available, species distribution model (SDM) metrics, Polish suitability indicators and probability class, together with top Polish regions. Supplementary Material 1.3 – model performance metrics Evaluation results for MaxEnt‑type SDMs fitted to candidate species with sufficient data. For each modelled taxon, the table reports sample size, number of cross‑validation folds and AUC statistics, together with attempted Boyce indices. It allows users to assess the strength and uncertainty of individual models. Supplementary Material 1.4 – top Polish regions per species Region‑level summary indicating, for each modelled candidate species, up to three Polish voivodeships with the largest proportion of grid cells classified as climatically suitable. Voivodeships are listed by their Polish names and can be used to prioritise areas for targeted surveys and validation. Supplementary Material 1.5 – map collection (Supplementary Figures 1–101) Set of 101 raster maps showing predicted potential distributions in Poland for modelled candidate species. Each figure displays continuous climatic suitability and the subset of grid cells exceeding a 10th‑percentile training‑presence threshold. Supplementary Material 1.6 – bibliographic list Alphabetically ordered list of all bibliographic sources used in the Polish checklist (Supplementary Material 1.1). It contains the full citations in Harvard style corresponding to the condensed reference strings in the checklist table. Supplementary Material 1.7 – raw R code Raw R code used to model the potential distribution of Eumycetozoa in Poland.
Files
Steps to reproduce
- Obtain the harmonised Darwin Core Eumycetozoa dataset for Central and Eastern Europe and load it into R, ensuring correct encodings, trimming of strings, numeric coercion and coordinate validity checks. - Create the Polish subset by filtering to country = "Poland", extract unique combinations of order, genus, species and scientificName, and aggregate habitat, substrate and reference fields per species to generate the full checklist (Supplementary Material 1.1). - Build a regional species index and compare it with the Polish checklist to define candidate taxa. For these candidates, aggregate countries of occurrence, habitat and substrate descriptors and bibliographic references, and count unique occurrences; then fit MaxEnt‑type SDMs for species with sufficient data and summarise model metrics and Polish suitability indicators (Supplementary Material 1.2). - From the fitted SDMs, extract cross‑validated AUC (and attempted Boyce) statistics together with sample sizes and fold numbers, compile them into a single table and export as model performance metrics (Supplementary Material 1.3). - Project final SDMs onto the climatic grid covering Poland, apply the 10th‑percentile training‑presence threshold, overlay binary suitability maps with voivodeship polygons and identify, for each species, up to three regions with the largest proportion of suitable area (Supplementary Material 1.4). - Use the continuous suitability outputs for each modelled species, cropped and masked to the Polish boundary, together with the species‑specific thresholds, to produce publication‑ready raster maps of predicted potential distributions (Supplementary Material 1.5).
Institutions
- Bialystok University of TechnologyPodlasie, Bialystok