Datasets used to train and test prediction model to predict scores in terms of SDG 7 realization

Published: 5 March 2025| Version 1 | DOI: 10.17632/6c8fm7s4y2.1
Contributors:
,
,

Description

The datasets used in this research work refer to the aims of Sustainable Development Goal 7. These datasets were used to train and test machine learning model based on artificial neural network and other machine learning regression models for solving the problem of prediction scores in terms of SDG 7 aims realization. Train dataset was created based on data from 2013 to 2021 and includes 261 samples. Test dataset includes 29 samples. Sources data from 2013 to 2022 are available in 10 XLSX and CSV files. Train and test datasets are available in XLSX and CSV files. Detailed description of data is available in PDF file.

Files

Steps to reproduce

First data related to SDG 7 realization was collected from https://ec.europa.eu/eurostat/data/database in form of XLSX files for each criterion. Then we created XLSX files for each year 2013-2022 with decision matrices for 29 countries placed in rows and 10 criteria placed in columns. Then we evaluated each decision matrix using multi-criteria TOPSIS method and we received score value for each country in each year. The we created train dataset including ten features and score (denoted by Pref), which is target variable for each country for each year from 2013 to 2021 and analogous test dataset for 2022.

Institutions

Uniwersytet Szczecinski

Categories

Machine Learning, Decision Support System, Regression Model, Multiple-Criteria Decision Analysis

Funding

National Science Center

2022/45/B/HS4/02960

Licence