Datasets used to train and test prediction model to predict scores in terms of SDG 7 realization for 2010-2020

Published: 6 March 2025| Version 1 | DOI: 10.17632/86n5bcpm46.1
Contributors:
,
,
,

Description

The datasets used in this research work refer to the aims of Sustainable Development Goal 7. These datasets were used to train and test machine learning model based on artificial neural network and other machine learning regression models for solving the problem of prediction scores in terms of SDG 7 aims realization. Train dataset was created based on data from 2010 to 2019 and includes 300 samples. Test dataset includes 30 samples. Sources data from 2010 to 2020 are available in 11 XLSX and CSV files. Train and test datasets are available in XLSX and CSV files. Detailed description of data is available in PDF file.

Files

Steps to reproduce

First data related to SDG 7 realization was collected from https://ec.europa.eu/eurostat/data/database in form of XLSX files for each criterion. Then we created XLSX files for each year 2010-2020 with decision matrices for 30 countries placed in rows and 11 criteria placed in columns. Then we evaluated each decision matrix using multi-criteria TOPSIS method and we received score value for each country in each year. The we created train dataset including ten features and score (denoted by Pref), which is target variable for each country for each year from 2010 to 2019 and analogous test dataset for 2020.

Institutions

Uniwersytet Szczecinski

Categories

Machine Learning, Decision Support System, Regression Model, Multiple-Criteria Decision Analysis

Funding

National Science Center

2022/45/B/HS4/02960

Licence