A study of Data-Driven Forecasting of PISA Reading Literacy Performance of Türkiye: Artificial Intelligence Based Time Series Estimation. Yıldırım et al.

Published: 1 June 2026| Version 1 | DOI: 10.17632/yy9wwkg2ww.1
Contributors:
Ahmet YILDIRIM,

Description

This repository contains the datasets, preprocessing procedures, forecasting outputs, and Python scripts used in the study investigating Türkiye’s future PISA Reading Literacy performance using deep learning-based time series forecasting models. The study compares GRU, LSTM, ARIMA, and Prophet models using recursive multi-step forecasting and sequential variable inclusion strategies. The repository includes raw and processed datasets, missing data handling procedures, evaluation metric outputs (MAE, MSE, RMSE, DTW), forecasting results for the 2025, 2028, and 2031 PISA cycles, reproducible Python scripts, and supplementary methodological materials. Independent variables used in the forecasting framework include parental education (PARED), cultural possessions (CULTPOS), highest parental occupational status (HISEI), and sense of belonging at school (BELONG). Missing values were completed using interpolation and extrapolation techniques. All variables were standardized using Z-score normalization. The package was prepared to support transparency, reproducibility, and future research in educational data mining, artificial intelligence in education, and educational forecasting studies.

Files

Institutions

Categories

Artificial Intelligence, Educational Measurement, Deep Learning

Licence