Multi-Dimensional Cognitive dataset for Educational Datamining

Published: 6 April 2026| Version 1 | DOI: 10.17632/2hkrbgv5kb.1
Contributors:
C P Pavan Kumar Hota,

Description

This dataset is developed to support research in Educational Data Mining (EDM), Learning Analytics, and Cognitive Intelligence Modeling. It provides a comprehensive representation of student learning by capturing multiple dimensions of cognition, behavior, emotion, and academic performance, enabling deeper insights into learning processes and outcomes. Each record in the dataset corresponds to an individual learner and reflects their interaction with the learning environment. The dataset integrates both intrinsic cognitive abilities and extrinsic behavioral patterns, making it suitable for advanced analytical and predictive modeling tasks. The dataset includes the following key dimensions: Cognitive Abilities: Features related to logical reasoning, problem-solving skills, memory retention, and analytical thinking, which influence a student’s ability to process and apply knowledge. Behavioral Patterns: Indicators such as study habits, time spent on tasks, consistency, engagement levels, and frequency of interaction with learning platforms. Emotional and Psychological Factors: Attributes capturing stress levels, motivation, attention span, and emotional stability, which play a crucial role in learning effectiveness. Academic Performance Metrics: Quantitative measures including scores, grades, assessment results, and performance trends across time. Learning Environment Factors: External conditions such as access to resources, type of learning (online/offline), and environmental influences affecting academic performance. The multi-dimensional nature of the dataset allows researchers to explore complex relationships between cognitive traits, behavior, and academic success. It supports analysis of how learning patterns evolve and how different factors contribute to performance variations among students. This dataset can be applied to a variety of machine learning and data mining tasks, including classification (e.g., predicting student performance), regression (e.g., estimating scores), clustering (e.g., grouping students based on learning behavior), anomaly detection (e.g., identifying at-risk learners), and recommendation systems (e.g., personalized learning pathways). Additionally, the dataset is well-suited for developing Explainable AI (XAI) models, enabling interpretation of how different cognitive and behavioral features influence predictions. This makes it valuable not only for technical research but also for practical educational decision-making. Overall, this dataset serves as a foundation for building intelligent educational systems that can enhance teaching strategies, support personalized learning, and improve student outcomes through data-driven insights.

Files

Steps to reproduce

This dataset was synthetically generated and curated to simulate multi-dimensional cognitive and behavioral characteristics of students for educational data mining research. Step 1: Define cognitive, behavioral, emotional, and academic dimensions based on educational psychology and learning analytics literature. Step 2: Generate structured student profiles by assigning realistic value ranges to features such as logical reasoning, memory retention, study time, engagement level, stress, and academic scores. Step 3: Introduce variability and correlations between features (e.g., higher study time positively influencing performance, higher stress negatively affecting scores) to reflect real-world learning scenarios. Step 4: Normalize and preprocess the dataset to ensure consistency, remove noise, and maintain balanced distributions across categories. Step 5: Validate the dataset by testing with basic machine learning models (classification and regression) to ensure meaningful patterns and predictive capability. This dataset is intended for simulation, research, and educational purposes and does not contain real personal data.

Categories

Artificial Intelligence, Education, Machine Learning, Academic Learning

Licence