Dataset of Computer Science Course Queries from Students: Categorized and Scored According to Bloom's Taxonomy
Description
This dataset consists of 3 .csv files - 1. Data_Structure.csv 2. Introduction_to_Computers_and_Research.csv 3. Irrelevant_Questions.csv. Each of the files consists of questions asked by students of Independent University, Bangladesh on the Summer 2023 Semester in Computer Science Courses. The questions have been manually pre-processed and categorized according to their course and topics. The questions have also been scored using Bloom's taxonomy's six levels of questions [remember (5 points), understand (10 points), apply (15 points), analyze (20 points), evaluate (20 points), create (30 points).]. File-1 consists of the scored and categorized questions from the "Data Structure" course. File-2 consists of the scored and categorized questions from the "Introduction to Computers and Research" course. File-3 consists of the irrelevant questions which do not belong to the courses above but were asked by the students from those courses.
Files
Steps to reproduce
The data was gathered using an online application. The application was used in the classroom by students from Independent University, Bangladesh (IUB) on the Summer 2023 semester. The application was experimented on the "Data Structure" and "Introduction to Computers and Research" course. The questions have been pre-processed by fixing grammar, punctuations, spelling mistakes and capitalizations. After pre-processing similar questions have been removed using Levenshtein Distance with an 80% similarity. It was then evaluated using bloom's taxonomy and a score was assigned to each question. Finally, the questions were manually categorized according to their topics and courses.