A Dataset of Inertial Measurement Units for Handwritten Punjabi Alphabets
Description
The dataset consists of Inertial Measurement Unit (IMU) data corresponding to the 41 characters of the Punjabi alphabet. The data was collected using an IMU 6050 sensor, which was attached to a marker held by the participant during the handwriting process. The IMU sensor records accelerations along three axes (X, Y, Z) and rotational velocities along the same three axes, providing a comprehensive view of the motion involved in writing each character. Data Collection Process: Twenty students participated in the data collection process for this study. Each student was tasked with writing all 41 Punjabi characters twice, once with an IMU sensor attached to the upper part of a marker and once with the sensor attached to the lower part. This dual sensor positioning allowed us to examine whether the location of the sensor affects the distinctiveness of the motion patterns recorded for each character. As a result, each student contributed 82 samples (41 characters × 2 sensor positions), creating a comprehensive dataset that captures a diverse array of motion patterns. The data collection experiment was conducted over four months, ensuring that a substantial volume of data was gathered. During each session, the students wrote the characters on a whiteboard, repeating each character 250 times. To accurately capture the timing and duration of each writing instance, students were provided with a button that they would press before beginning to write a character and release upon completion. The IMU sensor recorded data for all 250 instances of each character written by each student. This extensive data collection approach ensured that multiple repetitions of each character were captured, resulting in a rich dataset that is ideal for analysis and modeling purposes. Labeling The data is labeled according to the sequence of the Punjabi alphabet, with each character assigned a unique label. The first character is labeled '1,' the second character '2,' and so on, up to '41.' This labeling allows for easy identification and classification of the characters within the dataset. '1' represents first letter of the Punjabi letter "ਅ" (Ura), '2' represents the second letter "ਆ" (Aira) and so on. This dataset can be used to develop and train machine learning models, particularly those focused on pattern recognition and handwriting recognition. Researchers and developers can use this data to: Data Interpretation and Usage: Character Recognition: Train models to recognize and classify Punjabi characters based on the IMU data. Sensor Analysis: Study the effect of sensor positioning on the accuracy of character recognition and explore methods to compensate for these variations. Handwriting Dynamics: Analyze the dynamics of handwriting, such as speed, pressure, and motion trajectories, as recorded by the IMU sensor. This dataset provides a valuable resource for exploring the use of inertial sensors in handwriting recognition, specifically for the Punjabi alphabet.