Grantha Data Set

Published: 16 May 2024| Version 1 | DOI: 10.17632/j89cdpxwmw.1


The dataset contains handwritten samples of Grantha numbers and vowels, totaling 44 different characters (10 numbers and 34 vowels). The data was collected on standard A4 sheets and scanned using a mobile phone. The handwritten samples were obtained from a diverse group of 150 subjects across various age groups. The data underwent segmentation, preprocessing, and was stored in a publicly accessible repository. After removing obscured images and scribbles, the final dataset comprises 5,852 digitized images, which include 1,330 Grantha numbers (133 samples for each number) and 4,522 vowels (133 samples for each vowel). This data is meticulously organized into respective folders. Additionally, the dataset is available in the form of 44 CSV (comma-separated values) files, with each file representing a unique character or number, and the corresponding labels attached.



VIT-AP Campus


Computer Vision, Optical Character Recognition, Machine Learning, Deep Learning