test
Description
Gujarati is the formalized communication language in the state of Gujarat and united territory of Dadra and Nagar Haveli and Div and Daman in India predominantly spoken by the Gujarati. Gujarati language contained a wealthy set of characters that includes vowels, consonants, digits, various signs. This dataset contains 75,000 grayscale isolated handwritten character images with the size of 28 X 28 pixels. This dataset contains sample images for 34 consonants, 12 vowels, 12 vowel signs and 5 various signs. This dataset could be used for the Gujarati Handwritten Character recognition in the field of Natural Language Processing (NLP) and Deep Learning .
Files
Steps to reproduce
Characters were collected on blank paper from different people. scanned all physical paper with the scanner at a resolution of 1024 pixels. With the help of tri-level segmentation, isolate each handwritten character into a respective image in .png format with a size of 28 X 28 pixels.