Handwritten Devanagari Characters - Vowels and Numerals (38,750 Isolated images + 9,200 Isolated images)

Published: 1 February 2023| Version 4 | DOI: 10.17632/pxrnvp4yy8.4
Contributors:
Duddela Sai Prashanth,

Description

This article presents handwritten isolated characters of the Devanagari script. Devanagari script contains ten numerals, 13 vowels, and 33 consonants. Devanagari Character dataset includes 23 different characters of numerals and vowels. 2400 handwritten samples are collected for each of the numerals and 1400 for each vowel. Collected samples are digitized and pre-processed and an additional 400 samples are collected for testing purpose under different environment. During pre-processing, images with noise are removed. In this context, a final dataset of 38,750 images were included, where 2,250 and 1,250 samples for each numeral and vowel, respectively for dataset_1 and 9,200 images were included, where 400 images of each character are extracted for dataset_2. The data is available in images and comma-separated-values, along with attached labels. The dataset could be used for Optical Character Recognition research and deep learning. In India, the Devanagari script is the base script on which 120+ languages are evolved; hence this dataset serves as the base for Machine Learning research in these languages

Files

Institutions

Sri Chandrasekharendra Swaraswathi Viswa Maha Vidyalaya, Sahyadri College of Engineering and Management

Categories

Computer Vision, Optical Character Recognition, Machine Learning

Licence