Bangla_MorseCode_Images_Dataset

Published: 15 January 2025| Version 1 | DOI: 10.17632/zrh83nfrzv.1
Contributor:
Shuvo Kumar Basak Shuvo

Description

The Bangla_MorseCode_Images_Dataset is a collection of images designed for training machine learning models, specifically aimed at recognizing and processing Morse code for the Bengali language. This dataset contains visual representations of the Morse code symbols corresponding to Bengali letters and numbers. Each image in the dataset depicts the Morse code of a Bengali character, with its visual output stored as high-resolution (256x256 pixel) images. These images are generated by encoding each Bengali character's Morse code and displaying it in a sequential format. The dataset contains 4000 images per character, which are stored in separate subfolders for each individual Bengali character. This dataset serves as a useful resource for applications involving character recognition, language processing, and image classification tasks related to Bengali text and Morse code. The high volume of images (4000 per character) ensures a robust dataset for model training, enabling the model to generalize well for tasks like Bengali Morse code translation, recognition, and automated decoding. Note for Researchers Using the dataset This dataset was created by Shuvo Kumar Basak. If you use this dataset for your research or academic purposes, please ensure to cite this dataset appropriately. If you have published your research using this dataset, please share a link to your paper. Good Luck.

Files

Steps to reproduce

Text Conversion to Morse Code: Each Bengali character (letter or digit) is mapped to its corresponding Morse code symbol. This mapping is stored in the Bengali Morse Code Dictionary. For example, the letter "অ" corresponds to the Morse code . -.-. and the letter "১" corresponds to the Morse code .----. Image Generation: A blank 256x256 pixel image is created for each character. The Morse code for the character is rendered in the center of the image using a specific font (size 64 by default, adjustable). The Morse code is wrapped to fit within the image dimensions, and the text is rendered in black on a white background. Folder Organization: A separate folder is created for each Bengali character (e.g., "অ", "আ", "১", "২"). Each folder contains 4000 images, representing 4000 different instances of the character’s Morse code. These images are sequentially numbered (e.g., 1.jpg, 2.jpg, ..., 4000.jpg).

Institutions

Jahangirnagar University

Categories

Artificial Intelligence, Cybersecurity, Machine Learning

Licence