Bengali Speech Recognition - Bangla Real Number Audio Dataset

Published: 28 March 2018| Version 6 | DOI: 10.17632/t33byr6cpt.6
Md Mahadi Hasan Nahid,


======================================================== This dataset is developed by Md Ashraful Islam, SUST CSE'2010 Md Mahadi Hasan Nahid, SUST CSE'2010 ( Md Saiful Islam, ( Department of Computer Science and Engineering (CSE) Shahjalal University of Science and Technology (SUST), Special Thanks To Mohammad Al-Amin, SUST CSE'2011 Md Mazharul Islam Midhat, SUST CSE'2010 Md Mahedi Hasan Nayem, SUST CSE'2010 Avro Key Board, Omicron lab, ========================================================= It is a Audio Text Parallel Corpus. This dataset contains Some Recording Audio of Bangla Real Number and Its Coresponding Text. Specially designed for Bangla Speech recognition. There are five speakers(alamin, ashraful, midhat, nahid, nayem) in this dataset. Vocabulary Contains only bangla real numbers (shunno-ekshoto, hazar, loksho, koti, doshomic etc.) Total Number of Audio file : 175 (35 from each speaker) Age range of the speakers : 20-23 Total Size: 32.4MB ========================================================== TextData.txt file contains the text of the audio set. Each line starts with <s> tag and ends with </s> tag. The file name is added after each line using parenthesis, in this audio file you will get its recorder Audio Data. This text data actually generated using Avro (Free Opensourse Writting Software). ========================================================== For Full Data: please contact



Shahjalal University of Science and Technology


Natural Language Processing, Speech Recognition, Corpus Linguistics, Bengali Language