Development of Annotated Bangla Speech Corpora

Published: 8 September 2018| Version 1 | DOI: 10.17632/c79z6gz9rm.1
Firoj Alam,
S. M. Murtoza Habib,
Dil Afroza Sultana,
Mumit Khan


This dataset contains Bangla read speech corpora which can be used for phonetic research and developing speech applications. Several criteria were maintained in the corpora development process that includes considering the phonetic and prosodic features during text selection. On the other hand, a specification was maintained in the recording phase as the speaking style is a vital part in speech applications. We also concentrated on proper text normalization, pronunciation, aligning, and labeling. The labeling was done manually – in the present endeavor sentence level labeling (annotation) was completed by maintaining a specification so that it could be expanded in future.


Steps to reproduce

Please see the readme to use the data. It contains text and xml file, which are aligned with wav files in wav directory. Relevant link: For further contact, please drop us an email at


Speech Processing, Speech Recognition, Audio Synthesis, Applied Computer Science