UFTBESD: University of Frontier Technology, Bangladesh-Bangla Emotional Speech Dataset
Description
UFTBESD (University of Frontier Technology, Bangladesh - Bangla Emotional Speech Dataset) is a Bangla language-based speech emotion recognition dataset developed to capture realistic emotional speech in everyday acoustic conditions. The dataset consists of 1400 speech-audio recordings collected from 100 native Bangla speakers aged between 19 and 60 years, with a balanced gender distribution (50 male and 50 female). Each participant uttered two selected Bangla sentences, with each sentence spoken once (single trial) in seven emotional states: angry, disgust, fear, happy, neutral, sad, and surprise. Thus, the dataset contains 2 sentences × 7 emotions × 100 speakers = 1400 audio clips. All recordings were collected using smartphone microphones in both indoor and outdoor environments. A significant portion of the data includes natural background noise, making the dataset suitable for real-world speech emotion recognition research. The audio files are stored in WAV format (44.1 kHz, 16-bit, mono). This release focuses on providing raw audio data, and detailed metadata will be added in future versions. UFTBESD is intended to support the development and evaluation of Bangla speech emotion recognition systems and can be used with common machine learning and deep learning architectures such as CNN, LSTM, BiLSTM, and transformer-based models.
Files
Institutions
- Bangabandhu Sheikh Mujibur Rahman Digital UniversityDhaka, Gazipur