BANSpEmo: A Bangla Language Emotional Speech Recognition Dataset
For languages with low resources like the Bangla language, BANSpEmo is the third audio dataset for emotional speech recognition (SER). BANSpEmo consists of 792 utterance recordings of six basic emotional reactions of two sets of sentences. Each set has six sentences. Speakers are explained the emotional states and utterances are recorded in a more realistic way than just reading the sentences. These emotional states are Disgust (বিতৃষ্ণা), Happy (খুশি), Sad (দুঃখজনক), Surprised (বিস্মিত), Anger (রাগ), Fear (ভয়). The produced corpus includes voice recordings from 22 unprofessional speakers, 11 of whom are male and 11 of whom are female. The audio recording was for two sets of sentences.
Center for Research Innovation and Transformation (CRIT), Green University of Bangladesh