A Multimodal Bangla Text–Audio Dataset for Sentiment Analysis

Name: A Multimodal Bangla Text–Audio Dataset for Sentiment Analysis
Creator: Md. Darun Nayeem
Published: 2025-12-15T16:07:17.042Z
Keywords: Natural Language Processing, Speech Recognition, Bengali Language, Multimodality, Sentiment Analysis

Nayeem, Md. Darun; Rafa , Zarin; Nova, Tasnuva Tasnim; Rahman, Yasin; Pathan, Abdul Mumeet; Sultana, Nusrat

doi:10.17632/5yb4jjzrx3.1

A Multimodal Bangla Text–Audio Dataset for Sentiment Analysis

Published: 15 December 2025| Version 1 | DOI: 10.17632/5yb4jjzrx3.1

Contributors:

,

Description

• Bangla, a language spoken by more than 230 million people worldwide, is significantly underrepresented in speech and sentiment analysis research when compared to high-resource languages. • This is addressed with the dataset. Researchers and developers working on low-resource language technologies, such as sentiment analysis, speech recognition, and multimodal learning frameworks, should find this extensive resource very helpful. • Sentiment-aware speech recognition, speech-based emotion detection, emotionally expressive text-to-speech systems, multimodal sentiment classification, and speaker-independent recognition models are just a few of the many applications that can be developed and evaluated using this dataset. • Its modular structure promotes continuous research expansion by enabling contributors to add new regional vocabularies, dialectal variations, or additional sentiment classes over time. • The dataset is precisely balanced, with 4,000 audio recordings created by four native speakers (two male and two female) and 500 samples for each sentiment category. The sentences capture the natural and everyday use of the Bangla language, spanning a wide range of topics that include events, emotions, personal experiences, and general statements.

Files

Institutions

Bangladesh University of Business and Technology

A Multimodal Bangla Text–Audio Dataset for Sentiment Analysis

Description

Files

Institutions

Categories

Licence