Bangla Social media comments Dataset (BanglaMedia)

Published: 4 September 2025| Version 1 | DOI: 10.17632/xyxb5kryx3.1
Contributors:
,

Description

This Dataset contains 7,725 comments collected from public YouTube comments. Collected data was labelled into 10 topic class (others, abusive, political, religious, international, education, food, sports, technology, threat) and 4 sentiment class (positive, negative, neutral, hate . The dataset were annotated by native Bengali speakers. The dataset supports multiple downstream tasks such as topic classification, sentiment analysis, and the development of real-world automatic censorship models.

Files

Institutions

Daffodil International University

Categories

Natural Language Processing, Machine Learning

Licence