Bangla Social media comments Dataset (BanglaMedia)
Published: 4 September 2025| Version 1 | DOI: 10.17632/xyxb5kryx3.1
Contributors:
, Description
This Dataset contains 7,725 comments collected from public YouTube comments. Collected data was labelled into 10 topic class (others, abusive, political, religious, international, education, food, sports, technology, threat) and 4 sentiment class (positive, negative, neutral, hate . The dataset were annotated by native Bengali speakers. The dataset supports multiple downstream tasks such as topic classification, sentiment analysis, and the development of real-world automatic censorship models.
Files
Institutions
Daffodil International University
Categories
Natural Language Processing, Machine Learning