Bangla Social media comments Dataset (BanglaMedia)
Published: 4 September 2025| Version 1 | DOI: 10.17632/xyxb5kryx3.1
Contributors:
Rayhan Rafin, Mohammad Sohaib Islam ShiblyDescription
This Dataset contains 7,725 comments collected from public YouTube comments. Collected data was labelled into 10 topic class (others, abusive, political, religious, international, education, food, sports, technology, threat) and 4 sentiment class (positive, negative, neutral, hate . The dataset were annotated by native Bengali speakers. The dataset supports multiple downstream tasks such as topic classification, sentiment analysis, and the development of real-world automatic censorship models.
Files
Institutions
- Daffodil International University
Categories
Natural Language Processing, Machine Learning