Bengali YouTube News Opinion Data for Temporal Sentiment Analysis

Published: 5 October 2023| Version 1 | DOI: 10.17632/3c3j3bkxvn.1
Contributors:
Lomat Haider Chowdhury,
,

Description

The dataset presents the news articles published in a renowned Bengali YouTube news channel along with the public comments, replies, and other corresponding information. There are 7,62,678 samples of data with 15 features. The features include video URL, title of the news, likes in the video, video views, publishing date, hashtags, video description, comments with corresponding likes, and replies with likes. To ensure the privacy of the commentators, their names have been encoded.

Files

Steps to reproduce

The data was collected using NodeJS and Puppeteer framework in both sequential and keyword-based approaches. Initially collected data was in JSON format. Later, the data format was updated to Excel after some preprocessing.

Institutions

Stamford University Bangladesh, United International University

Categories

Public Opinion, Natural Language Processing, Bengali Language, Temporal Analysis, Sentiment Analysis

Licence