MADTRAS (Dataset for Aspect-based Sentiment Analysis of Movie Reviews in Tamil)

Published: 10 January 2025| Version 1 | DOI: 10.17632/p59cfx4vx6.1
Contributors:
Arunmozhi Mourougappane,
,

Description

The dataset is a carefully selected set of Tamil film reviews with the goal of advancing NLP research in the areas of text classification, sentiment analysis, and aspect-based sentiment analysis. We have invited users to review twenty-five films using a Google form. Additional reviews were taken from websites such as IMDb and YouTube. From the list of selected aspects, we also made sure that the review collection was based on the presence of at least one target aspect, including cinematography, acting, screenplay, story, director, songs, background music, and editing. About 1,390 reviews total, tagged for positive as well as negative views across eight different categories, make up the dataset.

Files

Institutions

Pondicherry University

Categories

Natural Language Processing, Deep Learning, Sentiment Analysis

Licence