ID-SMSA: Indonesian Stock Market Dataset for Sentiment Analysis

Published: 20 January 2025| Version 3 | DOI: 10.17632/tn4vzs8tdw.3
Contributors:
Jason Hartanto,
,

Description

The ID-SMSA Dataset is a collection of stock market-related Indonesian tweets that were collected via X (formerly known as Twitter). The dataset contains tweets in the Indonesian language, each labeled with sentiment categories: positive, negative, or neutral. A team of annotators completes the annotations using annotation guidelines that a clinical psychology specialist has reviewed. To facilitate future studies in sentiment analysis and financial market studies, other variables are also incorporated, such as the tweet's date and user engagement metrics (Quote Count, Reply Count, Retweet Count, and Favorite Count).

Files

Institutions

Bina Nusantara University

Categories

Bahasa Indonesia, Stock Exchange, Sentiment Analysis

Licence