Halal Tourism Tweets: Dataset for sentiment Analysis

Published: 9 February 2023| Version 1 | DOI: 10.17632/cdbhzm5twf.1
Mohammad Abdullah,


This dataset contains 1,587,808 Halal tourism related tweets from 2006 to 2021. User identification information are removed to maintain data privacy. The data was collected using the Twitter Academic API. The data is stored in as a RDS file (R Programing Language file), where each line represents a tweet. The RDS file provides information on the following fields: lang: language of the tweet text: the full content of the tweet created_at: the publication date of the tweet location: the country of the author of the tweet


Steps to reproduce

This dataset is generated by Twitter Academic API. Dataset is in raw form, data cleaning can be performed by the end user. Dataset is in RDS format which can be used in R or Python by using following codes. R code: df <- readRDS("Halal_toruism_rawdata.rds") View(df) Python code: import pyreadr import pandas as pd data = pyreadr.read_r('Halal_toruism_rawdata.rds') df= data[None] df


Universiti Sultan Zainal Abidin


Tourism, Sentiment Analysis