Sentiment Analysis of Multilingual Dataset of Bahraini Dialects, Arabic, and English
Published: 15 February 2023| Version 1 | DOI: 10.17632/5rhw2srzjj.1
Contributors:
Thuraya Omran, Baraa Sharif, Crina Grosan, Yongmin LiDescription
This dataset was generated using two cascading stages of translation—a machine translation followed by a manual one. Machine translation was applied using Google translate to translate English Amazon product reviews to Standard Arabic. In contrast, the manual approach was applied to translate the resulting Arabic reviews to Bahraini ones by qualified native speakers utilizing constructed customized forms. The resulting parallel dataset of English, Standard Arabic, and Bahraini dialects is called English_Modern Standard Arabic_Bahraini Dialects product reviews for sentiment analysis “E_MSA_BDs-PR-SA”. The dataset is balanced, composed of 2,500 positive and 2,500 negative reviews.
Files
Institutions
Brunel University
Categories
Dialect, Bahrain, Textual Analysis, Sentiment Analysis