Published: 21 March 2022| Version 1 | DOI: 10.17632/w88vrhrfp7.1
This is some collections of COVID-19 comment from Twitter, YouTube, Facebook, and Instagram in Indonesian language. This dataset has been pre-processing with various stages : 1. Cleansing 2. Case folding 3. Text normalization 4. Stopword removal, and 5. Stemming by Sastrawi There are two folders in the file, in the form of csv and json. Each of the datasets has been split into train and test data with an 80:20 ratio.
Universitas Udayana Fakultas Teknik
Machine Learning, Text Mining, Deep Learning