COVID-19 Dataset

Published: 21 March 2022| Version 1 | DOI: 10.17632/w88vrhrfp7.1
Contributor:
Gung Mayun

Description

This is some collections of COVID-19 comment from Twitter, YouTube, Facebook, and Instagram in Indonesian language. This dataset has been pre-processing with various stages : 1. Cleansing 2. Case folding 3. Text normalization 4. Stopword removal, and 5. Stemming by Sastrawi There are two folders in the file, in the form of csv and json. Each of the datasets has been split into train and test data with an 80:20 ratio.

Files

Institutions

Universitas Udayana Fakultas Teknik

Categories

Machine Learning, Text Mining, Deep Learning

License