COVID-19 Dataset

Name: COVID-19 Dataset
Creator: gungmayun kukuh
Published: 2022-03-21T18:58:06.688Z
Keywords: Machine Learning, Text Mining, Deep Learning

kukuh, gungmayun

doi:10.17632/w88vrhrfp7.1

COVID-19 Dataset

Published: 21 March 2022| Version 1 | DOI: 10.17632/w88vrhrfp7.1

Contributor:

gungmayun kukuh

Description

This is some collections of COVID-19 comment from Twitter, YouTube, Facebook, and Instagram in Indonesian language. This dataset has been pre-processing with various stages : 1. Cleansing 2. Case folding 3. Text normalization 4. Stopword removal, and 5. Stemming by Sastrawi There are two folders in the file, in the form of csv and json. Each of the datasets has been split into train and test data with an 80:20 ratio.

Files

Institutions

Universitas Udayana Fakultas Teknik

COVID-19 Dataset

Description

Files

Institutions

Categories

Licence