COVID-19 Feature selection datasets
Published: 30 August 2022| Version 1 | DOI: 10.17632/wbjjz9bzxf.1
Contributors:
Gabriel Pena, , Description
Datasets used to perform feature selection routines on COVID-19 peaks of cases and deaths. To be published soon. Data is from Our World in Data and Institute of Health Metrics and Evaluation (mobility and mask use attributes only).
Files
Steps to reproduce
1. Apply a low pass filter on the reported data to eliminate noise (example: MA or Gaussian filters). 2. Choose real peaks and discard fake ones. 3. Normalize dependant attributes by country population. 4. Fill missing values by some imputation algorithm (example: kNN imputation). 5. Fix a number of output classes. (example: by bin number estimation). 6. Classify peaks (country + date) into the output classes. (example: by K-Means clustering)
Institutions
Universidad Nacional de Tres de Febrero
Categories
Epidemiology, COVID-19