Tabular dataset for AI-based vector-borne disease prediction
Description
This dataset gathers clinical information about patients diagnosed with malaria, dengue, yellow fever, and typhoid fever, as well as patients excluded from these diseases. The data was collected from the health districts of DO and DAFRA, located in the Hauts-Bassins region of Burkina Faso. The dataset includes 300 records. Among the consultations recorded, 150 are from DAFRA and 150 from DO. The data was preprocessed, and sensitive information such as name, surname, or place of birth was removed. The original form contained 115 questions, but only 109 were retained. The data includes two CSV files: the first, "data.csv," contains the data, and the second, "description.xls," contains the attribute descriptions. Some attributes are present in the description but not in the data, as they were removed due to lack of information.
Files
Steps to reproduce
To collect the data, authorization was requested from the Ministry of Health of Burkina Faso. The data collection was carried out using a Kobotoolbox form, provided to the doctors. This form was filled out in real-time during consultations in the DO district and based on archives for the DAFRA district. The form was printed on paper because the doctors found this method faster due to the large number of patients to be consulted. For data from the DAFRA district, recording was done in two steps: first during the initial consultation, and then upon the availability of laboratory results. The data collection lasted for three months, from September to November 2024.
Institutions
Categories
Funding
International Development Research Centre