Clinically Validated CBC Dataset of 7,196 Samples for Hematological Disorder Analysis
Description
This dataset contains 7,196 clinically validated complete blood count (CBC) samples collected from patients at Noakhali 3814, Bangladesh. The data include standard hematological parameters such as hemoglobin, red blood cells, white blood cells, platelets, lymphocyte percentage, monocyte percentage, hematocrit percentage, mean corpuscular volume, mean corpuscular hemoglobin, mean corpuscular hemoglobin concentration, and red cell distribution width percentage, along with patient age and gender. All samples have been anonymized to protect patient privacy. The dataset is intended for research on hematological disorders, machine learning modeling, and biomedical data analysis. It can be used for classification, prediction, and other computational studies in hematology. Ethical approval: The data collection and use were approved by the Ethics Committee of Noakhali Science and Technology University (Reference no. NSTU/SCI/EC/2025/420).
Files
Steps to reproduce
1. Download the CSV file from Mendeley Data. 2. Load the data into any statistical or machine learning software (Python, R, MATLAB, etc.). 3. Perform preprocessing as needed (e.g., handle missing values, normalization). 4. Use hematological parameters (RBC, WBC, Hb, HCT, Platelets, etc.) for classification, prediction, or other analyses. 5. Refer to the corresponding paper for model details and methodology.