Clinically Validated CBC Dataset of 7,196 Samples for Hematological Disorder Analysis

Published: 24 November 2025| Version 1 | DOI: 10.17632/rnfzzy4wz6.1
Contributors:
Shahadath Hossen,
,
,
,
,
,

Description

This dataset contains 7,196 clinically validated complete blood count (CBC) samples collected from patients at Noakhali 3814, Bangladesh. The data include standard hematological parameters such as hemoglobin, red blood cells, white blood cells, platelets, lymphocyte percentage, monocyte percentage, hematocrit percentage, mean corpuscular volume, mean corpuscular hemoglobin, mean corpuscular hemoglobin concentration, and red cell distribution width percentage, along with patient age and gender. All samples have been anonymized to protect patient privacy. The dataset is intended for research on hematological disorders, machine learning modeling, and biomedical data analysis. It can be used for classification, prediction, and other computational studies in hematology. Ethical approval: The data collection and use were approved by the Ethics Committee of Noakhali Science and Technology University (Reference no. NSTU/SCI/EC/2025/420).

Files

Steps to reproduce

1. Download the CSV file from Mendeley Data. 2. Load the data into any statistical or machine learning software (Python, R, MATLAB, etc.). 3. Perform preprocessing as needed (e.g., handle missing values, normalization). 4. Use hematological parameters (RBC, WBC, Hb, HCT, Platelets, etc.) for classification, prediction, or other analyses. 5. Refer to the corresponding paper for model details and methodology.

Institutions

Noakhali Science and Technology University

Categories

Hematology, Medical Education, Disease, Machine Learning, Blood Disorder, Healthcare Research, Blood Analysis, Biomedical Research

Licence