CAIR-CVD-2025: An Extensive Cardiovascular Disease Risk Assessment Dataset from Bangladesh
Description
This dataset comprises 1,529 patient samples collected from Jamalpur Medical College Hospital, Jamalpur, Bangladesh, from January 20, 2024, to January 1, 2025. The data were gathered following ethical guidelines, ensuring patient confidentiality and informed consent. The dataset provides a comprehensive collection of demographic, anthropometric, clinical, biochemical, and lifestyle parameters essential for assessing cardiovascular disease (CVD) risk and overall patient health. The dataset includes a wide range of variables critical for CVD risk estimation, including basic demographic information, anthropometric measurements, clinical values, biochemical markers, and lifestyle factors. These variables are crucial for identifying risk factors, understanding disease progression, and developing preventive health strategies. Clinical Parameters Included: Sex: Male or Female. Age: Patient’s age (in years). Weight (kg): Patient’s weight in kilograms. Height (m): Patient’s height in meters. BMI: Body Mass Index, calculated from weight and height. Abdominal Circumference (cm): Measurement of abdominal girth. BP: Blood pressure readings. Total Cholesterol: Total cholesterol levels in the blood. HDL: High-density lipoprotein levels. Fasting Blood Sugar: Blood glucose levels after fasting. Smoking Status: Indicates whether the patient is a smoker. Diabetes Status: Indicates whether the patient has diabetes. Physical Activity Level: Level of physical activity. Family History of CVD: Indicates if there is a family history of cardiovascular diseases. CVD Risk Level: Classification of the patient’s CVD risk level. Height (cm): Patient’s height in centimeters. Waist-to-Height Ratio: Ratio of waist circumference to height. Systolic BP: Systolic blood pressure. Diastolic BP: Diastolic blood pressure. Blood Pressure Category: Classification of blood pressure levels. Estimated LDL: Estimated low-density lipoprotein levels. CVD Risk Score: Numerical score representing the patient’s CVD risk. Dataset Structure Format: CSV Rows: 1,529 (individual patient records) Columns: 22 (including demographic, clinical, and lifestyle characteristics) N.B. The data collection process was supported by the Cognitive AI & Informatics Research Lab (CAIR Lab), ensuring rigorous quality assessment and validation of the dataset.