Imbalanced Data-based Prediction and Risk Factor Analysis of Stroke
Description
In this study, a total of 4603 subjects who fulfilled the Inclusion and exclusion criteria were included all from NHANES. Of these, 362 individuals (7.86%) were diagnosed as stroke patients, while 4241 individuals (92.14%) were identified as non-stroke patients. The first column of the dataset is the outcome variable of stroke, and the other columns are the predictors, including gender, age, Race,Marital status, alcohol, smoke, sleep disorder, Health Insurance, General health condition, depression, sleep time, diabetes, hypertension, high cholesterol, Minutes sedentary activity, Coronary Heart Disease, Body Mass Index, Waist Circumference, Systolic blood pressure, Diastolic blood pressure, High-density lipoprotein, Triglyceride, Low-density lipoprotein, Fasting Glucose, Glycohemoglobin, energy, protein, Carbohydrate, Dietary fiber, Total fat, Total saturated fatty acids, Total monounsaturated fatty acids, Total polyunsaturated fatty acids, Potassium, Sodium.