Joint ensemble learning-based risk prediction of Alzheimer's disease among mild cognitive impairment patients

Published: 20 November 2024| Version 1 | DOI: 10.17632/fswb4z7y76.1
Contributor:
Tianyuan Guan

Description

This study used the UDS version 3.0 from the National Alzheimer’s Coordinating Center (NACC) (https://www.naccdata.org),This study used data from 27,418 patients from the UDS 3.0. We developed a risk prediction model for AD among MCI patients using NACC data. Data from MCI patients were obtained from the NACC. Importance ranking and the SHapley Additive exPlanations (SHAP) method for the Random Survival Forest (RSF) and Extreme Gradient Boosting (XGBoost) algorithms in ensemble learning were adopted to select the predictors, and hierarchical clustering analysis was used to mitigate multicollinearity. A total of 3674 subjects with MCI were included. Thirteen predictors were ultimately identified.This model can be easily understood by nonstatisticians. Furthermore, it can be used in primary medical institutions to screen high-risk populations for further examination and treatment, to reduce the incidence of AD, and to prolong the life of AD patients.

Files

Steps to reproduce

In this study, we developed a risk prediction model for AD among MCI patients using NACC data. We have adopted a relatively novel screening approach for predictive factors. Importance ranking and the SHapley Additive exPlanations method for the Random Survival Forest (RSF) and Extreme Gradient Boosting (XGBoost) algorithms in ensemble learning were adopted to select the predictors, and hierarchical clustering analysis was used to mitigate multicollinearity.

Institutions

Fourth Military Medical University

Categories

Public Health, Medical Statistics, Cognitive Neuroscience

Funding

National Natural Science Foundation of China

82073662

Licence