Using machine learning and electronic health record (EHR) data for the early prediction of Alzheimer's Disease and Related Dementias - 24/07/25

Doi : 10.1016/j.tjpad.2025.100169 
Sonia Akter a, #, Zhandi Liu b, #, Eduardo J. Simoes c, Praveen Rao a, b,
a Institute for Data Science and Informatics, University of Missouri, USA 
b Department of Electrical Engineering and Computer Science, University of Missouri, USA 
c Department of Biomedical Informatics, Biostatics and Medical Epidemiology, University of Missouri, USA 

Corresponding author at: Department of Electrical Engineering & Computer Science, University of Missouri, Columbia, USA.Department of Electrical Engineering & Computer ScienceUniversity of MissouriColumbiaUSA

Benvenuto su EM|consulte, il riferimento dei professionisti della salute.
Articolo gratuito.

Si connetta per beneficiarne

Highlights

Developed ML models for early ADRD prediction using de-identified EHR data.
GBT achieved AUC-ROC scores of 0.809–0.833 over 1–5-year prediction windows.
SHAP identified depression, age, heart disease, sleep apnea, and headache as risks.
Sleep apnea and headache were highlighted as novel risk factors for ADRD.
Early detection helps delay ADRD progression and improve patient outcomes.

Il testo completo di questo articolo è disponibile in PDF.

Abstract

Background

Over 6 million patients in the United States are affected by Alzheimer's Disease and Related Dementias (ADRD). Early detection of ADRD can significantly improve patient outcomes through timely treatment.

Objective

To develop and validate machine learning (ML) models for early ADRD diagnosis and prediction using de-identified EHR data from the University of Missouri (MU) Healthcare.

Design

Retrospective case-control study.

Setting

The study used de-identified EHR data provided by the MU NextGen Biomedical Informatics, modeled with the PCORnet Common Data Model (CDM).

Participants

An initial cohort of 380,269 patients aged 40 or older with at least two healthcare encounters was narrowed to a final dataset of 4,012 ADRD cases and 119,723 controls.

Methods

Six ML classifier models: Gradient-Boosted Trees (GBT), Light Gradient-Boosting Machine (LightGBM), Random Forest (RF), eXtreme Gradient-Boosting (XGBoost), Logistic Regression (LR), and Adaptive Boosting (AdaBoost) were evaluated using Area Under the Receiver Operating Characteristic Curve (AUC-ROC), accuracy, sensitivity, specificity, and F1 score. SHAP (SHapley Additive exPlanations) analysis was applied to interpret predictions.

Results

The GBT model achieved the best AUC-ROC scores of 0.809–0.833 across 1- to 5-year prediction windows. SHAP analysis identified depressive disorder, age groups 80–90 yrs and 70–80 yrs, heart disease, anxiety, and the novel risk factors of sleep apnea, and headache.

Conclusion

This study underscores the potential of ML models for leveraging EHR data to enable early ADRD prediction, supporting timely interventions, and improving patient outcomes. By identifying both established and novel risk factors, these findings offer new opportunities for personalized screening and management strategies, advancing both clinical and informatics science.

Il testo completo di questo articolo è disponibile in PDF.

Keywords : Alzheimer's disease, Dementias, Machine learning (ML), Electronic health record data, Early prediction


Mappa


© 2025  Pubblicato da Elsevier Masson SAS.
Aggiungere alla mia biblioteca Togliere dalla mia biblioteca Stampare
Esportazione

    Citazioni Export

  • File

  • Contenuto

Vol 12 - N° 7

Articolo 100169- agosto 2025 Ritorno al numero
Articolo precedente Articolo precedente
  • Estimation of the value-based price of a blood test for Alzheimer’s disease pathology in primary and specialty care in the U.S.
  • Soeren Mattke, Jiahe Chen, Mark Hanson, Kim G. Johnson, Cara Leahy, David A. Merrill, Victoria Shada, Jorge G. Ruiz
| Articolo seguente Articolo seguente
  • Comparative efficacy of cognitive training modalities in cognitive impairment: A systematic review and network meta-analysis
  • Li-bing Liang, Shan Wang, Kun-peng Li, Cai-qin Wu

Benvenuto su EM|consulte, il riferimento dei professionisti della salute.

@@150455@@ Voir plus

Il mio account


Dichiarazione CNIL

EM-CONSULTE.COM è registrato presso la CNIL, dichiarazione n. 1286925.

Ai sensi della legge n. 78-17 del 6 gennaio 1978 sull'informatica, sui file e sulle libertà, Lei puo' esercitare i diritti di opposizione (art.26 della legge), di accesso (art.34 a 38 Legge), e di rettifica (art.36 della legge) per i dati che La riguardano. Lei puo' cosi chiedere che siano rettificati, compeltati, chiariti, aggiornati o cancellati i suoi dati personali inesati, incompleti, equivoci, obsoleti o la cui raccolta o di uso o di conservazione sono vietati.
Le informazioni relative ai visitatori del nostro sito, compresa la loro identità, sono confidenziali.
Il responsabile del sito si impegna sull'onore a rispettare le condizioni legali di confidenzialità applicabili in Francia e a non divulgare tali informazioni a terzi.


Tutto il contenuto di questo sito: Copyright © 2026 Elsevier, i suoi licenziatari e contributori. Tutti i diritti sono riservati. Inclusi diritti per estrazione di testo e di dati, addestramento dell’intelligenza artificiale, e tecnologie simili. Per tutto il contenuto ‘open access’ sono applicati i termini della licenza Creative Commons.