Abstract
Background: Cognitive impairment, indicative of Alzheimer disease and other forms of dementia, significantly deteriorates the quality of life of older adult populations and imposes considerable burdens on families and health care systems worldwide. The early identification of individuals at risk for cognitive impairment through a convenient and rapid method is crucial for the timely implementation of interventions.
Objective: The objective of this study was to explore the application of machine learning (ML) to integrate blood biomarkers, life behaviors, and disease history to predict the decline in cognitive function.
Methods: This approach uses data from the Chinese Longitudinal Healthy Longevity Survey. A total of 2688 participants aged 65 years or older from the 2008‐2009, 2011‐2012, and 2014 Chinese Longitudinal Healthy Longevity Survey waves were included, with cognitive impairment defined as a Mini-Mental State Examination (MMSE) score below 18. The dataset was divided into a training set (n=1331), an internal test set (n=333), and a prospective validation set (n=1024). Participants with a baseline MMSE score of less than 18 were excluded from the cohort to ensure a more accurate assessment of cognitive function. We developed ML models that integrate demographic information, health behaviors, disease history, and blood biomarkers to predict cognitive function at the 3-year follow-up point, specifically identifying individuals who are at risk of experiencing significant declines in cognitive function by that time. Specifically, the models aimed to identify individuals who would experience a significant decline in their MMSE scores (less than 18) by the end of the follow-up period. The performance of these models was evaluated using metrics including accuracy, sensitivity, and the area under the receiver operating characteristic curve.
Results: All ML models outperformed the MMSE alone. The balanced random forest achieved the highest accuracy (88.5% in the internal test set and 88.7% in the prospective validation set), albeit with a lower sensitivity, while logistic regression recorded the highest sensitivity. SHAP (Shapley Additive Explanations) analysis identified instrumental activities of daily living, age, and baseline MMSE scores as the most influential predictors for cognitive impairment.
Conclusions: The incorporation of blood biomarkers, along with demographic, life behavior, and disease history into ML models offers a convenient, rapid, and accurate approach for the early identification of older adult individuals at risk of cognitive impairment. This method presents a valuable tool for health care professionals to facilitate timely interventions and underscores the importance of integrating diverse data types in predictive health models.
doi:10.2196/67437
Keywords
Introduction
Alzheimer disease (AD), the most prevalent form of dementia, is a progressive condition primarily characterized by memory loss [
]. It is estimated that approximately 50 million people worldwide are currently living with AD [ ]. This condition not only deteriorates the quality of life for older adult individuals but also imposes significant burdens on families and health care systems, especially as the global population continues to age [ - ]. Mild cognitive impairment (MCI) is recognized as an intermediary stage between normal aging and the more severe cognitive decline observed in dementia. Early detection of MCI through diagnostic tools, such as magnetic resonance imaging (MRI), can facilitate timely interventions aimed at reducing the risk of progression to AD [ ]. However, cognitive impairment, which encompasses a broader spectrum of decline, including MCI, is a critical concern that requires early identification. Predicting cognitive impairment, including MCI and other forms of decline, is essential for implementing preventive strategies and improving long-term health outcomes. Therefore, identifying individuals at potential risk for cognitive impairment is crucial.While MRI and cerebrospinal fluid biomarkers, such as amyloid β, are significant indicators of AD [
], we aim to develop an algorithm that allows for the precise identification of individuals at risk using a more convenient and rapid approach, without relying on complex analyses like MRI or cerebrospinal fluid biomarkers. Recent studies have established a correlation between blood biomarkers and various factors, including diseases such as hypertension, diabetes, heart disease, and cerebrovascular disease, and lifestyle behaviors such as smoking, physical activity, and living conditions, with cognitive impairment [ , ]. High-dimensional data analysis has proven effective in capturing features that are crucial for identifying health issues [ , ]. Hence, the incorporation of these relevant factors into a sophisticated model allows for a practical application both in clinical settings and at home, enabling doctors or families to use existing data to train and monitor patients or older adult individuals, thereby identifying high-risk groups for further treatment and intervention.In recent years, machine learning (ML), a branch of artificial intelligence, has been increasingly used for the prediction of disease outcome [
, ]. Unlike traditional methods that rely heavily on statistical significance, ML leverages algorithms to process existing factors and develop optimized models [ ]. While there has been significant research into understanding the pathogenesis and influencing factors of MCI through ML, most of these studies have primarily focused on imaging techniques such as MRI [ ]. Although MRI is a powerful diagnostic tool, its high cost and the inconvenience it poses limit its practicality for widespread use. In contrast, cognitive function assessments, such as the Mini-Mental State Examination (MMSE), offer a more accessible and feasible option for large-scale population screenings [ , ].In this study, we used follow-up data from the Chinese Longitudinal Healthy Longevity Survey (CLHLS) collected during the years 2008‐2009, 2011‐2012 and 2014, encompassing a total of 2688 participants. Cognitive impairment was classified based on the MMSE scores, with a cutoff point set at 18, to determine cognitive status after 3 years. We included routine blood indices, lifestyle behaviors, and disease history from the baseline data in the ML model for training, aiming to predict the occurrence of cognitive impairment.
Methods
Study Participants
The cohort for this study is selected from CLHLS, which is a comprehensive longitudinal survey co-orchestrated by Peking University and the China Aging Science Research Center [
, ]. The survey targets Chinese seniors aged 65 years or older and includes detailed information about their living conditions, socioeconomic status, and health profiles [ ]. The CLHLS initiates its baseline survey in 1998 and subsequently conducts follow-up surveys at regular intervals, with the relevant cohorts for this study being those from 2008‐2009, 2011‐2012, and 2014. The CLHLS study maintains ethical standards with approval from the Research Ethics Committee of Peking University (IRB00001052-13074), and all participants or their legal proxies provide written informed consent.The initial samples from these specified years consist of 8418, 6066, and 3441 participants, respectively. Baseline subjects were screened by MMSE, and cognitive impairment was defined as an MMSE score <18 points [
]. The participants without missing data on MMSE scores at baseline and follow-up while with biomarker data were included in this study. This results in a narrowed-down research sample, with 602 participants from the 2008‐2009 cohort, 1263 from 2011‐2012, and 1116 from 2014, leaving a final sample size of 2688 subjects. Among the remaining participants, those from the 2008‐2009 and 2011‐2012 waves (n=1664) were further divided into a training set (n=1331) and an internal test set (n=333). The 1024 participants from the 2014 wave were used as a prospective validation set.Predictors
Demographic predictors include age, gender, and BMI, calculated as weight in kilograms divided by height in meters squared (kg/m2). Data on life behaviors and disease history are collected from the CLHLS questionnaire. Life behaviors account for living status (living alone or not), current smoking and drinking habits, exercise practices, marital status, and overall activity ability and mental health. Activity ability is assessed through activities of daily living (ADL) and instrumental activities of daily living (IADL). ADL was assessed by 6 indicators, including bathing, dressing, toileting, indoor transfer, continence, and eating. If all 6 items can be completely self-care, it means that daily life activities can take care of themselves. If one or more items cannot be completely self-care, it means the daily life activities cannot take care of themselves completely (0=normal, 1=disability) [
, ]. IADL was assessed by 8 indicators, including visiting neighbors, going shopping, cooking a meal, washing clothing, walking continuously for 1 km at a time, lifting a weight of 5 kg, continuously crouching and standing up 3 times, and taking public transportation. If 8 items can be completed independently, such as the evaluation method of ADL, it means the instrumental daily life activities can be completed by themselves (0=normal or 1=disability) [ , , ].The development of the model in this study encompasses 3 categories of predictors derived from the baseline survey data: biomarkers, life behaviors, and disease history. The set of biomarkers comprises both routine blood examination indices and plasma biochemical examination indices. Routine blood indices include white blood cell count, red blood cell count, hemoglobin, erythrocyte hematocrit, erythrocyte mean corpuscular volume, erythrocyte mean corpuscular hemoglobin, erythrocyte mean corpuscular hemoglobin concentration, platelet count, plateletcrit, mean platelet volume, lymphocyte count, percentage of lymphocytes, and platelet distribution width. The plasma biochemical indices include high-density lipoprotein cholesterol (HDL), uric acid, plasma creatinine, glucose, triglyceride, total cholesterol (CHO), high-sensitivity c-reactive protein, malondialdehyde (MDA), and superoxide dismutase activity.
Mental health evaluation incorporates 7 questions, with 4 positively framed inquiries (regarding optimism, neatness, decision-making, and happiness relative to youth) and 3 negatively framed ones (concerning fear, loneliness, and feelings of decreased self-worth with age). Responses are scored on a scale, with higher scores correlating with poorer mental health. Disease history captures the presence or absence of hypertension, diabetes, heart disease, stroke, cancer, and arthritis. These multifaceted predictors collectively contribute to the ML model, providing a comprehensive profile for the assessment of cognitive impairment risk.
Data Preprocessing and Model Configuration
The data preprocessing phase involved addressing missing values and mitigating class imbalance to ensure a robust foundation for model training. Missing values in the dataset were imputed using the mean of the respective columns, which ensured completeness and preserved the statistical properties of the data [
]. In order to address the issue of class imbalance, the SMOTE (Synthetic Minority Over-Sampling Technique) was applied to the training set. SMOTE effectively generated synthetic samples for the minority class, enhancing the model’s ability to learn from the imbalanced data distribution [ ].The selection of ML models was driven by a desire to compare different algorithms’ performance and assess their robustness in dealing with imbalanced datasets. We chose 5 widely used algorithms: random forest (RF) [
], Extreme Gradient Boosting (XGBoost) [ ], logistic regression [ ], support vector machines (SVM) [ ], and balanced random forest (BRF) [ ]. These models were selected for their varied approaches to classification and their effectiveness in handling different types of data. RF and XGBoost are ensemble models that excel in handling high-dimensional data and capturing nonlinear relationships. Logistic regression and SVM are classic algorithms for binary classification, with SVM known for its ability to handle high-dimensional spaces effectively. The addition of BRF was made specifically to address the class imbalance issue in the RF model. The main difference between RF and BRF is how they handle class imbalance.Each model was configured as follows:
- XGBoost (XGBClassifier): a learning rate of 0.03 and a maximum depth of 4 were chosen for the XGBoost model, with class weights incorporated to address class imbalance.
- Logistic regression: configured with a maximum of 1000 iterations and class weights set to “balanced” to account for class imbalance.
- SVM: probability estimation was enabled and class weights were balanced to ensure fairness across experiments.
- BRF: class weights were set to “balanced” to address the class imbalance within the dataset.
Outcomes
The assessment of cognitive function among participants was conducted using the MMSE, administered at both baseline and during follow-up sessions. The MMSE encompasses evaluations across various cognitive domains including orientation, registration, attention and calculation, recall, and language abilities, with a maximum achievable score of 30 points. In this study, the primary outcome for the ML model is the determination of cognitive impairment, defined as an MMSE score of less than 18 points at follow-up.
Model Construction, Evaluation, and Interpretation
The construction of each model followed a systematic approach to ensure rigorous evaluation and validation. Initially, the CLHLS 2008‐2009 and CLHLS 2011‐2012 datasets were merged. From this combined dataset, 1331/1664 (80%) of the data was randomly allocated as the training set, while the remaining 333/1664 (20%) was reserved as an internal test set for model evaluation. And then, the CLHLS 2014 dataset was used as a prospective validation set, with the model training based exclusively on the merged data from CLHLS 2008‐2009 and CLHLS 2011‐2012. Standard performance metrics—including accuracy, sensitivity, specificity, and the area under the receiver operating characteristic curve—were calculated to assess model effectiveness. All statistical analyses are conducted using Python (version 3.8; Python Software Foundation)
SHAP (Shapley Additive Explanations) [
] is a method for interpreting the output of ML models by assigning a contribution value to each feature, allowing us to understand the impact of individual predictors on a model’s decision. In our study, SHAP was used to explain the predictive model for cognitive impairment in older adult individuals. SHAP values decompose the model’s prediction into individual contributions from each feature, making it possible to attribute the output to the various risk factors in a transparent and interpretable manner.Ethical Considerations
The CLHLS was approved by the Duke University Institutional Review Board (Pro00062871) and the Peking University Biomedical Ethics Committee (IRB00001052–13074). All participants provided written informed consent. The data used in this study were deidentified to protect participant privacy and confidentiality. No compensation wasprovided to participants.
Results
Participant Characteristics
summarizes the baseline characteristics of the 2688 participants included in this study, stratified into 3 subsets: the training set (n=1331), the internal test set (n=333), and the prospective validation set (n=1024). Overall, the mean age was 79.73 (SD 10.97) years and slightly over half of the participants were male (53.24%). The mean BMI was 21.81 (SD 3.8 kg/m2). Most participants lived with others (79.49%), did not smoke (77.79%), or drink (80.96%) at present, and had normal ADL (95.02%). Additionally, more than half (55.28%) showed abnormal mental health status, while 51.45% were married and living with a spouse. Several biochemical indicators were measured. The mean white blood cell count was 5.85 (SD 1.74×109)/L, red blood cell count was 4.64 (SD 1.68×1012)/L, and hemoglobin was 131.56 (SD 30.20) g/L. Participants had a mean HDL level of 1.31 (SD 0.38) mmol/L, and other biomarkers (eg, uric acid, creatinine, glucose, triglycerides, CHO, high-sensitivity c-reactive protein, MDA, and superoxide dismutase) were also assessed. Regarding disease history, hypertension was most prevalent (27.18%), followed by arthritis (11.31%), heart disease (8.01%), stroke (6.29%), and diabetes (2.75%). Cancer was reported by 0.42% of participants. Detailed distributions of these characteristics across the training, internal test, and prospective validation sets are presented in . A detailed flow diagram of participant inclusion and exclusion is provided in .
Predictors | Overall (n=2688) | Train set (n=1331) | Internal test set (n=333) | Prospective validation set (n=1024) | ||
Demographics and life behaviors | ||||||
Age, year, mean (SD) | 79.73 (10.97) | 80.24 (11.43) | 79.69 (11.6) | 79.73 (10.07) | ||
Gender, n (%) | ||||||
Male | 1431 (53.24) | 685 (51.47) | 173 (51.95) | 573 (55.96) | ||
Female | 1257 (46.76) | 646 (48.53) | 160 (48.05) | 451 (44.04) | ||
BMI, kg/m2, mean (SD) | 21.81 (3.8) | 21.51 (3.86) | 21.32 (3.67) | 22.31 (3.64) | ||
Living alone, n (%) | ||||||
Yes | 543 (20.51) | 267 (20.37) | 62 (18.9) | 214 (21.21) | ||
No | 2105 (79.49) | 1044 (79.63) | 266 (81.1) | 795 (78.79) | ||
Smoke at present, n (%) | ||||||
Yes | 593 (22.21) | 317 (23.94) | 72 (21.82) | 204 (20.08) | ||
No | 2077 (77.79) | 1007 (76.06) | 258 (78.18) | 812 (79.92) | ||
Drink at present, n (%) | ||||||
Yes | 509 (19.04) | 265 (19.98) | 57 (17.17) | 187 (18.42) | ||
No | 2164 (80.96) | 1061 (80.02) | 275 (82.83) | 828 (81.58) | ||
Exercise at present, n (%) | ||||||
Yes | 2134 (81.08) | 1056 (81.23) | 267 (81.65) | 811 (80.7) | ||
No | 498 (18.92) | 244 (18.77) | 60 (18.35) | 194 (19.3) | ||
ADL | , n (%)||||||
Normal | 2479 (95.02) | 1232 (93.97) | 313 (95.72) | 934 (96.19) | ||
Disability | 130 (4.98) | 79 (6.03) | 14 (4.28) | 37 (3.81) | ||
IADL | , n (%)||||||
Normal | 1593 (59.53) | 742 (55.87) | 198 (59.64) | 653 (64.27) | ||
Disability | 1083 (40.47) | 586 (44.13) | 134 (40.36) | 363 (35.73) | ||
Marital status, n (%) | ||||||
Married and living with spouse | 1370 (51.45) | 654 (49.32) | 165 (49.55) | 551 (54.88) | ||
Others | 1293 (48.55) | 672 (50.68) | 168 (20.45) | 453 (45.12) | ||
Mental health, n (%) | ||||||
Normal | 1122 (44.72) | 542 (43.78) | 139 (44.84) | 441 (45.89) | ||
Abnormal | 1387 (55.28) | 696 (56.22) | 171 (55.16) | 520 (54.11) | ||
Biomarkers, mean (SD) | ||||||
WBC | , 109/L5.85 (1.74) | 5.63 (1.76) | 5.64 (1.89) | 6.19 (1.61) | ||
RBC | , 1012/L4.64 (1.68) | 4.82 (2.11) | 4.98 (1.9) | 4.3 (0.56) | ||
HGB | , g/L131.56 (30.2) | 129.98 (26.56) | 132.23 (22.69) | 133.36 (36) | ||
HCT | , %36.31 (15.05) | 33.77 (18.01) | 34.25 (18.1) | 40.07 (7.23) | ||
MCV | , fL93.95 (10.24) | 93.51 (12.24) | 92.94 (9.76) | 94.81 (7.03) | ||
MCH | , pg29.46 (7.24) | 28.38 (5.25) | 28.16 (5.75) | 31.24 (9.22) | ||
MCHC | , g/L312.18 (42.25) | 304.34 (48.89) | 301.38 (54.41) | 325.58 (18.74) | ||
PLT | , 109/L207.32 (99.24) | 216.44 (109.67) | 213.75 (111.53) | 193.65 (77.22) | ||
PCT | , %0.27 (3.29) | 0.33 (4.42) | 0.2 (0.1) | 0.18 (0.06) | ||
MPV | , fL9.54 (5.14) | 9.43 (2.87) | 9.97 (6.53) | 9.55 (6.65) | ||
LYMPH | , 109/L14.56 (15.81) | 23.13 (15.58) | 21.28 (15.48) | 2.03 (1.25) | ||
LYM% | , %19.55 (16.81) | 10.49 (14.48) | 11.59 (15.01) | 33.06 (9.32) | ||
PDW | , fL15.47 (4.37) | 14.86 (2.38) | 14.87 (1.99) | 16.44 (6.27) | ||
HDL | , mmol/L1.31 (0.38) | 1.26 (0.37) | 1.24 (0.38) | 1.41 (0.37) | ||
UA | , umol/L291 (85.39) | 285.55 (85.47) | 288.51 (87.31) | 298.83 (84.13) | ||
CRE | , mmol/L81.36 (24.19) | 82.49 (25.97) | 81.68 (22.2) | 79.81 (22.28) | ||
GLU | , mmol/L5.06 (2.05) | 4.83 (2.29) | 4.85 (1.87) | 5.42 (1.71) | ||
TGu, mmol/L | 1.29 (0.95) | 1.22 (0.95) | 1.44 (1.24) | 1.34 (0.82) | ||
CHO | , mmol/L4.34 (1.17) | 4.06 (1.18) | 3.94 (1.22) | 4.84 (0.95) | ||
CRPHS | , mg/L4.17 (18.46) | 5.46 (23.21) | 6.15 (29.35) | 2.49 (5.29) | ||
MDA | , nmol/ml5.64 (3.4) | 5.43 (2.76) | 5.2 (2.63) | 5.93 (4.04) | ||
SOD | , IU/mL55.53 (10.33) | 55.53 (11.9) | 53.34 (11.61) | 56.04 (8.13) | ||
Disease history, n (%) | ||||||
Hypertension | ||||||
Yes | 716 (27.18) | 316 (24.09) | 88 (26.91) | 312 (31.36) | ||
No | 1918 (72.82) | 996 (75.91) | 239 (73.09) | 683 (68.64) | ||
Diabetes | ||||||
Yes | 73 (2.75) | 31 (2.34) | 11 (3.33) | 31 (3.11) | ||
No | 2577 (97.25) | 1292 (97.66) | 319 (96.67) | 966 (96.89) | ||
Heart disease | ||||||
Yes | 212 (8.01) | 91 (6.9) | 28 (8.54) | 93 (9.29) | ||
No | 2436 (91.99) | 1228 (93.1) | 300 (91.46) | 908 (90.71) | ||
Stroke | ||||||
Yes | 167 (6.29) | 77 (5.82) | 20 (6.08) | 70 (6.97) | ||
No | 2490 (93.71) | 1247 (94.18) | 309 (93.92) | 934 (93.03) | ||
Cancer | ||||||
Yes | 11 (0.42) | 3 (0.23) | 1 (0.3) | 7 (0.73) | ||
No | 2588 (99.58) | 1301 (99.77) | 329 (99.7) | 958 (99.27) | ||
Arthritis | ||||||
Yes | 300 (11.31) | 183 (13.86) | 55 (16.72) | 62 (6.18) | ||
No | 2352 (88.69) | 1137 (86.14) | 274 (83.28) | 941 (93.82) |
aADL: activities of daily living.
bIADL: instrumental activities of daily living.
cWBC: white blood cell count.
dRBC: red blood cell count.
eHGB: hemoglobin.
fHCT: erythrocyte hematocrit.
gMCV: erythrocyte mean corpuscular volume.
hMCH: erythrocyte mean corpuscular hemoglobin.
iMCHC: erythrocyte mean corpuscular hemoglobin concentration.
jPLT: platelet count.
kPCT: plateletocrit.
lMPV: mean platelet volume.
mLYMPH: lymphocyte count.
nLYM%: percentage of lymphocytes.
oPDW: platelet distribution width.
pHDL: high-density lipoprotein cholesterol.
qUA: urea acid.
rCRE: plasma creatine.
sGLU: glucose.
tTG: triglyceride.
uCHO: total cholesterol.
vCRPHS: high-sensitivity c-reactive protein.
wMDA: malondialdehyde.
xSOD: superoxide dismutase activity.

Model Performance
The performance of the ML models in predicting follow-up cognitive impairment was evaluated using 5 algorithms—RF, XGBoost, logistic regression, SVM, and BRF—on both the internal test set and prospective validation set. Each model’s accuracy, sensitivity, and area under the receiver operating characteristic curve were assessed. The detailed results are presented in
and visually depicted in . Additionally, we used the MMSE as an input for ML prediction. After testing several ML models, the overall performance remained suboptimal. For illustration, we selected 1 representative result, as shown in .Internal test | Prospective validation | |||||
Accuracy (95% CI) | Sensitivity (95% CI) | AUC | (95% CI)Accuracy (95% CI) | Sensitivity (95% CI) | AUC (95% CI) | |
RF | 0.828 (0.806-0.854) | 0.688 (0.633-0.742) | 0.81 (0.769-0.852) | 0.813 (0.79-0.84) | 0.684 (0.63-0.739) | 0.806 (0.767-0.848) |
XGBoost | 0.849 (0.825-0.87) | 0.674 (0.621-0.728) | 0.811 (0.769-0.851) | 0.836 (0.812-0.857) | 0.682 (0.629-0.733) | 0.808 (0.768-0.848) |
Logistic regression | 0.778 (0.751-0.803) | 0.715 (0.66-0.761) | 0.803 (0.753-0.847) | 0.771 (0.744-0.795) | 0.745 (0.696-0.789) | 0.804 (0.753-0.847) |
SVM | 0.69 (0.661-0.719) | 0.681 (0.629-0.731) | 0.777 (0.725-0.827) | 0.672 (0.643-0.701) | 0.676 (0.623-0.725) | 0.777 (0.726-0.826) |
Balanced RF classifier | 0.885 (0.866-0.903) | 0.58 (0.54-0.626) | 0.809 (0.765-0.849) | 0.887 (0.867-0.905) | 0.616 (0.572-0.664) | 0.811 (0.767-0.849) |
MMSE | N/A | N/A | 0.571 (0.485-0.653) | N/A | N/A | 0.558 (0.494-0.62) |
aML: machine learning.
bAUC: area under the curve.
cRF: random forest.
dXGBoost: Extreme Gradient Boosting.
eSVM: support vector machines.
fMMSE: Mini-Mental State Examination.
gN/A: not applicable.

In the internal test set, the BRF achieved the highest accuracy at 88.5% but had lower sensitivity at 58%. RF and XGBoost provided balanced results with accuracies of 82.8% and 84.9% and sensitivities of 68.8% and 67.4%, respectively. Logistic regression yielded a moderate accuracy of 77.8% but the highest sensitivity at 71.5%, while SVM had the lowest performance with 69% accuracy and 68.1% sensitivity. Similar patterns were observed in the prospective validation set, with BRF at 88.7% accuracy, logistic regression reaching 74.5% sensitivity, and SVM again showing the lowest overall performance. Additionally, the performance of all ML models was superior to that of the MMSE.
offers a detailed exposition of the XGBoost for cognitive impairment in older adult individuals, featuring the 20 most influential risk predictors. The model prioritizes IADL, age, and baseline MMSE scores as the top determinants, with marital status, living alone, and hypertension also providing significant predictive value. Other important factors include biological markers such as MDA and HDL, alongside lifestyle factors such as current exercise, smoking, and drinking habits.
illustrates the distribution of SHAP values for these predictors, indicating their impact on the XGBoost’s output. In this plot, each dot represents a single instance of a feature in the dataset, and the horizontal axis shows the SHAP value of that feature. The SHAP value indicates the contribution of each feature to the model’s prediction, with positive values suggesting that the feature increases the likelihood of cognitive impairment, and negative values indicating a decrease. The color of the dots, ranging from blue to red, represents the feature’s value, with blue corresponding to lower values and red indicating higher values of the predictor. This color scheme helps highlight how different values of each predictor influence the model’s outcome. A high SHAP value for a given feature corresponds to a high level of importance in the predictive model. features a bar plot that quantifies the average impact of each predictor, measured by the mean absolute SHAP value. The length of each bar represents the average contribution of a feature to the model’s output across all data points. Here, the most important features in the model—such as IADL, age, and MMSE score at baseline—are easily identified, as they have the longest bars, indicating that they have the highest average impact on the model’s predictions.
illustrates a specific case study, showing how the SHAP values for a particular individual (in this case, an adult aged 98 years) contribute to the XGBoost’s prediction of cognitive impairment. Each feature is shown with its value (eg, hypertension=1), and the arrows indicate how these values shift the model’s prediction. Features such as hypertension and living alone appear to have a red color, indicating they push the prediction toward a higher risk of cognitive impairment. Similarly, age (with a value of 98) and IADL further emphasize the risk in this individual’s profile. The interaction of these predictors is visualized through their SHAP values, which collectively guide the prediction model’s decision, offering an individualized risk profile for this person.

Discussion
Principal Findings
This study uses the capabilities of ML to integrate a diverse set of 39 predictors for forecasting the decline in cognitive function over a 3-year period, yielding significant findings that resonate with the existing body of literature. Our ML model underscores the importance of IADL, age, baseline MMSE scores, marital status, living arrangements, hypertension, arthritis, and general lifestyle habits as pivotal factors influencing cognitive function. These determinants are consistent with findings from prior research, thereby affirming the reliability and relevance of our analytical approach [
, ].Consistent with prior studies, advanced age has been identified as a significant risk factor for cognitive impairment [
, ]. The risk of MCI in the older adults aged 65 years or older is as high as 10%‐20% [ ]. The limitation of older adult individuals’ ability to perform activities, as assessed by ADL and IADL, restricts their range of activities, diminishes social interactions, and consequently reduces the cerebral stimulation necessary for maintaining cognitive functions [ , ]. Furthermore, the impact of living conditions on cognitive health is evidenced by the predictive value of marital status and living alone [ , - ]. Older adult individuals residing with a spouse typically exhibit healthier brain functions due to increased communication and maintenance of a normal life.Additionally, this study underscores the detrimental effects of unhealthy lifestyle behaviors such as smoking and drinking on cognitive function [
- ], as well as the risk of cognitive impairment caused by pre-existing health conditions [ - ]. For instance, an Indian study highlighted that older smokers were 24% more likely to experience cognitive impairment compared to nonsmokers [ ]. Similarly, Sabia et al [ ] reported that abstaining from alcohol or consuming more than 14 units per week in middle age escalates the risk of AD. Moreover, recent research suggests that up to 3% of dementia cases could be averted by enhancing physical activity levels [ , ]. Concerning health conditions, a community-based cohort study illustrated that hypertension is associated with an increased risk of both all-cause MCI and nonamnestic MCI, with hazard ratios of 1.4 and 1.7, respectively, after age and sex adjustments [ ]. Additionally, Appenzeller et al [ ] found that patients with rheumatoid arthritis exhibited a significantly higher incidence of cognitive impairment compared to healthy controls.In this study, in addition to age, lifestyle behaviors, and disease history—which can be evaluated through questionnaires or scales—biomarkers were specifically included in the ML model to enhance the predictive accuracy of clinical risk assessments for cognitive decline. Prior research has indicated that count elevated levels of specific biomarkers, including MDA, HDL, platelets, mean platelet volume, platelet distribution width, mean corpuscular hemoglobin, CHO, lymphocyte percentage, and plasma creatinine, are associated with an increased risk of cognitive decline [
- ]. For clinicians, integrating a patient’s lifestyle behaviors with blood biochemical markers can aid in the assessment of cognitive function. For communities, these indicators can help identify residents who may be at high risk. For family members, this model enables the evaluation of older adult relatives who may be reluctant to acknowledge their cognitive decline, thereby facilitating timely medical intervention. The model developed in this study is versatile and offers valuable insights for the identification of cognitive impairment across various settings.The ML model developed in this study has the potential to significantly improve clinical practice and primary care by providing a rapid, efficient, and accessible tool for identifying individuals at risk of cognitive decline. Traditional methods for cognitive function assessment, such as imaging techniques such as MRI, can be time-consuming and resource-intensive, especially in resource-limited settings. By leveraging routine data such as blood biomarkers, demographic information, and lifestyle factors, this model offers a cost-effective approach to identify individuals who may require further clinical evaluation or early intervention. In primary care settings, where health care professionals often manage large volumes of patients, the model can serve as a valuable screening tool to detect early cognitive decline and facilitate referrals for specialized care. Furthermore, by integrating this model into electronic health records, health care providers can make timely and informed decisions, improving patient outcomes through proactive management. In essence, the model has the potential to transform early detection and intervention strategies, shifting the focus toward preventative care and better allocation of health care resources.
Compared to previous studies using ML for cognitive impairment prediction, our study offers several distinct contributions. For example, while studies such as those by Hu et al [
] and Gao et al [ ] have successfully developed ML models to predict cognitive impairment among Chinese community-dwelling older adult individuals, they often focused on a more limited set of predictors—typically emphasizing demographic factors and neuropsychological assessments. In contrast, our study integrates a comprehensive set of 39 predictors, including both routine blood biomarkers (eg, MDA and HDL) and detailed lifestyle and disease history data. This broader approach not only enhances predictive accuracy but also provides a rapid, cost-effective tool that can be easily applied in community and clinical settings. Moreover, while some previous work [ , ] has primarily relied on imaging data or traditional statistical methods, our use of advanced ensemble ML techniques (such as BRF and XGBoost) combined with SHAP-based interpretability offers a clearer understanding of individual risk factors. This interpretability is crucial for clinicians to tailor early intervention strategies. In summary, our study advances the field by delivering a more inclusive and interpretable model that effectively tracks cognitive decline over 3 years, thereby addressing gaps in existing research and offering tangible benefits for early detection and intervention.Nevertheless, this study presents certain limitations that warrant consideration. Primarily, the dependence on MMSE scores as the sole measure of cognitive impairment may not fully represent the broad spectrum of cognitive health, as the MMSE mainly evaluates specific cognitive domains and does not address emotional or psychological aspects. Furthermore, there is potential bias due to the generally younger age and better overall function of this study’s participants compared to those who were lost to follow-up, which could skew the results. However, the representation of the dataset at a national level does provide a measure of balance, helping to partially offset these biases.
Conclusions
In conclusion, this study validates the efficacy of a ML model integrating demographic data, lifestyle factors, and biomarkers to predict cognitive impairment in older adults. It underscores the significance of traditional risk factors such as age and daily functional abilities while highlighting the role of solitary living conditions and unhealthy habits in cognitive decline. By including a broad spectrum of biomarkers, the model enriches the predictive framework, offering clinicians, communities, and families a valuable tool for early identification and intervention in cognitive impairment, which could have far-reaching implications for public health and the well-being of the aging population.
Limitations
While the CLHLS is a large-scale longitudinal study that primarily focuses on individuals aged 65 years and older, to assess the health status and longevity of the older adult population in China, it is important to acknowledge the limitations of the dataset concerning its representativeness. The CLHLS sample is designed to represent the health conditions of the older adult population in China, but it may not fully capture the global demographics of AD or other forms of dementia. Specifically, older adult populations in other countries or regions may differ by genetic background, lifestyle factors, and health risks, which could influence the development and progression of cognitive impairment. Future studies incorporating diverse, multinational cohorts would be beneficial in enhancing the generalizability and robustness of cognitive decline prediction models.
Acknowledgments
We thank CLHLS for providing data, and all participants for supporting this study. This study was supported by Guangzhou Science and Technology Plan Project (2024A03J1074 and 2024A03J1067) and General Project of Guangdong Provincial Natural Science Foundation (2025A1515012013).
Data Availability
The data used in this study are stored at Peking University and available upon request.
Authors' Contributions
YZ and CL contributed equally to the work. YZ, CL, HR, QW, ZL, D Li, D Liang, WT, and LL performed the data collection and analysis. YZ, CL, HR, and QW contributed to visualization. YZ and HR were responsible for data validation. YZ, CL and HR wrote this paper, and review editing was performed by HR and WC. All the authors read and approved the final paper.
Conflicts of Interest
None declared.
References
- Hane FT, Robinson M, Lee BY, Bai O, Leonenko Z, Albert MS. Recent progress in Alzheimer’s disease research, part 3: diagnosis and treatment. J Alzheimers Dis. 2017;57(3):645-665. [CrossRef] [Medline]
- Scheltens P, De Strooper B, Kivipelto M, et al. Alzheimer’s disease. Lancet. Apr 24, 2021;397(10284):1577-1590. [CrossRef] [Medline]
- 2023 Alzheimer’s disease facts and figures. Alzheimers Dement. Apr 2023;19(4):1598-1695. [CrossRef]
- Prince M, Wimo A, Ali GC, Wu YT, Prina M. World Alzheimer report 2015: the global impact of dementia: an analysis of prevalence, incidence, cost and trends. Alzheimer’s Disease International; 2015.
- Takizawa C, Thompson PL, van Walsem A, Faure C, Maier WC. Epidemiological and economic burden of Alzheimer’s disease: a systematic literature review of data across Europe and the United States of America. JAD. 2014;43(4):1271-1284. [CrossRef]
- Jia J, Wei C, Chen S, et al. The cost of Alzheimer’s disease in China and re-estimation of costs worldwide. Alzheimers Dement. Apr 2018;14(4):483-491. [CrossRef] [Medline]
- Anderson ND. State of the science on mild cognitive impairment (MCI). CNS Spectr. Feb 2019;24(1):78-87. [CrossRef] [Medline]
- Tan CC, Yu JT, Tan L. Biomarkers for preclinical Alzheimer’s disease. J Alzheimers Dis. 2014;42(4):1051-1069. [CrossRef] [Medline]
- Dong X, Nao J, Shi J, Zheng D. Predictive value of routine peripheral blood biomarkers in Alzheimer’s disease. Front Aging Neurosci. 2019;11:332. [CrossRef] [Medline]
- Jia L, Du Y, Chu L, et al. Prevalence, risk factors, and management of dementia and mild cognitive impairment in adults aged 60 years or older in China: a cross-sectional study. Lancet Public Health. Dec 2020;5(12):e661-e671. [CrossRef] [Medline]
- Jing F, Cheng M, Li J, et al. Social, lifestyle, and health status characteristics as a proxy for occupational burnout identification: a network approach analysis. Front Psychiatry. 2023;14:1119421. [CrossRef] [Medline]
- Tan WY, Hargreaves C, Chen C, Hilal S. A machine learning approach for early diagnosis of cognitive impairment using population-based data. J Alzheimers Dis. 2023;91(1):449-461. [CrossRef] [Medline]
- Aguayo GA, Zhang L, Vaillant M, et al. Machine learning for predicting neurodegenerative diseases in the general older population: a cohort study. BMC Med Res Methodol. Jan 11, 2023;23(1):8. [CrossRef] [Medline]
- Tan M, Xiao Y, Jing F, et al. Evaluating machine learning-enabled and multimodal data-driven exercise prescriptions for mental health: a randomized controlled trial protocol. Front Psychiatry. 2024;15:1352420. [CrossRef] [Medline]
- Navarro CLA, Damen JAA, Takada T, et al. Completeness of reporting of clinical prediction models developed using supervised machine learning: a systematic review. BMC Med Res Methodol. Jan 13, 2022;22(1):12. [CrossRef] [Medline]
- Frizzell TO, Glashutter M, Liu CC, et al. Artificial intelligence in brain MRI analysis of Alzheimer’s disease over the past 12 years: a systematic review. Ageing Res Rev. May 2022;77:101614. [CrossRef] [Medline]
- Tombaugh TN, McIntyre NJ. The Mini-Mental State Examination: a comprehensive review. J Am Geriatr Soc. Sep 1992;40(9):922-935. [CrossRef] [Medline]
- Folstein MF, Folstein SE, McHugh PR. “Mini-mental state”. J Psychiatr Res. Nov 1975;12(3):189-198. [CrossRef]
- The Chinese Longitudinal Healthy Longevity Survey (CLHLS)—Longitudinal Data (1998–2018). Peking University Open Research Data Platform. 1998. URL: https://opendata.pku.edu.cn/dataset.xhtml?persistentId=doi:10.18170/DVN/WBO7LK [Accessed 2025-04-19]
- Zeng Y, Vaupel JW. Chinese Longitudinal Healthy Longevity Survey (CLHLS), biomarkers datasets, 2009, 2012, 2014 (ICPSR 37226). National Archive of Computerized Data on Aging. 2009. URL: https://www.icpsr.umich.edu/web/NACDA/studies/37226/versions/V1 [Accessed 2025-04-19]
- Wu J, Li J. The impact of social participation on older people’s death risk: an analysis from CLHLS. China Popul Dev Stud. Jun 2018;2(2):173-185. [CrossRef]
- Hu M, Shu X, Yu G, Wu X, Välimäki M, Feng H. A risk prediction model based on machine learning for cognitive impairment among chinese community-dwelling elderly people with normal cognition: development and validation study. J Med Internet Res. Feb 24, 2021;23(2):e20298. [CrossRef] [Medline]
- Mlinac ME, Feng MC. Assessment of activities of daily living, self-care, and independence. Arch Clin Neuropsychol. Sep 2016;31(6):506-516. [CrossRef] [Medline]
- Farias ST, Mungas D, Reed BR, Harvey D, Cahn-Weiner D, Decarli C. MCI is associated with deficits in everyday functioning. Alzheimer Dis Assoc Disord. 2006;20(4):217-223. [CrossRef] [Medline]
- Kiosses DN, Alexopoulos GS. IADL functions, cognitive deficits, and severity of depression: a preliminary study. Am J Geriatr Psychiatry. Mar 2005;13(3):244-249. [CrossRef] [Medline]
- Graf C. The Lawton instrumental activities of daily living scale. AJN. 2008;108(4):52-62. [CrossRef]
- Little RJA, Rubin D. Statistical Analysis with Missing Data. John Wiley & Sons; 2019.
- Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: Synthetic Minority Over-sampling Technique. JAIR. 2002;16:321-357. [CrossRef]
- Breiman L. Random forests. Mach Learn. 2001;45(1):5-32. [CrossRef]
- Chen T, Guestrin C. XGBoost. Presented at: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Aug 13-17, 2016; San Francisco, CA, United States.
- Sturdivant RX, Hosmer DW, Lemeshow S. Applied Logistic Regression. John Wiley & Sons; 2013. [CrossRef]
- Cortes C, Vapnik V. Support-vector networks. Mach Learn. Sep 1995;20(3):273-297. [CrossRef]
- Khalilia M, Chakraborty S, Popescu M. Predicting disease risks from highly imbalanced data using random forest. BMC Med Inform Decis Mak. Jul 29, 2011;11:1-13. [CrossRef] [Medline]
- Mangalathu S, Hwang SH, Jeon JS. Failure mode and effects analysis of RC members based on machine-learning-based Shapley Additive Explanations (SHAP) approach. Eng Struct. Sep 2020;219:110927. [CrossRef]
- Cong L, Ren Y, Wang Y, et al. Mild cognitive impairment among rural-dwelling older adults in China: a community-based study. Alzheimers Dement. Jan 2023;19(1):56-66. [CrossRef] [Medline]
- Grueso S, Viejo-Sobera R. Machine learning methods for predicting progression from mild cognitive impairment to Alzheimer’s disease dementia: a systematic review. Alzheimers Res Ther. Sep 28, 2021;13(1):162. [CrossRef] [Medline]
- Langa KM, Levine DA. The diagnosis and management of mild cognitive impairment: a clinical review. JAMA. Dec 17, 2014;312(23):2551-2561. [CrossRef] [Medline]
- Njegovan V, Hing MM, Mitchell SL, Molnar FJ. The hierarchy of functional loss associated with cognitive decline in older persons. J Gerontol A Biol Sci Med Sci. Oct 2001;56(10):M638-M643. [CrossRef] [Medline]
- Jekel K, Damian M, Wattmo C, et al. Mild cognitive impairment and deficits in instrumental activities of daily living: a systematic review. Alzheimers Res Ther. 2015;7(1):17. [CrossRef] [Medline]
- Sundström A, Westerlund O, Kotyrlo E. Marital status and risk of dementia: a nationwide population-based prospective study from Sweden. BMJ Open. Jan 4, 2016;6(1):e008565. [CrossRef] [Medline]
- Sommerlad A, Ruegger J, Singh-Manoux A, Lewis G, Livingston G. Marriage and risk of dementia: systematic review and meta-analysis of observational studies. J Neurol Neurosurg Psychiatry. Mar 2018;89(3):231-238. [CrossRef] [Medline]
- Ahn IS, Kim JH, Kim S, et al. Impairment of instrumental activities of daily living in patients with mild cognitive impairment. Psychiatry Investig. Sep 2009;6(3):180-184. [CrossRef] [Medline]
- Tyas SL, White LR, Petrovitch H, et al. Mid-life smoking and late-life dementia: the Honolulu-Asia aging study. Neurobiol Aging. 2003;24(4):589-596. [CrossRef] [Medline]
- Xu W, Wang H, Wan Y, et al. Alcohol consumption and dementia risk: a dose-response meta-analysis of prospective studies. Eur J Epidemiol. Jan 2017;32(1):31-42. [CrossRef] [Medline]
- Sabia S, Fayosse A, Dumurgier J, et al. Alcohol consumption and risk of dementia: 23 year follow-up of Whitehall II cohort study. BMJ. Aug 1, 2018;362:k2927. [CrossRef] [Medline]
- Vitturi BK, Nascimento BAC, Alves BR, de Campos FSC, Torigoe DY. Cognitive impairment in patients with rheumatoid arthritis. J Clin Neurosci. Nov 2019;69:81-87. [CrossRef] [Medline]
- Meade T, Manolios N, Cumming SR, Conaghan PG, Katz P. Cognitive impairment in rheumatoid arthritis: a systematic review. Arthritis Care Res (Hoboken). Jan 2018;70(1):39-52. [CrossRef] [Medline]
- Aronow WS. Hypertension and cognitive impairment. Ann Transl Med. Jun 2017;5(12):259. [CrossRef] [Medline]
- Reitz C, Tang MX, Manly J, Mayeux R, Luchsinger JA. Hypertension and the risk of mild cognitive impairment. Arch Neurol. Dec 2007;64(12):1734-1740. [CrossRef] [Medline]
- Muhammad T, Govindu M, Srivastava S. Relationship between chewing tobacco, smoking, consuming alcohol and cognitive impairment among older adults in India: a cross-sectional study. BMC Geriatr. Jan 29, 2021;21(1):85. [CrossRef] [Medline]
- Liang JH, Lu L, Li JY, et al. Contributions of modifiable risk factors to dementia incidence: a Bayesian network analysis. J Am Med Dir Assoc. Nov 2020;21(11):1592-1599. [CrossRef] [Medline]
- Dominguez LJ, Veronese N, Vernuccio L, et al. Nutrition, physical activity, and other lifestyle factors in the prevention of cognitive decline and dementia. Nutrients. Nov 15, 2021;13(11):4080. [CrossRef] [Medline]
- Appenzeller S, Bertolo MB, Costallat LTL. Cognitive impairment in rheumatoid arthritis. Methods Find Exp Clin Pharmacol. Jun 2004;26(5):339-343. [CrossRef] [Medline]
- Keller JN, Schmitt FA, Scheff SW, et al. Evidence of increased oxidative damage in subjects with mild cognitive impairment. Neurology (ECronicon). Apr 12, 2005;64(7):1152-1156. [CrossRef] [Medline]
- Liu Z, Liu Y, Tu X, et al. High serum levels of malondialdehyde and 8-OHdG are both associated with early cognitive impairment in patients with acute ischaemic stroke. Sci Rep. Aug 25, 2017;7(1):9493. [CrossRef] [Medline]
- van Exel E, de Craen AJM, Gussekloo J, et al. Association between high-density lipoprotein and cognitive impairment in the oldest old. Ann Neurol. Jun 2002;51(6):716-721. [CrossRef] [Medline]
- Atzmon G, Gabriely I, Greiner W, Davidson D, Schechter C, Barzilai N. Plasma HDL levels highly correlate with cognitive function in exceptional longevity. J Gerontol A Biol Sci Med Sci. Nov 2002;57(11):M712-M715. [CrossRef] [Medline]
- Crichton GE, Elias MF, Davey A, Sullivan KJ, Robbins MA. Higher HDL cholesterol is associated with better cognitive function: the Maine-Syracuse study. J Int Neuropsychol Soc. Nov 2014;20(10):961-970. [CrossRef] [Medline]
- Zhang Y, Liu J, Wei Z, et al. Elevated serum platelet count inhibits the effects of brain functional changes on cognitive function in patients with mild cognitive impairment: A resting-state functional magnetic resonance imaging study. Front Aging Neurosci. 2023;15:1088095. [CrossRef] [Medline]
- Sun D, Wang Q, Kang J, et al. Correlation between serum platelet count and cognitive function in patients with atrial fibrillation: a cross-sectional study. Cardiol Res Pract. 2021;2021:9039610. [CrossRef] [Medline]
- Wang RT, Jin D, Li Y, Liang QC. Decreased mean platelet volume and platelet distribution width are associated with mild cognitive impairment and Alzheimer’s disease. J Psychiatr Res. May 2013;47(5):644-649. [CrossRef] [Medline]
- Winchester LM, Powell J, Lovestone S, Nevado-Holgado AJ. Red blood cell indices and anaemia as causative factors for cognitive function deficits and for Alzheimer’s disease. Genome Med. Jun 28, 2018;10(1):51. [CrossRef] [Medline]
- Guo Y, Li P, Ma X, et al. Association of circulating cholesterol level with cognitive function and mild cognitive impairment in the elderly: a community-based population study. Curr Alzheimer Res. 2020;17(6):556-565. [CrossRef] [Medline]
- Magaki S, Yellon SM, Mueller C, Kirsch WM. Immunophenotypes in the circulation of patients with mild cognitive impairment. J Psychiatr Res. Feb 2008;42(3):240-246. [CrossRef] [Medline]
- Yoshida M, Higashi K, Kuni K, et al. Distinguishing mild cognitive impairment from Alzheimer’s disease with acrolein metabolites and creatinine in urine. Clinica Chimica Acta. Feb 2015;441:115-121. [CrossRef]
- Gao H, Schneider S, Hernandez R, et al. Early identification of cognitive impairment in community environments through modeling subtle inconsistencies in questionnaire responses: machine learning model development and validation. JMIR Form Res. Nov 13, 2024;8:e54335. [CrossRef] [Medline]
- Graham SA, Lee EE, Jeste DV, et al. Artificial intelligence approaches to predicting and detecting cognitive decline in older adults: a conceptual review. Psychiatry Res. Feb 2020;284:112732. [CrossRef] [Medline]
- Zhou Y, Song Z, Han X, Li H, Tang X. Prediction of Alzheimer’s disease progression based on magnetic resonance imaging. ACS Chem Neurosci. Nov 17, 2021;12(22):4209-4223. [CrossRef] [Medline]
Abbreviations
AD: Alzheimer disease |
ADL: activities of daily living |
BRF: balanced random forest |
CHO: total cholesterol |
CLHLS: Chinese Longitudinal Healthy Longevity Survey |
HDL: high-density lipoprotein cholesterol |
IADL: instrumental activities of daily living |
MCI: mild cognitive impairment |
MDA: malondialdehyde |
ML: machine learning |
MMSE: Mini-Mental State Examination |
MRI: magnetic resonance imaging |
RF: random forest |
SHAP: Shapley Additive Explanations |
SMOTE: Synthetic Minority Over-Sampling Technique |
SVM: support vector machines |
XGBoost: Extreme Gradient Boosting |
Edited by Hanna Köttl; submitted 19.10.24; peer-reviewed by Anurag Garikipati, Yafei Wu; final revised version received 05.03.25; accepted 01.04.25; published 30.04.25.
Copyright© Hao Ren, Yiying Zheng, Changjin Li, Fengshi Jing, Qiting Wang, Zeyu Luo, Dongxiao Li, Deyi Liang, Weiming Tang, Li Liu, Weibin Cheng. Originally published in JMIR Aging (https://aging.jmir.org), 30.4.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Aging, is properly cited. The complete bibliographic information, a link to the original publication on https://aging.jmir.org, as well as this copyright and license information must be included.