Original Paper
- Shayan Nejadshamsi1,2,3, PhD ;
- Vania Karami1,2,3, PhD ;
- Negar Ghourchian4, PhD ;
- Narges Armanfard1,5, PhD ;
- Howard Bergman2, MD, FRCP ;
- Roland Grad2, MD, CM, MSc ;
- Machelle Wilchesky2,3,6, PhD ;
- Vladimir Khanassov2, MD ;
- Isabelle Vedel2, MD, PhD ;
- Samira Abbasgholizadeh Rahimi1,2,3,7, BEng, PhD
1Mila—Quebec Artificial Intelligence Institute, Montreal, QC, Canada
2Family Medicine Department, Faculty of Medicine and Health Sciences, McGill University, Montreal, QC, Canada
3Lady Davis Institute for Medical Research, Jewish General Hospital, Montreal, QC, Canada
4Aerial Technologies, Montreal, QC, Canada
5Department of Electrical and Computer Engineering, Faculty of Engineering, McGill University, Montreal, QC, Canada
6Donald Berman Maimonides Centre for Research in Aging, Montreal, QC, Canada
7Faculty of Dental Medicine and Oral Health Sciences, McGill University, Montreal, Canada
Corresponding Author:
Samira Abbasgholizadeh Rahimi, BEng, PhD
Family Medicine Department
Faculty of Medicine and Health Sciences
McGill University
5858 Côte des Negies
Montreal, QC, H3S 1Z1
Canada
Phone: 1 5143987375
Email: samira.rahimi@mcgill.ca
Abstract
Background: Depression, characterized by persistent sadness and loss of interest in daily activities, greatly reduces quality of life. Early detection is vital for effective treatment and intervention. While many studies use wearable devices to classify depression based on physical activity, these often rely on intrusive methods. Additionally, most depression classification studies involve large participant groups and use single-stage classifiers without explainability.
Objective: This study aims to assess the feasibility of classifying depression using nonintrusive Wi-Fi–based motion sensor data using a novel machine learning model on a limited number of participants. We also conduct an explainability analysis to interpret the model’s predictions and identify key features associated with depression classification.
Methods: In this study, we recruited adults aged 65 years and older through web-based and in-person methods, supported by a McGill University health care facility directory. Participants provided consent, and we collected 6 months of activity and sleep data via nonintrusive Wi-Fi–based sensors, along with Edmonton Frailty Scale and Geriatric Depression Scale data. For depression classification, we proposed a HOPE (Home-Based Older Adults’ Depression Prediction) machine learning model with feature selection, dimensionality reduction, and classification stages, evaluating various model combinations using accuracy, sensitivity, precision, and F1-score. Shapely addictive explanations and local interpretable model-agnostic explanations were used to explain the model’s predictions.
Results: A total of 6 participants were enrolled in this study; however, 2 participants withdrew later due to internet connectivity issues. Among the 4 remaining participants, 3 participants were classified as not having depression, while 1 participant was identified as having depression. The most accurate classification model, which combined sequential forward selection for feature selection, principal component analysis for dimensionality reduction, and a decision tree for classification, achieved an accuracy of 87.5%, sensitivity of 90%, and precision of 88.3%, effectively distinguishing individuals with and those without depression. The explainability analysis revealed that the most influential features in depression classification, in order of importance, were “average sleep duration,” “total number of sleep interruptions,” “percentage of nights with sleep interruptions,” “average duration of sleep interruptions,” and “Edmonton Frailty Scale.”
Conclusions: The findings from this preliminary study demonstrate the feasibility of using Wi-Fi–based motion sensors for depression classification and highlight the effectiveness of our proposed HOPE machine learning model, even with a small sample size. These results suggest the potential for further research with a larger cohort for more comprehensive validation. Additionally, the nonintrusive data collection method and model architecture proposed in this study offer promising applications in remote health monitoring, particularly for older adults who may face challenges in using wearable devices. Furthermore, the importance of sleep patterns identified in our explainability analysis aligns with findings from previous research, emphasizing the need for more in-depth studies on the role of sleep in mental health, as suggested in the explainable machine learning study.
doi:10.2196/67715
Keywords
Introduction
Depression is a prevalent mental health disorder characterized by emotional dysregulation, leading to persistent sadness, loss of interest, and anhedonia [Abd-Alrazaq A, AlSaad R, Shuweihdi F, Ahmed A, Aziz S, Sheikh J. Systematic review and meta-analysis of performance of wearable artificial intelligence in detecting and predicting depression. NPJ Digit Med. 2023;6(1):84. [FREE Full text] [CrossRef] [Medline]1-Parra F, Benezeth Y, Yang F. Automatic assessment of emotion dysregulation in American, French, and Tunisian adults and new developments in deep multimodal fusion: cross-sectional study. JMIR Ment Health. 2022;9(1):e34333. [FREE Full text] [CrossRef] [Medline]3]. The rising incidence of depression among older adults has become a significant public health issue [Choi J, Lee S, Kim S, Kim D, Kim H. Depressed mood prediction of elderly people with a wearable band. Sensors. 2022;22(11):4174. [FREE Full text] [CrossRef] [Medline]4-Song S, Seo Y, Hwang S, Kim HY, Kim J. Digital phenotyping of geriatric depression using a community-based digital mental health monitoring platform for socially vulnerable older adults and their community caregivers: 6-week living lab single-arm pilot study. JMIR Mhealth Uhealth. 2024;12:e55842. [FREE Full text] [CrossRef] [Medline]6]. Early detection of depression and corresponding intervention are vital for improving mental health outcomes and reducing the overall burden on individuals and health care systems [Colizzi M, Lasalvia A, Ruggeri M. Prevention and early intervention in youth mental health: Is it time for a multidisciplinary and trans-diagnostic model for care? Int J Ment Health Syst. 2020;14(1):23. [FREE Full text] [CrossRef] [Medline]7-Huang S, Wang Y, Li G, Hall BJ, Nyman TJ. Digital mental health interventions for alleviating depression and anxiety during psychotherapy waiting lists: systematic review. JMIR Ment Health. 2024;11:e56650. [FREE Full text] [CrossRef] [Medline]9]. Traditional methods for assessing depression include various approaches that typically require in-person evaluations, specialized training in comprehensive geriatric assessments, and reliance on clinical judgment and questionnaires, which can be challenging and resource-intensive [Smith KM, Renshaw PF, Bilello J. The diagnosis of depression: current and emerging methods. Compr Psychiatry. 2013;54(1):1-6. [FREE Full text] [CrossRef] [Medline]10-Glavin D, Grua EM, Nakamura CA, Scazufca M, Ribeiro Dos Santos E, Wong GHY, et al. Patient health questionnaire-9 item pairing predictiveness for prescreening depressive symptomatology: machine learning analysis. JMIR Ment Health. 2023;10:e48444. [FREE Full text] [CrossRef] [Medline]12]. These methods require older adults to visit clinical settings frequently, increasing strain on health care facilities and reducing data collection opportunities. Additionally, many older adults prefer to remain in their homes and be remotely monitored in that environment, highlighting the need for remote care solutions in this demographic [Camp N, Johnston J, Lewis MGC, Zecca M, Di Nuovo A, Hunter K, et al. Perceptions of in-home monitoring technology for activities of daily living: semistructured interview study with community-dwelling older adults. JMIR Aging. 2022;5(2):e33714. [FREE Full text] [CrossRef] [Medline]13,Chan A, Cai J, Qian L, Coutts B, Phan S, Gregson G, et al. In-home positioning for remote home health monitoring in older adults: systematic review. JMIR Aging. 2024;7:e57320. [FREE Full text] [CrossRef] [Medline]14].
Physical activity and mobility are among the important factors in evaluating depression, with strong correlations established between these parameters and depression assessments [Abd-Alrazaq A, AlSaad R, Shuweihdi F, Ahmed A, Aziz S, Sheikh J. Systematic review and meta-analysis of performance of wearable artificial intelligence in detecting and predicting depression. NPJ Digit Med. 2023;6(1):84. [FREE Full text] [CrossRef] [Medline]1,Gianfredi V, Blandi L, Cacitti S, Minelli M, Signorelli C, Amerio A, et al. Depression and objectively measured physical activity: a systematic review and meta-analysis. Int J Environ Res Public Health. 2020;17(10):3738. [FREE Full text] [CrossRef] [Medline]15]. The advent of the Internet of Things has enabled continuous and remote monitoring of physical activity. Several studies have used statistical methods to analyze the relationship between physical activity, as measured by wearable devices, and depression [Arrieta H, Rezola-Pardo C, Echeverria I, Iturburu M, Gil SM, Yanguas JJ, et al. Physical activity and fitness are associated with verbal memory, quality of life and depression among nursing home residents: preliminary data of a randomized controlled trial. BMC Geriatr. 2018;18(1):80. [CrossRef] [Medline]16-Raudsepp L, Riso EM. Longitudinal association between objectively measured walking and depressive symptoms among Estonian older adults. J Aging Phys Act. 2017;25(4):639-645. [CrossRef] [Medline]21]. As the field of artificial intelligence (AI) advances, machine learning models have emerged as promising tools for depression classification using physical activity data [Kumar P, Garg S, Garg A. Assessment of anxiety, depression and stress using machine learning models. Procedia Comput Sci. 2020;171:1989-1998. [CrossRef]22]. For instance, Adamczyk and Malawski [Adamczyk J, Malawski F. Comparison of manual and automated feature engineering for daily activity classification in mental disorder diagnosis. Comput Inf. 2021;40(4):850-879. [CrossRef]23] used data from wearable actigraph watches in 3 classification models: logistic regression (LR), support vector machine (SVM), and random forest (RF) comparing automatic and manual feature engineering for depression classification. Bai et al [Bai R, Xiao L, Guo Y, Zhu X, Li N, Wang Y, et al. Tracking and monitoring mood stability of patients with major depressive disorder by machine learning models using passive digital data: prospective naturalistic multicenter study. JMIR Mhealth Uhealth. 2021;9(3):e24365. [FREE Full text] [CrossRef] [Medline]24] used phone use, sleep data, and step counts from 334 participants, using 2 feature selection methods (L1-based feature selection) and 6 machine learning models (decision tree [DT], k-nearest neighbors, naive Bayes, LR, SVM, and RF) for mood classification. Chikersal et al [Chikersal P, Doryab A, Tumminia M, Villalba DK, Dutcher JM, Liu X, et al. Detecting depression and predicting its onset using longitudinal symptoms captured by passive sensing. ACM Trans Comput Hum Interact. 2021;28(1):1-41. [CrossRef]25] analyzed data from smartphones and fitness trackers of 138 college students to identify those experiencing depressive symptoms, using nested randomized LR for feature selection and AdaBoost with gradient boosting classifier. Dai et al [Dai R, Kannampallil T, Zhang J, Lv N, Ma J, Lu C. Multi-task learning for randomized controlled trials. Proc ACM Interact Mob Wearable Ubiquitous Technol. 2022;6(2):1-23. [CrossRef]26] used heart rate, energy expenditure, sleep, and other activity data from wearable Fitbit devices for depression remission detection in 106 participants within 2 intervention and control groups, using a multitask learning algorithm comprising 2 dense layers with shared parameters. Similarly, Griffiths et al [Griffiths C, da Silva KM, Leathlean C, Jiang H, Ang CS, Searle R. Investigation of physical activity, sleep, and mental health recovery in treatment resistant depression (TRD) patients receiving repetitive transcranial magnetic stimulation (rTMS) treatment. J Affect Disord Rep. 2022;8:100337. [FREE Full text] [CrossRef] [Medline]27] classified depression using activity and sleep data from Fitbit devices of 24 participants through an RF model. Espino-Salinas et al [Espino-Salinas CH, Galván-Tejada CE, Luna-García H, Gamboa-Rosales H, Celaya-Padilla JM, Zanella-Calzada LA, et al. Two-dimensional convolutional neural network for depression episodes detection in real time using motor activity time series of depresjon dataset. Bioengineering. 2022;9(9):458. [FREE Full text] [CrossRef] [Medline]28] used wrist-worn accelerometers to measure physical activity in 55 participants, applying a 2D-convolutional neural network (CNN) and a deep neural network for depression classification. Jakobsen et al [Jakobsen P, Garcia-Ceja E, Riegler M, Stabell LA, Nordgreen T, Torresen J, et al. Applying machine learning in motor activity time series of depressed bipolar and unipolar patients compared to healthy controls. PLoS One. 2020;15(8):e0231995. [FREE Full text] [CrossRef] [Medline]29] used RF, deep neural network, and CNN algorithms for depression classification with wrist-worn actigraph data from 55 participants. Jung et al [Jung D, Kim J, Mun KR. Identifying depression in the elderly using gait accelerometry. Annu Int Conf IEEE Eng Med Biol Soc. 2022;2022:4946-4949. [CrossRef] [Medline]30] used gait accelerometry data and a bidirectional long short-term memory network–based classifier to assess depression in 45 older adults. Other studies have also explored the use of wearable device data combined with various classification methods, such as 1D-CNN [Kulam S. Time-series classification with uni-dimensional convolutional neural networks: An experimental comparison with long short-term memory networks [dissertation]. University of Oslo; 2019. URL: https://www.duo.uio.no/handle/10852/73250 [accessed 2024-12-07] 31], deep convolutional neuro-fuzzy [Kumar A, Sangwan SR, Arora A, Menon VG. Depress-DCNF: A deep convolutional neuro-fuzzy model for detection of depression episodes using IoMT. Appl Soft Comput. 2022;122:108863. [CrossRef]32], Ensemble models [Nguyen DK, Chan CL, Adams Li AH, Phan DV. Deep stacked generalization ensemble learning models in early diagnosis of depression illness from wearable devices data. 2021. Presented at: ICMHI 2021: 2021 5th International Conference on Medical and Health Informatics; May 14-16, 2021; Kyoto, Japan. [CrossRef]33], and extreme gradient boosting [Opoku Asare K, Moshe I, Terhorst Y, Vega J, Hosio S, Baumeister H, et al. Mood ratings and digital biomarkers from smartphone and wearable data differentiates and predicts depression status: a longitudinal data analysis. Pervasive Mobile Comput. 2022;83:101621. [CrossRef]34].
These recent studies highlight the integration of wearable devices with machine learning algorithms as a promising approach for continuous, remote, home-based monitoring and early detection of depression. However, wearable device-based approaches face challenges due to their intrusive nature [Park C, Mishra R, Golledge J, Najafi B. Digital biomarkers of physical frailty and frailty phenotypes using sensor-based physical activity and machine learning. Sensors. 2021;21(16):5289. [FREE Full text] [CrossRef] [Medline]35]. Participants are required to wear sensors or devices, which may lead to issues with compliance, comfort, and data accuracy, especially over extended periods [Akbari G, Nikkhoo M, Wang L, Chen CPC, Han DS, Lin YH, et al. Frailty level classification of the community elderly using Microsoft Kinect–based skeleton pose: a machine learning approach. Sensors (Basel). 2021;21(12):4017. [FREE Full text] [CrossRef] [Medline]36], particularly among older adults [Utsumi T, Hashimoto M. A development of an early detection system of pre-frailty in senior citizens living inside. 2023. Presented at: 17th International Conference on Ubiquitous Information Management and Communication (IMCOM); Seoul, Korea; January 03-05, 2023. [CrossRef]37]. These challenges can result in inaccurate data collection [Piau A, Wild K, Mattek N, Kaye J. Current state of digital biomarker technologies for real-life, home-based monitoring of cognitive function for mild cognitive impairment to mild Alzheimer disease and implications for clinical care: systematic review. J Med Internet Res. 2019;21(8):e12785. [FREE Full text] [CrossRef] [Medline]38]. Nonwearable methods, such as remote monitoring through ambient sensors, offer potential solutions to these issues [Utsumi T, Hashimoto M. A development of an early detection system of pre-frailty in senior citizens living inside. 2023. Presented at: 17th International Conference on Ubiquitous Information Management and Communication (IMCOM); Seoul, Korea; January 03-05, 2023. [CrossRef]37,Tegou T, Kalamaras I, Tsipouras M, Giannakeas N, Votis K, Tzovaras D. A low-cost indoor activity monitoring system for detecting frailty in older adults. Sensors. 2019;19(3):452. [FREE Full text] [CrossRef] [Medline]39,Goonawardene N, Tan HP, Tan LB. Unobtrusive detection of frailty in older adults. In: Zhou J, Salvendy G, editors. Human Aspects of IT for the Aged Population Applications in Health, Assistance, and Entertainment. Cham, Switzerland. Springer International Publishing; 2018. 40]. These approaches can alleviate concerns related to device adherence and physical discomfort, providing a more seamless integration into daily life. Additionally, our literature review indicates that most studies on depression classification use a relatively large number of participants and primarily use single-stage classifiers. Many studies also focus on detailed aspects of physical activity data (eg, body displacement, acceleration) using intrusive wearable sensors, which, while effective, present challenges related to user comfort and compliance. This reliance on wearable technology underscores the need for exploring alternative, less intrusive methods.
Recently, Wi-Fi–based sensing in smart homes has emerged as an alternative method for detecting and monitoring contextual human activity and movement [Mohtadifar M, Cheffena M, Pourafzal A. Acoustic- and radio-frequency–based human activity recognition. Sensors. 2022;22(9):3125. [FREE Full text] [CrossRef] [Medline]41]. Wi-Fi–based technologies are increasingly adopted due to their existing infrastructure in homes and minimal additional setup costs [Wang S, Zhou G. A review on radio based activity recognition. Digit Commun Netw. 2015;1(1):20-29. [CrossRef]42]. These technologies use signal metrics such as received signal strength indicator (RSSI) and channel state information (CSI) to analyze Wi-Fi signal characteristics, offering human activity identification compared to invasive wearable sensors, image analysis, or video-based systems [Showmik IA, Sanam TF, Imtiaz H. Human activity recognition from Wi-Fi CSI data using principal component-based wavelet CNN. Digit Signal Process. 2023;138:104056. [CrossRef]43]. Leveraging Wi-Fi–based activity data with machine learning presents a viable approach for different diseases, specifically depression classification, which is the focus of this study.
Technological and AI-based methods for depression classification predominantly rely on wearable sensors and are often conducted on large participant groups and typically use statistical techniques or single-stage machine learning classifiers. Furthermore, these approaches often overlook the explainability analysis of the models, which causes a lack of understanding of the underlying decision-making processes and the contribution of individual features to the model’s predictions. To address these challenges, this feasibility study introduces a novel 3-stage machine learning model that incorporates feature selection, dimensionality reduction, and classification for depression classification. This model is specifically designed for a limited number of participants using low cost, easily installable Wi-Fi data, which provides continuous insights into human indoor activities. In addition to creating a model that functions effectively with small sample sizes, this study integrates explainable AI techniques to enhance the interpretability of the model’s predictions. This approach ensures that the insights derived from the model are transparent and comprehensible, providing clarity on how specific features contribute to the classification outcomes. Figure 1 illustrates a schematic overview of our proposed framework.

Methods
Data Acquisition
We begin by outlining the data acquisition process for the participants, followed by an in-depth explanation of our proposed HOPE model.
Study Cohort
Our recruitment approach used both digital and in-person strategies. Digital outreach was conducted through email campaigns, social media platforms, and digital posters. Prospective participants were provided with detailed information about the study, and those who expressed interest received a consent form. A member of the research team then coordinated the setup of the monitoring equipment. Participants were compensated for their time and involvement with an e-gift card.
Participants in this study were required to be aged 65 years or older, capable of communicating in English or French, and have access to an internet connection at home. Exclusion criteria were as follows: (1) individuals with mental or physical conditions that would impede their ability to participate in the study and its 6-month follow-up, such as gait or balance disorders, active mental health issues, or the use of mobility aids such as canes; and (2) individuals with current substance use disorder, including alcohol or drugs, due to their potential impact on physical mobility. However, individuals with a history of substance use disorder who were no longer consuming were considered eligible.
Experimental Protocol
This study used a nonintrusive Wi-Fi–based motion sensor system to facilitate remote monitoring of human activity. Initially, we collected demographic information (ie, age and gender) from the participants. Subsequently, Wi-Fi data were acquired through our remote monitoring technology [Pulle V, Rozon M, Zandifar A, Yadavalli SH, Asghari VR, Minayi Jalil A, et al. Systems and methods for remote monitoring care using wireless sensing technologies. Justia. URL: https://patents.justia.com/patent/20240027564 [accessed 2024-10-29] 44]. This device, installed at the network access point, detects Wi-Fi signals in environments conducive to passive sensing, including private residences and public spaces where Wi-Fi is prevalent [Allegue M, Ghourchian N, Rozon M, Carignan E, Hopper M, Gonzalez JF, et al. WiFi motion intelligence: The fundamentals. Aerial Technologies Inc. URL: https://aerial.ai/files/common/The_fundamentals_Aerial_Telefonica_PCP-final_aligned.pdf [accessed 2025-01-14] 45]. In such indoor settings, Wi-Fi signals exhibit stability in the absence of individuals but fluctuate significantly with the presence and movement of people [Allegue M, Ghourchian N, Rozon M, Carignan E, Hopper M, Gonzalez JF, et al. WiFi motion intelligence: The fundamentals. Aerial Technologies Inc. URL: https://aerial.ai/files/common/The_fundamentals_Aerial_Telefonica_PCP-final_aligned.pdf [accessed 2025-01-14] 45]. These signal variations correspond to distinct patterns associated with human movements and activities, thus providing valuable data for activity monitoring [Vitkovic J, Le C, Lee SL, Clark RA. The contribution of hearing and hearing loss to balance control. Audiol Neurootol. 2016;21(4):195-202. [CrossRef] [Medline]46]. The collected data were subsequently transferred to secure cloud storage via an internet connection. Our team has developed advanced signal processing and AI-based algorithms to process raw Wi-Fi RSSI and CSI measurements. These algorithms standardize signal variations and translate RSSI and CSI fluctuations into a detailed set of contextual information related to human activity [Allegue M, Ghourchian N, Rozon M, Carignan E, Hopper M, Gonzalez JF, et al. WiFi motion intelligence: The fundamentals. Aerial Technologies Inc. URL: https://aerial.ai/files/common/The_fundamentals_Aerial_Telefonica_PCP-final_aligned.pdf [accessed 2025-01-14] 45]. This information encompasses daily activity duration, bedtime, wake-up time, total sleep duration, and sleep interruption information. In addition to using Wi-Fi–based activity data, frailty and depression statuses were also assessed using validated assessment tools. The Edmonton Frailty Scale (EFS), which measures multiple dimensions of frailty [Dent E, Kowal P, Hoogendijk EO. Frailty measurement in research and clinical practice: a review. Eur J Intern Med. 2016;31:3-10. [CrossRef] [Medline]47-Perna S, Francis MD, Bologna C, Moncaglieri F, Riva A, Morazzoni P, et al. Performance of Edmonton Frail Scale on frailty assessment: its association with multi-dimensional geriatric conditions assessed with specific screening tools. BMC Geriatr. 2017;17(1):2. [FREE Full text] [CrossRef] [Medline]49], and the 15-item Geriatric Depression Scale (GDS) [Yesavage JA. The use of self-rating depression scales in the elderly. In: Poon LW, Crook T, Davis KL, Eisdorfer C, Gurland BJ, Kaszniak AW, et al, editors. Handbook for Clinical Memory Assessment of Older Adults. Washington, DC. American Psychological Association; 1986. 50] were administered at the end of the experiment.
Analytical Framework
The methodical steps of our proposed model are detailed in the following subsections. During the development, implementation, and reporting, we adhered to the Minimum Information About Clinical Artificial Intelligence Modeling (MI-CLAIM) guidelines [Norgeot B, Quer G, Beaulieu-Jones BK, Torkamani A, Dias R, Gianfrancesco M, et al. Minimum information about clinical artificial intelligence modeling: the MI-CLAIM checklist. Nat Med. 2020;26(9):1320-1324. [FREE Full text] [CrossRef] [Medline]51], following best practices designed to promote transparency and reproducibility of our AI model.
Data Preparation and Feature Extraction
To prepare the contextual human activity data as input for depression classification, obtained from our Wi-Fi signal analysis software, we designed a preprocessing stage. This process involves handling missing values and outliers to ensure the integrity of the data [Heo YJ. Population aging and house prices: Who are we calling old? J Econ Ageing. 2022;23:100417. [FREE Full text] [CrossRef]52,Sarawagi S, Bhamidipaty A. Interactive deduplication using active learning. 2002. Presented at: KDD02: The Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; July 23-26, 2002; Edmonton, Canada. [CrossRef]53]. To enhance analytical depth and improve model performance, we implemented feature engineering on contextual human activity data. This process involves extracting a variety of new features, such as the mean and SD of bedtime and wake-up times, mean and SD of sleep duration (in hours), total count and mean of sleep interruptions, total duration and mean duration of sleep interruptions (in hours), longest continuous sleep duration (in hours), percentage of nights with sleep disturbances, and metrics related to daily activity, including the mean and SD of total daily activity and hourly activity durations, as well as peak activity hour. These derived features, combined with EFS data, were incorporated into our 3-stage machine learning classification model. The depression status, determined using the GDS, was used to label the samples for classification purposes.
HOPE Model Development
The proposed HOPE Model was designed for depression classification in older adults using nonintrusive Wi-Fi–based motion sensor data. Due to the limited number of participants, it is necessary to provide an efficient pipeline for preprocessing, feature extraction, and classification. The limited number of participants and high dimensionality of features required a tailored multistage machine learning pipeline to maximize classification accuracy. Furthermore, ensuring that our depression classification model is explainable to clinicians is important, as highlighted in our previous works [Ghadiri P, Yaffe MJ, Adams AM, Abbasgholizadeh-Rahimi S. Primary care physicians' perceptions of artificial intelligence systems in the care of adolescents' mental health. BMC Prim Care. 2024;25(1):215. [FREE Full text] [CrossRef] [Medline]54,Rahimi SA, Légaré F, Sharma G, Archambault P, Zomahoun HTV, Chandavong S, et al. Application of artificial intelligence in community-based primary health care: systematic scoping review and critical appraisal. J Med Internet Res. 2021;23(9):e29839. [FREE Full text] [CrossRef] [Medline]55]. To address this, we incorporated explainable machine learning techniques such as Shapley additive explanations (SHAP) and local interpretable model-agnostic explanations (LIME). To achieve these goals, our proposed HOPE model was structured into 3 stages of machine learning architecture: feature selection, dimensionality reduction, and classification followed by post hoc explainability analysis using SHAP and LIME. Each stage plays a critical role in refining the data and ensuring that the final classification is both accurate and interpretable. Table 1 provides details on the various techniques used and evaluated at each stage.
Feature selection | Dimensionality reduction | Classification | Explainability analysis |
CFSa [Hall MA. Correlation-Based Feature Selection for Machine Learning [dissertation]. Hamilton, New Zealand. University of Waikato; 1999. URL: https://www.lri.fr/~pierres/donn%E9es/save/these/articles/lpr-queue/hall99correlationbased.pdf [accessed 2025-01-14] 56] | PCAb [Abdi H, Williams LJ. Principal component analysis. WIREs Computational Stats. 2010;2(4):433-459. [CrossRef]57] | NBc [Rish I. An empirical study of the naive Bayes classifier. 2001. Presented at: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence; August 4, 2001; Seattle, WA. URL: https://www.semanticscholar.org/paper/An-empirical-study-of-the-naive-Bayes-classifier-Watson/2825733f97124013e8841b3f8a0f5bd4ee4af88a58] | SHAPd [Lundberg SM, Lee SI. A unified approach to interpreting model predictions. 2017. Presented at: NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing Systems; December 4-9, 2017; Long Beach, CA. URL: https://papers.nips.cc/paper_files/paper/2017/hash/8a20a8621978632d76c43dfd28b67767-Abstract.html59] |
SFSe [Marcano-Cedeño A, Quintanilla-Domínguez J, Cortina-Januchs MG, Andina D. Feature selection using sequential forward selection and classification applying artificial metaplasticity neural network. 2010. Presented at: IECON 2010—36th Annual Conference on IEEE Industrial Electronics Society; November 7-10, 2010; Glendale, AZ. [CrossRef]60] | FAf [Fodor IK. A survey of dimension reduction techniques. Office of Scientific and Technical Information. 2002. URL: https://www.osti.gov/biblio/15002155 [accessed 2022-05-09] 61] | LRg [Jr DWH, Lemeshow S. Applied Logistic Regression. Hoboken, NJ. John Wiley & Sons; 2004. 62] | LIMEh [Peltola T. Local interpretable model-agnostic explanations of Bayesian predictive models via Kullback-Leibler projections. ArXiv. Preprint posted online on October 5, 2018. 2018. [CrossRef]63] |
MIi [Vergara JR, Estévez PA. A review of feature selection methods based on mutual information. Neural Comput Appl. 2013;24(1):175-186. [CrossRef]64] | LDAj [Balakrishnama S, Ganapathiraju A. Linear Discriminant Analysis—A Brief Tutorial [dissertation]. Mississippi State University; 1995. URL: https://www.semanticscholar.org/paper/LINEAR-DISCRIMINANT-ANALYSIS-A-BRIEF-TUTORIAL-Balakrishnama-Ganapathiraju/163bb761cd7c5ca8c786bf4816a10308f1396ffb [accessed 2024-10-29] 65] | kNNk [Peterson LE. K-nearest neighbor. Scholarpedia. 2009;10(2):1883. [CrossRef]66] | —l |
SelectKBest [Tariq MA. A study on comparative analysis of feature selection algorithms for students grades prediction. J Inf Organ Sci. 2024;48(1):133-147. [CrossRef]67] | kPCAm [Cao LJ, Chua KS, Chong WK, Lee HP, Gu QM. A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine. Neurocomputing. 2003;55(1-2):321-336. [CrossRef]68] | SVMn [Hearst MA, Dumais ST, Osuna E, Platt J, Scholkopf B. Support vector machines. IEEE Intell Syst Appl. 1998;13(4):18-28. [CrossRef] [Medline]69] | — |
RFEo [Granitto PM, Furlanello C, Biasioli F, Gasperi F. Recursive feature elimination with random forest for PTR-MS analysis of agroindustrial products. Chemometrics Intell Lab Syst. 2006;83(2):83-90. [CrossRef]70] | — | Decision tree [Song Y, Lu Y. Decision tree methods: applications for classification and prediction. Shanghai Arch Psychiatry. 2015;27(2):130-135. [FREE Full text] [CrossRef] [Medline]71] | — |
— | — | RFp [Breiman L. Random forests. Machine Learning. 2001;45(1):5-32. [CrossRef]72] | — |
— | — | GBMq [Friedman JH. Greedy function approximation: A gradient boosting machine. Ann. Statist. 2001;29(5):1189-1232. [CrossRef]73] | — |
— | — | XGboost [Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H, et al. xgboost: Extreme Gradient Boosting. R Foundation for Statistical Computing. 2014. URL: https://cran.r-project.org/web/packages/xgboost/index.html [accessed 2024-07-24] 74] | — |
— | — | LightGBM [Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, et al. LightGBM: A highly efficient gradient boosting decision tree. 2017. Presented at: NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing Systems; December 4-9, 2017; Long Beach, CA.75] | — |
— | — | Voting classifier [Ruta D, Gabrys B. Classifier selection for majority voting. Inf Fusion. 2005;6(1):63-81. [CrossRef]76] | — |
— | — | Bagging classifier [Zareapoor M, Shamsolmoali P. Application of credit card fraud detection: based on bagging ensemble classifier. Procedia Comput Sci. 2015;48:679-685. [CrossRef]77] | — |
— | — | AdaBoost [Hastie T, Rosset S, Zhu J, Zou H. Multi-class AdaBoost. Stat Interface. 2009;2(3):349-360. [CrossRef]78] | — |
aCFS: correlation-based selection.
bPCA: principal component analysis.
cNB: naive Bayes.
dSHAP: Shapley addictive explanations.
eSFS: sequential forward selection.
fFA: factor analysis.
gLR: logistic regression.
hLIME: local interpretable model-agnostic explanations.
iMI: mutual information.
jLDA: linear discriminant analysis.
kkNN: k-nearest neighbor.
lNot applicable.
mkPCA: kernel principal component analysis.
nSVM: support vector machine.
oRFE: recursive feature elimination.
pRF: random forest.
qGBM: gradient boosting machine.
Feature selection is performed to reduce the dimensionality of the dataset by identifying the most relevant features for depression classification, enhancing both the speed and accuracy of the classification model [Khalid S, Khalil T, Nasreen S. A survey of feature selection and feature extraction techniques in machine learning. Sci Inf Conf. 2014:024-032. [FREE Full text] [CrossRef]79]. The reduced subset of features serves as input for the subsequent dimensionality reduction stage. The validity of the chosen selected features is investigated using correlation analysis in the Results section. Dimensionality reduction techniques are applied to further refine the feature set compared to the initial feature selection stage [Tsai CF, Sung YT. Ensemble feature selection in high dimension, low sample size datasets: parallel and serial combination approaches. Knowl Based Syst. 2020;203:106097. [CrossRef]80] and to minimize overfitting. The dimensionally reduced features from the second stage were then processed in the third stage, which focused on classification. In this stage, the classification model processes the features derived from the earlier stages to categorize samples into 2 target classes: “participants with depression” and “participants without depression.” The classification algorithm leverages the patterns identified in the features, such as sleep duration and interruptions, to make predictions. The classification task involved assigning a probability score to each sample, determining the likelihood of belonging to either class based on the relationships in the feature set. A decision boundary was then established to assign the final class label for each sample. The classification process was systematically evaluated to ensure robustness and reliability, focusing on separating the 2 groups effectively even with the small dataset. The machine learning classification pipeline was designed to minimize the risk of overfitting by using techniques such as feature selection and dimensionality reduction, ensuring that the most informative and relevant features were used for prediction. To conclude, we used explainable AI techniques to interpret the model’s predictions, focusing on identifying the most influential features and their impact on classification outcomes.
Following the training and evaluation of all potential combinations for each stage, the architecture using sequential forward selection (SFS) for feature selection, principal component analysis (PCA) for dimensionality reduction, and DT for classification emerged as the most effective configuration. This SFS-PCA-DT framework (Figure 1) demonstrated superior performance compared to other combinations.
Our proposed model is supported by different considerations. The SFS algorithm incrementally selects features to improve classification performance and is particularly suited for datasets with a smaller number of participants [Soares I, Dias J, Rocha H, Do Carmo Lopes M, Ferreira B. Feature selection in small databases: a medical-case study. 2016. Presented at: XIV Mediterranean Conference on Medical and Biological Engineering and Computing 2016; March 31-April 2, 2016; Paphos, Cyprus. [CrossRef]81]. Unlike filter methods, SFS acts as a wrapper technique that pairs with a machine learning classification algorithm, providing greater stability in performance [Soares I, Dias J, Rocha H, Do Carmo Lopes M, Ferreira B. Feature selection in small databases: a medical-case study. 2016. Presented at: XIV Mediterranean Conference on Medical and Biological Engineering and Computing 2016; March 31-April 2, 2016; Paphos, Cyprus. [CrossRef]81]. The method starts with no selected features and progressively adds them based on their ability to enhance cross-validation outcomes. In the second stage, PCA serves as a highly effective tool for reducing the dimensionality of data in an unsupervised manner [Lever J, Krzywinski M, Altman N. Principal component analysis. Nature Methods. 2017;14(7):641-642. [CrossRef]82]. It converts the initial set of features into a reduced number of uncorrelated components, maintaining the bulk of the data’s variance. This reduction is important for preventing overfitting, especially with limited sample sizes. As a classification algorithm, DT algorithm, known for its strength in binary classification tasks, effectively uses the streamlined feature set generated in earlier stages. It models data by learning simple decision rules inferred from the input features, creating a tree-like structure. Each internal node in the tree represents a decision based on a feature, each branch represents an outcome of the decision, and each leaf node represents a class label [Saed AAA, Jaharadak AA. Implementation with performance evaluation of decision tree classifier for uncertain data. Lit Rev. 2022;5(5):125-132. [CrossRef]83]. The integration of SFS, PCA, and DT results in an efficient model that aligns with established methodologies and theoretical principles in the field. In this study, comprising only 4 participants, the training and validation procedure were carefully designed to minimize overfitting and to achieve reliable model generalization. To this end, we used a 4-fold cross-validation strategy. Each fold consisted of 3 participants for training and 1 participant for testing, ensuring that every participant contributed to both training and testing in separate iterations. This approach was repeated 10 times with different random seeds to account for variations in the training process, further enhancing the robustness of the performance metrics. During the training phase, a range of hyperparameter optimization techniques, including random search [Bergstra J, Bengio Y. Random search for hyper-parameter optimization. J Mach Learn Res. 2012;13(1):281-305. [FREE Full text]84], Bayesian optimization [Frazier P. A tutorial on Bayesian optimization. ArXiv. Preprint posted online July 8, 2018. 2018. [FREE Full text] [CrossRef]85], and Hyperband [Li L, Jamieson KG, DeSalvo G, Rostamizadeh A, Talwalkar A. Hyperband: A novel bandit-based approach to hyperparameter optimization. J Mach Learn Res. 2016:6765-6816. [FREE Full text]86] was performed for each component of the 3-stage pipeline. For example, the DT classifier’s maximum depth, minimum sample split, and criterion parameters are tuned using Bayesian optimization within predefined search spaces. Similarly, for PCA, the optimal number of components is optimized to maximize variance retention while preventing overfitting. The SFS algorithm was guided by internal cross-validation within the training set to identify the most predictive subset of features.
Evaluation Metrics
To validate the effectiveness of our depression classification method, we used 4 evaluation metrics: accuracy [Shreffler J, Huecker MR. Diagnostic Testing Accuracy: Sensitivity, Specificity, Predictive Values and Likelihood Ratios. Treasure Island, FL. StatPearls Publishing; 2024. 87], sensitivity [Shreffler J, Huecker MR. Diagnostic Testing Accuracy: Sensitivity, Specificity, Predictive Values and Likelihood Ratios. Treasure Island, FL. StatPearls Publishing; 2024. 87], precision [Shreffler J, Huecker MR. Diagnostic Testing Accuracy: Sensitivity, Specificity, Predictive Values and Likelihood Ratios. Treasure Island, FL. StatPearls Publishing; 2024. 87], and F1-score. Accuracy provides a comprehensive measure of the model’s overall performance. Sensitivity helps ensure that the model accurately identifies as many true cases of depression as possible, minimizing the risk of missing individuals who actually have the condition [Liu W, Jiang Y, Zhang J, Ma Z. RFARN: Retinal vessel segmentation based on reverse fusion attention residual network. PLoS One. 2021;16(12):e0257256. [FREE Full text] [CrossRef] [Medline]88]. To further validate the stability of the model, we present the training and test accuracies against the hyperparameter variations, demonstrating the model convergence.
Ethical Considerations
The study received approval from McGill University’s Institutional Ethics Committee (A06-B18-21A), allowing data collection and analysis for this project. Written informed consent was collected from all participants prior to their involvement in the study. All collected data were anonymized immediately after collection, with no personally identifiable information retained to ensure participant confidentiality. Participants received a $20 e-gift card as compensation for their time. At every stage of the research, we adhered to the ethical principles outlined in the Declaration of Helsinki [Goodyear MDE, Krleza-Jeric K, Lemmens T. The Declaration of Helsinki. BMJ. 2007;335(7621):624-625. [FREE Full text] [CrossRef] [Medline]89] and the Tri-Council Policy Statement [Tri-Council policy statement: ethical conduct for research involving humans—TCPS 2 (2022). Government of Canada. URL: https://ethics.gc.ca/eng/policy-politique_tcps2-eptc2_2022.html [accessed 2024-10-29] 90].
Results
Clinical Study Insights
Six community-dwelling older adults residing in Montreal, Canada, were recruited for this study between May 2022 and September 2022. However, 2 participants withdrew due to internet connectivity issues, resulting in the use of data from the remaining 4 participants for the analysis. The EFS results indicated that 2 participants exhibited moderate frailty (scores ranging from 6 to 11), while the other 2 participants were classified as nonfrail (scores of 5 or below). GDS results suggested that 1 participant had depressive symptoms (score=10), while the other 3 participants did not (score range=0-4).
Some of the participants’ demographic and clinical characteristics are detailed in Table 2. Over a 6-month period, the activities of each participant were continuously monitored at 15-minute intervals using Wi-Fi motion sensors. Following the identification of potential input features derived from Wi-Fi signals and questionnaire data, we designed and developed a 3-stage architecture as outlined in the methodology section. To support future research and ensure the reproducibility of our findings, the code for our model is openly accessible on our lab’s GitHub repository [Shayan Nejadshamsi. AID project - Depression Github Repository. URL: https://github.com/rahimi-s-lab/HOPE-model [accessed 2025-02-07]
91].
Participant without depression (n=3) | Participant with depression (n=1) | |
Sex (female/male), n/n | 1/2 | 1/0 |
Age (years), mean (SD) | 67.05 (3.70) | 65.50 (0.00) |
Edmonton frailty scale, mean (SD) | 3.34 (3.21) | 7.00 (0.00) |
Geriatric depression scale, mean (SD) | 4.00 (1.00) | 10.00 (0.00) |
We assessed a total of 240 model configurations, which resulted from the combination of 5 feature selection methods, 4 dimensionality reduction techniques, and 12 classifiers. Each configuration was trained 10 times to account for variations in performance metrics, ensuring the robustness of our findings. To mitigate the overfitting risk, we used k-fold cross-validation [Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2015;12(85):2825-2830. [CrossRef]92]. We experimented with different initial feature sets to optimize model performance, and among the various hyperparameter tuning methods, Bayesian optimization consistently yielded superior results.
Model Performance and Validation
This section presents the outcomes of our classification model evaluation. Table 3 summarizes the top-performing configurations among the 240 model variations that we tested. The SFS-PCA-DT model, which integrates SFS, PCA, and DT, emerged as the leading performer across multiple metrics. Its relatively high accuracy indicates the model’s ability to distinguish between individuals with and without depression. The model’s high sensitivity ensures that individuals with depression are correctly identified. This is critical in clinical settings, where depression, particularly among older adults, often goes unrecognized despite its severe impact on cognitive function [Akyol Y, Durmus D, Dogan C, Bek Y, Canturk F. Quality of life and level of depressive symptoms in the geriatric population. Turk J Rheumatol. 2010;25(4):165-173. [CrossRef]93-Ordu Gokkaya NK, Gokce-Kutsal Y, Borman P, Ceceli E, Dogan A, Eyigor S, et al. Pain and quality of life (QoL) in elderly: the Turkish experience. Arch Gerontol Geriatr. 2012;55(2):357-362. [CrossRef] [Medline]96], quality of life [Hsu HC. Physical function trajectories, depressive symptoms, and life satisfaction among the elderly in Taiwan. Aging Ment Health. 2009;13(2):202-212. [CrossRef] [Medline]97,Lue BH, Chen LJ, Wu SC. Health, financial stresses, and life satisfaction affecting late-life depression among older adults: a nationwide, longitudinal survey in taiwan. Arch Gerontol Geriatr. 2010;50:S34-S38. [CrossRef] [Medline]98], and mortality risk [Feng L, Yap KB, Ng TP. Depressive symptoms in older adults with chronic kidney disease: mortality, quality of life outcomes, and correlates. Am J Geriatr Psychiatry. 2013;21(6):570-579. [CrossRef] [Medline]99,Van der Weele GM, Gussekloo J, De Waal MWM, De Craen AJM, Van der Mast RC. Co-occurrence of depression and anxiety in elderly subjects aged 90 years and its relationship with functional status, quality of life and mortality. Int J Geriatr Psychiatry. 2009;24(6):595-601. [CrossRef] [Medline]100]. Ensuring high sensitivity reduces the likelihood of missed diagnoses, which is important for timely and effective treatment. However, the relatively high standard deviation in both accuracy and sensitivity suggests that the model’s performance may vary, indicating occasional instances of less reliable predictions.
Model | Accuracy (%) | Sensitivity (%) | Precision (%) | F1-score (%) |
SFSa – PCAb – DTc | 87.50 (12.50) | 90.00 (20.00) | 88.34 (18.34) | 86.00 (14.74) |
SFS + FAd + DT | 85.00 (16.58) | 83.34 (25.82) | 90.00 (20.00) | 81.67 (18.93) |
SFS + PCA + LRe | 82.50 (19.53) | 85.00 (22.91) | 83.34 (25.82) | 80.00 (20.82) |
SFS + PCA + SVMf | 82.50 (19.53) | 90.00 (20.00) | 75.00 (22.42) | 80.00 (20.82) |
MIg + PCA + LR | 80.00 (10.00) | 85.00 (22.91) | 78.34 (22.42) | 76.00 (13.06) |
aSFS: sequential forward selection.
bPCA: principal component analysis.
cDT: decision tree.
dFA: factor analysis.
eLR: logistic regression.
fSVM: support vector machine.
gMI: mutual information.
Table 3 illustrates that SFS is frequently featured among the highest-performing models, including the top model with 87.50% accuracy, and 3 other strong contenders. Mutual information also proves effective, appearing in 1 model with 80.00% accuracy, indicating that both SFS and mutual information are potent feature selection techniques for depression classification with limited samples. PCA seems to be the preferred method for dimensionality reduction, being used in 4 of the top 5 models. Among classifiers, DT stands out, featuring in the top 2 models with 87.50% and 85.00% accuracy. Other classifiers, such as LR and SVM, also perform well, each appearing in models with an accuracy exceeding 80%. The results highlight interesting tradeoffs, such as the SFS + PCA + SVM model, which, while slightly lower in accuracy (82.50%), maintains a high sensitivity (90.00%). This supports the practice of evaluating models using multiple metrics, especially in situations where the application involves clinical diagnoses, where accurately identifying true positives is crucial.
Figure 2 displays the averaged confusion matrix for the top-performing SFS – PCA – DT model used to classify depression status. Due to the constraint of having only 4 samples, we used a 4-fold cross-validation strategy, with each fold being tested 10 times to ensure a thorough evaluation. The model showed promising results, accurately classifying 86% of individuals without depression as not having depression and 89% of individuals with depression as having depression. The average false positive rate, where individuals without depression were incorrectly classified as having it, was 14%, while the average false negative rate, where individuals with depression were incorrectly classified as not having it, was 11%.

To validate the convergence of the proposed algorithm, we analyzed the relationship between tree depth and accuracy on both the training and test datasets. Figure 3 demonstrates the training and test accuracy of the best performing model (SFS – PCA – DT) as a function of tree depth. The training accuracy increases consistently with tree depth, stabilizing at its maximum, reflecting that the model can fully capture the training data as depth increases. The test accuracy improves initially with increasing tree depth but stabilizes beyond a depth of 3. These observations confirm that the proposed algorithm achieves convergence in terms of performance tradeoffs between model complexity and generalization.

Feature Selection and Analysis
In our 3-stage classification model, we implement a combination of feature selection and dimensionality reduction techniques to improve the efficacy of our machine learning approach [Tsai CF, Sung YT. Ensemble feature selection in high dimension, low sample size datasets: parallel and serial combination approaches. Knowl Based Syst. 2020;203:106097. [CrossRef]80]. The features selected by SFS included “average sleep duration,” “total number of sleep interruptions,” “percentage of nights with sleep interruptions,” “average duration of sleep interruptions,” and “EFS.” Correlation analysis of these features revealed notable associations with depression status. Such analysis helps identify how variations in these features might be related to changes in depression, providing valuable insights for clinicians and researchers to develop more effective diagnostic tools and treatments.
Figure 4 illustrates a strong negative correlation between depression and average sleep duration. Conversely, depression was positively correlated with the total number of sleep interruptions, percentage of nights with sleep interruptions, average duration of sleep interruptions, and EFS. Blue cells indicate negative correlation values, while red cells represent positive correlations. Darker colors signify stronger correlations. The high correlation values highlight the significance of these factors in understanding and potentially classifying older adults with depression, aligning with findings from previous studies [Ben-Zeev D, Scherer EA, Wang R, Xie H, Campbell AT. Next-generation psychiatric assessment: using smartphone sensors to monitor behavior and mental health. Psychiatr Rehabil J. 2015;38(3):218-226. [FREE Full text] [CrossRef] [Medline]101-Jacobson NC, Weingarden H, Wilhelm S. Using digital phenotyping to accurately detect depression severity. J Nerv Ment Dis. 2019;207(10):893-896. [CrossRef] [Medline]105]. For example, Vallance et al [Vallance JK, Eurich D, Lavallee C, Johnson ST. Daily pedometer steps among older men: associations with health-related quality of life and psychosocial health. Am J Health Promot. 2013;27(5):294-298. [CrossRef] [Medline]106] demonstrated that engaging in daily activities can alleviate the adverse effects of depression among older adults. Furthermore, several studies have highlighted a connection between depression and frailty [Aprahamian I, Borges MK, Hanssen DJC, Jeuring HW, Oude Voshaar RC. The frail depressed patient: a narrative review on treatment challenges. Clin Interv Aging. 2022;17:979-990. [FREE Full text] [CrossRef] [Medline]107,Lohman MC, Mezuk B. Frailty and depression: comorbidity in the context of imperfect measurement. J Am Geriatr Soc. 2013;61(3):474. [FREE Full text] [CrossRef] [Medline]108]. Vaughan et al [Vaughan L, Corbin AL, Goveas JS. Depression and frailty in later life: a systematic review. Clin Interv Aging. 2015;10:1947-1958. [FREE Full text] [CrossRef] [Medline]109] showed that the prevalence of both depression and frailty among individuals aged 55 years and older exceeds 10%. These findings confirm the association between depression and the features incorporated in our model.

Comparative Analysis With Baseline Models
In this section, we evaluate the performance of our proposed 3-stage architecture for depression classification, which leverages Wi-Fi–based contextual human activity data, against baseline models previously outlined in the introduction. Although direct comparisons are inherently difficult due to differences in data acquisition methods (wearable devices), feature sets, and sample sizes, this analysis serves to contextualize the effectiveness of our approach for our case study with a limited number of participants.
For the baseline models, we selected the most current classification architectures used in the literature for depression classification. To ensure a fair comparison, each baseline model was trained and tested using the same feature set applied in our experiment. The resulting performance metrics for each baseline model are presented in Table 4. Despite the challenges associated with our smaller sample size, the comparison offers valuable insights into the relative efficacy of our method.
Model architecture | Accuracy (%) | Sensitivity (%) | Precision (%) | F1-score (%) |
RFa [Adamczyk J, Malawski F. Comparison of manual and automated feature engineering for daily activity classification in mental disorder diagnosis. Comput Inf. 2021;40(4):850-879. [CrossRef]23,Griffiths C, da Silva KM, Leathlean C, Jiang H, Ang CS, Searle R. Investigation of physical activity, sleep, and mental health recovery in treatment resistant depression (TRD) patients receiving repetitive transcranial magnetic stimulation (rTMS) treatment. J Affect Disord Rep. 2022;8:100337. [FREE Full text] [CrossRef] [Medline]27] | 12.50 | 5.00 | 5.00 | N/Ab |
SVMc [Adamczyk J, Malawski F. Comparison of manual and automated feature engineering for daily activity classification in mental disorder diagnosis. Comput Inf. 2021;40(4):850-879. [CrossRef]23] | 15.00 | 10.00 | 10.00 | N/A |
LRd [Adamczyk J, Malawski F. Comparison of manual and automated feature engineering for daily activity classification in mental disorder diagnosis. Comput Inf. 2021;40(4):850-879. [CrossRef]23] | 22.50 | 15.00 | 13.34 | N/A |
XGBoost [Opoku Asare K, Moshe I, Terhorst Y, Vega J, Hosio S, Baumeister H, et al. Mood ratings and digital biomarkers from smartphone and wearable data differentiates and predicts depression status: a longitudinal data analysis. Pervasive Mobile Comput. 2022;83:101621. [CrossRef]34] | 25.00 | 10.00 | 10.00 | N/A |
L1-based feature selection + DTe [Bai R, Xiao L, Guo Y, Zhu X, Li N, Wang Y, et al. Tracking and monitoring mood stability of patients with major depressive disorder by machine learning models using passive digital data: prospective naturalistic multicenter study. JMIR Mhealth Uhealth. 2021;9(3):e24365. [FREE Full text] [CrossRef] [Medline]24] | 32.50 | 35.00 | 18.34 | N/A |
L1-based feature selection + RF [Bai R, Xiao L, Guo Y, Zhu X, Li N, Wang Y, et al. Tracking and monitoring mood stability of patients with major depressive disorder by machine learning models using passive digital data: prospective naturalistic multicenter study. JMIR Mhealth Uhealth. 2021;9(3):e24365. [FREE Full text] [CrossRef] [Medline]24] | 22.50 | 15.00 | 13.34 | N/A |
L1-based feature selection + kNNf [Bai R, Xiao L, Guo Y, Zhu X, Li N, Wang Y, et al. Tracking and monitoring mood stability of patients with major depressive disorder by machine learning models using passive digital data: prospective naturalistic multicenter study. JMIR Mhealth Uhealth. 2021;9(3):e24365. [FREE Full text] [CrossRef] [Medline]24] | 22.50 | 15.00 | 13.34 | N/A |
L1-based feature selection + NBg [Bai R, Xiao L, Guo Y, Zhu X, Li N, Wang Y, et al. Tracking and monitoring mood stability of patients with major depressive disorder by machine learning models using passive digital data: prospective naturalistic multicenter study. JMIR Mhealth Uhealth. 2021;9(3):e24365. [FREE Full text] [CrossRef] [Medline]24] | 30.00 | 25.00 | 15.00 | N/A |
L1-based feature selection + LR [Bai R, Xiao L, Guo Y, Zhu X, Li N, Wang Y, et al. Tracking and monitoring mood stability of patients with major depressive disorder by machine learning models using passive digital data: prospective naturalistic multicenter study. JMIR Mhealth Uhealth. 2021;9(3):e24365. [FREE Full text] [CrossRef] [Medline]24] | 37.50 | 45.00 | 25.00 | N/A |
L1-based feature selection + SVM [Bai R, Xiao L, Guo Y, Zhu X, Li N, Wang Y, et al. Tracking and monitoring mood stability of patients with major depressive disorder by machine learning models using passive digital data: prospective naturalistic multicenter study. JMIR Mhealth Uhealth. 2021;9(3):e24365. [FREE Full text] [CrossRef] [Medline]24] | 25.00 | 10.00 | 10.00 | N/A |
Randomized LR + AdaBoost [Chikersal P, Doryab A, Tumminia M, Villalba DK, Dutcher JM, Liu X, et al. Detecting depression and predicting its onset using longitudinal symptoms captured by passive sensing. ACM Trans Comput Hum Interact. 2021;28(1):1-41. [CrossRef]25] | 55.00 | 73.33 | 55.00 | N/A |
HOPE modelh | 87.50 | 90.00 | 88.34 | 86.00 |
aRF: random forest.
bN/A: data not applicable.
cSVM: support vector machine.
dLR: logistic regression.
eDT: decision tree.
fkNN: k-nearest neighbor.
gNB: naive Bayes.
hBest performed proposed model.
As shown in Table 4, traditional single-stage machine learning classifiers such as RF, LR, and SVM demonstrate relatively lower performance, with accuracy ranging from 12.50% to 22.50%. Among these, LR achieves relatively higher accuracy. XGBoost exhibits better performance than the traditional models. Incorporating feature selection techniques further improves the performance of these models. Specifically, combining L1-based feature selection with various classifiers results in modest performance gains, while the randomized LR combined with AdaBoost achieves a significant improvement, reaching 55.00% accuracy and 73.33% sensitivity. Our proposed 3-stage architectures significantly surpass all other baseline models across all metrics. While most baseline models struggle with sensitivity and precision, often scoring below 15%, our best proposed model demonstrates substantial enhancements in these metrics with 90.00% sensitivity and 88.34% precision, indicating a superior capability to correctly identify positive cases and reduce false positives.
Model Explainability
To enhance our understanding of the decision-making processes within our proposed model, we used SHAP [Yesavage JA. The use of self-rating depression scales in the elderly. In: Poon LW, Crook T, Davis KL, Eisdorfer C, Gurland BJ, Kaszniak AW, et al, editors. Handbook for Clinical Memory Assessment of Older Adults. Washington, DC. American Psychological Association; 1986. 50] and LIME [Ghadiri P, Yaffe MJ, Adams AM, Abbasgholizadeh-Rahimi S. Primary care physicians' perceptions of artificial intelligence systems in the care of adolescents' mental health. BMC Prim Care. 2024;25(1):215. [FREE Full text] [CrossRef] [Medline]54] for model interpretability analysis. These model-agnostic methods can be applied across various machine learning models, providing valuable insights into our model’s predictive behavior. By integrating these interpretability techniques, we aim to improve the transparency and potential clinical relevance of our depression classification framework. These methods help us identify which features most significantly influence the model’s predictions, particularly in the context of depression classification.
SHAP Analysis
The SHAP waterfall plot (Figure 5) illustrates the relative importance of features for depression classification, with red and blue colors representing positive and negative contributions, respectively. Among the features, “the percentage of nights with sleep interruptions” is the most impactful, positively correlating with depression risk, indicating that frequent sleep disturbances are a strong predictor of depression. Conversely, the average sleep duration exhibits a substantial negative impact on depression prediction, suggesting that longer sleep durations are associated with a reduced likelihood of depression. Sleep-related variables continue to play a pivotal role in the model’s predictions; both the total number of sleep interruptions and the average duration of these interruptions contribute positively to depression risk, further underscoring the importance of uninterrupted sleep in depression diagnosis. Although the frailty scale is included in the model, its influence is relatively minor compared to sleep-related features.

LIME Analysis
The LIME plot (Figure 6) provides a complementary view of feature importance, with green and red colors indicating positive and negative influences, respectively. Consistent with the SHAP results, LIME identifies “the percentage of nights with sleep interruptions” as the most critical feature in the classification of depression of our proposed model. Similarly, the average sleep duration is shown to have a significant negative impact on depression classification, in line with SHAP findings. The total number of sleep interruptions also ranks highly with a positive influence on depression risk, again aligning with SHAP results. A notable difference between the 2 methods is the relatively lower impact of the frailty scale in the LIME analysis, which requires further investigations.

Discussion
Our research uses Wi-Fi–based motion sensors to extract daily activities, which are then used in our proposed machine learning method for depression classification.
Our study findings confirm the feasibility of using Wi-Fi–based motion sensors for depression classification among older adults. Our proposed HOPE (Home-Based Older Adults’ Depression Prediction) model achieved an accuracy of 87.5%, sensitivity of 90%, and precision of 88.3%. The most influential features identified were sleep-related metrics, such as average sleep duration and sleep interruptions, highlighting the importance of sleep patterns in depression classification. These findings suggest that Wi-Fi–based monitoring offers a nonintrusive and effective alternative to conventional wearable technologies for depression assessment. These conventional methods, while effective, often present challenges in terms of participant compliance, particularly among older adults, due to their burdensome and sometimes uncomfortable nature. In contrast, our Wi-Fi–based approach is nonintrusive and allows for continuous monitoring without requiring participants to wear or interact with any devices. This can significantly enhance participant compliance and the integrity of the data collected over extended periods. Compared to other nonintrusive monitoring technologies, such as camera-based methods, our Wi-Fi–based approach has distinct advantages. Wi-Fi infrastructure is prevalent in most homes and does not pose privacy risks, making it a cost-effective and scalable solution for continuous health monitoring. Furthermore, unlike previous studies that rely on microlevel body displacement and accelerometer data, our study emphasizes macrolevel physical activity features like sleep patterns and overall activity levels shift is crucial as it highlights the potential of using broader, more easily obtainable metrics to assess depression status. Our findings demonstrate that these macrolevel features are not only feasible but also effective measures for depression classification, broadening the scope of nonintrusive monitoring technologies in mental health research. The next steps can be extracting more detailed types of human activity using nonintrusive Wi-Fi data and expanding more on using this type of data acquisition for depression classification.
Additionally, our proposed model demonstrates relatively high performance compared to other classification models presented in existing depression classification studies, even with a limited sample size. Many studies using physical activity data from wearable devices often benefit from larger datasets and frequently use single-stage classifiers or deep neural networks. These models generally show strong performance with abundant data; however, their effectiveness diminishes when applied to smaller datasets, such as the one in this feasibility study. To address the limitations imposed by our smaller sample size, we designed a 3-stage machine learning classification architecture, which combines feature selection, dimensionality reduction, and classification into a multistep process. This approach allows for the extraction of the most relevant features while minimizing noise, thereby improving classification performance. Despite the small sample size, our model consistently outperformed conventional single-stage classifiers, highlighting the strength of both the machine learning architecture and the selected human activity features—particularly sleep patterns and activity levels—used for depression classification. This also underscores the adaptability of our model to different data scales, making it a more versatile option for future research where data availability might be limited. While this model shows promising results, however, caution is needed in interpreting these results. Future work should aim to enhance its robustness and generalizability by expanding the dataset. Collecting Wi-Fi–based physical activity data from a larger and more diverse sample would not only improve the model’s statistical power but also allow for a more comprehensive evaluation of its performance across different population groups, such as varying age ranges and health conditions. This would be particularly valuable in developing a scalable solution for real-world applications. Additionally, the integration of advanced machine learning techniques, such as deep neural networks or hybrid models combining traditional classifiers with deep learning components, could further enhance classification accuracy.
Our study is distinctive not only in its methodological approach but also in its emphasis on model explainability, a crucial aspect often overlooked in prior research on depression classification. Explainability is essential in health care applications, where understanding the factors driving a model’s decision is critical for clinical adoption and trust. By using SHAP and LIME, we were able to dissect the decision-making process of our model and pinpoint the most influential features for classifying depression. Both explainability analyses converge on the identification of sleep interruption features as key predictors in the depression classification of our proposed model. Among these, the “percentage of nights with sleep interruptions,” “average sleep duration,” and “total number of sleep interruptions” emerged as the primary driving factors. These findings align with existing literature that highlights the strong correlation between sleep disturbances and depression. However, our approach goes a step further by quantifying the impact of these features on the classification outcomes, providing a more nuanced understanding of their role. These findings suggest that future tools for depression assessment may benefit from a stronger focus on sleep quality and patterns and further investigations are required in this regard.
The integration of sensors and AI is transforming health care, yet the application of these technologies in depression classification remains underdeveloped and lacks extensive investigation. This study aimed to create an automated machine learning system for the continuous, remote monitoring and assessment of daily physical activity among older adults in a home setting, with the goal of distinguishing between individuals with and without depression.
In summary, although there were some challenges, our results suggest that using Wi-Fi–based data to capture contextual human activities is a promising and efficient method for classifying depression. The model we developed, leveraging data from Wi-Fi motion sensors, showed strong potential in accurately identifying early signs of depression and paving the way for more advanced and accessible mental health monitoring technologies among community dwelling older adults..
Acknowledgments
This study was funded by the Ministry of Innovation of Quebec, McGill University, Aerial Tech, and Mila—Quebec AI Institute. The authors acknowledge support from these institutions. SAR is Canada Research Chair (Tier II) in Artificial Intelligence and Advanced Digital Primary Health Care, received salary support from a Research Scholar Junior 1 Career Development Award from the Fonds de Recherche du Québec-Santé during part of this project, and her research program is supported by the Natural Sciences Research Council Discovery (grant 2020-05246).
Data Availability
The data analyzed during this study are not publicly available due to restrictions placed by the McGill University Institutional Ethics Committee, but the deidentified data are available from the corresponding author on reasonable request.
Authors' Contributions
Conceptualization was led by SAR, who established the study’s goals, design, and research questions and obtained the funding for the project. The methodology was developed by SAR, and SN. The data collection was done by SAR and VK. Data curation was managed by SAR, SN, and VK. Formal analysis was conducted by SAR and SN. The original draft was written by SAR, and SN. Reviewing and editing were a collaborative effort with all authors. Supervision and overall project leadership were provided by SAR.
Conflicts of Interest
None declared.
References
- Abd-Alrazaq A, AlSaad R, Shuweihdi F, Ahmed A, Aziz S, Sheikh J. Systematic review and meta-analysis of performance of wearable artificial intelligence in detecting and predicting depression. NPJ Digit Med. 2023;6(1):84. [FREE Full text] [CrossRef] [Medline]
- Su YA, Si T. Progress and challenges in research of the mechanisms of anhedonia in major depressive disorder. Gen Psychiatr. 2022;35(1):e100724. [FREE Full text] [CrossRef] [Medline]
- Parra F, Benezeth Y, Yang F. Automatic assessment of emotion dysregulation in American, French, and Tunisian adults and new developments in deep multimodal fusion: cross-sectional study. JMIR Ment Health. 2022;9(1):e34333. [FREE Full text] [CrossRef] [Medline]
- Choi J, Lee S, Kim S, Kim D, Kim H. Depressed mood prediction of elderly people with a wearable band. Sensors. 2022;22(11):4174. [FREE Full text] [CrossRef] [Medline]
- Gao Y, Jia Z, Zhao L, Han S. The effect of activity participation in middle-aged and older people on the trajectory of depression in later life: national cohort study. JMIR Public Health Surveill. 2023;9:e44682. [FREE Full text] [CrossRef] [Medline]
- Song S, Seo Y, Hwang S, Kim HY, Kim J. Digital phenotyping of geriatric depression using a community-based digital mental health monitoring platform for socially vulnerable older adults and their community caregivers: 6-week living lab single-arm pilot study. JMIR Mhealth Uhealth. 2024;12:e55842. [FREE Full text] [CrossRef] [Medline]
- Colizzi M, Lasalvia A, Ruggeri M. Prevention and early intervention in youth mental health: Is it time for a multidisciplinary and trans-diagnostic model for care? Int J Ment Health Syst. 2020;14(1):23. [FREE Full text] [CrossRef] [Medline]
- Owen D, Antypas D, Hassoulas A, Pardiñas AF, Espinosa-Anke L, Collados JC. Enabling early health care intervention by detecting depression in users of web-based forums using language models: longitudinal analysis and evaluation. JMIR AI. 2023;2:e41205. [FREE Full text] [CrossRef] [Medline]
- Huang S, Wang Y, Li G, Hall BJ, Nyman TJ. Digital mental health interventions for alleviating depression and anxiety during psychotherapy waiting lists: systematic review. JMIR Ment Health. 2024;11:e56650. [FREE Full text] [CrossRef] [Medline]
- Smith KM, Renshaw PF, Bilello J. The diagnosis of depression: current and emerging methods. Compr Psychiatry. 2013;54(1):1-6. [FREE Full text] [CrossRef] [Medline]
- Minaeva O, Riese H, Lamers F, Antypa N, Wichers M, Booij SH. Screening for depression in daily life: development and external validation of a prediction model based on actigraphy and experience sampling method. J Med Internet Res. 2020;22(12):e22634. [FREE Full text] [CrossRef] [Medline]
- Glavin D, Grua EM, Nakamura CA, Scazufca M, Ribeiro Dos Santos E, Wong GHY, et al. Patient health questionnaire-9 item pairing predictiveness for prescreening depressive symptomatology: machine learning analysis. JMIR Ment Health. 2023;10:e48444. [FREE Full text] [CrossRef] [Medline]
- Camp N, Johnston J, Lewis MGC, Zecca M, Di Nuovo A, Hunter K, et al. Perceptions of in-home monitoring technology for activities of daily living: semistructured interview study with community-dwelling older adults. JMIR Aging. 2022;5(2):e33714. [FREE Full text] [CrossRef] [Medline]
- Chan A, Cai J, Qian L, Coutts B, Phan S, Gregson G, et al. In-home positioning for remote home health monitoring in older adults: systematic review. JMIR Aging. 2024;7:e57320. [FREE Full text] [CrossRef] [Medline]
- Gianfredi V, Blandi L, Cacitti S, Minelli M, Signorelli C, Amerio A, et al. Depression and objectively measured physical activity: a systematic review and meta-analysis. Int J Environ Res Public Health. 2020;17(10):3738. [FREE Full text] [CrossRef] [Medline]
- Arrieta H, Rezola-Pardo C, Echeverria I, Iturburu M, Gil SM, Yanguas JJ, et al. Physical activity and fitness are associated with verbal memory, quality of life and depression among nursing home residents: preliminary data of a randomized controlled trial. BMC Geriatr. 2018;18(1):80. [CrossRef] [Medline]
- Dillon CB, McMahon E, O'Regan G, Perry IJ. Associations between physical behaviour patterns and levels of depressive symptoms, anxiety and well-being in middle-aged adults: a cross-sectional study using isotemporal substitution models. BMJ Open. 2018;8(1):e018978. [FREE Full text] [CrossRef] [Medline]
- Jung S, Lee S, Lee S, Bae S, Imaoka M, Harada K, et al. Relationship between physical activity levels and depressive symptoms in community-dwelling older Japanese adults. Geriatr Gerontol Int. 2018;18(3):421-427. [CrossRef] [Medline]
- Ludwig VM, Bayley A, Cook DG, Stahl D, Treasure JL, Asthworth M, et al. Association between depressive symptoms and objectively measured daily step count in individuals at high risk of cardiovascular disease in South London, UK: a cross-sectional study. BMJ Open. 2018;8(4):e020942. [FREE Full text] [CrossRef] [Medline]
- Ku PW, Steptoe A, Liao Y, Sun WJ, Chen LJ. Prospective relationship between objectively measured light physical activity and depressive symptoms in later life. Int J Geriatr Psychiatry. 2018;33(1):58-65. [CrossRef] [Medline]
- Raudsepp L, Riso EM. Longitudinal association between objectively measured walking and depressive symptoms among Estonian older adults. J Aging Phys Act. 2017;25(4):639-645. [CrossRef] [Medline]
- Kumar P, Garg S, Garg A. Assessment of anxiety, depression and stress using machine learning models. Procedia Comput Sci. 2020;171:1989-1998. [CrossRef]
- Adamczyk J, Malawski F. Comparison of manual and automated feature engineering for daily activity classification in mental disorder diagnosis. Comput Inf. 2021;40(4):850-879. [CrossRef]
- Bai R, Xiao L, Guo Y, Zhu X, Li N, Wang Y, et al. Tracking and monitoring mood stability of patients with major depressive disorder by machine learning models using passive digital data: prospective naturalistic multicenter study. JMIR Mhealth Uhealth. 2021;9(3):e24365. [FREE Full text] [CrossRef] [Medline]
- Chikersal P, Doryab A, Tumminia M, Villalba DK, Dutcher JM, Liu X, et al. Detecting depression and predicting its onset using longitudinal symptoms captured by passive sensing. ACM Trans Comput Hum Interact. 2021;28(1):1-41. [CrossRef]
- Dai R, Kannampallil T, Zhang J, Lv N, Ma J, Lu C. Multi-task learning for randomized controlled trials. Proc ACM Interact Mob Wearable Ubiquitous Technol. 2022;6(2):1-23. [CrossRef]
- Griffiths C, da Silva KM, Leathlean C, Jiang H, Ang CS, Searle R. Investigation of physical activity, sleep, and mental health recovery in treatment resistant depression (TRD) patients receiving repetitive transcranial magnetic stimulation (rTMS) treatment. J Affect Disord Rep. 2022;8:100337. [FREE Full text] [CrossRef] [Medline]
- Espino-Salinas CH, Galván-Tejada CE, Luna-García H, Gamboa-Rosales H, Celaya-Padilla JM, Zanella-Calzada LA, et al. Two-dimensional convolutional neural network for depression episodes detection in real time using motor activity time series of depresjon dataset. Bioengineering. 2022;9(9):458. [FREE Full text] [CrossRef] [Medline]
- Jakobsen P, Garcia-Ceja E, Riegler M, Stabell LA, Nordgreen T, Torresen J, et al. Applying machine learning in motor activity time series of depressed bipolar and unipolar patients compared to healthy controls. PLoS One. 2020;15(8):e0231995. [FREE Full text] [CrossRef] [Medline]
- Jung D, Kim J, Mun KR. Identifying depression in the elderly using gait accelerometry. Annu Int Conf IEEE Eng Med Biol Soc. 2022;2022:4946-4949. [CrossRef] [Medline]
- Kulam S. Time-series classification with uni-dimensional convolutional neural networks: An experimental comparison with long short-term memory networks [dissertation]. University of Oslo; 2019. URL: https://www.duo.uio.no/handle/10852/73250 [accessed 2024-12-07]
- Kumar A, Sangwan SR, Arora A, Menon VG. Depress-DCNF: A deep convolutional neuro-fuzzy model for detection of depression episodes using IoMT. Appl Soft Comput. 2022;122:108863. [CrossRef]
- Nguyen DK, Chan CL, Adams Li AH, Phan DV. Deep stacked generalization ensemble learning models in early diagnosis of depression illness from wearable devices data. 2021. Presented at: ICMHI 2021: 2021 5th International Conference on Medical and Health Informatics; May 14-16, 2021; Kyoto, Japan. [CrossRef]
- Opoku Asare K, Moshe I, Terhorst Y, Vega J, Hosio S, Baumeister H, et al. Mood ratings and digital biomarkers from smartphone and wearable data differentiates and predicts depression status: a longitudinal data analysis. Pervasive Mobile Comput. 2022;83:101621. [CrossRef]
- Park C, Mishra R, Golledge J, Najafi B. Digital biomarkers of physical frailty and frailty phenotypes using sensor-based physical activity and machine learning. Sensors. 2021;21(16):5289. [FREE Full text] [CrossRef] [Medline]
- Akbari G, Nikkhoo M, Wang L, Chen CPC, Han DS, Lin YH, et al. Frailty level classification of the community elderly using Microsoft Kinect–based skeleton pose: a machine learning approach. Sensors (Basel). 2021;21(12):4017. [FREE Full text] [CrossRef] [Medline]
- Utsumi T, Hashimoto M. A development of an early detection system of pre-frailty in senior citizens living inside. 2023. Presented at: 17th International Conference on Ubiquitous Information Management and Communication (IMCOM); Seoul, Korea; January 03-05, 2023. [CrossRef]
- Piau A, Wild K, Mattek N, Kaye J. Current state of digital biomarker technologies for real-life, home-based monitoring of cognitive function for mild cognitive impairment to mild Alzheimer disease and implications for clinical care: systematic review. J Med Internet Res. 2019;21(8):e12785. [FREE Full text] [CrossRef] [Medline]
- Tegou T, Kalamaras I, Tsipouras M, Giannakeas N, Votis K, Tzovaras D. A low-cost indoor activity monitoring system for detecting frailty in older adults. Sensors. 2019;19(3):452. [FREE Full text] [CrossRef] [Medline]
- Goonawardene N, Tan HP, Tan LB. Unobtrusive detection of frailty in older adults. In: Zhou J, Salvendy G, editors. Human Aspects of IT for the Aged Population Applications in Health, Assistance, and Entertainment. Cham, Switzerland. Springer International Publishing; 2018.
- Mohtadifar M, Cheffena M, Pourafzal A. Acoustic- and radio-frequency–based human activity recognition. Sensors. 2022;22(9):3125. [FREE Full text] [CrossRef] [Medline]
- Wang S, Zhou G. A review on radio based activity recognition. Digit Commun Netw. 2015;1(1):20-29. [CrossRef]
- Showmik IA, Sanam TF, Imtiaz H. Human activity recognition from Wi-Fi CSI data using principal component-based wavelet CNN. Digit Signal Process. 2023;138:104056. [CrossRef]
- Pulle V, Rozon M, Zandifar A, Yadavalli SH, Asghari VR, Minayi Jalil A, et al. Systems and methods for remote monitoring care using wireless sensing technologies. Justia. URL: https://patents.justia.com/patent/20240027564 [accessed 2024-10-29]
- Allegue M, Ghourchian N, Rozon M, Carignan E, Hopper M, Gonzalez JF, et al. WiFi motion intelligence: The fundamentals. Aerial Technologies Inc. URL: https://aerial.ai/files/common/The_fundamentals_Aerial_Telefonica_PCP-final_aligned.pdf [accessed 2025-01-14]
- Vitkovic J, Le C, Lee SL, Clark RA. The contribution of hearing and hearing loss to balance control. Audiol Neurootol. 2016;21(4):195-202. [CrossRef] [Medline]
- Dent E, Kowal P, Hoogendijk EO. Frailty measurement in research and clinical practice: a review. Eur J Intern Med. 2016;31:3-10. [CrossRef] [Medline]
- Rolfson DB, Majumdar SR, Tsuyuki RT, Tahir A, Rockwood K. Validity and reliability of the Edmonton Frail Scale. Age Ageing. 2006;35(5):526-529. [FREE Full text] [CrossRef] [Medline]
- Perna S, Francis MD, Bologna C, Moncaglieri F, Riva A, Morazzoni P, et al. Performance of Edmonton Frail Scale on frailty assessment: its association with multi-dimensional geriatric conditions assessed with specific screening tools. BMC Geriatr. 2017;17(1):2. [FREE Full text] [CrossRef] [Medline]
- Yesavage JA. The use of self-rating depression scales in the elderly. In: Poon LW, Crook T, Davis KL, Eisdorfer C, Gurland BJ, Kaszniak AW, et al, editors. Handbook for Clinical Memory Assessment of Older Adults. Washington, DC. American Psychological Association; 1986.
- Norgeot B, Quer G, Beaulieu-Jones BK, Torkamani A, Dias R, Gianfrancesco M, et al. Minimum information about clinical artificial intelligence modeling: the MI-CLAIM checklist. Nat Med. 2020;26(9):1320-1324. [FREE Full text] [CrossRef] [Medline]
- Heo YJ. Population aging and house prices: Who are we calling old? J Econ Ageing. 2022;23:100417. [FREE Full text] [CrossRef]
- Sarawagi S, Bhamidipaty A. Interactive deduplication using active learning. 2002. Presented at: KDD02: The Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; July 23-26, 2002; Edmonton, Canada. [CrossRef]
- Ghadiri P, Yaffe MJ, Adams AM, Abbasgholizadeh-Rahimi S. Primary care physicians' perceptions of artificial intelligence systems in the care of adolescents' mental health. BMC Prim Care. 2024;25(1):215. [FREE Full text] [CrossRef] [Medline]
- Rahimi SA, Légaré F, Sharma G, Archambault P, Zomahoun HTV, Chandavong S, et al. Application of artificial intelligence in community-based primary health care: systematic scoping review and critical appraisal. J Med Internet Res. 2021;23(9):e29839. [FREE Full text] [CrossRef] [Medline]
- Hall MA. Correlation-Based Feature Selection for Machine Learning [dissertation]. Hamilton, New Zealand. University of Waikato; 1999. URL: https://www.lri.fr/~pierres/donn%E9es/save/these/articles/lpr-queue/hall99correlationbased.pdf [accessed 2025-01-14]
- Abdi H, Williams LJ. Principal component analysis. WIREs Computational Stats. 2010;2(4):433-459. [CrossRef]
- Rish I. An empirical study of the naive Bayes classifier. 2001. Presented at: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence; August 4, 2001; Seattle, WA. URL: https://www.semanticscholar.org/paper/An-empirical-study-of-the-naive-Bayes-classifier-Watson/2825733f97124013e8841b3f8a0f5bd4ee4af88a
- Lundberg SM, Lee SI. A unified approach to interpreting model predictions. 2017. Presented at: NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing Systems; December 4-9, 2017; Long Beach, CA. URL: https://papers.nips.cc/paper_files/paper/2017/hash/8a20a8621978632d76c43dfd28b67767-Abstract.html
- Marcano-Cedeño A, Quintanilla-Domínguez J, Cortina-Januchs MG, Andina D. Feature selection using sequential forward selection and classification applying artificial metaplasticity neural network. 2010. Presented at: IECON 2010—36th Annual Conference on IEEE Industrial Electronics Society; November 7-10, 2010; Glendale, AZ. [CrossRef]
- Fodor IK. A survey of dimension reduction techniques. Office of Scientific and Technical Information. 2002. URL: https://www.osti.gov/biblio/15002155 [accessed 2022-05-09]
- Jr DWH, Lemeshow S. Applied Logistic Regression. Hoboken, NJ. John Wiley & Sons; 2004.
- Peltola T. Local interpretable model-agnostic explanations of Bayesian predictive models via Kullback-Leibler projections. ArXiv. Preprint posted online on October 5, 2018. 2018. [CrossRef]
- Vergara JR, Estévez PA. A review of feature selection methods based on mutual information. Neural Comput Appl. 2013;24(1):175-186. [CrossRef]
- Balakrishnama S, Ganapathiraju A. Linear Discriminant Analysis—A Brief Tutorial [dissertation]. Mississippi State University; 1995. URL: https://www.semanticscholar.org/paper/LINEAR-DISCRIMINANT-ANALYSIS-A-BRIEF-TUTORIAL-Balakrishnama-Ganapathiraju/163bb761cd7c5ca8c786bf4816a10308f1396ffb [accessed 2024-10-29]
- Peterson LE. K-nearest neighbor. Scholarpedia. 2009;10(2):1883. [CrossRef]
- Tariq MA. A study on comparative analysis of feature selection algorithms for students grades prediction. J Inf Organ Sci. 2024;48(1):133-147. [CrossRef]
- Cao LJ, Chua KS, Chong WK, Lee HP, Gu QM. A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine. Neurocomputing. 2003;55(1-2):321-336. [CrossRef]
- Hearst MA, Dumais ST, Osuna E, Platt J, Scholkopf B. Support vector machines. IEEE Intell Syst Appl. 1998;13(4):18-28. [CrossRef] [Medline]
- Granitto PM, Furlanello C, Biasioli F, Gasperi F. Recursive feature elimination with random forest for PTR-MS analysis of agroindustrial products. Chemometrics Intell Lab Syst. 2006;83(2):83-90. [CrossRef]
- Song Y, Lu Y. Decision tree methods: applications for classification and prediction. Shanghai Arch Psychiatry. 2015;27(2):130-135. [FREE Full text] [CrossRef] [Medline]
- Breiman L. Random forests. Machine Learning. 2001;45(1):5-32. [CrossRef]
- Friedman JH. Greedy function approximation: A gradient boosting machine. Ann. Statist. 2001;29(5):1189-1232. [CrossRef]
- Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H, et al. xgboost: Extreme Gradient Boosting. R Foundation for Statistical Computing. 2014. URL: https://cran.r-project.org/web/packages/xgboost/index.html [accessed 2024-07-24]
- Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, et al. LightGBM: A highly efficient gradient boosting decision tree. 2017. Presented at: NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing Systems; December 4-9, 2017; Long Beach, CA.
- Ruta D, Gabrys B. Classifier selection for majority voting. Inf Fusion. 2005;6(1):63-81. [CrossRef]
- Zareapoor M, Shamsolmoali P. Application of credit card fraud detection: based on bagging ensemble classifier. Procedia Comput Sci. 2015;48:679-685. [CrossRef]
- Hastie T, Rosset S, Zhu J, Zou H. Multi-class AdaBoost. Stat Interface. 2009;2(3):349-360. [CrossRef]
- Khalid S, Khalil T, Nasreen S. A survey of feature selection and feature extraction techniques in machine learning. Sci Inf Conf. 2014:024-032. [FREE Full text] [CrossRef]
- Tsai CF, Sung YT. Ensemble feature selection in high dimension, low sample size datasets: parallel and serial combination approaches. Knowl Based Syst. 2020;203:106097. [CrossRef]
- Soares I, Dias J, Rocha H, Do Carmo Lopes M, Ferreira B. Feature selection in small databases: a medical-case study. 2016. Presented at: XIV Mediterranean Conference on Medical and Biological Engineering and Computing 2016; March 31-April 2, 2016; Paphos, Cyprus. [CrossRef]
- Lever J, Krzywinski M, Altman N. Principal component analysis. Nature Methods. 2017;14(7):641-642. [CrossRef]
- Saed AAA, Jaharadak AA. Implementation with performance evaluation of decision tree classifier for uncertain data. Lit Rev. 2022;5(5):125-132. [CrossRef]
- Bergstra J, Bengio Y. Random search for hyper-parameter optimization. J Mach Learn Res. 2012;13(1):281-305. [FREE Full text]
- Frazier P. A tutorial on Bayesian optimization. ArXiv. Preprint posted online July 8, 2018. 2018. [FREE Full text] [CrossRef]
- Li L, Jamieson KG, DeSalvo G, Rostamizadeh A, Talwalkar A. Hyperband: A novel bandit-based approach to hyperparameter optimization. J Mach Learn Res. 2016:6765-6816. [FREE Full text]
- Shreffler J, Huecker MR. Diagnostic Testing Accuracy: Sensitivity, Specificity, Predictive Values and Likelihood Ratios. Treasure Island, FL. StatPearls Publishing; 2024.
- Liu W, Jiang Y, Zhang J, Ma Z. RFARN: Retinal vessel segmentation based on reverse fusion attention residual network. PLoS One. 2021;16(12):e0257256. [FREE Full text] [CrossRef] [Medline]
- Goodyear MDE, Krleza-Jeric K, Lemmens T. The Declaration of Helsinki. BMJ. 2007;335(7621):624-625. [FREE Full text] [CrossRef] [Medline]
- Tri-Council policy statement: ethical conduct for research involving humans—TCPS 2 (2022). Government of Canada. URL: https://ethics.gc.ca/eng/policy-politique_tcps2-eptc2_2022.html [accessed 2024-10-29]
- Shayan Nejadshamsi. AID project - Depression Github Repository. URL: https://github.com/rahimi-s-lab/HOPE-model [accessed 2025-02-07]
- Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2015;12(85):2825-2830. [CrossRef]
- Akyol Y, Durmus D, Dogan C, Bek Y, Canturk F. Quality of life and level of depressive symptoms in the geriatric population. Turk J Rheumatol. 2010;25(4):165-173. [CrossRef]
- de Jonge P, Ormel J, Slaets JPJ, Kempen GIJM, Ranchor AV, van Jaarsveld CHM, et al. Depressive symptoms in elderly patients predict poor adjustment after somatic events. Am J Geriatr Psychiatry. 2004;12(1):57-64. [Medline]
- Helvik AS, Engedal K, Krokstad S, Stordal E, Selbæk G. A comparison of depressive symptoms in elderly medical inpatients and the elderly in a population-based health study (the Nord-Trøndelag Health Study 3). Nord J Psychiatry. 2012;66(3):189-197. [CrossRef] [Medline]
- Ordu Gokkaya NK, Gokce-Kutsal Y, Borman P, Ceceli E, Dogan A, Eyigor S, et al. Pain and quality of life (QoL) in elderly: the Turkish experience. Arch Gerontol Geriatr. 2012;55(2):357-362. [CrossRef] [Medline]
- Hsu HC. Physical function trajectories, depressive symptoms, and life satisfaction among the elderly in Taiwan. Aging Ment Health. 2009;13(2):202-212. [CrossRef] [Medline]
- Lue BH, Chen LJ, Wu SC. Health, financial stresses, and life satisfaction affecting late-life depression among older adults: a nationwide, longitudinal survey in taiwan. Arch Gerontol Geriatr. 2010;50:S34-S38. [CrossRef] [Medline]
- Feng L, Yap KB, Ng TP. Depressive symptoms in older adults with chronic kidney disease: mortality, quality of life outcomes, and correlates. Am J Geriatr Psychiatry. 2013;21(6):570-579. [CrossRef] [Medline]
- Van der Weele GM, Gussekloo J, De Waal MWM, De Craen AJM, Van der Mast RC. Co-occurrence of depression and anxiety in elderly subjects aged 90 years and its relationship with functional status, quality of life and mortality. Int J Geriatr Psychiatry. 2009;24(6):595-601. [CrossRef] [Medline]
- Ben-Zeev D, Scherer EA, Wang R, Xie H, Campbell AT. Next-generation psychiatric assessment: using smartphone sensors to monitor behavior and mental health. Psychiatr Rehabil J. 2015;38(3):218-226. [FREE Full text] [CrossRef] [Medline]
- Byrne JEM, Bullock B, Brydon A, Murray G. A psychometric investigation of the sleep, circadian rhythms, and mood (SCRAM) questionnaire. Chronobiol Int. 2019;36(2):265-275. [CrossRef] [Medline]
- Difrancesco S, Lamers F, Riese H, Merikangas KR, Beekman ATF, van Hemert AM, et al. Sleep, circadian rhythm, and physical activity patterns in depressive and anxiety disorders: a 2-week ambulatory assessment study. Depress Anxiety. 2019;36(10):975-986. [FREE Full text] [CrossRef] [Medline]
- Ghandeharioun A, Fedor S, Sangermano L, Ionescu D, Alpert J, Dale C, et al. Objective assessment of depressive symptoms with machine learning and wearable sensors data. 2017. Presented at: 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII); October 23-26, 2017; San Antonio, TX. [CrossRef]
- Jacobson NC, Weingarden H, Wilhelm S. Using digital phenotyping to accurately detect depression severity. J Nerv Ment Dis. 2019;207(10):893-896. [CrossRef] [Medline]
- Vallance JK, Eurich D, Lavallee C, Johnson ST. Daily pedometer steps among older men: associations with health-related quality of life and psychosocial health. Am J Health Promot. 2013;27(5):294-298. [CrossRef] [Medline]
- Aprahamian I, Borges MK, Hanssen DJC, Jeuring HW, Oude Voshaar RC. The frail depressed patient: a narrative review on treatment challenges. Clin Interv Aging. 2022;17:979-990. [FREE Full text] [CrossRef] [Medline]
- Lohman MC, Mezuk B. Frailty and depression: comorbidity in the context of imperfect measurement. J Am Geriatr Soc. 2013;61(3):474. [FREE Full text] [CrossRef] [Medline]
- Vaughan L, Corbin AL, Goveas JS. Depression and frailty in later life: a systematic review. Clin Interv Aging. 2015;10:1947-1958. [FREE Full text] [CrossRef] [Medline]
Abbreviations
AI: artificial intelligence |
CNN: convolutional neural network |
CSI: channel state information |
DT: decision tree |
EFS: Edmonton Frailty Scale |
GDS: Geriatric Depression Scale |
LIME: local interpretable model-agnostic explanations |
LR: logistic regression |
MI-CLAIM: Minimum Information About Clinical Artificial Intelligence Modeling |
PCA: principal component analysis |
RF: random forest |
RSSI: received signal strength indicator |
SFS: sequential forward selection |
SHAP: Shapely addictive explanations |
SVM: support vector machine |
Edited by Y Du; submitted 18.10.24; peer-reviewed by P-H Liao, C Silva; comments to author 05.12.24; revised version received 12.12.24; accepted 19.12.24; published 03.03.25.
Copyright©Shayan Nejadshamsi, Vania Karami, Negar Ghourchian, Narges Armanfard, Howard Bergman, Roland Grad, Machelle Wilchesky, Vladimir Khanassov, Isabelle Vedel, Samira Abbasgholizadeh Rahimi. Originally published in JMIR Aging (https://aging.jmir.org), 03.03.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Aging, is properly cited. The complete bibliographic information, a link to the original publication on https://aging.jmir.org, as well as this copyright and license information must be included.