%0 Journal Article %@ 2561-7605 %I JMIR Publications %V 8 %N %P e65195 %T Development and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis %A Qian,Xing Xing %A Chau,Pui Hing %A Fong,Daniel Y T %A Ho,Mandy %A Woo,Jean %K fall-related admissions %K electronic medical records %K text mining %K case detection %K natural language processing %D 2025 %7 8.7.2025 %9 %J JMIR Aging %G English %X Background: In order to address fall underestimation by the International Classification of Diseases (ICD) in clinical settings, information from clinical notes could be incorporated via natural language processing (NLP) as a possible solution. However, its application to inpatient notes has not been fully investigated. Objective: This study aims to develop and validate a rule-based NLP algorithm to identify falls based on inpatient admission notes from older patients. Methods: This retrospective study used 12-year electronic inpatient records of patients aged ≥65 years from public hospitals in Hong Kong. A random sample of 1000 patients was drawn to develop the NLP algorithm. Manual review was the gold standard for assessing the algorithm’s performance, with sensitivity, specificity, precision, and F1-score calculated at the record, episode, and patient levels. In addition, the study compared the number of falls identified by ICD codes and clinical notes independently and combined. Results: Our rule-based NLP algorithm showed excellent performance, with a sensitivity, specificity, precision, and F1-score of 93.3%, 99.0%, 87.5%, and 0.903 at the record and episode levels, and 92.9%, 98.3%, 89.7%, and 0.912 at the patient level. The combined identification strategy using ICD codes and the NLP method provided the most comprehensive capture of fall-related episodes and fallers. Conclusions: The NLP method proved efficient and accurate in detecting falls from clinical notes in inpatient episodes. For comprehensive capture of fall episodes and fallers, we recommend the combined use of the NLP algorithm and ICD codes, which should be applied in future fall epidemiology studies and clinical practice for identifying high-risk groups of fall interventions. %R 10.2196/65195 %U https://aging.jmir.org/2025/1/e65195 %U https://doi.org/10.2196/65195