Screening for frequent hospitalization risk among community-dwelling older adult between 2016 and 2023: machine learning-driven item selection, scoring system development, and prospective validation

Eman Leung; Jingjing Guan; Qingpeng Zhang; Chun Cheung Ching; Hiliary Yee; Yilin Liu; Hang Sau Ng; Richard Xu; Hector Wing Hong Tsang; Albert Lee; Frank Youhua Chen

doi:10.3389/fpubh.2024.1413529

Screening for frequent hospitalization risk among community-dwelling older adult between 2016 and 2023: machine learning-driven item selection, scoring system development, and prospective validation

Front Public Health. 2024 Nov 27:12:1413529. doi: 10.3389/fpubh.2024.1413529. eCollection 2024.

Authors

Eman Leung^#^{1

2}, Jingjing Guan^#^{2

3}, Qingpeng Zhang⁴, Chun Cheung Ching², Hiliary Yee⁵, Yilin Liu², Hang Sau Ng⁶, Richard Xu⁵, Hector Wing Hong Tsang^{5

7}, Albert Lee^{2

5

7}, Frank Youhua Chen¹

Affiliations

¹ Department of Management Sciences, City University of Hong Kong, Kowloon, Hong Kong SAR, China.
² JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, China.
³ Epitelligence, Hong Kong, Hong Kong SAR, China.
⁴ Department of Pharmacology and Pharmacy, HKU Musketeers Foundation Institute of Data Science, The University of Hong Kong, Pokfulam, China.
⁵ Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong SAR, China.
⁶ People Service Centre, Kowloon, Hong Kong SAR, China.
⁷ Mental Health Research Centre, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong SAR, China.

^# Contributed equally.

Abstract

Background: Screening for frequent hospitalizations in the community can help prevent super-utilizers from growing in the inpatient population. However, the determinants of frequent hospitalizations have not been systematically examined, their operational definitions have been inconsistent, and screening among community members lacks tools. Nor do we know if what determined frequent hospitalizations before COVID-19 continued to be the determinant of frequent hospitalizations at the height of the pandemic. Hence, the current study aims to identify determinants of frequent hospitalization and their screening items developed from the Comprehensive Geriatric Assessment (CGA), as our 273-item CGA is too lengthy to administer in full in community or primary care settings. The stability of the identified determinants will be examined in terms of the prospective validity of pre-COVID-selected items administered at the height of the pandemic.

Methods: Comprehensive Geriatric Assessments (CGAs) were administered between 2016 and 2018 in the homes of 1,611 older adults aged 65+ years. Learning models were deployed to select CGA items to maximize the classification of different operational definitions of frequent hospitalizations, ranging from the most inclusive definition, wherein two or more hospitalizations over 2 years, to the most exclusive, wherein two or more hospitalizations must appear during year two, reflecting different care needs. In addition, the CGA items selected by the best-performing learning model were then developed into a random-forest-based scoring system for assessing frequent hospitalization risk, the validity of which was tested during 2018 and again prospectively between 2022 and 2023 in a sample of 329 older adults recruited from a district adjacent to where the CGAs were initially performed.

Results: Seventeen items were selected from the CGA by our best-performing algorithm (DeepBoost), achieving 0.90 AUC in classifying operational definitions of frequent hospitalizations differing in temporal distributions and care needs. The number of medications prescribed and the need for assistance with emptying the bowel, housekeeping, transportation, and laundry were selected using the DeepBoost algorithm under the supervision of all operational definitions of frequent hospitalizations. On the other hand, reliance on walking aids, ability to balance on one's own, history of chronic obstructive pulmonary disease (COPD), and usage of social services were selected in the top 10 by all but the operational definitions that reflect the greatest care needs. The prospective validation of the original risk-scoring system using a sample recruited from a different district during the COVID-19 pandemic achieved an AUC of 0.82 in differentiating those rehospitalized twice or more over 2 years from those who were not.

Conclusion: A small subset of CGA items representing one's independence in aspects of (instrumental) activities of daily living, mobility, history of COPD, and social service utilization are sufficient for community members at risk of frequent hospitalization. The determinants of frequent hospitalization represented by the subset of CGA items remain relevant over the course of COVID-19 pandemic and across sociogeography.

Keywords: COVID-19; artificial intelligence: machine learning and deep learning; data science; health risk assessment; patient readmission; public health: preventive medicine.

MeSH terms

Aged
Aged, 80 and over
COVID-19* / epidemiology
Female
Geriatric Assessment* / methods
Hospitalization* / statistics & numerical data
Humans
Independent Living*
Machine Learning*
Male
Mass Screening / statistics & numerical data
Prospective Studies
Risk Assessment

Grants and funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Strategic Public Policy Research Funding Scheme (project number S2019.A4.015.19S) awarded to Professors Albert Lee and Eman Leung, Community Involvement Fund, Home Affairs Department of HKSAR awarded to Professors Eman Leung and Albert Lee, and the Sino International Industrial Limited’s charitable donation awarded to Professor Frank Chen. The funders only provided financial support and had not interfered with how the research is being conducted or presented.