Prediction of general medical admission length of stay with natural language processing and deep learning: a pilot study

Stephen Bacchi; Samuel Gluck; Yiran Tan; Ivana Chim; Joy Cheng; Toby Gilbert; David K Menon; Jim Jannes; Timothy Kleinig; Simon Koblar

doi:10.1007/s11739-019-02265-3

Prediction of general medical admission length of stay with natural language processing and deep learning: a pilot study

Intern Emerg Med. 2020 Sep;15(6):989-995. doi: 10.1007/s11739-019-02265-3. Epub 2020 Jan 2.

Authors

Stephen Bacchi^{1

2}, Samuel Gluck^{3

4}, Yiran Tan^{3

4}, Ivana Chim³, Joy Cheng³, Toby Gilbert^{3

4}, David K Menon⁵, Jim Jannes^{3

4}, Timothy Kleinig^{3

4}, Simon Koblar^{3

4}

Affiliations

¹ Neurology Department, Royal Adelaide Hospital, Port Road, Adelaide, SA, 5000, Australia. [email protected].
² University of Adelaide, Adelaide, SA, 5005, Australia. [email protected].
³ Neurology Department, Royal Adelaide Hospital, Port Road, Adelaide, SA, 5000, Australia.
⁴ University of Adelaide, Adelaide, SA, 5005, Australia.
⁵ Division of Anaesthesia, University of Cambridge, Cambridge, CB2 0QQ, UK.

PMID: 31898204
DOI: 10.1007/s11739-019-02265-3

Abstract

Length of stay (LOS) and discharge destination predictions are key parts of the discharge planning process for general medical hospital inpatients. It is possible that machine learning, using natural language processing, may be able to assist with accurate LOS and discharge destination prediction for this patient group. Emergency department triage and doctor notes were retrospectively collected on consecutive general medical and acute medical unit admissions to a single tertiary hospital from a 2-month period in 2019. These data were used to assess the feasibility of predicting LOS and discharge destination using natural language processing and a variety of machine learning models. 313 patients were included in the study. The artificial neural network achieved the highest accuracy on the primary outcome of predicting whether a patient would remain in hospital for > 2 days (accuracy 0.82, area under the received operator curve 0.75, sensitivity 0.47 and specificity 0.97). When predicting LOS as an exact number of days, the artificial neural network achieved a mean absolute error of 2.9 and a mean squared error of 16.8 on the test set. For the prediction of home as a discharge destination (vs any non-home alternative), all models performed similarly with an accuracy of approximately 0.74. This study supports the feasibility of using natural language processing to predict general medical inpatient LOS and discharge destination. Further research is indicated with larger, more detailed, datasets from multiple centres to optimise and examine the accuracy that may be achieved with such predictions.

Keywords: Artificial intelligence; Deep learning; Machine learning; Natural language processing; Neural network; Prognostication.

MeSH terms

Aged
Aged, 80 and over
Deep Learning
Female
Forecasting / methods*
Hospitalization / statistics & numerical data*
Humans
Length of Stay / statistics & numerical data*
Length of Stay / trends
Male
Middle Aged
Natural Language Processing*
Patients' Rooms / organization & administration
Patients' Rooms / statistics & numerical data
Pilot Projects
Retrospective Studies