PARSE: A personalized clinical time-series representation learning framework via abnormal offsets analysis

Comput Methods Programs Biomed. 2023 Dec:242:107838. doi: 10.1016/j.cmpb.2023.107838. Epub 2023 Oct 5.

Abstract

Background and objective: Clinical risk prediction of patients is an important research issue in the field of healthcare, which is of great significance for the diagnosis, treatment and prevention of diseases. In recent years, a large number of deep learning-based methods have been proposed for clinical prediction by mining relevant features of patients' health condition from historical Electronic Health Records (EHRs) data. However, most of these existing methods only focus on discovering the time series characteristics of physiological indexes such as laboratory tests and physical examinations, and fail to comprehensively consider the deviation degree of these physiological indexes from the normal range and their stability, thus greatly limiting the prediction performance.

Methods: We propose a personalized clinical time-series representation learning framework via abnormal offsets analysis named PARSE for clinical risk prediction. In PARSE, while extracting relevant temporal features from the original EHR data, we further capture relevant features of abnormal condition as complementary information from the absolute offset of each physiological index's observed values from its normal value and the relative offset between each physiological index's observed values in two adjacent time steps. Finally, an adaptive fusion module is introduced to effectively integrate the above features to obtain the personalized patient's representations for clinical risk prediction.

Results: We conduct an in-hospital mortality prediction task on two public real-world datasets. PARSE achieves the highest F1 scores of 48.1% and 40.3%, outperforming the state-of-the-art methods with a boost of 2.4% and 6.2% on two datasets respectively. Furthermore, the results of ablation experiments demonstrate that the two abnormal offsets and the proposed adaptive fusion method are contributing.

Conclusions: PARSE can better extract the risk-related information from the EHRs data and improve the personalization of the patients' representations. Each part of PARSE improves the final prediction performance independently.

Keywords: Clinical risk prediction; Deep learning; Electronic health records; Representation learning; Time-series data.

MeSH terms

  • Electronic Health Records*
  • Humans
  • Time Factors