Temporal electronic phenotyping by mining careflows of breast cancer patients

J Biomed Inform. 2017 Feb:66:136-147. doi: 10.1016/j.jbi.2016.12.012. Epub 2017 Jan 3.

Abstract

In this work we present a careflow mining approach designed to analyze heterogeneous longitudinal data and to identify phenotypes in a patient cohort. The main idea underlying our approach is to combine methods derived from sequential pattern mining and temporal data mining to derive frequent healthcare histories (careflows) in a population of patients. This approach was applied to an integrated data repository containing clinical and administrative data of more than 4000 breast cancer patients. We used the mined histories to identify sub-cohorts of patients grouped according to healthcare activities pathways, then we characterized these sub-cohorts with clinical data. In this way, we were able to perform temporal electronic phenotyping of electronic health records (EHR) data.

Keywords: Careflow mining; Electronic phenotyping; Heterogeneous data sets; Temporal data mining.

MeSH terms

  • Breast Neoplasms / diagnosis
  • Breast Neoplasms / therapy*
  • Data Mining*
  • Delivery of Health Care
  • Electronic Health Records*
  • Electronics
  • Female
  • Humans
  • Patient Care / statistics & numerical data*