Multi-modal Patient Cohort Identification from EEG Report and Signal Data

AMIA Annu Symp Proc. 2017 Feb 10:2016:1794-1803. eCollection 2016.

Abstract

Clinical electroencephalography (EEG) is the most important investigation in the diagnosis and management of epilepsies. An EEG records the electrical activity along the scalp and measures spontaneous electrical activity of the brain. Because the EEG signal is complex, its interpretation is known to produce moderate inter-observer agreement among neurologists. This problem can be addressed by providing clinical experts with the ability to automatically retrieve similar EEG signals and EEG reports through a patient cohort retrieval system operating on a vast archive of EEG data. In this paper, we present a multi-modal EEG patient cohort retrieval system called MERCuRY which leverages the heterogeneous nature of EEG data by processing both the clinical narratives from EEG reports as well as the raw electrode potentials derived from the recorded EEG signal data. At the core of MERCuRY is a novel multimodal clinical indexing scheme which relies on EEG data representations obtained through deep learning. The index is used by two clinical relevance models that we have generated for identifying patient cohorts satisfying the inclusion and exclusion criteria expressed in natural language queries. Evaluations of the MERCuRY system measured the relevance of the patient cohorts, obtaining MAP scores of 69.87% and a NDCG of 83.21%.

MeSH terms

  • Abstracting and Indexing*
  • Electrodes
  • Electroencephalography*
  • Epilepsy / physiopathology*
  • Humans
  • Information Storage and Retrieval*
  • Information Systems*
  • Machine Learning
  • Natural Language Processing
  • Neural Networks, Computer