Improving Patient Similarity Using Different Modalities of Phenotypes Extracted from Clinical Narratives

Stud Health Technol Inform. 2023 May 18:302:1037-1041. doi: 10.3233/SHTI230342.

Abstract

In the context of medical concept extraction, it is critical to determine if clinical signs or symptoms mentioned in the text were present or absent, experienced by the patient or their relatives. Previous studies have focused on the NLP aspect but not on how to leverage this supplemental information for clinical applications. In this paper, we aim to use the patient similarity networks framework to aggregate different phenotyping modalities. NLP techniques were applied to extract phenotypes and predict their modalities from 5470 narrative reports of 148 patients with ciliopathies (a group of rare diseases). Patient similarities were computed using each modality separately for aggregation and clustering. We found that aggregating negated phenotypes improved patient similarity, but further aggregating relatives' phenotypes worsened the result. We suggest that different modalities of phenotypes can contribute to patient similarity, but they should be aggregated carefully and with appropriate similarity metrics and aggregation models.

Keywords: deep phenotyping; experiencer; negated phenotype; patient similarity.

MeSH terms

  • Electronic Health Records*
  • Humans
  • Narration*
  • Natural Language Processing
  • Phenotype
  • Rare Diseases