Improving Patient Similarity Using Different Modalities of Phenotypes Extracted from Clinical Narratives

Xiaoyi Chen; Carole Faviez; Marc Vincent; Sophie Saunier; Nicolas Garcelon; Anita Burgun

doi:10.3233/SHTI230342

Improving Patient Similarity Using Different Modalities of Phenotypes Extracted from Clinical Narratives

Stud Health Technol Inform. 2023 May 18:302:1037-1041. doi: 10.3233/SHTI230342.

Authors

Xiaoyi Chen^{1

2

3}, Carole Faviez^{2

3}, Marc Vincent¹, Sophie Saunier⁴, Nicolas Garcelon^{1

2

3}, Anita Burgun^{2

3

5

6}

Affiliations

¹ Data Science Platform, Imagine Institute, Université de Paris Cité, Inserm UMR 1163, Paris, France.
² Inserm, Centre de Recherche des Cordeliers, Sorbonne Université, Université de Paris Cité, Paris, France.
³ HeKA, Inria Paris, Paris, France.
⁴ Laboratory of Renal Hereditary Diseases, Imagine Institute, Université de Paris Cité, Inserm UMR 1163, Paris, France.
⁵ Hôpital Necker-Enfants Malades, Département d'informatique médicale, Assistance Publique-Hôpitaux de Paris (AP-HP), Paris, France.
⁶ PaRis Artificial Intelligence Research InstitutE (PRAIRIE), France.

PMID: 37203576
DOI: 10.3233/SHTI230342

Abstract

In the context of medical concept extraction, it is critical to determine if clinical signs or symptoms mentioned in the text were present or absent, experienced by the patient or their relatives. Previous studies have focused on the NLP aspect but not on how to leverage this supplemental information for clinical applications. In this paper, we aim to use the patient similarity networks framework to aggregate different phenotyping modalities. NLP techniques were applied to extract phenotypes and predict their modalities from 5470 narrative reports of 148 patients with ciliopathies (a group of rare diseases). Patient similarities were computed using each modality separately for aggregation and clustering. We found that aggregating negated phenotypes improved patient similarity, but further aggregating relatives' phenotypes worsened the result. We suggest that different modalities of phenotypes can contribute to patient similarity, but they should be aggregated carefully and with appropriate similarity metrics and aggregation models.

Keywords: deep phenotyping; experiencer; negated phenotype; patient similarity.

MeSH terms

Electronic Health Records*
Humans
Narration*
Natural Language Processing
Phenotype
Rare Diseases