Characterizing multimorbidity in ALIVE: comparing single and ensemble clustering methods

Jacqueline E Rudolph; Bryan Lau; Becky L Genberg; Jing Sun; Gregory D Kirk; Shruti H Mehta

doi:10.1093/aje/kwae031

Characterizing multimorbidity in ALIVE: comparing single and ensemble clustering methods

Am J Epidemiol. 2024 Aug 5;193(8):1146-1154. doi: 10.1093/aje/kwae031.

Authors

Jacqueline E Rudolph¹, Bryan Lau¹, Becky L Genberg¹, Jing Sun¹, Gregory D Kirk^{1

2}, Shruti H Mehta¹

Affiliations

¹ Department of Epidemiology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD 21205, United States.
² Division of Infectious Diseases, Johns Hopkins School of Medicine, Baltimore, MD 21205, United States.

PMID: 38576181
PMCID: PMC11299029 (available on 2025-04-03)
DOI: 10.1093/aje/kwae031

Abstract

Multimorbidity, defined as having 2 or more chronic conditions, is a growing public health concern, but research in this area is complicated by the fact that multimorbidity is a highly heterogenous outcome. Individuals in a sample may have a differing number and varied combinations of conditions. Clustering methods, such as unsupervised machine learning algorithms, may allow us to tease out the unique multimorbidity phenotypes. However, many clustering methods exist, and choosing which to use is challenging because we do not know the true underlying clusters. Here, we demonstrate the use of 3 individual algorithms (partition around medoids, hierarchical clustering, and probabilistic clustering) and a clustering ensemble approach (which pools different clustering approaches) to identify multimorbidity clusters in the AIDS Linked to the Intravenous Experience cohort study. We show how the clusters can be compared based on cluster quality, interpretability, and predictive ability. In practice, it is critical to compare the clustering results from multiple algorithms and to choose the approach that performs best in the domain(s) that aligns with plans to use the clusters in future analyses.

Keywords: clustering; ensemble clustering; hierarchical clustering; multimorbidity; partition around medoids; probabilistic clustering; unsupervised machine learning.

Publication types

Comparative Study

MeSH terms

Adult
Algorithms*
Cluster Analysis
Female
Humans
Male
Middle Aged
Multimorbidity*
Unsupervised Machine Learning

Abstract

Publication types

MeSH terms

Grants and funding