Learning site-invariant features of connectomes to harmonize complex network measures

Nancy R Newlin; Praitayini Kanakaraj; Thomas Li; Kimberly Pechman; Derek Archer; Angela Jefferson; BIOCARD Study Team; Bennett Landman; Daniel Moyer

doi:10.1117/12.3009645

Learning site-invariant features of connectomes to harmonize complex network measures

Proc SPIE Int Soc Opt Eng. 2024 Feb:12930:129302E. doi: 10.1117/12.3009645. Epub 2024 Apr 2.

Authors

Nancy R Newlin¹, Praitayini Kanakaraj¹, Thomas Li², Kimberly Pechman³, Derek Archer^{3

4

5}, Angela Jefferson^{3

4

6}; BIOCARD Study Team; Bennett Landman^{1

2

7

8}, Daniel Moyer¹

Affiliations

¹ Department of Computer Science, Vanderbilt University, Nashville, TN, USA.
² Department of Biomedical Engineering, Vanderbilt University, Nashville, Tennessee, USA.
³ Vanderbilt Memory and Alzheimer's Center, Vanderbilt University Medical Center, Nashville, TN, USA.
⁴ Department of Neurology, Vanderbilt University Medical Center, Nashville, TN, USA.
⁵ Vanderbilt Genetics Institute, Vanderbilt University School of Medicine, Nashville, TN, USA.
⁶ Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
⁷ Vanderbilt University Institute of Imaging Science, Vanderbilt University, Nashville, TN, USA.
⁸ Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, TN, USA.

Abstract

Multi-site diffusion MRI data is often acquired on different scanners and with distinct protocols. Differences in hardware and acquisition result in data that contains site dependent information, which confounds connectome analyses aiming to combine such multi-site data. We propose a data-driven solution that isolates site-invariant information whilst maintaining relevant features of the connectome. We construct a latent space that is uncorrelated with the imaging site and highly correlated with patient age and a connectome summary measure. Here, we focus on network modularity. The proposed model is a conditional, variational autoencoder with three additional prediction tasks: one for patient age, and two for modularity trained exclusively on data from each site. This model enables us to 1) isolate site-invariant biological features, 2) learn site context, and 3) re-inject site context and project biological features to desired site domains. We tested these hypotheses by projecting 77 connectomes from two studies and protocols (Vanderbilt Memory and Aging Project (VMAP) and Biomarkers of Cognitive Decline Among Normal Individuals (BIOCARD) to a common site. We find that the resulting dataset of modularity has statistically similar means (p-value <0.05) across sites. In addition, we fit a linear model to the joint dataset and find that positive correlations between age and modularity were preserved.

Keywords: Diffusion MRI; complex network measures; connectome; multi-site analysis; site-invariance.

Abstract

Grants and funding