Statistical harmonization of versions of measures across studies using external data: self-rated health and self-rated memory

Yingyan Wu; Eleanor Hayes-Larson; Yixuan Zhou; Vincent Bouteloup; Scott C Zimmerman; Anna M Pederson; Vincent Planche; Marissa J Seamans; Daniel Westreich; M Maria Glymour; Laura E Gibbons; Carole Dufouil; Elizabeth Rose Mayeda

doi:10.1016/j.annepidem.2025.01.002

Statistical harmonization of versions of measures across studies using external data: self-rated health and self-rated memory

Ann Epidemiol. 2025 Jan 10:S1047-2797(25)00008-0. doi: 10.1016/j.annepidem.2025.01.002. Online ahead of print.

Affiliations

¹ Department of Epidemiology, University of California, Los Angeles Fielding School of Public Health, CA, USA.
² Department of Epidemiology, University of California, Los Angeles Fielding School of Public Health, CA, USA; Department of Biostatistics, University of California, Los Angeles Fielding School of Public Health, CA, USA.
³ Univ. Bordeaux, Inserm, Bordeaux Population Health, UMR 1219, Bordeaux, France; CIC 1401 EC, Pôle Santé Publique; CHU de Bordeaux, France.
⁴ Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA.
⁵ Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA; Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA.
⁶ Univ. Bordeaux, CNRS, Institut des Maladies Neurodégénératives, UMR 5293, Bordeaux, France; Pôle de Neurosciences Cliniques, Centre Mémoire de Ressources et de Recherche, CHU de Bordeaux, France.
⁷ Department of Epidemiology, University of North Carolina Gillings School of Global Public Health, Chapel Hill, North Carolina, USA.
⁸ Department of Medicine, School of Medicine, University of Washington, Seattle, WA, USA.
⁹ Department of Epidemiology, University of California, Los Angeles Fielding School of Public Health, CA, USA. Electronic address: [email protected].

PMID: 39800088
DOI: 10.1016/j.annepidem.2025.01.002

Abstract

Purpose: Harmonizing variables for constructs measured differently across studies is essential for comparing, combining, and generalizing results. We developed and fielded a brief survey to harmonize Likert and continuous versions of measures for two constructs, self-rated health and self-rated memory, for use in studies of French older adults.

Methods: We recruited 300 participants from a French memory clinic in 2023 to answer both the Likert and continuous versions of self-rated health and self-rated memory questions. For each construct, we predicted responses to the Likert version with multinomial and ordinal logistic models, varying specifications of continuous version responses (linear or spline) and covariate sets (question order, age, sex/gender, and interactions between the continuous version and covariates). We also implemented a percentiles-based crosswalk sensitivity analysis. We compared Cohen's weighted kappa values to identify the best statistical harmonization approach.

Results: In the final models [multinomial models with continuous version spline, question order (self-rated memory model only), age, sex/gender, and interactions between the continuous version and covariates], weighted kappa values were 0.61 for self-rated health and 0.60 for self-rated memory, reflecting moderate agreement.

Conclusions: Primary data collection feasibly facilitates statistical harmonization of variables for constructs measured differently across studies.

Keywords: Measurement; Primary data collection; Statistical harmonization.