Obtaining personalized predictions from a randomized controlled trial on Alzheimer's disease

Sci Rep. 2025 Jan 11;15(1):1671. doi: 10.1038/s41598-024-84687-4.

Abstract

The purpose of this article is to infer patient level outcomes from population level randomized control trials (RCTs). In this pursuit, we utilize the recently proposed synthetic nearest neighbors (SNN) estimator. At its core, SNN leverages information across patients to impute missing data associated with each patient of interest. We focus on two types of missing data: (i) unrecorded outcomes from discontinuing the assigned treatments and (ii) unobserved outcomes associated with unassigned treatments. Data imputation in the former powers and de-biases RCTs, while data imputation in the latter simulates "synthetic RCTs" to predict the outcomes for each patient under every treatment. The SNN estimator is interpretable, transparent, and causally justified under a broad class of missing data scenarios. Relative to several standard methods, we empirically find that SNN performs well for the above two applications using Phase 3 clinical trial data on patients with Alzheimer's Disease. Our findings directly suggest that SNN can tackle a current pain point within the clinical trial workflow on patient dropouts and serve as a new tool towards the development of precision medicine. Building on our insights, we discuss how SNN can further generalize to real-world applications.

Keywords: Dropouts; Precision medicine; Real-world data; Real-world evidence; Synthetic trials.

MeSH terms

  • Alzheimer Disease* / therapy
  • Clinical Trials, Phase III as Topic
  • Humans
  • Precision Medicine* / methods
  • Randomized Controlled Trials as Topic*