Super learning to hedge against incorrect inference from arbitrary parametric assumptions in marginal structural modeling

J Clin Epidemiol. 2013 Aug;66(8 Suppl):S99-109. doi: 10.1016/j.jclinepi.2013.01.016.

Abstract

Objective: Clinical trials are unlikely to ever be launched for many comparative effectiveness research (CER) questions. Inferences from hypothetical randomized trials may however be emulated with marginal structural modeling (MSM) using observational data, but success in adjusting for time-dependent confounding and selection bias typically relies on parametric modeling assumptions. If these assumptions are violated, inferences from MSM may be inaccurate. In this article, we motivate the application of a data-adaptive estimation approach called super learning (SL) to avoid reliance on arbitrary parametric assumptions in CER.

Study design and setting: Using the electronic health records data from adults with new-onset type 2 diabetes, we implemented MSM with inverse probability weighting (IPW) estimation to evaluate the effect of three oral antidiabetic therapies on the worsening of glomerular filtration rate.

Results: Inferences from IPW estimation were noticeably sensitive to the parametric assumptions about the associations between both the exposure and censoring processes and the main suspected source of confounding, that is, time-dependent measurements of hemoglobin A1c. SL was successfully implemented to harness flexible confounding and selection bias adjustment from existing machine learning algorithms.

Conclusion: Erroneous IPW inference about clinical effectiveness because of arbitrary and incorrect modeling decisions may be avoided with SL.

Keywords: Comparative effectiveness research; Inverse probability weighting; Marginal structural model; Selection bias; Super learning; Time-dependent confounding.

MeSH terms

  • Adult
  • Aged
  • Cohort Studies
  • Comparative Effectiveness Research / statistics & numerical data
  • Confounding Factors, Epidemiologic
  • Data Interpretation, Statistical*
  • Diabetes Mellitus, Type 2 / drug therapy
  • Diabetes Mellitus, Type 2 / physiopathology
  • Disease Progression
  • Drug Therapy, Combination
  • Electronic Health Records / statistics & numerical data*
  • Glomerular Filtration Rate / drug effects
  • Humans
  • Hypoglycemic Agents / administration & dosage
  • Hypoglycemic Agents / therapeutic use
  • Metformin / administration & dosage
  • Metformin / therapeutic use
  • Middle Aged
  • Models, Statistical*
  • Randomized Controlled Trials as Topic / statistics & numerical data*
  • Selection Bias
  • Sulfonylurea Compounds / administration & dosage
  • Sulfonylurea Compounds / therapeutic use
  • Survival Analysis
  • Time Factors
  • Treatment Outcome
  • Young Adult

Substances

  • Hypoglycemic Agents
  • Sulfonylurea Compounds
  • Metformin