Collaborative targeted maximum likelihood estimation for variable importance measure: Illustration for functional outcome prediction in mild traumatic brain injuries

Romain Pirracchio; John K Yue; Geoffrey T Manley; Mark J van der Laan; Alan E Hubbard; TRACK-TBI Investigators including Wayne A Gordon, Hester F Lingsma, Andrew IR Maas, Pratik Mukherjee, David O Okonkwo, David M Schnyer, Alex B Valadka and Esther L Yuh

doi:10.1177/0962280215627335

Collaborative targeted maximum likelihood estimation for variable importance measure: Illustration for functional outcome prediction in mild traumatic brain injuries

Stat Methods Med Res. 2018 Jan;27(1):286-297. doi: 10.1177/0962280215627335. Epub 2016 Jun 29.

Authors

Romain Pirracchio¹, John K Yue^{2

3}, Geoffrey T Manley^{2

3}, Mark J van der Laan⁴, Alan E Hubbard⁴; TRACK-TBI Investigators including Wayne A Gordon, Hester F Lingsma, Andrew IR Maas, Pratik Mukherjee, David O Okonkwo, David M Schnyer, Alex B Valadka and Esther L Yuh

Affiliations

¹ 1 Department of Anesthesia and Perioperative Care, UCSF, San Francisco General Hospital, San Francisco, CA, USA.
² 2 Brain and Spinal Injury Center, San Francisco, CA, USA.
³ 3 Department of Neurosurgery, University of California San Francisco, San Francisco, CA, USA.
⁴ 4 Division of Biostatistics, School of Public Health, University of California Berkeley, Berkeley, CA, USA.

Abstract

Standard statistical practice used for determining the relative importance of competing causes of disease typically relies on ad hoc methods, often byproducts of machine learning procedures (stepwise regression, random forest, etc.). Causal inference framework and data-adaptive methods may help to tailor parameters to match the clinical question and free one from arbitrary modeling assumptions. Our focus is on implementations of such semiparametric methods for a variable importance measure (VIM). We propose a fully automated procedure for VIM based on collaborative targeted maximum likelihood estimation (cTMLE), a method that optimizes the estimate of an association in the presence of potentially numerous competing causes. We applied the approach to data collected from traumatic brain injury patients, specifically a prospective, observational study including three US Level-1 trauma centers. The primary outcome was a disability score (Glasgow Outcome Scale - Extended (GOSE)) collected three months post-injury. We identified clinically important predictors among a set of risk factors using a variable importance analysis based on targeted maximum likelihood estimators (TMLE) and on cTMLE. Via a parametric bootstrap, we demonstrate that the latter procedure has the potential for robust automated estimation of variable importance measures based upon machine-learning algorithms. The cTMLE estimator was associated with substantially less positivity bias as compared to TMLE and larger coverage of the 95% CI. This study confirms the power of an automated cTMLE procedure that can target model selection via machine learning to estimate VIMs in complicated, high-dimensional data.

Keywords: Variable importance measure; causal inference; collaborative targeted maximum likelihood; high-dimensional data; positivity; semi-parametric.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Brain Injuries, Traumatic / physiopathology*
Cooperative Behavior
Forecasting
Glasgow Outcome Scale
Humans
Likelihood Functions*
Machine Learning
Outcome Assessment, Health Care / methods
Recovery of Function*

Abstract

Publication types

MeSH terms

Grants and funding