Methodological variations in lagged regression for detecting physiologic drug effects in EHR data

Matthew E Levine; David J Albers; George Hripcsak

doi:10.1016/j.jbi.2018.08.014

Methodological variations in lagged regression for detecting physiologic drug effects in EHR data

J Biomed Inform. 2018 Oct:86:149-159. doi: 10.1016/j.jbi.2018.08.014. Epub 2018 Aug 30.

Authors

Matthew E Levine¹, David J Albers², George Hripcsak³

Affiliations

¹ Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY 10032, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States. Electronic address: [email protected].
² Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY 10032, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States.
³ Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY 10032, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States; NewYork-Presbyterian Hospital, 622 W. 168th Street, New York, NY 10032, United States.

Abstract

We studied how lagged linear regression can be used to detect the physiologic effects of drugs from data in the electronic health record (EHR). We systematically examined the effect of methodological variations ((i) time series construction, (ii) temporal parameterization, (iii) intra-subject normalization, (iv) differencing (lagged rates of change achieved by taking differences between consecutive measurements), (v) explanatory variables, and (vi) regression models) on performance of lagged linear methods in this context. We generated two gold standards (one knowledge-base derived, one expert-curated) for expected pairwise relationships between 7 drugs and 4 labs, and evaluated how the 64 unique combinations of methodological perturbations reproduce the gold standards. Our 28 cohorts included patients in the Columbia University Medical Center/NewYork-Presbyterian Hospital clinical database, and ranged from 2820 to 79,514 patients with between 8 and 209 average time points per patient. The most accurate methods achieved AUROC of 0.794 for knowledge-base derived gold standard (95%CI [0.741, 0.847]) and 0.705 for expert-curated gold standard (95% CI [0.629, 0.781]). We observed a mean AUROC of 0.633 (95%CI [0.610, 0.657], expert-curated gold standard) across all methods that re-parameterize time according to sequence and use either a joint autoregressive model with time-series differencing or an independent lag model without differencing. The complement of this set of methods achieved a mean AUROC close to 0.5, indicating the importance of these choices. We conclude that time-series analysis of EHR data will likely rely on some of the beneficial pre-processing and modeling methodologies identified, and will certainly benefit from continued careful analysis of methodological perturbations. This study found that methodological variations, such as pre-processing and representations, have a large effect on results, exposing the importance of thoroughly evaluating these components when comparing machine-learning methods.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Academic Medical Centers
Area Under Curve
Data Collection
Databases, Factual
Drug Therapy / methods*
Electronic Health Records*
Humans
Linear Models
Machine Learning*
New York City
Pharmaceutical Preparations*
ROC Curve
Regression Analysis
Reproducibility of Results
Time Factors

Substances

Pharmaceutical Preparations

Grants and funding

R01 LM006910/LM/NLM NIH HHS/United States