Predictive Models for Sustained, Uncontrolled Hypertension and Hypertensive Crisis Based on Electronic Health Record Data: Algorithm Development and Validation

Hieu Minh Nguyen; William Anderson; Shih-Hsiung Chou; Andrew McWilliams; Jing Zhao; Nicholas Pajewski; Yhenneko Taylor

doi:10.2196/58732

Predictive Models for Sustained, Uncontrolled Hypertension and Hypertensive Crisis Based on Electronic Health Record Data: Algorithm Development and Validation

JMIR Med Inform. 2024 Oct 28:12:e58732. doi: 10.2196/58732.

Authors

Hieu Minh Nguyen¹, William Anderson², Shih-Hsiung Chou³, Andrew McWilliams^{4

5}, Jing Zhao⁶, Nicholas Pajewski^{1

7}, Yhenneko Taylor^{1

8}

Affiliations

¹ Center for Health System Sciences (CHASSIS), Atrium Health, Charlotte, NC, United States.
² Statistics and Data Management, Elanco, Greenfield, IN, United States.
³ Enterprise Data Management, Atrium Health, Charlotte, NC, United States.
⁴ Information Technology, Atrium Health, Charlotte, NC, United States.
⁵ Department of Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, United States.
⁶ GSCO Market Access Analytics and Real World Evidence, Johnson & Johnson, Raritan, NJ, United States.
⁷ Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC, United States.
⁸ Department of Social Sciences and Health Policy, Wake Forest University School of Medicine, Winston-Salem, NC, United States.

PMID: 39466045
PMCID: PMC11533385
DOI: 10.2196/58732

Abstract

Background: Assessing disease progression among patients with uncontrolled hypertension is important for identifying opportunities for intervention.

Objective: We aim to develop and validate 2 models, one to predict sustained, uncontrolled hypertension (≥2 blood pressure [BP] readings ≥140/90 mm Hg or ≥1 BP reading ≥180/120 mm Hg) and one to predict hypertensive crisis (≥1 BP reading ≥180/120 mm Hg) within 1 year of an index visit (outpatient or ambulatory encounter in which an uncontrolled BP reading was recorded).

Methods: Data from 142,897 patients with uncontrolled hypertension within Atrium Health Greater Charlotte in 2018 were used. Electronic health record-based predictors were based on the 1-year period before a patient's index visit. The dataset was randomly split (80:20) into a training set and a validation set. In total, 4 machine learning frameworks were considered: L2-regularized logistic regression, multilayer perceptron, gradient boosting machines, and random forest. Model selection was performed with 10-fold cross-validation. The final models were assessed on discrimination (C-statistic), calibration (eg, integrated calibration index), and net benefit (with decision curve analysis). Additionally, internal-external cross-validation was performed at the county level to assess performance with new populations and summarized using random-effect meta-analyses.

Results: In internal validation, the C-statistic and integrated calibration index were 0.72 (95% CI 0.71-0.72) and 0.015 (95% CI 0.012-0.020) for the sustained, uncontrolled hypertension model, and 0.81 (95% CI 0.79-0.82) and 0.009 (95% CI 0.007-0.011) for the hypertensive crisis model. The models had higher net benefit than the default policies (ie, treat-all and treat-none) across different decision thresholds. In internal-external cross-validation, the pooled performance was consistent with internal validation results; in particular, the pooled C-statistics were 0.70 (95% CI 0.69-0.71) and 0.79 (95% CI 0.78-0.81) for the sustained, uncontrolled hypertension model and hypertensive crisis model, respectively.

Conclusions: An electronic health record-based model predicted hypertensive crisis reasonably well in internal and internal-external validations. The model can potentially be used to support population health surveillance and hypertension management. Further studies are needed to improve the ability to predict sustained, uncontrolled hypertension.

Keywords: blood pressure; cardiovascular; decision support; electronic health record; machine learning; predictive model; risk prediction.

Publication types

Validation Study

MeSH terms

Aged
Algorithms*
Disease Progression
Electronic Health Records* / statistics & numerical data
Female
Humans
Hypertension* / diagnosis
Hypertension* / epidemiology
Hypertensive Crisis
Machine Learning
Male
Middle Aged