Monitoring of the trough concentration of valproic acid in pediatric epilepsy patients: a machine learning-based ensemble model

Front Pharmacol. 2024 Dec 18:15:1521932. doi: 10.3389/fphar.2024.1521932. eCollection 2024.

Abstract

Aims: Few personalized monitoring models for valproic acid (VPA) in pediatric epilepsy patients (PEPs) incorporate machine learning (ML) algorithms. This study aimed to develop an ensemble ML model for VPA monitoring to enhance clinical precision of VPA usage.

Methods: A dataset comprising 366 VPA trough concentrations from 252 PEPs, along with 19 covariates and the target variable (VPA trough concentration), was refined by Spearman correlation and multicollinearity testing (366 × 11). The dataset was split into a training set (292) and testing set (74) at a ratio of 8:2. An ensemble model was formulated by Gradient Boosting Regression Trees (GBRT), Random Forest Regression (RFR), and Support Vector Regression (SVR), and assessed by SHapley Additive exPlanations (SHAP) analysis for covariate importance. The model was optimized for R2, relative accuracy, and absolute accuracy, and validated against two independent external datasets (32 in-hospital and 28 out-of-hospital dataset).

Results: Using the R2 weight ratio of GBRT, RFR and SVR optimized at 5:2:3, the ensemble model demonstrated superior performance in terms of relative accuracy (87.8%), absolute accuracy (78.4%), and R2 (0.50), while also exhibiting a lower Mean Absolute Error (9.87) and Root Mean Squared Error (12.24), as validated by the external datasets. Platelet count (PLT) and VPA daily dose were identified as pivotal covariates.

Conclusion: The proposed ensemble model effectively monitors VPA trough concentrations in PEPs. By integrating covariates across various ML algorithms, it delivers results closely aligned with clinical practice, offering substantial clinical value for the guided use of VPA.

Keywords: SHAP; VPA trough concentration; ensemble model; machine learning; pediatric epilepsy patients.

Grants and funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. The Joint Funds for the Innovation of Science and Technology, Fujian Province, China (2021Y9090 to Z-JL). Education and Teaching Research Program of Fujian Medical University, Fujian Province, China (J22038 to SC).