Background: Adherence to self-administered biologic therapies is important to induce remission and prevent adverse clinical outcomes in Inflammatory bowel disease (IBD). This study aimed to use administrative claims data and machine learning methods to predict nonadherence in an academic medical center test population.
Methods: A model-training dataset of beneficiaries with IBD and the first unique dispense of a self-administered biologic between June 30, 2016 and June 30, 2019 was extracted from the Commercial Claims and Encounters and Medicare Supplemental Administrative Claims Database. Known correlates of medication nonadherence were identified in the dataset. Nonadherence to biologic therapies was defined as a proportion of days covered ratio <80% at 1 year. A similar dataset was obtained from a tertiary academic medical center's electronic medical record data for use in model testing. A total of 48 machine learning models were trained and assessed utilizing the area under the receiver operating characteristic curve as the primary measure of predictive validity.
Results: The training dataset included 6998 beneficiaries (n = 2680 nonadherent, 38.3%) while the testing dataset included 285 patients (n = 134 nonadherent, 47.0%). When applied to test data, the highest performing models had an area under the receiver operating characteristic curve of 0.55, indicating poor predictive performance. The majority of models trained had low sensitivity and high specificity.
Conclusions: Administrative claims-trained models were unable to predict biologic medication nonadherence in patients with IBD. Future research may benefit from datasets with enriched demographic and clinical data in training predictive models.
Keywords: biologics; machine learning; medication adherence.
© The Author(s) 2024. Published by Oxford University Press on behalf of Crohn's & Colitis Foundation.