Objective: To develop and test predictive models of discontinuation of behavioral health service use within 12 months in transitional age youth with recent behavioral health service use.
Data sources: Administrative claims for Medicaid beneficiaries aged 15-26 years in Connecticut.
Study design: We compared the performance of a decision tree, random forest, and gradient boosting machine learning algorithms to logistic regression in predicting service discontinuation within 12 months among beneficiaries using behavioral health services.
Data extraction: We identified 33,532 transitional age youth with ≥1 claim for a primary behavioral health diagnosis in 2016 and Medicaid enrollment of ≥11 months in 2016 and ≥11 months in 2017.
Principal findings: Classification accuracy for identifying youth who discontinued behavioral health service use was highest for gradient boosting (80%, AUC = 0.86), decision tree (79%, AUC = 0.84), and random forest (79%, AUC = 0.86), as compared with logistic regression (71%, AUC = 0.71).
Conclusions: Predictive models based on Medicaid claims can assist in identifying transitional age youth who are at risk of discontinuing from behavioral health care within 12 months, thus allowing for proactive assessment and outreach to promote continuity of care for younger persons who have behavioral health needs.
Keywords: emerging adults; machine learning; mental health services.
© 2021 Health Research and Educational Trust.