Using Structural Equation Modelling in Routine Clinical Data on Diabetes and Depression: Observational Cohort Study

JMIRx Med. 2022 Apr 27;3(2):e22912. doi: 10.2196/22912.

Abstract

Background: Large data sets comprising routine clinical data are becoming increasingly available for use in health research. These data sets contain many clinical variables that might not lend themselves to use in research. Structural equation modelling (SEM) is a statistical technique that might allow for the creation of "research-friendly" clinical constructs from these routine clinical variables and therefore could be an appropriate analytic method to apply more widely to routine clinical data.

Objective: SEM was applied to a large data set of routine clinical data developed in East London to model well-established clinical associations. Depression is common among patients with type 2 diabetes, and is associated with poor diabetic control, increased diabetic complications, increased health service utilization, and increased health care costs. Evidence from trial data suggests that integrating psychological treatment into diabetes care can improve health status and reduce costs. Attempting to model these known associations using SEM will test the utility of this technique in routine clinical data sets.

Methods: Data were cleaned extensively prior to analysis. SEM was used to investigate associations between depression, diabetic control, diabetic care, mental health treatment, and Accident & Emergency (A&E) use in patients with type 2 diabetes. The creation of the latent variables and the direction of association between latent variables in the model was based upon established clinical knowledge.

Results: The results provided partial support for the application of SEM to routine clinical data. Overall, 19% (3106/16,353) of patients with type 2 diabetes had received a diagnosis of depression. In line with known clinical associations, depression was associated with worse diabetic control (β=.034, P<.001) and increased A&E use (β=.071, P<.001). However, contrary to expectation, worse diabetic control was associated with lower A&E use (β=-.055, P<.001) and receipt of mental health treatment did not impact upon diabetic control (P=.39). Receipt of diabetes care was associated with better diabetic control (β=-.072, P<.001), having depression (β=.018, P=.007), and receiving mental health treatment (β=.046, P<.001), which might suggest that comprehensive integrated care packages are being delivered in East London.

Conclusions: Some established clinical associations were successfully modelled in a sample of patients with type 2 diabetes in a way that made clinical sense, providing partial evidence for the utility of SEM in routine clinical data. Several issues relating to data quality emerged. Data improvement would have likely enhanced the utility of SEM in this data set.

Keywords: PLS-SEM; accident; acute care; clinical data; depression; diabetes; electronic health records; emergency; emergency care; equation modelling; path analysis; structural equation modelling.