Background: Data science approaches have increasingly been used in behavioral health research and may be useful for addressing social factors contributing to disparities in health status. This study evaluated the importance of cultural stress-related factors in classifying depression and post-traumatic stress disorder (PTSD) among adult survivors (N = 319) of Hurricane Maria who migrated from Puerto Rico to the United States mainland.
Methods: We evaluated the performance of random forests (RF) and logistic regression (LR) for classifying PTSD and depression. Models included demographic, hurricane exposure, and migration-related cultural stress variables. We inspected area under the receiver operating characteristic curve (AUC), accuracy, balanced accuracy, F1 score, precision, recall, and specificity.
Results: Negative context of reception and language-related stressors were moderately important for accurately classifying depression and PTSD. For classifying depression, RF showed higher accuracy, balanced accuracy, specificity, precision, and F1. For classifying PTSD, RF showed higher accuracy, specificity, precision, and F1.
Limitations: A more thorough classification model would also include biomarkers (e.g., of allostatic load), family, community, or neighborhood-level attributes. Findings may not generalize to other groups who have experienced crisis-related migration.
Conclusions: Findings underscore the importance of culturally and linguistically appropriate and trauma-informed clinical services for recent migrants. Use of assessments to identify pre-migration and post-migration stressors could inform clinical practice with migrants presenting with behavioral health-related difficulties.
Keywords: Data science; Depression; Ethnic discrimination; Post-traumatic stress disorder.
Copyright © 2023 Elsevier B.V. All rights reserved.