Using categorical data analyses in suicide research: Considering clinical utility and practicality

Suicide Life Threat Behav. 2021 Feb;51(1):76-87. doi: 10.1111/sltb.12670.

Abstract

Objective: Categorical data analysis is relevant to suicide risk and prevention research that focuses on discrete outcomes (e.g., suicide attempt status). Unfortunately, results from these analyses are often misinterpreted and not presented in a clinically tangible manner. We aimed to address these issues and highlight the relevance and utility of categorical methods in suicide research and clinical assessment. Additionally, we introduce relevant basic machine learning methods concepts and address the distinct utility of the current methods.

Method: We review relevant background concepts and pertinent issues with references to helpful resources. We also provide non-technical descriptions and tutorials of how to convey categorical statistical results (logistic regression, receiver operating characteristic [ROC] curves, area under the curve [AUC] statistics, clinical cutoff scores) for clinical context and more intuitive use.

Results: We provide comprehensive examples, using simulated data, and interpret results. We also note important considerations for conducting and interpreting these analyses. We provide a walk-through demonstrating how to convert logistic regression estimates into predicted probability values, which is accompanied by Appendices demonstrating how to produce publication-ready figures in R and Microsoft Excel.

Conclusion: Improving the translation of statistical estimates to practical, clinically tangible information may narrow the divide between research and clinical practice.

Keywords: Categorical methods; clinical utility; logistic regression; probability; tutorial.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Area Under Curve
  • Data Analysis*
  • Humans
  • Logistic Models
  • ROC Curve
  • Suicide, Attempted*