Explainable artificial intelligence models for predicting risk of suicide using health administrative data in Quebec

Fatemeh Gholi Zadeh Kharrat; Christian Gagne; Alain Lesage; Geneviève Gariépy; Jean-François Pelletier; Camille Brousseau-Paradis; Louis Rochette; Eric Pelletier; Pascale Lévesque; Mada Mohammed; JianLi Wang

doi:10.1371/journal.pone.0301117

Explainable artificial intelligence models for predicting risk of suicide using health administrative data in Quebec

PLoS One. 2024 Apr 3;19(4):e0301117. doi: 10.1371/journal.pone.0301117. eCollection 2024.

Authors

Fatemeh Gholi Zadeh Kharrat^{1

2}, Christian Gagne¹, Alain Lesage^{3

4}, Geneviève Gariépy^{5

6

7}, Jean-François Pelletier^{3

4}, Camille Brousseau-Paradis^{3

4}, Louis Rochette², Eric Pelletier², Pascale Lévesque², Mada Mohammed⁸, JianLi Wang⁸

Affiliations

¹ Institut Intelligence et Données (IID), Université Laval, Québec, Québec, Canada.
² Institut National de Santé Publique du Québec (INSPQ), Québec, Québec, Canada.
³ Department of Psychiatry and Addiction, Université de Montréal, Montreal, QC, Canada.
⁴ Centre de Recherche de l'Institut Universitaire en Santé Mentale de Montréal, Québec, Canada.
⁵ Centre for Surveillance and Applied Research, Health Promotion and Chronic Disease Prevention Branch, Public Health Agency of Canada, Ottawa, Canada.
⁶ Department of Social and Preventive Medicine, School of Public Health, University of Montreal, Montreal, Canada.
⁷ Montreal Mental Health University Institute Research Center, Montreal, Canada.
⁸ Department of Community Health and Epidemiology, Faculty of Medicine, Dalhousie University, Halifax, Canada.

Abstract

Suicide is a complex, multidimensional event, and a significant challenge for prevention globally. Artificial intelligence (AI) and machine learning (ML) have emerged to harness large-scale datasets to enhance risk detection. In order to trust and act upon the predictions made with ML, more intuitive user interfaces must be validated. Thus, Interpretable AI is one of the crucial directions which could allow policy and decision makers to make reasonable and data-driven decisions that can ultimately lead to better mental health services planning and suicide prevention. This research aimed to develop sex-specific ML models for predicting the population risk of suicide and to interpret the models. Data were from the Quebec Integrated Chronic Disease Surveillance System (QICDSS), covering up to 98% of the population in the province of Quebec and containing data for over 20,000 suicides between 2002 and 2019. We employed a case-control study design. Individuals were considered cases if they were aged 15+ and had died from suicide between January 1st, 2002, and December 31st, 2019 (n = 18339). Controls were a random sample of 1% of the Quebec population aged 15+ of each year, who were alive on December 31st of each year, from 2002 to 2019 (n = 1,307,370). We included 103 features, including individual, programmatic, systemic, and community factors, measured up to five years prior to the suicide events. We trained and then validated the sex-specific predictive risk model using supervised ML algorithms, including Logistic Regression (LR), Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Multilayer perceptron (MLP). We computed operating characteristics, including sensitivity, specificity, and Positive Predictive Value (PPV). We then generated receiver operating characteristic (ROC) curves to predict suicides and calibration measures. For interpretability, Shapley Additive Explanations (SHAP) was used with the global explanation to determine how much the input features contribute to the models' output and the largest absolute coefficients. The best sensitivity was 0.38 with logistic regression for males and 0.47 with MLP for females; the XGBoost Classifier with 0.25 for males and 0.19 for females had the best precision (PPV). This study demonstrated the useful potential of explainable AI models as tools for decision-making and population-level suicide prevention actions. The ML models included individual, programmatic, systemic, and community levels variables available routinely to decision makers and planners in a public managed care system. Caution shall be exercised in the interpretation of variables associated in a predictive model since they are not causal, and other designs are required to establish the value of individual treatments. The next steps are to produce an intuitive user interface for decision makers, planners and other stakeholders like clinicians or representatives of families and people with live experience of suicidal behaviors or death by suicide. For example, how variations in the quality of local area primary care programs for depression or substance use disorders or increased in regional mental health and addiction budgets would lower suicide rates.

Copyright: © 2024 Gholi Zadeh Kharrat et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

MeSH terms

Artificial Intelligence*
Case-Control Studies
Female
Humans
Male
Quebec / epidemiology
Routinely Collected Health Data
Suicide*

Grants and funding

This research was financially supported through a grant from the Tri-Agency Institutional Programs Secretariat, Government of Canada, specifically under the New Frontiers for Research Funds (Confirmation Gran#: NFRFE -2019-00471). Furthermore, JLW is supported by a Tier 1 Canada Research Chair in Health Data Science and Innovation and assumed the role of the principal investigator for this project. JLW played a substantial role in various aspects of the study, including design, conceptualization, decision-making regarding publication, and the review and editing of the manuscript. FGZK was the recipient of a post-doctoral award from the Quebec Suicide, Mood, and Associated Disorders Research Network, funded by Quebec's Health and Social Research Funds (Confirmation Grant#: 268065). In her capacity, FGZK actively contributed to the study's design, data collection, analysis of results, and the composition of the original draft. CG, holding a Canada-CIFAR AI chair associated with Mila, played a pivotal role in the study's design, providing supervision, contributing to conceptualization, decision-making regarding publication, and undertaking the review and editing of the manuscript.