TY - JOUR
T1 - Predicting future suicidal behaviour in young adults, with different machine learning techniques
T2 - A population-based longitudinal study
AU - van Mens, Kasper
AU - de Schepper, C. W.M.
AU - Wijnen, Ben
AU - Koldijk, Saskia J.
AU - Schnack, Hugo
AU - de Looff, Peter
AU - Lokkerbol, Joran
AU - Wetherall, Karen
AU - Cleare, Seonaid
AU - C O'Connor, Rory
AU - de Beurs, Derek
N1 - Copyright © 2020. Published by Elsevier B.V.
PY - 2020/6/15
Y1 - 2020/6/15
N2 - Background: The predictive accuracy of suicidal behaviour has not improved over the last decades. We aimed to explore the potential of machine learning to predict future suicidal behaviour using population-based longitudinal data. Method: Baseline risk data assessed within the Scottish wellbeing study, in which 3508 young adults (18–34 years) completed a battery of psychological measures, were used to predict both suicide ideation and suicide attempts at one-year follow-up. The performance of the following algorithms was compared: regular logistic regression, K-nearest neighbors, classification tree, random forests, gradient boosting and support vector machine. Results: At one year follow up, 2428 respondents (71%) finished the second assessment. 336 respondents (14%) reported suicide ideation between baseline and follow up, and 50 (2%) reported a suicide attempt. All performance metrics were highly similar across methods. The random forest algorithm was the best algorithm to predict suicide ideation (AUC 0.83, PPV 0.52, BA 0.74) and the gradient boosting to predict suicide attempt (AUC 0.80, PPV 0.10, BA 0.69). Limitations: The number of respondents with suicidal behaviour at follow up was small. We only had data on psychological risk factors, limiting the potential of the more complex machine learning algorithms to outperform regular logistical regression. Conclusions: When applied to population-based longitudinal data containing multiple psychological measurements, machine learning techniques did not significantly improve the predictive accuracy of suicidal behaviour. Adding more detailed data on for example employment, education or previous health care uptake, might result in better performance of machine learning over regular logistical regression.
AB - Background: The predictive accuracy of suicidal behaviour has not improved over the last decades. We aimed to explore the potential of machine learning to predict future suicidal behaviour using population-based longitudinal data. Method: Baseline risk data assessed within the Scottish wellbeing study, in which 3508 young adults (18–34 years) completed a battery of psychological measures, were used to predict both suicide ideation and suicide attempts at one-year follow-up. The performance of the following algorithms was compared: regular logistic regression, K-nearest neighbors, classification tree, random forests, gradient boosting and support vector machine. Results: At one year follow up, 2428 respondents (71%) finished the second assessment. 336 respondents (14%) reported suicide ideation between baseline and follow up, and 50 (2%) reported a suicide attempt. All performance metrics were highly similar across methods. The random forest algorithm was the best algorithm to predict suicide ideation (AUC 0.83, PPV 0.52, BA 0.74) and the gradient boosting to predict suicide attempt (AUC 0.80, PPV 0.10, BA 0.69). Limitations: The number of respondents with suicidal behaviour at follow up was small. We only had data on psychological risk factors, limiting the potential of the more complex machine learning algorithms to outperform regular logistical regression. Conclusions: When applied to population-based longitudinal data containing multiple psychological measurements, machine learning techniques did not significantly improve the predictive accuracy of suicidal behaviour. Adding more detailed data on for example employment, education or previous health care uptake, might result in better performance of machine learning over regular logistical regression.
KW - Humans
KW - Logistic Models
KW - Longitudinal Studies
KW - Machine Learning
KW - Suicidal Ideation
KW - Suicide, Attempted
KW - Young Adult
UR - http://www.scopus.com/inward/record.url?scp=85083634461&partnerID=8YFLogxK
U2 - 10.1016/j.jad.2020.03.081
DO - 10.1016/j.jad.2020.03.081
M3 - Article
C2 - 32479313
AN - SCOPUS:85083634461
SN - 0165-0327
VL - 271
SP - 169
EP - 177
JO - Journal of Affective Disorders
JF - Journal of Affective Disorders
ER -