Machine learning based identification of suicidal ideation using non-suicidal predictors in a university mental health clinic

Ballı, Muhammed; Dogan, Asli Ercan; Senol, Sevin Hun; Eser, Hale Yapici

doi:10.1038/s41598-025-97387-4

Download PDF

Article
Open access
Published: 22 April 2025

Machine learning based identification of suicidal ideation using non-suicidal predictors in a university mental health clinic

Muhammed Ballı¹,
Asli Ercan Dogan²,
Sevin Hun Senol³ &
…
Hale Yapici Eser⁴

Scientific Reports volume 15, Article number: 13843 (2025) Cite this article

6761 Accesses
5 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Suicide causes over 700,000 deaths annually worldwide. Mental disorders are closely linked to suicidal ideation, but predicting suicide remains complex due to the multifaceted nature of contributing factors. Traditional assessment tools often fail to capture the interactions that drive suicidal thoughts, underscoring the need for more sophisticated predictive approaches. This study aimed to predict suicidal and self-harm ideation among university students using machine learning models without relying on suicidal behavior related predictors. The goal was to uncover less obvious risk factors and provide deeper insights into the complex relationships between psychiatric symptoms and suicidal ideation. Data from 924 university students seeking mental health services were analyzed using seven machine learning algorithms. Suicidal ideation was assessed through the 9th item of the Patient Health Questionnaire-9. Three predictive models were developed, with the final model utilizing only subdomains from the DSM-5 Level 1 Self Rated Cross-Cutting Symptom Measure. Feature importance was assessed using SHAP and Integrated Gradients techniques. To ensure model generalizability, the best-performing model was externally validated on a separate dataset of 361 individuals. Machine learning models achieved strong predictive accuracy, with logistic regression and neural networks reaching AUC values of 0.80. The final model achieved an AUC of 0.80 on the training data and 0.79 on external validation data. Key predictors of suicidal ideation included personality functioning and depressed mood (both increasing the likelihood), while anxiety and repetitive thoughts were associated with a decreased likelihood. The use of non-suicidal predictors across datasets highlighted psychiatric dimensions relevant to early intervention. This study demonstrates the effectiveness of machine learning in predicting suicidal ideation without relying on suicide-specific inputs. The findings emphasize the critical roles of personality functioning, mood, and anxiety in shaping suicidal ideation. These insights can enhance early detection and personalized interventions, especially in individuals reluctant to disclose suicidal thoughts.

A machine learning approach for predicting suicidal thoughts and behaviours among college students

Article Open access 15 June 2021

Suicide risk prediction for Korean adolescents based on machine learning

Article Open access 28 April 2025

Machine learning and the prediction of suicide in psychiatric populations: a systematic review

Article Open access 09 March 2024

Introduction

Suicide causes over 700,000 deaths annually, representing 1.3% of total deaths, with 58% occurring in individuals under 50¹. According to recent reports by the World Health Organization (WHO), suicide, which previously ranked as the second leading cause of death among individuals aged 15–29, has dropped to fourth place, following road traffic accidents, tuberculosis, and interpersonal violence. These reports also indicate a decrease of approximately 100,000 in the total number of suicide deaths^1,2. This trend may be seen as a positive outcome of better access to psychiatric services; studies show that suicide-related deaths rise when access to mental health care decreases, while they decrease with improved access^3,4. Similarly in terms of age group, among the 4061 suicide cases reported in Turkiye in 2023, the highest number of suicides occurred in the 20–29 age group, which accounted for 27% of total suicide deaths⁵, still underlining the importance of ongoing research to explain suicide.

Efforts to explain suicide began with Durkheim’s sociological approach (1897)⁶, yet no single explanation has prevailed⁷. Biological Suicide Theory emphasizes altered stress responses and serotonergic neurotransmission, whereas Hopelessness Theory links depression-induced hopelessness to suicide^8,9. Klonsky and May’s (2015) Three-Step Theory suggests suicidal desire stems from pain and hopelessness, it increases when pain outweighs connectedness, and escalates to attempts with sufficient capability¹⁰. On the other hand, mental disorders strongly correlate with suicide¹¹; 87% of suicide decedents has prior psychiatric diagnoses¹², and some specific disorders significantly elevate suicide risk—psychotic disorders by 13.2-fold and mood disorders by 12.3-fold¹³. According to a recent review and meta-analysis study, depression, borderline personality disorder, and schizophrenia spectrum disorders represent the most substantial risk factors for suicide among all mental disorders¹⁴. However, mental disorders alone poorly predict attempts due to additional factors¹⁵. A psychological autopsy showed 37% of suicide cases lacked diagnosed mental disorders, underscoring subthreshold disorders and social influences¹⁶. Subthreshold eating disorders, psychosis, anxiety, post-traumatic stress disorder, and depression also significantly elevate suicidal ideation or attempt^{17,18,19,20,21}. Further a comprehensive meta-analysis identified previous psychiatric hospitalization, prior suicide attempts, and past suicidal ideation as prominent risk factors⁷. Suicidal ideation involves thinking and planning of self-harm with intent to die²². Non-suicidal self-injury refers to self-harm without intent to die²³. Suicidal ideation is a critical intervention point in preventing suicide attempt²⁴. However, individuals may conceal suicidal thoughts due to factors like fear of stigmatization, hindering timely support^25,26. Scale based screening tools help assess suicide risk^27,28,29,30, but despite the established link between mental disorders and suicide^11,12, existing tools like ASQ and C-SSRS^27,28 offer only momentary assessments, failing to reveal underlying causes and these tools may provide only a snapshot of the current situation without elucidating underlying causes.

This study focuses on predicting suicidal ideation and self-harm ideation using symptoms of mental disorders from self-report scales and questionnaires, employing Machine Learning (ML) techniques. ML algorithms, particularly supervised learning with labeled data, enhance prediction accuracy in psychiatric studies and suicide prediction^31,32. A meta-analysis found ML methods superior to traditional statistics in predicting suicide-related outcomes due to ML’s effective classification of large datasets, whereas statistical methods focus more on causal mechanisms³³. A recent systematic review reported that 87% of studies achieved an AUC value of 70% or higher, with past suicide attempts and psychiatric diagnoses being the most predictive features³⁴. However, many studies do not have an external validation sample, which is critical to evaluate the generalization performance of the ML model^34,35,36.

In this study, routine psychometric and demographic data from graduate and undergraduate students who applied to Koç University Psychological Counseling and Therapy Center (KUPTEM) at two different intervals were used as ML model predictors. The primary goal was to identify psychiatric dimensions that most accurately explain suicidal ideation, without relying on data directly related to suicide (past attempts or thoughts). In most ML studies, past suicide attempts and thoughts are prominent predictors³⁴, which avoids to explain the other psychiatric dimensions related to suicidal ideation^25,26. The aim was to determine other dimensions predicting suicidal ideation hierarchically and directionally, using as few predictors as possible. Three ML models were created: the first with 32 predictors, the second reduced to 11, and the third using 13 variables directly related to psychiatric symptoms and disorders, utilizing only the subdomains of the DSM-5-TR Self-Rating Level 1 Cross-Cutting Symptom Scale (DSM-5 SRL1). Suicidal/self-harm ideation was assessed using the ninth question of the PHQ-9 questionnaire. Seven different classification algorithms capable of solving linear and non-linear relationships were employed. Models were trained on the first dataset and in order to address the previous gaps, externally validated machine learning models were developed for demonstrating reliability and generalizability. Using DSM-5 SRL1 subdomains, hierarchical importance and directional effects of psychiatric symptoms on suicidal ideation were identified.

Method

Participants

This study involved data from participants who had applied to the Koç University Psychological Counseling and Psychotherapy Center (KUPTEM), which is a comprehensive mental health clinic that provides various clinical services including individual psychotherapy, psychiatric consultation and treatment. It is not a regular counseling center, but more of a mental health clinic. When students initially contact KUPTEM either by email or by visiting the office, they are scheduled for an intake appointment by a clinical psychologist. Before the interview, a series of routine screening questions that help to determine the severity and immediacy of students’ concerns are used for a comprehensive evaluation to collaboratively determine the best management options for students. KUPTEM is committed to delivering culturally competent services that are responsive to the needs, strengths, and identities of Koç University’s diverse community of students. The services are free of charge. The psychopathology of the students that apply, may vary from mild adjustment disorder symptoms to borderline personality disorder with severe suicidal behavior and to schizophrenia. All students give an informed consent for the use of the anonymized scale data for academic purposes before filling the surveys. All applicants who were older than 18 years of age and who filled all the survey data fully and reliably were included in the analysis (Fig. 1). For the building of the machine learning model, a first group of participants’ data was collected from 924 students who applied between September 2019 and October 2022 ( Mean age: 23.54 ± 3.4. age range: 18–40, 66% women). Data from another group of participants composed of 361 students (Mean age: 22.25 ± 3.56, age range: 18–44, 61.5% women) who applied to KUPTEM between January 2023 and January 2024 was used as an external validation sample to use for Methods Sect. 2.8. The primary study and use of the validation sample study was approved by Koç University ethics committee (2022.478.IRB2.075). All procedures have been in accordance with the Declaration of Helsinki.

Suicidal ideation/self-harm definition

Suicidal ideation/self-harm was monitored with the answer to question 9 of the PHQ-9 scale. (“Thoughts that you would be better off dead, or of hurting yourself in some way?“). This question has answers in the range of 0 to 3. Responses other than 0 were accepted as 1 regardless of their severity and the target variable was converted into binary³⁷. The dataset was naturally well-balanced, with 45% of participants reporting suicidal ideation/self-harm in the first dataset and 51.8% in the external validation dataset. Given this relatively balanced distribution between the classes, no additional balancing techniques (such as oversampling or undersampling) were required for the machine learning analysis. This natural balance in the data helps ensure that the models’ performance metrics are not biased by class imbalance issues, as research has shown that class imbalance only becomes problematic when the minority class represents less than approximately 20–25% of the majority class³⁸.

While both the PHQ-9 and DSM-5 SRL1 assess suicidal/self-harm thoughts, question 9 of PHQ-9 was chosen to use as target variable for several reasons. The PHQ-9 question asks about ‘Thoughts that you would be better off dead, or of hurting yourself in some way,’ while the DSM-5 SRL1 item specifically asks about ‘Thoughts of actually hurting yourself.’ Despite their slightly different wording - the DSM-5 SRL1 focuses on active self-harm, while the PHQ-9 captures both thoughts of death and self-harm thought. On the other hand, the two items showed a strong correlation (Spearman’s ρ = 0.68, p < 0.001), which was computed with the scipy library³⁹. Using the same binarization approach as question 9 of PHQ-9, the DSM-5 SRL1 item 11 responses showed 339 positive (presence of self harm ideation) and 585 negative (absence of self harm ideation) cases, which was also used for the further analysis to validate the model.

Psychometric scales and measures used in the study

DSM-5 self-rated level 1 cross-cutting symptom measure

This 23-item assessment tool evaluates major domains of psychopathology⁴⁰. It covers thirteen domains including depression, anger, mania, anxiety, somatization, self-harm, psychosis, sleep disturbances, memory, repetitive thoughts, dissociation, personality functioning and substance use. The anhedonia (item 1) and mood (item 2) components of depression were used separately, and a total of 13 subdomains were used, excluding the self-harm subdomain (item 11). Turkish validation study was conducted by Çökmüş et al.⁴¹.

Patient health questionnaire (PHQ-9)

A nine-item instrument for screening and measuring depression severity²⁹, with Turkish validity established in 2016⁴².

Other assessment tools

Additional validated measures included the Mood Disorder Questionnaire^43,44, SCOFF (Sick-Control-One-Fat-Food) Questionnaire for eating disorders^45,46, Adult ADHD Self-Report Scale^47,48, CAGE (Cut, Annoyed, Guilty, and Eye) Questionnaire for alcohol dependence^49,50, General Anxiety Disorder-7 (GAD-7) scale for anxiety^51,52, and Short Dissociative Experiences Scale for dissociative measures^53,54,55. Scale details are summarized in Table 1.

Table 1 Mental health assessment tools used in the study.

Full size table

Data preparation and preprocessing

Prior to model training, the predictors underwent a preprocessing step where they were scaled using a Min-Max Scaler to ensure that all variables were on the same scale, thus facilitating better performance across all machine learning algorithms. This step is critical, especially for models that are sensitive to the scale of input features, such as K-Nearest Neighbors (KNN) and Support Vector Machines (SVM).

Machine learning algorithms

Seven different ML algorithms were used in the study: KNN, Naive Bayes, Logistic Regression, Decision Trees, SVM, Random Forest, and Artificial Neural Networks (Fig. 2). First of all, the algorithms were used in the Python programming language with default hyperparameters in the scikit-learn library⁵⁶. In order to reach the optimum performance of ML models, the hyperparameters were optimized by the Bayesian optimization method, due to its ability to efficiently find the optimum parameters in a high-dimensional and complex search space⁵⁷. The MATLAB Classification Learner Toolbox was used to perform the optimization of algorithms other than logistic regression, as it is efficient with its parallel computation capability⁵⁸. Logistic regression was optimized using the 0.10.1 version of the skopt, which integrates well with scikit-learn, due to the limitations of MATLAB’s optimization capabilities. In all machine learning models, the mean Receiver Operating Characteristic (ROC) Area Under the Curve (AUC) values were evaluated using a 5-fold cross-validation approach, both with default hyperparameters and during hyperparameter optimization. Additionally, a neural network with Multi-Layer Perceptron (MLP) architecture was developed for the prediction task using Keras deep learning framework⁵⁹. The results of the machine learning models were visualized using the matplotlib library⁶⁰.

Machine learning prediction models

In the first model of the study, suicidal/self-harm ideation was tried to be predicted by using 32 features belonging to 924 individuals. The initial machine learning model was developed using a comprehensive set of predictors which encompassed a range of demographic, psychometric, and clinical variables thought to be relevant to suicidal ideation.

In the second model, it was aimed to simplify the predictive model by reducing the number of predictors. Specifically, the DSM-5 SRL1 scale items were replaced by the DSM-5 Self-Rating Level-1 scale total score, which reduced the number of predictors from 32 to 11.

Finally, the third model was developed using subdomains of the DSM-5 SRL1. These subdomains represent different psychiatric symptoms and disorders. The subdomains were derived from the original scale by summing the scores of relevant items and included anhedonia (item 1), depressed mood (item 2), anger (item 3), mania (item 4 + item 5), anxiety (item 6 + item 7 + item 8), somatization (item 9 + item 10), psychosis (item 12 + item 13), sleep (item 14), memory (item 15), repetitive thoughts (item 16 + item 17), dissociation (item 18), personality functioning (item 19 + item 20), and substance use (item 21 + item 22 + item 23)⁴⁰. This model focused on understanding the contribution of specific psychiatric symptoms and disorders to suicidal ideation. Table 2 summarizes the predictors used across the models.

Table 2 Predictors used in machine learning models.

Full size table

Neural network analysis

For the model using DSM-5 SRL1 subdomains, it was decided to build a more qualified neural network algorithm because, after adjusting the hyper-parameters of the algorithms, the neural networks showed the best performance in the third model. This neural network model with MLP architecture consisted of an input layer with the number of neurons corresponding to the number of features in the dataset (DSM-5 SRL1 subdomains) and two hidden layers with 64 and 32 neurons, respectively. The neural network was trained with the first data set of 924 participants. Non-linearity was ensured by using the Relu activation function between layers⁶¹. For binary classification, a sigmoid activation function was used in the output layer consisting of only one neuron⁶². The model was compiled using the Adam optimizer, which is highly capable in adaptive learning rate optimization, and the binary cross-entropy loss function⁶³. The early stopping method was used to prevent overfitting⁶⁴. If no increase in the validation AUC was observed for 10 epochs, training was stopped.

Two different methods were used to observe the hierarchical and directional contributions of the 13 subdomains of DSM-5 SRL1 to suicidal ideation. These methods are SHapley Additive exPlanations (SHAP) and Integrated Gradients, respectively. The SHAP method assigns feature importance values (SHAP values) based on game theory by comparing model predictions with and without each feature⁶⁵. Integrated Gradients was also used to validate the SHAP results by accumulating gradients from a baseline to the actual inputs, providing a complementary and reliable assessment of feature importance⁶⁶.

External validation of the neural network model

This model was validated using data from a second group of 361 individuals. This second group included 361 students who applied between January 2023 and January 2024. Mean age: 22.25 ± 3.56, age range: 18–44, 61.5% women. Applying weights and biases obtained from the training of initial dataset to a separate external validation dataset tested the model’s generalizability and accuracy in predicting suicidal ideation across diverse cohorts. Data underwent Min-Max scaling, with model performance evaluated by AUC.

Additionally, the final MLP model was assessed for further predicting the DSM-5 SRL1 item 11 responses using the same methodology.

Results

Machine learning models results

In the analysis for the first model, results indicated that logistic regression reached the highest mean AUC value of 0.78, while random forest followed closely behind with a mean score of 0.77. In contrast, the decision tree model had the lowest mean AUC value of 0.64. Other models such as neural networks and SVM exhibited mean AUC performances of 0.72 and 0.76, respectively (Fig. 3A). Post-tuning, all models showed significant improvements in terms of mean AUC value: KNN attained 0.76, logistic regression improved to 0.79, neural networks also hit 0.79, SVMs achieved 0.80, decision tree increased to 0.71, Ensemble method improved to 0.79, and Naive Bayes scored 0.75.

In the second model results revealed that logistic regression reached an AUC value of 0.80, neural networks and SVM showed mean AUC values of 0.78 and 0.77, respectively. In contrast, the decision tree model recorded the lowest mean AUC value at 0.64 (Fig. 3B). The mean AUC values obtained after optimization of the hyperparameters are as follows: KNN 0.75, logistic regression 0.80, neural networks 0.80, SVM 0.80, decision tree 0.75, ensemble methods 0.79 and naive bayes 0.75.

In the third (last) model logistic regression reached a mean AUC value of 0.75, followed by random forest and naive bayes with mean AUC values of 0.74. Decision tree model recorded the lowest mean AUC value of 0.553 (Fig. 3C). After hyperparameter optimization in the last model, the neural networks reached 0.76 mean AUC value. Table 3 shows the AUC values of the algorithms across the models after hyperparameter optimization. The optimized parameters are presented in Supplementary Table 1.

Table 3 Post hyperparameters tuning AUC results of models.

Full size table

In the MLP based neural network model created with keras with the predictors used in the third model, the training AUC increased from around 0.450 to 0.850, while also the validation AUC increased with significant improvements, stabilizing around 0.800. This MLP model was evaluated on the external validation dataset. The model achieved an AUC value of 0.79 on the external validation data. Figure 4 illustrates the change in the loss and AUC performances of the MLP model built with Keras according to the epochs and external validation data set ROC curve respectively. Furthermore, when this MLP model was applied to predict DSM-5 SRL1 item 11 responses using the same binarization methodology as before, it achieved an AUC of 0.78.(Supplementary Fig. 1).

Feature contribution analysis results

In the SHAP summary plot for the first dataset used to train the MLP model several features stand out as significant contributors to the model’s predictions of suicidal ideation (Fig. 5A). Personality functioning showed major positive impact, with higher values increasing predicted suicidal ideation. Anxiety exhibited strong negative influence, as increased SHAP values corresponded with decreased suicidal ideation probability. Similar to personality functioning, an increase in depressed mood score had an effect on increasing the probability of the model predicting suicidal ideation. The increase in repetitive thoughts decreased the model’s prediction probability of suicidal ideation.

In external validation (Fig. 5B), anxiety maintained its negative contribution to suicidal ideation probability. Personality functioning remained a robust positive predictor consistent with the first dataset. Notably, anger emerged as an influential feature in the external dataset. Depressed mood and repetitive thoughts maintained their respective positive and negative relationships with suicidal ideation probability across both datasets.

The Integrated Gradients analysis of the datasets revealed the relative contributions of the DSM-5 SRL1 subdomains to suicidal ideation prediction. Anxiety, anhedonia, depressed mood, repetitive thoughts, and personality functioning were among the top six predictors for both datasets. Anxiety and repetitive thoughts again showed negative contributions, while depressed mood, personality functioning, and anhedonia demonstrated positive contributions to the model’s predictions (Supplementary Table 2). Supplementary Fig. 2 and Supplementary Fig. 3 show the results of this analysis for the first and external dataset respectively.

Discussion

This study demonstrated that suicidal/self-harm ideation can be predicted by various ML algorithms without relying on suicide-specific predictors like past attempts or explicit thoughts. It is one of the first to predict suicidal ideation in a large university student sample using machine learning (ML) without depending on such predictors. Predicting suicidal ideation through routine psychometric assessments offers a discreet yet powerful tool for early intervention. Clinicians could integrate ML-based screening tools from models like this study into routine evaluations to identify high-risk students earlier, facilitating preventive interventions. These tools enable mental health professionals to focus on students who may not openly report suicidal ideation due to stigma or fear of consequences²⁵. Furthermore, the results demonstrate potential for personalized mental health care. University services could tailor interventions based on specific psychiatric dimensions contributing most to suicidal ideation, such as depressive mood and personality functioning.

An ML model’s performance is considered moderate with accuracy of 0.70–0.79, good at 0.80–0.89, and excellent above 0.90⁶⁷. In the three models created for this study with varying predictors, both linear and non-linear algorithms achieved good AUC value, especially after hyperparameter tuning. Logistic regression performed consistently well, and algorithms like SVM, neural networks, and random forest, which capture complex relationships, were also successful. This suggests suicidal ideation has both linear and intricate relationships with input features. These algorithms achieved above 0.90 AUC value in an adolescent suicide risk study⁶⁸. However, their use of oversampling to address data imbalance may not generalize to real-world data. In another college study, an AUC of 0.80 was obtained using random forest, with 12-month suicidal ideation as the most important variable⁶⁹. Similarly, a UK Biobank study achieved an AUC around 0.90, with past suicide attempt as a key feature⁷⁰. It was emphasized that suicide-specific characteristics could significantly improve prediction accuracy but were excluded in this study as they may overshadow other contributing factors.

Notably, despite fewer predictors in the second model, the AUC value increased to 80%, in line with the aim of the study. In model 3, predicting suicidal ideation using only DSM-5 SRL1 subdomains was most successful with logistic regression, achieving an AUC of 0.755 using default parameters. Although initially underperforming, neural network performance improved after hyperparameter optimization, leading to a sophisticated MLP architecture that achieved an AUC of 0.80 on 924 participants and 0.79 on an external validation set of 361 participants. The model’s ability to predict outcomes on an independent dataset collected at a different time—without exposure during training—with similar accuracy underscores its reliability and generalizability³⁵. Additionally, using the trained model to predict item 11 of the DSM-5 SRL1 (self-harm ideation), which was previously excluded during training, an AUC of 0.78 was obtained, further confirming its generalization capacity.

A notable feature of the final model is its reliance on psychiatric symptom predictors. Interpretability analyses using SHAP consistently highlighted personality functioning, anxiety, depressed mood, and repetitive thoughts among the top predictors, with uniform directional effects across datasets. Integrated Gradients supported these findings, similarly identifying anxiety, depressed mood, personality functioning, anhedonia, and repetitive thoughts as leading predictors. While feature importance rankings differed slightly due to methodological variations^65,66, the consistent directionality and overlap reinforce the significance of these symptoms as key drivers of model predictions. This application of interpretable machine learning techniques aligns with recent recommendations emphasizing explainable artificial intelligence in clinical mental health research⁷¹.

Personality functioning was consistently among the top contributors in both data sets. The DSM-5 SRL1 assesses this with two basic questions: the first deals with the individual’s inability to recognize who they are and to know what they want from life, and the second deals with the inability to form close and enjoyable relationships. These questions explore the person’s identity, sense of purpose, and relationships with others. Deficits in these areas, such as difficulty connecting with others and lacking purpose, correlate with increased suicide risk, while having life purpose is protective⁷². Furthermore, the personality functioning domain is significantly negatively associated with self-esteem in college populations, and low self-esteem has been shown to significantly predict suicidal ideation^73,74. Further supporting its importance, Lear and Pepper⁷⁵ found that self-concept clarity, closely related to identity, is significantly associated with non-suicidal self-injury in college students⁷⁵, and similar identity and relationship disturbances characterize borderline personality disorder, frequently associated with suicidal behaviors^76,77.

While anxiety has a significant positive relationship with suicidal ideation in many studies^78,79, our results showed increased anxiety associated with decreased likelihood of suicidal/self-harm ideation. This unexpected inverse relationship aligns with de Klerk et al.‘s analysis, revealing that somatic anxiety symptoms might protect against self-harm and suicidal ideation when controlling for other variables, highlighting the importance of differentiating anxiety subtypes in risk assessment⁸⁰. Capron et al. also found that elevated anxiety sensitivity related to physical concerns may be protective against suicide risk among individuals with higher cognitive anxiety concerns⁸¹. These findings suggest that rather than anxiety directly decreasing suicidal ideation, the neural network model may capture complex interactions integrating increased anxiety with decreased ideation in its predictions.

The repetitive thoughts domain assesses symptoms related to obsessive-compulsive disorder (OCD). SHAP and Integrated Gradients analyses both indicated that increased repetitive thoughts were associated with decreased likelihood of suicidal ideation. While counterintuitive, this aligns with some of the existing findings. Fernández de la Cruz et al. found that comorbid anxiety disorders acted as a protective factor against suicide in a large OCD cohort⁸². Pellegrini et al. also found that lifetime comorbid anxiety disorder had a protective effect against suicidal ideation in OCD patients⁸³.

Another point is that our ML models’ accuracy did not exceed 0.80. The type of data affects accuracy; a recent review showed that models using administrative data achieved an average AUC of 0.91, while survey data averaged 0.80⁸⁴. Since this study also uses survey-based data, it is noteworthy to achieve this average. The mental disorder-related predictors used may also explain this result. Although mental disorders are highly associated with suicidal behavior, other factors predict it. O’Connor and Nock (2014) review evidence showing that interpersonal stressors and adverse life events are prominent risk factors for suicidal ideation and behavior, with findings demonstrating this relationship can persist even after controlling for mental disorders⁸⁵. That underscores the need for future research to examine factors beyond psychiatric diagnoses, including adverse life events, interpersonal stressors, coping mechanisms, and social factors, to develop a more comprehensive understanding of suicide risk prediction. Biological samples and neuroimaging data may be other predictors to increase accuracy. Neuroimaging studies have revealed structural differences in brain regions linked to suicidal tendencies^86,87,88. Potential biomarkers include abnormalities in serotonin signaling and inflammatory markers like IL-1β and IL-6 levels^89,90. These markers may enhance prediction accuracy when used alongside neuroimaging data. While suicide prediction offers important benefits, such as early intervention and personalised prevention strategies, it should be considered that it also raises ethical concerns, such as privacy issues and false positive or negative risk.

The clinical relevance of this work is significant given that the 20–29 age group, which comprises 27% of suicide deaths in Türkiye⁵. In university mental health settings, stigma may prevent students from openly discussing suicidal thoughts^25,26, making indirect assessment methods clinically valuable. Our machine learning approach surpasses traditional assessments by identifying suicide risk through routine psychiatric evaluations without explicit suicide-related questions, facilitating standardized screening across large student populations, and pinpointing psychiatric domains requiring clinical attention. Rather than replacing clinical judgment, these models help clinicians prioritize high-risk cases and guide targeted interventions. To mitigate overfitting³⁶, external validation and model simplification strategies were used and achieved consistent performance across datasets. Predicting DSM-5 SLR1 item 11 responses further validated the model’s generalizability. Table 4 summarizes 10 studies predicting suicidal ideation via machine learning; notably, most studies employed internal validation methods, and 8 included suicide-specific predictors.

Table 4 Summary of methodologies and performance metrics of machine learning studies predicting suicidal ideation.

Full size table

Several limitations of this study should be noted. The cross-sectional nature of our data means that while our models can identify concurrent associations between psychiatric symptoms and suicidal ideation, they cannot predict future suicidal behavior. The term “prediction” is used here in the machine learning sense, referring to the models’ ability to classify cases based on learned patterns, rather than temporal prediction of future events. Furthermore, our data do not include information on actual suicide attempts or completed suicides; the analysis is based solely on self-reported suicidal/self-harm ideation, which may not always translate to suicidal behavior. Additionally, the study was conducted exclusively within a college population, limiting generalizability. The unique characteristics of college students may influence psychiatric symptoms and their relationship to suicidal ideation differently from the general population. Therefore, caution should be exercised when applying these results to broader populations, and further research is needed across diverse settings. Our models were defined based on previous theoretical knowledge about psychiatric dimensions. Dimensionality reduction by principal component analysis (PCA) was not preferred due to its tendency to compromise interpretability. While PCA effectively reduces dimensionality, it transforms variables into principal components, making it difficult to interpret each predictor’s contribution⁹⁹. Given the importance of interpretability and preserving the direct influence of original variables, PCA was not appropriate. Additionally, one main goal is to identify a single scale that clinicians can use to predict suicide risk. The direct relationship between original variables and the outcome is critical, supporting the decision to avoid PCA.

In conclusion, this study demonstrates that machine learning algorithms can effectively predict suicidal/self-harm ideation even without suicide-specific predictors such as past attempts or explicit suicidal thoughts. Findings emphasize the nuanced roles of cognitive and psychiatric factors, including personality functioning, depressed mood, anxiety, repetitive thoughts and anhedonia. Notably, inverse relationships between anxiety or repetitive thoughts and suicidal ideation challenge conventional assumptions, highlighting the complexity of these interactions. Limitations include reliance on survey-based data and restricted generalizability beyond university populations. Future research could incorporate biological markers like neuroimaging or inflammation biomarkers to enhance prediction accuracy. Overall, machine learning offers significant potential for improved early detection and targeted interventions in mental health care.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

World Health Organization. Suicide Worldwide in 2019: Global Health Estimates. (2021).
World Health Organization. Preventing Suicide: A Global Imperative (World Health Organization, 2014).
Hung, P., Busch, S. H., Shih, Y. W., McGregor, A. J. & Wang, S. Changes in community mental health services availability and suicide mortality in the US: A retrospective study. BMC Psychiatry. 20, 1–12 (2020).
Article Google Scholar
Pirkola, S., Sund, R., Sailas, E. & Wahlbeck, K. Community mental-health services and suicide rate in Finland: A nationwide small-area analysis. Lancet 373, 147–153 (2009).
Article PubMed Google Scholar
Türkiye İstatistik Kurumu (TÜİK). Ölüm ve Ölüm Nedeni İstatistikleri. (2024). Available at: https://data.tuik.gov.tr/Bulten/Index?p=Olum-ve-Olum-Nedeni-Istatistikleri–2024–53709
Durkheim, E. Suicide: A Study in Sociology (Routledge, 2005).
Franklin, J. C. et al. Risk factors for suicidal thoughts and behaviors: A meta-analysis of 50 years of research. Psychol. Bull. 143, 187–232 (2017).
Article PubMed Google Scholar
Oquendo, M. A. et al. Toward a biosignature for suicide. Am. J. Psychiatry. 171, 1259–1277 (2014).
Article PubMed PubMed Central Google Scholar
Beck, A. T. Hopelessness as a predictor of eventual suicide. Ann. N Y Acad. Sci. 487, 90–96 (1986).
Article CAS ADS PubMed Google Scholar
Klonsky, E. D. & May, A. M. The three-step theory (3ST): A new theory of suicide rooted in the ideation-to-action framework. Int. J. Cogn. Ther. 8, 114–129 (2015).
Article Google Scholar
Ferrari, A. J. et al. The burden attributable to mental and substance use disorders as risk factors for suicide: findings from the global burden of disease study 2010. PLoS One. 9, e91936 (2014).
Article ADS PubMed PubMed Central Google Scholar
Arsenault-Lapierre, G., Kim, C. & Turecki, G. Psychiatric diagnoses in 3275 suicides: A meta-analysis. BMC Psychiatry. 4, 1–11 (2004).
Article Google Scholar
Too, L. S. et al. The association between mental disorders and suicide: A systematic review and meta-analysis of record linkage studies. J. Affect. Disord. 259, 302–313 (2019).
Article PubMed Google Scholar
Favril, L., Yu, R., Uyar, A., Sharpe, M. & Fazel, S. Risk factors for suicide in adults: systematic review and meta-analysis of psychological autopsy studies. BMJ Ment Health. 25, 148–155 (2022).
Google Scholar
Nock, M. K. Why do people hurt themselves? New insights into the nature and functions of self-injury. Curr. Dir. Psychol. Sci. 18, 78–83 (2009).
Article PubMed PubMed Central Google Scholar
Milner, A., Sveticic, J. & De Leo, D. Suicide in the absence of mental disorder? A review of psychological autopsy studies across countries. Int. J. Soc. Psychiatry. 59, 545–554 (2013).
Article PubMed Google Scholar
Lipson, S. K. & Sonneville, K. R. Understanding suicide risk and eating disorders in college student populations: results from a National study. Int. J. Eat. Disord. 53, 229–238 (2020).
Article PubMed Google Scholar
Yates, K. et al. Association of psychotic experiences with subsequent risk of suicidal ideation, suicide attempts, and suicide deaths: A systematic review and meta-analysis. JAMA Psychiatry. 76, 180–189 (2019).
Article PubMed Google Scholar
Gilmour, H. Threshold and subthreshold generalized anxiety disorder (GAD) and suicide ideation. Stat. Can., 13–21 (2016).
Kim, H. Y. et al. Prevalence of lifetime psychiatric disorders and suicidality in adults with subthreshold posttraumatic stress disorder: A population-based nationwide study in Korea. Psychol. Trauma. (2021).
Balázs, J. et al. Adolescent subthreshold depression and anxiety: psychopathology, functional impairment and increased suicide risk. J. Child. Psychol. Psychiatry. 54, 670–677 (2013).
Article ADS PubMed Google Scholar
Centers for Disease Control and Prevention & Definitions Self-directed violence. Injury Prev. Control: Div. Violence Prev. (2015).
Crosby, A., Ortega, L. & Melanson, C. Self-directed violence surveillance: uniform definitions and recommended data elements. Centers Disease Control Prev. (2011).
Jobes, D. A. & Joiner, T. E. Reflections on suicidal ideation. Crisis 40, 227–230 (2019).
Article PubMed Google Scholar
Hom, M. A., Stanley, I. H., Podlogar, M. C. & Joiner, T. E. Are you having thoughts of suicide? Examining experiences with disclosing and denying suicidal ideation. J. Clin. Psychol. 73, 1382–1392 (2017).
Article PubMed Google Scholar
Hallford, D. J. et al. Disclosure of suicidal ideation and behaviours: A systematic review and meta-analysis of prevalence. Clin. Psychol. Rev. 101, 102272 (2023).
Article CAS PubMed Google Scholar
Horowitz, L. M. et al. Ask Suicide-Screening questions (ASQ): A brief instrument for the pediatric emergency department. Arch. Pediatr. Adolesc. Med. 166, 1170–1176 (2012).
Article PubMed PubMed Central Google Scholar
Posner, K. et al. The Columbia–Suicide severity rating scale: initial validity and internal consistency findings from three multisite studies with adolescents and adults. Am. J. Psychiatry. 168, 1266–1277 (2011).
Article PubMed PubMed Central Google Scholar
Kroenke, K., Spitzer, R. L. & Williams, J. B. The PHQ–9: validity of a brief depression severity measure. J. Gen. Intern. Med. 16, 606–613 (2001).
Article CAS PubMed PubMed Central Google Scholar
Osman, A. et al. The suicidal behaviors Questionnaire-Revised (SBQ-R): validation with clinical and nonclinical samples. Assessment 8, 443–454 (2001).
Article CAS PubMed Google Scholar
Chekroud, A. M. et al. The promise of machine learning in predicting treatment outcomes in psychiatry. World Psychiatry. 20, 154–170 (2021).
Article PubMed PubMed Central Google Scholar
Grendas, L. N., Chiapella, L., Rodante, D. E. & Daray, F. M. Comparison of traditional model-based statistical methods with machine learning for the prediction of suicide behaviour. J. Psychiatr Res. 145, 85–91 (2022).
Article Google Scholar
Schafer, K. M., Kennedy, G., Gallyer, A. & Resnik, P. A direct comparison of theory-driven and machine learning prediction of suicide: A meta-analysis. PLoS One. 16, e0249833 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pigoni, A. et al. Machine learning and the prediction of suicide in psychiatric populations: A systematic review. Transl Psychiatry. 14, 140 (2024).
Article PubMed PubMed Central Google Scholar
Ramspek, C. L. et al. Lessons learnt when accounting for competing events in the external validation of time-to-event prognostic models. Int. J. Epidemiol. 51, 615–625 (2022).
Article PubMed Google Scholar
Dwyer, D. B., Falkai, P. & Koutsouleris, N. Machine learning approaches for clinical psychology and psychiatry. Annu. Rev. Clin. Psychol. 14, 91–118 (2018).
Article PubMed Google Scholar
Jordan, P., Shedden-Mora, M. C. & Löwe, B. Predicting suicidal ideation in primary care: an approach to identify easily assessable key variables. Gen. Hosp. Psychiatry. 51, 106–111 (2018).
Article PubMed Google Scholar
Silva, E. J. & Zanchettin, C. On the existence of a threshold in class imbalance problems. In 2015 IEEE international conference on systems, man, and cybernetics (pp. 2714–2719). IEEE. (2015), October.
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in python. Nat. Methods. 17 (3), 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
Narrow, W. E. et al. DSM–5 field trials in the united States and Canada, part III: development and reliability testing of a cross-cutting symptom assessment for DSM–5. Am. J. Psychiatry. 170, 71–82 (2013).
Article PubMed Google Scholar
Çökmüş, F. P., Balıkçı, K. & Aydemir, Ö. Reliability and validity of Turkish form of DSM–5 Self-Rated level 1 Cross-Cutting symptom Scale-Adult version. Anadolu Psikiyatri Derg. 18, 5–13 (2017).
Article Google Scholar
Sarı, Y. E. et al. Turkish reliability of the patient health Questionnaire–9. Biomed. Res. India. 27, S460–S462 (2016).
Google Scholar
Hirschfeld, R. M. et al. Development and validation of a screening instrument for bipolar spectrum disorder: the mood disorder questionnaire. Am. J. Psychiatry. 157, 1873–1875 (2000).
Article CAS PubMed Google Scholar
Konuk, N. et al. Validation of the Turkish version of the mood disorder questionnaire for screening bipolar disorders. Turk. Psikiyatri Derg. 18, 147–154 (2007).
PubMed Google Scholar
Morgan, J. F., Reid, F. & Lacey, J. H. The SCOFF questionnaire: assessment of a new screening tool for eating disorders. BMJ 319, 1467–1468 (1999).
Article CAS PubMed PubMed Central Google Scholar
Aydemir, Ö., Köksal, B., Sapmaz, S. Y. & Yüceyar, H. Kadın Üniversite öğrencilerinde REZZY Yeme Bozuklukları ölçeği Türkçe formunun Güvenilirlik ve geçerliliği. Anadolu Psikiyatri Derg. 16, 31–37 (2015).
Article Google Scholar
Kessler, R. C. et al. The world health organization adult ADHD Self-Report scale (ASRS): A short screening scale for use in the general population. Psychol. Med. 35, 245–256 (2005).
Article PubMed Google Scholar
Doğan, S., Öncü Çetinkaya, B., Saraçoğlu Varol, G. & Küçükgöncü, S. Erişkin Dikkat Eksikliği Hiperaktivite Bozukluğu Kendi bildirim ölçeği (ASRS-v1.1): Türkçe formunun geçerlilik ve güvenilirliği. Anadolu Psikiyatri Derg. 10, 77–87 (2009).
Google Scholar
Mayfield, D., McLeod, G. & Hall, P. The CAGE questionnaire: validation of a new alcoholism screening instrument. Am. J. Psychiatry. 131, 1121–1123 (1974).
Article CAS PubMed Google Scholar
Arıkan, Z., Özdemir, Y. D., Candansayar, S. & Işık, E. SMAST ve CAGE tarama testlerinin alkolizmin tanısal önemi üzerine bir çalışma. In 27. Ulusal Psikiyatri Kongresi, Antalya (1991).
Spitzer, R. L., Kroenke, K., Williams, J. B. & Löwe, B. A brief measure for assessing generalized anxiety disorder: the GAD–7. Arch. Intern. Med. 166, 1092–1097 (2006).
Article PubMed Google Scholar
Konkan, R., Şenormancı, Ö., Güçlü, O., Aydin, E. & Sungur, M. Z. Validity and reliability study for the Turkish adaptation of the generalized anxiety Disorder–7 (GAD–7) scale. Arch. Neuropsychiatry. 50, 53–58 (2013).
Article Google Scholar
Dalenberg, C. & Carlson, E. Brief Dissociative Experiences Scale (DES-B) (2010).
Bernstein, E. M. & Putnam, F. W. Development, reliability, and validity of a dissociation scale. J. Nerv. Ment Dis. 174, 727–735 (1986).
Article CAS PubMed Google Scholar
Şar, V. et al. The reliability and validity of the Turkish version of the Dissociative Experiences Scale (DES-II). In Proceedings of the 33rd National Congress of Psychiatry, 55–64 (1997).
Pedregosa, F. et al. Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet Google Scholar
Snoek, J., Larochelle, H. & Adams, R. P. Practical bayesian optimization of machine learning algorithms. Adv. Neural Inf. Process. Syst. 25, (2012).
The MathWorks Inc. MATLAB Version: 9.13.0 (R2022b) (The MathWorks Inc., 2022).
Chollet, F. Keras [Online Resource]. (2015).
Hunter, J. D. Matplotlib: A 2D graphics environment. Comput. Sci. Eng. 9 (03), 90–95 (2007).
Article Google Scholar
Glorot, X. & Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 249–256 (2010).
Cybenko, G. Approximation by superpositions of a sigmoidal function. Math. Control Signals Syst. 2, 303–314 (1989).
Article MathSciNet Google Scholar
Kingma, D. P., Ba, J. & Adam A method for stochastic optimization. ArXiv Preprint. arXiv, 14126980 (2014).
Google Scholar
Prechelt, L. Early stopping—but when? In Neural Networks: Tricks of the Trade, 55–69 (Springer, (2002).
Scott, M. & Su-In, L. A unified approach to interpreting model predictions. Adv. Neural. Inf. Process. Syst. 30, 4765–4774 (2017).
Google Scholar
Sundararajan, M., Taly, A. & Yan, Q. Axiomatic attribution for deep networks. In International Conference on Machine Learning, 3319–3328 (2017).
Šimundić, A. M. Measures of diagnostic accuracy: basic definitions. EJIFCC 19, 203–211 (2009).
PubMed PubMed Central Google Scholar
Kim, K. W., Lim, J. S., Yang, C. M., Jang, S. H. & Lee, S. Y. Classification of adolescent psychiatric patients at high risk of suicide using the personality assessment inventory by machine learning. Psychiatry Investig. 18, 1137–1144 (2021).
Article PubMed PubMed Central Google Scholar
Macalli, M. et al. A machine learning approach for predicting suicidal thoughts and behaviours among college students. Sci. Rep. 11, 11363 (2021).
Article CAS ADS PubMed PubMed Central Google Scholar
Wang, J. et al. Prediction of suicidal behaviors in the middle-aged population: machine learning analyses of UK biobank. JMIR Public. Health Surveill. 9, e43419 (2023).
Article PubMed PubMed Central Google Scholar
Hussain, S. Challenges and future research directions in automated detection of mental illness using machine learning. Acad. Bull. Mental Health. 2 (2), 56–60 (2024).
Article Google Scholar
Li, W., Dorstyn, D. S. & Jarmon, E. Identifying suicide risk among college students: A systematic review. Death Stud. 44, 450–458 (2020).
Article PubMed Google Scholar
Bravo, A. J., Villarosa-Hurlocker, M. C. & Pearson, M. R. College student mental health: an evaluation of the DSM–5 self-rated level 1 cross-cutting symptom measure. Psychol. Assess. 30, 1382–1389 (2018).
Article PubMed PubMed Central Google Scholar
Wilburn, V. R. & Smith, D. E. Stress, self-esteem, and suicidal ideation in late adolescents. Adolescence 40, 33–45 (2005).
PubMed Google Scholar
Lear, M. K. & Pepper, C. M. Self-concept clarity and emotion dysregulation in nonsuicidal self-injury. J. Pers. Disord. 30, 813–827 (2016).
Article PubMed Google Scholar
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders: DSM-5 5th edn (American Psychiatric Association, 2013).
Pompili, M., Girardi, P., Ruberto, A. & Tatarelli, R. Suicide in borderline personality disorder: A meta-analysis. Nord J. Psychiatry. 59, 319–324 (2005).
Article PubMed Google Scholar
Sareen, J. et al. Anxiety disorders and risk for suicidal ideation and suicide attempts: A population-based longitudinal study of adults. Arch. Gen. Psychiatry. 62, 1249–1257 (2005).
Article PubMed Google Scholar
Choi, H. Y. et al. A study on correlation between anxiety symptoms and suicidal ideation. Psychiatry Investig. 8, 320–326 (2011).
Article PubMed PubMed Central Google Scholar
de Klerk, S. et al. Prevalence and correlates of lifetime deliberate self-harm and suicidal ideation in naturalistic outpatients: the Leiden routine outcome monitoring study. J. Affect. Disord. 133, 257–264 (2011).
Article PubMed Google Scholar
Capron, D. W., Cougle, J. R., Ribeiro, J. D., Joiner, T. E. & Schmidt, N. B. An interactive model of anxiety sensitivity relevant to suicide attempt history and future suicidal ideation. J. Psychiatr Res. 46, 174–180 (2012).
Article PubMed Google Scholar
de la Fernández, L. et al. Suicide in obsessive–compulsive disorder: A population-based study of 36,788 Swedish patients. Mol. Psychiatry. 22, 1626–1632 (2017).
Article Google Scholar
Pellegrini, L. et al. Suicide attempts and suicidal ideation in patients with obsessive-compulsive disorder: A systematic review and meta-analysis. J. Affect. Disord. 276, 1001–1021 (2020).
Article PubMed Google Scholar
Somé, N. H., Noormohammadpour, P. & Lange, S. The use of machine learning on administrative and survey data to predict suicidal thoughts and behaviors: A systematic review. Front. Psychiatry. 15, 1291362 (2024).
Article PubMed PubMed Central Google Scholar
O’Connor, R. C. & Nock, M. K. The psychology of suicidal behaviour. Lancet Psychiatry. 1 (1), 73–85 (2014).
Article PubMed Google Scholar
Wagner, G. et al. Prefrontal cortical thickness in depressed patients with high-risk for suicidal behavior. J. Psychiatr Res. 46, 1449–1455 (2012).
Article PubMed Google Scholar
Campos, A. I. et al. Brain correlates of suicide attempt in 18,925 participants across 18 international cohorts. Biol. Psychiatry. 90, 243–252 (2021).
Article PubMed PubMed Central Google Scholar
Gifuni, A. J. et al. Brain cortical and subcortical morphology in adolescents with depression and a history of suicide attempt. J. Psychiatry Neurosci. 46, E347–E357 (2021).
Article PubMed PubMed Central Google Scholar
Sullivan, G. M. et al. Positron emission tomography quantification of serotonin1A receptor binding in suicide attempters with major depressive disorder. JAMA Psychiatry. 72, 169–178 (2015).
Article PubMed PubMed Central Google Scholar
Black, C. & Miller, B. J. Meta-analysis of cytokines and chemokines in suicidality: distinguishing suicidal versus nonsuicidal patients. Biol. Psychiatry. 78, 28–37 (2015).
Article CAS PubMed Google Scholar
Morales, S. et al. Acute mental discomfort associated with suicide behavior in a clinical sample of patients with affective disorders: ascertaining critical variables using artificial intelligence tools. Front. Psychiatry. 8, 7 (2017).
Article PubMed PubMed Central Google Scholar
Jung, J. S. et al. Prediction models for high risk of suicide in Korean adolescents using machine learning techniques. PLoS One, 14(6), e0217639. (2019).
Ge, F., Jiang, J., Wang, Y., Yuan, C. & Zhang, W. Identifying suicidal ideation among Chinese patients with major depressive disorder: evidence from a real-world hospital-based study in China. Neuropsychiatr. Dis. Treat., 665–672. (2020).
Kim, S., Lee, H. K. & Lee, K. Detecting suicidal risk using MMPI–2 based on machine learning algorithm. Sci. Rep. 11 (1), 15310 (2021).
Article CAS PubMed PubMed Central Google Scholar
Song, S. I., Hong, H. T., Lee, C. & Lee, S. B. A machine learning approach for predicting suicidal ideation in post stroke patients. Sci. Rep. 12 (1), 15906 (2022).
Article CAS ADS PubMed PubMed Central Google Scholar
Huang, Y. et al. Comparison of three machine learning models to predict suicidal ideation and depression among Chinese adolescents: A cross-sectional study. J. Affect. Disord. 319, 221–228 (2022).
Article PubMed Google Scholar
Meda, N., Pardini, S., Rigobello, P., Visioli, F. & Novara, C. Frequency and machine learning predictors of severe depressive symptoms and suicidal ideation among university students. Epidemiol. Psychiatric Sci., 32, e42. (2023).
Sara, S. S., Rahman, M. A., Rahman, R. & Talukder, A. Prediction of suicidal ideation with associated risk factors among university students in the Southern part of Bangladesh: machine learning approach. J. Affect. Disord. 349, 502–508 (2024).
Article PubMed Google Scholar
Kim, S. B. & Rattakorn, P. Unsupervised feature selection using weighted principal components. Expert Syst. Appl. 38 (5), 5704–5710 (2011).
Article Google Scholar

Download references

Acknowledgements

We would like to thank all the clinical psychologists who worked at the KUPTEM service throughout the data collection periods.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Neuroscience PhD Program, Koç University Graduate School of Health Sciences, Koç University , Istanbul, Türkiye
Muhammed Ballı
Department of Psychiatry, Koç University School of Medicine, Istanbul, Türkiye
Asli Ercan Dogan
Department of Psychiatry, Koç University Hospital, Istanbul, Türkiye
Sevin Hun Senol
Neuroscience PhD Program, Graduate School of Health Sciences, Koç University School of Medicine, Koç University, Istanbul, Türkiye
Hale Yapici Eser

Authors

Muhammed Ballı
View author publications
Search author on:PubMed Google Scholar
Asli Ercan Dogan
View author publications
Search author on:PubMed Google Scholar
Sevin Hun Senol
View author publications
Search author on:PubMed Google Scholar
Hale Yapici Eser
View author publications
Search author on:PubMed Google Scholar

Contributions

M.B. and H.Y.E.conducted the analysis and wrote the main manuscript body . All authors contributed to the design, conceptualization, and writing of the manuscript. All authors reviewed and approved the final manuscript.

Corresponding author

Correspondence to Hale Yapici Eser.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Ballı, M., Dogan, A.E., Senol, S.H. et al. Machine learning based identification of suicidal ideation using non-suicidal predictors in a university mental health clinic. Sci Rep 15, 13843 (2025). https://doi.org/10.1038/s41598-025-97387-4

Download citation

Received: 25 October 2024
Accepted: 04 April 2025
Published: 22 April 2025
Version of record: 22 April 2025
DOI: https://doi.org/10.1038/s41598-025-97387-4

Subjects

Abstract

Similar content being viewed by others

A machine learning approach for predicting suicidal thoughts and behaviours among college students

Suicide risk prediction for Korean adolescents based on machine learning

Machine learning and the prediction of suicide in psychiatric populations: a systematic review

Introduction

Method

Participants

Suicidal ideation/self-harm definition

Psychometric scales and measures used in the study

DSM-5 self-rated level 1 cross-cutting symptom measure

Patient health questionnaire (PHQ-9)

Other assessment tools

Data preparation and preprocessing

Machine learning algorithms

Machine learning prediction models

Neural network analysis

External validation of the neural network model

Results

Machine learning models results

Feature contribution analysis results

Discussion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links