Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder

Abstract Background Anhedonia and suicidal ideation are symptoms of major depressive disorder (MDD) that are not regularly captured in structured scales but may be captured in unstructured clinical notes. Natural language processing (NLP) techniques may be used to extract longitudinal data on suicid...

Full description

Saved in:
Bibliographic Details
Main Authors: L. Alexander Vance, Leslie Way, Deepali Kulkarni, Emily O. C. Palmer, Abhijit Ghosh, Melissa Unruh, Kelly M. Y. Chan, Amey Girdhari, Joydeep Sarkar
Format: Article
Language:English
Published: BMC 2025-01-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:https://doi.org/10.1186/s12911-025-02851-w
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832594719502237696
author L. Alexander Vance
Leslie Way
Deepali Kulkarni
Emily O. C. Palmer
Abhijit Ghosh
Melissa Unruh
Kelly M. Y. Chan
Amey Girdhari
Joydeep Sarkar
author_facet L. Alexander Vance
Leslie Way
Deepali Kulkarni
Emily O. C. Palmer
Abhijit Ghosh
Melissa Unruh
Kelly M. Y. Chan
Amey Girdhari
Joydeep Sarkar
author_sort L. Alexander Vance
collection DOAJ
description Abstract Background Anhedonia and suicidal ideation are symptoms of major depressive disorder (MDD) that are not regularly captured in structured scales but may be captured in unstructured clinical notes. Natural language processing (NLP) techniques may be used to extract longitudinal data on suicidal behaviors and anhedonia within unstructured clinical notes. This study assessed the accuracy of using NLP techniques on electronic health records (EHRs) to identify these symptoms among patients with MDD. Methods EHR-derived, de-identified data were used from the NeuroBlu Database (version 23R1), a longitudinal behavioral health real-world database. Mental health clinicians annotated instances of anhedonia and suicidal symptoms in clinical notes creating a ground truth. Interrater reliability (IRR) was calculated using Krippendorff’s alpha. A novel transformer architecture-based NLP model was trained on clinical notes to recognize linguistic patterns and contextual cues. Each sentence was categorized into one of four labels: (1) anhedonia; (2) suicidal ideation without intent or plan; (3) suicidal ideation with intent or plan; (4) absence of suicidal ideation or anhedonia. The model was assessed using positive predictive values (PPV), negative predictive values, sensitivity, specificity, F1-score, and AUROC. Results The model was trained, tested, and validated on 2,198, 1,247, and 1,016 distinct clinical notes, respectively. IRR was 0.80. For anhedonia, suicidal ideation with intent or plan, and suicidal ideation without intent or plan the model achieved a PPV of 0.98, 0.93, and 0.87, an F1-score of 0.98, 0.91, and 0.89 during training and a PPV of 0.99, 0.95, and 0.87 and F1-score of 0.99, 0.95, and 0.89 during validation. Conclusions NLP techniques can leverage contextual information in EHRs to identify anhedonia and suicidal symptoms in patients with MDD. Integrating structured and unstructured data offers a comprehensive view of MDD’s trajectory, helping healthcare providers deliver timely, effective interventions. Addressing current limitations will further enhance NLP models, enabling more accurate extraction of critical clinical features and supporting personalized, proactive mental health care.
format Article
id doaj-art-c7e9e0ddc57b45a1b0198660dee52ef3
institution Kabale University
issn 1472-6947
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series BMC Medical Informatics and Decision Making
spelling doaj-art-c7e9e0ddc57b45a1b0198660dee52ef32025-01-19T12:26:06ZengBMCBMC Medical Informatics and Decision Making1472-69472025-01-012511910.1186/s12911-025-02851-wNatural language processing to identify suicidal ideation and anhedonia in major depressive disorderL. Alexander Vance0Leslie Way1Deepali Kulkarni2Emily O. C. Palmer3Abhijit Ghosh4Melissa Unruh5Kelly M. Y. Chan6Amey Girdhari7Joydeep Sarkar8Holmusk Technologies, IncHolmusk Technologies, IncKKT Technologies, Pte. LtdHolmusk Europe, LtdKKT Technologies, Pte. LtdHolmusk Technologies, IncKKT Technologies, Pte. LtdKKT Technologies, Pte. LtdKKT Technologies, Pte. LtdAbstract Background Anhedonia and suicidal ideation are symptoms of major depressive disorder (MDD) that are not regularly captured in structured scales but may be captured in unstructured clinical notes. Natural language processing (NLP) techniques may be used to extract longitudinal data on suicidal behaviors and anhedonia within unstructured clinical notes. This study assessed the accuracy of using NLP techniques on electronic health records (EHRs) to identify these symptoms among patients with MDD. Methods EHR-derived, de-identified data were used from the NeuroBlu Database (version 23R1), a longitudinal behavioral health real-world database. Mental health clinicians annotated instances of anhedonia and suicidal symptoms in clinical notes creating a ground truth. Interrater reliability (IRR) was calculated using Krippendorff’s alpha. A novel transformer architecture-based NLP model was trained on clinical notes to recognize linguistic patterns and contextual cues. Each sentence was categorized into one of four labels: (1) anhedonia; (2) suicidal ideation without intent or plan; (3) suicidal ideation with intent or plan; (4) absence of suicidal ideation or anhedonia. The model was assessed using positive predictive values (PPV), negative predictive values, sensitivity, specificity, F1-score, and AUROC. Results The model was trained, tested, and validated on 2,198, 1,247, and 1,016 distinct clinical notes, respectively. IRR was 0.80. For anhedonia, suicidal ideation with intent or plan, and suicidal ideation without intent or plan the model achieved a PPV of 0.98, 0.93, and 0.87, an F1-score of 0.98, 0.91, and 0.89 during training and a PPV of 0.99, 0.95, and 0.87 and F1-score of 0.99, 0.95, and 0.89 during validation. Conclusions NLP techniques can leverage contextual information in EHRs to identify anhedonia and suicidal symptoms in patients with MDD. Integrating structured and unstructured data offers a comprehensive view of MDD’s trajectory, helping healthcare providers deliver timely, effective interventions. Addressing current limitations will further enhance NLP models, enabling more accurate extraction of critical clinical features and supporting personalized, proactive mental health care.https://doi.org/10.1186/s12911-025-02851-wNatural language processingElectronic health recordsSuicidal ideationAnhedoniaMajor depressive disorder
spellingShingle L. Alexander Vance
Leslie Way
Deepali Kulkarni
Emily O. C. Palmer
Abhijit Ghosh
Melissa Unruh
Kelly M. Y. Chan
Amey Girdhari
Joydeep Sarkar
Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder
BMC Medical Informatics and Decision Making
Natural language processing
Electronic health records
Suicidal ideation
Anhedonia
Major depressive disorder
title Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder
title_full Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder
title_fullStr Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder
title_full_unstemmed Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder
title_short Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder
title_sort natural language processing to identify suicidal ideation and anhedonia in major depressive disorder
topic Natural language processing
Electronic health records
Suicidal ideation
Anhedonia
Major depressive disorder
url https://doi.org/10.1186/s12911-025-02851-w
work_keys_str_mv AT lalexandervance naturallanguageprocessingtoidentifysuicidalideationandanhedoniainmajordepressivedisorder
AT leslieway naturallanguageprocessingtoidentifysuicidalideationandanhedoniainmajordepressivedisorder
AT deepalikulkarni naturallanguageprocessingtoidentifysuicidalideationandanhedoniainmajordepressivedisorder
AT emilyocpalmer naturallanguageprocessingtoidentifysuicidalideationandanhedoniainmajordepressivedisorder
AT abhijitghosh naturallanguageprocessingtoidentifysuicidalideationandanhedoniainmajordepressivedisorder
AT melissaunruh naturallanguageprocessingtoidentifysuicidalideationandanhedoniainmajordepressivedisorder
AT kellymychan naturallanguageprocessingtoidentifysuicidalideationandanhedoniainmajordepressivedisorder
AT ameygirdhari naturallanguageprocessingtoidentifysuicidalideationandanhedoniainmajordepressivedisorder
AT joydeepsarkar naturallanguageprocessingtoidentifysuicidalideationandanhedoniainmajordepressivedisorder