The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study

Abstract Background Improving media adherence to World Health Organization (WHO) guidelines is crucial for preventing suicidal behaviors in the general population. However, there is currently no valid, rapid, and effective method to evaluate the adherence to these guidelines. Methods This compar...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zohar Elyospeh, Bénédicte Nobile, Inbar Levkovich, Raphael Chancel, Philippe Courtet, Yossi Levi-Belz
Format:	Article
Language:	English
Published:	Cambridge University Press 2025-01-01
Series:	European Psychiatry
Subjects:	artificial intelligence media natural language processing suicide suicide prevention
Online Access:	https://www.cambridge.org/core/product/identifier/S0924933825100370/type/journal_article
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850166345071591424
author	Zohar Elyospeh Bénédicte Nobile Inbar Levkovich Raphael Chancel Philippe Courtet Yossi Levi-Belz
author_facet	Zohar Elyospeh Bénédicte Nobile Inbar Levkovich Raphael Chancel Philippe Courtet Yossi Levi-Belz
author_sort	Zohar Elyospeh
collection	DOAJ
description	Abstract Background Improving media adherence to World Health Organization (WHO) guidelines is crucial for preventing suicidal behaviors in the general population. However, there is currently no valid, rapid, and effective method to evaluate the adherence to these guidelines. Methods This comparative effectiveness study (January–August 2024) evaluated the ability of two artificial intelligence (AI) models (Claude Opus 3 and GPT-4O) to assess the adherence of media reports to WHO suicide-reporting guidelines. A total of 120 suicide-related articles (40 in English, 40 in Hebrew, and 40 in French) published within the past 5 years were sourced from prominent newspapers. Six trained human raters (two per language) independently evaluated articles based on a WHO guideline-based questionnaire addressing aspects, such as prominence, sensationalism, and prevention. The same articles were also processed using AI models. Intraclass correlation coefficients (ICCs) and Spearman correlations were calculated to assess agreement between human raters and AI models. Results Overall adherence to WHO guidelines was ~50% across all languages. Both AI models demonstrated strong agreement with human raters, with GPT-4O showing the highest agreement (ICC = 0.793 [0.702; 0.855]). The combined evaluations of GPT-4O and Claude Opus 3 yielded the highest reliability (ICC = 0.812 [0.731; 0.869]). Conclusions AI models can replicate human judgment in evaluating media adherence to WHO guidelines. However, they have limitations and should be used alongside human oversight. These findings may suggest that AI tools have the potential to enhance and promote responsible reporting practices among journalists and, thus, may support suicide prevention efforts globally.
format	Article
id	doaj-art-ebef37b46f0d43f1bab44050afeae279
institution	OA Journals
issn	0924-9338 1778-3585
language	English
publishDate	2025-01-01
publisher	Cambridge University Press
record_format	Article
series	European Psychiatry
spelling	doaj-art-ebef37b46f0d43f1bab44050afeae2792025-08-20T02:21:29ZengCambridge University PressEuropean Psychiatry0924-93381778-35852025-01-016810.1192/j.eurpsy.2025.10037The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language studyZohar Elyospeh0Bénédicte Nobile1https://orcid.org/0000-0002-2570-6996Inbar Levkovich2https://orcid.org/0000-0003-1582-3889Raphael Chancel3https://orcid.org/0009-0009-3741-7508Philippe Courtet4https://orcid.org/0000-0002-6519-8586Yossi Levi-Belz5https://ror.org/02f009v59University of Haifa, Mount Carmel, Haifa, IsraelDepartment of Emergency Psychiatry and Acute Care, CHU Montpellier, Montpellier, France IGF, Univ. Montpellier, CNRS, INSERM, Montpellier, FranceFaculty of Education, https://ror.org/009st3569Tel Hai College, Upper Galilee, IsraelDepartment of Emergency Psychiatry and Acute Care, CHU Montpellier, Montpellier, FranceDepartment of Emergency Psychiatry and Acute Care, CHU Montpellier, Montpellier, France IGF, Univ. Montpellier, CNRS, INSERM, Montpellier, FranceLior Tsfaty Center for Suicide and Mental Pain Studies, https://ror.org/0361c8163Ruppin Academic Center, Emek Hefer, Israel Abstract Background Improving media adherence to World Health Organization (WHO) guidelines is crucial for preventing suicidal behaviors in the general population. However, there is currently no valid, rapid, and effective method to evaluate the adherence to these guidelines. Methods This comparative effectiveness study (January–August 2024) evaluated the ability of two artificial intelligence (AI) models (Claude Opus 3 and GPT-4O) to assess the adherence of media reports to WHO suicide-reporting guidelines. A total of 120 suicide-related articles (40 in English, 40 in Hebrew, and 40 in French) published within the past 5 years were sourced from prominent newspapers. Six trained human raters (two per language) independently evaluated articles based on a WHO guideline-based questionnaire addressing aspects, such as prominence, sensationalism, and prevention. The same articles were also processed using AI models. Intraclass correlation coefficients (ICCs) and Spearman correlations were calculated to assess agreement between human raters and AI models. Results Overall adherence to WHO guidelines was ~50% across all languages. Both AI models demonstrated strong agreement with human raters, with GPT-4O showing the highest agreement (ICC = 0.793 [0.702; 0.855]). The combined evaluations of GPT-4O and Claude Opus 3 yielded the highest reliability (ICC = 0.812 [0.731; 0.869]). Conclusions AI models can replicate human judgment in evaluating media adherence to WHO guidelines. However, they have limitations and should be used alongside human oversight. These findings may suggest that AI tools have the potential to enhance and promote responsible reporting practices among journalists and, thus, may support suicide prevention efforts globally. https://www.cambridge.org/core/product/identifier/S0924933825100370/type/journal_articleartificial intelligencemedianatural language processingsuicidesuicide prevention
spellingShingle	Zohar Elyospeh Bénédicte Nobile Inbar Levkovich Raphael Chancel Philippe Courtet Yossi Levi-Belz The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study European Psychiatry artificial intelligence media natural language processing suicide suicide prevention
title	The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study
title_full	The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study
title_fullStr	The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study
title_full_unstemmed	The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study
title_short	The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study
title_sort	role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide a multisite three language study
topic	artificial intelligence media natural language processing suicide suicide prevention
url	https://www.cambridge.org/core/product/identifier/S0924933825100370/type/journal_article
work_keys_str_mv	AT zoharelyospeh theroleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT benedictenobile theroleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT inbarlevkovich theroleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT raphaelchancel theroleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT philippecourtet theroleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT yossilevibelz theroleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT zoharelyospeh roleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT benedictenobile roleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT inbarlevkovich roleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT raphaelchancel roleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT philippecourtet roleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy AT yossilevibelz roleofgenerativeartificialintelligenceinevaluatingadherencetoresponsiblepressmediareportsonsuicideamultisitethreelanguagestudy

The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study

Similar Items