Assessing the Accuracy and Reliability of Large Language Models in Psychiatry Using Standardized Multiple-Choice Questions: Cross-Sectional Study

BackgroundLarge language models (LLMs), such as OpenAI’s GPT-3.5, GPT-4, and GPT-4o, have garnered early and significant enthusiasm for their potential applications within mental health, ranging from documentation support to chat-bot therapy. Understanding the accuracy and re...

Full description

Saved in:

Bibliographic Details
Main Authors:	Kaitlin Hanss, Karthik V Sarma, Anne L Glowinski, Andrew Krystal, Ramotse Saunders, Andrew Halls, Sasha Gorrell, Erin Reilly
Format:	Article
Language:	English
Published:	JMIR Publications 2025-05-01
Series:	Journal of Medical Internet Research
Online Access:	https://www.jmir.org/2025/1/e69910
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.jmir.org/2025/1/e69910

Assessing the Accuracy and Reliability of Large Language Models in Psychiatry Using Standardized Multiple-Choice Questions: Cross-Sectional Study

Internet

Similar Items