Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method

ObjectivesOral and maxillofacial diseases affect approximately 3.5 billion people worldwide. With the continuous advancement of Artificial Intelligence technologies, particularly the application of generative pre-trained transformers like ChatGPT-4, there is potential to enhance public awareness of...

Full description

Saved in:

Bibliographic Details
Main Authors:	Kaiyuan Ji, Zhihan Wu, Jing Han, Guangtao Zhai, Jiannan Liu
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2025-02-01
Series:	Frontiers in Oral Health
Subjects:	Artificial Intelligence Chain of Thought education tool ChatGPT-4 oral and maxillofacial
Online Access:	https://www.frontiersin.org/articles/10.3389/froh.2025.1541976/full
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1823856589734412288
author	Kaiyuan Ji Zhihan Wu Jing Han Guangtao Zhai Guangtao Zhai Jiannan Liu
author_facet	Kaiyuan Ji Zhihan Wu Jing Han Guangtao Zhai Guangtao Zhai Jiannan Liu
author_sort	Kaiyuan Ji
collection	DOAJ
description	ObjectivesOral and maxillofacial diseases affect approximately 3.5 billion people worldwide. With the continuous advancement of Artificial Intelligence technologies, particularly the application of generative pre-trained transformers like ChatGPT-4, there is potential to enhance public awareness of the prevention and early detection of these diseases. This study evaluated the performance of ChatGPT-4 in addressing oral and maxillofacial disease questions using standard approaches and the Chain of Thought (CoT) method, aiming to gain a deeper understanding of its capabilities, potential, and limitations.Materials and methodsThree experts, drawing from their extensive experience and the most common questions in clinical settings, selected 130 open-ended questions and 1,805 multiple-choice questions from the national dental licensing examination. These questions encompass 12 areas of oral and maxillofacial surgery, including Prosthodontics, Pediatric Dentistry, Maxillofacial Tumors and Salivary Gland Diseases, and maxillofacial Infections.ResultsUsing CoT approach, ChatGPT-4 exhibited marked enhancements in accuracy, structure, completeness, professionalism, and overall impression for open-ended questions, revealing statistically significant differences compared to its performance on general oral and maxillofacial inquiries. In the realm of multiple-choice questions, the application of CoT method boosted ChatGPT-4's accuracy across all major subjects, achieving an overall accuracy increase of 3.1%.ConclusionsWhen employing ChatGPT-4 to address questions in oral and maxillofacial surgery, incorporating CoT as a querying method can enhance its performance and help the public improve their understanding and awareness of such issues. However, it is not advisable to consider it a substitute for doctors.
format	Article
id	doaj-art-6a05acc2a1754104ac309ff342c68da7
institution	Kabale University
issn	2673-4842
language	English
publishDate	2025-02-01
publisher	Frontiers Media S.A.
record_format	Article
series	Frontiers in Oral Health
spelling	doaj-art-6a05acc2a1754104ac309ff342c68da72025-02-12T07:25:31ZengFrontiers Media S.A.Frontiers in Oral Health2673-48422025-02-01610.3389/froh.2025.15419761541976Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard methodKaiyuan Ji0Zhihan Wu1Jing Han2Guangtao Zhai3Guangtao Zhai4Jiannan Liu5School of Communication and Electronic Engineering, East China Normal University, Shanghai, ChinaDepartment of Oral and Maxillofacial Head and Neck Oncology, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, ChinaDepartment of Oral and Maxillofacial Head and Neck Oncology, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, ChinaSchool of Communication and Electronic Engineering, East China Normal University, Shanghai, ChinaSchool of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai, ChinaDepartment of Oral and Maxillofacial Head and Neck Oncology, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, ChinaObjectivesOral and maxillofacial diseases affect approximately 3.5 billion people worldwide. With the continuous advancement of Artificial Intelligence technologies, particularly the application of generative pre-trained transformers like ChatGPT-4, there is potential to enhance public awareness of the prevention and early detection of these diseases. This study evaluated the performance of ChatGPT-4 in addressing oral and maxillofacial disease questions using standard approaches and the Chain of Thought (CoT) method, aiming to gain a deeper understanding of its capabilities, potential, and limitations.Materials and methodsThree experts, drawing from their extensive experience and the most common questions in clinical settings, selected 130 open-ended questions and 1,805 multiple-choice questions from the national dental licensing examination. These questions encompass 12 areas of oral and maxillofacial surgery, including Prosthodontics, Pediatric Dentistry, Maxillofacial Tumors and Salivary Gland Diseases, and maxillofacial Infections.ResultsUsing CoT approach, ChatGPT-4 exhibited marked enhancements in accuracy, structure, completeness, professionalism, and overall impression for open-ended questions, revealing statistically significant differences compared to its performance on general oral and maxillofacial inquiries. In the realm of multiple-choice questions, the application of CoT method boosted ChatGPT-4's accuracy across all major subjects, achieving an overall accuracy increase of 3.1%.ConclusionsWhen employing ChatGPT-4 to address questions in oral and maxillofacial surgery, incorporating CoT as a querying method can enhance its performance and help the public improve their understanding and awareness of such issues. However, it is not advisable to consider it a substitute for doctors.https://www.frontiersin.org/articles/10.3389/froh.2025.1541976/fullArtificial IntelligenceChain of Thoughteducation toolChatGPT-4oral and maxillofacial
spellingShingle	Kaiyuan Ji Zhihan Wu Jing Han Guangtao Zhai Guangtao Zhai Jiannan Liu Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method Frontiers in Oral Health Artificial Intelligence Chain of Thought education tool ChatGPT-4 oral and maxillofacial
title	Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method
title_full	Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method
title_fullStr	Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method
title_full_unstemmed	Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method
title_short	Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method
title_sort	evaluating chatgpt 4 s performance on oral and maxillofacial queries chain of thought and standard method
topic	Artificial Intelligence Chain of Thought education tool ChatGPT-4 oral and maxillofacial
url	https://www.frontiersin.org/articles/10.3389/froh.2025.1541976/full
work_keys_str_mv	AT kaiyuanji evaluatingchatgpt4sperformanceonoralandmaxillofacialquerieschainofthoughtandstandardmethod AT zhihanwu evaluatingchatgpt4sperformanceonoralandmaxillofacialquerieschainofthoughtandstandardmethod AT jinghan evaluatingchatgpt4sperformanceonoralandmaxillofacialquerieschainofthoughtandstandardmethod AT guangtaozhai evaluatingchatgpt4sperformanceonoralandmaxillofacialquerieschainofthoughtandstandardmethod AT guangtaozhai evaluatingchatgpt4sperformanceonoralandmaxillofacialquerieschainofthoughtandstandardmethod AT jiannanliu evaluatingchatgpt4sperformanceonoralandmaxillofacialquerieschainofthoughtandstandardmethod

Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method

Similar Items