Neurological history both twinned and queried by generative artificial intelligence

Background and objectivesWe propose the use of GPT-4 to facilitate initial history-taking in neurology and other medical specialties. A large language model (LLM) could be utilized as a digital twin which could enhance queryable electronic medical record (EMR) systems and provide healthcare conversa...

Full description

Saved in:
Bibliographic Details
Main Authors: Jung-Hyun Lee, Eunhee Choi, Sergio L. Angulo, Robert A. McDougal, William W. Lytton
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Medicine
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmed.2024.1496866/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841525860119609344
author Jung-Hyun Lee
Jung-Hyun Lee
Jung-Hyun Lee
Eunhee Choi
Sergio L. Angulo
Sergio L. Angulo
Robert A. McDougal
Robert A. McDougal
Robert A. McDougal
Robert A. McDougal
William W. Lytton
William W. Lytton
William W. Lytton
author_facet Jung-Hyun Lee
Jung-Hyun Lee
Jung-Hyun Lee
Eunhee Choi
Sergio L. Angulo
Sergio L. Angulo
Robert A. McDougal
Robert A. McDougal
Robert A. McDougal
Robert A. McDougal
William W. Lytton
William W. Lytton
William W. Lytton
author_sort Jung-Hyun Lee
collection DOAJ
description Background and objectivesWe propose the use of GPT-4 to facilitate initial history-taking in neurology and other medical specialties. A large language model (LLM) could be utilized as a digital twin which could enhance queryable electronic medical record (EMR) systems and provide healthcare conversational agents (HCAs) to replace waiting-room questionnaires.MethodsIn this observational pilot study, we presented verbatim history of present illness (HPI) narratives from published case reports of headache, stroke, and neurodegenerative diseases. Three standard GPT-4 models were designated Models P: patient digital twin; N: neurologist to query Model P; and S: supervisor to synthesize the N-P dialogue into a derived HPI and formulate the differential diagnosis. Given the random variability of GPT-4 output, each case was presented five separate times to check consistency and reliability.ResultsThe study achieved an overall HPI content retrieval accuracy of 81%, with accuracies of 84% for headache, 82% for stroke, and 77% for neurodegenerative diseases. Retrieval accuracies for individual HPI components were as follows: 93% for chief complaints, 47% for associated symptoms and review of systems, 76% for relevant symptom details, and 94% for histories of past medical, surgical, allergies, social, and family factors. The ranking of case diagnoses in the differential diagnosis list averaged in the 89th percentile.DiscussionOur tripartite LLM model demonstrated accuracy in extracting essential information from published case reports. Further validation with EMR HPIs, and then with direct patient care will be needed to move toward adaptation of enhanced diagnostic digital twins that incorporate real-time data from health-monitoring devices and self-monitoring assessments.
format Article
id doaj-art-84c73795ba8e410e9b5086c77547da8f
institution Kabale University
issn 2296-858X
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Medicine
spelling doaj-art-84c73795ba8e410e9b5086c77547da8f2025-01-17T06:50:39ZengFrontiers Media S.A.Frontiers in Medicine2296-858X2025-01-011110.3389/fmed.2024.14968661496866Neurological history both twinned and queried by generative artificial intelligenceJung-Hyun Lee0Jung-Hyun Lee1Jung-Hyun Lee2Eunhee Choi3Sergio L. Angulo4Sergio L. Angulo5Robert A. McDougal6Robert A. McDougal7Robert A. McDougal8Robert A. McDougal9William W. Lytton10William W. Lytton11William W. Lytton12Department of Neurology, State University of New York Downstate Health Sciences University, Brooklyn, NY, United StatesDepartment of Neurology, Kings County Hospital, Brooklyn, NY, United StatesDepartment of Neurology, Maimonides Medical Center, Brooklyn, NY, United StatesDepartment of Internal Medicine, Lincoln Medical Center, Bronx, NY, United StatesDepartment of Neurology, State University of New York Downstate Health Sciences University, Brooklyn, NY, United StatesDepartment of Neurology, Kings County Hospital, Brooklyn, NY, United StatesDepartment of Biostatistics, Yale School of Public Health, Yale University, New Haven, CT, United StatesComputational Biology and Bioinformatics, Yale University, New Haven, CT, United StatesWu Tsai Institute, Yale University, New Haven, CT, United StatesBiomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, United StatesDepartment of Neurology, State University of New York Downstate Health Sciences University, Brooklyn, NY, United StatesDepartment of Neurology, Kings County Hospital, Brooklyn, NY, United StatesDepartment of Physiology and Pharmacology, State University of New York Downstate Health Sciences University, Brooklyn, NY, United StatesBackground and objectivesWe propose the use of GPT-4 to facilitate initial history-taking in neurology and other medical specialties. A large language model (LLM) could be utilized as a digital twin which could enhance queryable electronic medical record (EMR) systems and provide healthcare conversational agents (HCAs) to replace waiting-room questionnaires.MethodsIn this observational pilot study, we presented verbatim history of present illness (HPI) narratives from published case reports of headache, stroke, and neurodegenerative diseases. Three standard GPT-4 models were designated Models P: patient digital twin; N: neurologist to query Model P; and S: supervisor to synthesize the N-P dialogue into a derived HPI and formulate the differential diagnosis. Given the random variability of GPT-4 output, each case was presented five separate times to check consistency and reliability.ResultsThe study achieved an overall HPI content retrieval accuracy of 81%, with accuracies of 84% for headache, 82% for stroke, and 77% for neurodegenerative diseases. Retrieval accuracies for individual HPI components were as follows: 93% for chief complaints, 47% for associated symptoms and review of systems, 76% for relevant symptom details, and 94% for histories of past medical, surgical, allergies, social, and family factors. The ranking of case diagnoses in the differential diagnosis list averaged in the 89th percentile.DiscussionOur tripartite LLM model demonstrated accuracy in extracting essential information from published case reports. Further validation with EMR HPIs, and then with direct patient care will be needed to move toward adaptation of enhanced diagnostic digital twins that incorporate real-time data from health-monitoring devices and self-monitoring assessments.https://www.frontiersin.org/articles/10.3389/fmed.2024.1496866/fullneurology–clinicalstrokeheadacheneurodegenerative diseaselarge language model (LLM)history taking
spellingShingle Jung-Hyun Lee
Jung-Hyun Lee
Jung-Hyun Lee
Eunhee Choi
Sergio L. Angulo
Sergio L. Angulo
Robert A. McDougal
Robert A. McDougal
Robert A. McDougal
Robert A. McDougal
William W. Lytton
William W. Lytton
William W. Lytton
Neurological history both twinned and queried by generative artificial intelligence
Frontiers in Medicine
neurology–clinical
stroke
headache
neurodegenerative disease
large language model (LLM)
history taking
title Neurological history both twinned and queried by generative artificial intelligence
title_full Neurological history both twinned and queried by generative artificial intelligence
title_fullStr Neurological history both twinned and queried by generative artificial intelligence
title_full_unstemmed Neurological history both twinned and queried by generative artificial intelligence
title_short Neurological history both twinned and queried by generative artificial intelligence
title_sort neurological history both twinned and queried by generative artificial intelligence
topic neurology–clinical
stroke
headache
neurodegenerative disease
large language model (LLM)
history taking
url https://www.frontiersin.org/articles/10.3389/fmed.2024.1496866/full
work_keys_str_mv AT junghyunlee neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT junghyunlee neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT junghyunlee neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT eunheechoi neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT sergiolangulo neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT sergiolangulo neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT robertamcdougal neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT robertamcdougal neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT robertamcdougal neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT robertamcdougal neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT williamwlytton neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT williamwlytton neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence
AT williamwlytton neurologicalhistorybothtwinnedandqueriedbygenerativeartificialintelligence