Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project

AbstractThe increasing demand for population-wide genomic screening and the limited availability of genetic counseling resources have created a pressing need for innovative service delivery models. Chatbots powered by large language models (LLMs) have shown potential in genomic services,...

Full description

Saved in:

Bibliographic Details
Main Authors:	Emma Coen, Guilherme Del Fiol, Kimberly A Kaphingst, Emerson Borsato, Jackilen Shannon, Hadley Smith, Aaron Masino, Caitlin G Allen
Format:	Article
Language:	English
Published:	JMIR Publications 2025-06-01
Series:	JMIR Cancer
Online Access:	https://cancer.jmir.org/2025/1/e65848
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849334271685165056
author	Emma Coen Guilherme Del Fiol Kimberly A Kaphingst Emerson Borsato Jackilen Shannon Hadley Smith Aaron Masino Caitlin G Allen
author_facet	Emma Coen Guilherme Del Fiol Kimberly A Kaphingst Emerson Borsato Jackilen Shannon Hadley Smith Aaron Masino Caitlin G Allen
author_sort	Emma Coen
collection	DOAJ
description	AbstractThe increasing demand for population-wide genomic screening and the limited availability of genetic counseling resources have created a pressing need for innovative service delivery models. Chatbots powered by large language models (LLMs) have shown potential in genomic services, particularly in pretest counseling, but their application in returning positive population-wide genomic screening results remains underexplored. Leveraging advanced LLMs like GPT-4 offers an opportunity to address this gap by delivering accurate, contextual, and user-centered communication to individuals receiving positive genetic test results. This project aimed to design, implement, and evaluate a chatbot integrated with GPT-4, tailored to support the return of positive genomic screening results in the context of South Carolina’s In Our DNA SC program. This initiative offers free genetic screening to 100,000 individuals, with over 33,000 results returned and numerous positive findings for conditions such as Lynch syndrome, hereditary breast and ovarian cancer syndrome, and familial hypercholesterolemia. A 3-step prompt engineering process using retrieval-augmented generation and few-shot techniques was used to create the chatbot. Training materials included patient frequently asked questions, genetic counseling scripts, and patient-derived queries. The chatbot underwent iterative refinement based on 13 training questions, while performance was evaluated through expert ratings on responses to 2 hypothetical patient scenarios. The 2 scenarios were intended to represent common but distinct patient profiles in terms of gender, race, ethnicity, age, and background knowledge. Domain experts rated the chatbot using a 5-point Likert scale across 8 predefined criteria: tone, clarity, program accuracy, domain accuracy, robustness, efficiency, boundaries, and usability. The chatbot achieved an average score of 3.86 (SD 0.89) across all evaluation metrics. The highest-rated criteria were tone (mean 4.25, SD 0.71) and usability (mean 4.25, SD 0.58), reflecting the chatbot’s ability to communicate effectively and provide a seamless user experience. Boundary management (mean 4.0, SD 0.76) and efficiency (mean 3.88, SD 1.08) also scored well, while clarity and robustness received ratings of 3.81 (SD 1.05) and 3.81 (SD 0.66), respectively. Domain accuracy was rated 3.63 (SD 0.96), indicating satisfactory performance in delivering genetic information, whereas program accuracy received the lowest score of 3.25 (SD 1.39), highlighting the need for improvements in delivering program-specific details. This project demonstrates the feasibility of using LLM-powered chatbots to support the return of positive genomic screening results. The chatbot effectively handled open-ended patient queries, maintained conversational boundaries, and delivered user-friendly responses. However, enhancements in program-specific accuracy are essential to maximize its utility. Future research will explore hybrid chatbot designs that combine the strengths of LLMs with rule-based components to improve scalability, accuracy, and accessibility in genomic service delivery. The findings underscore the potential of generative artificial intelligence tools to address resource limitations and improve the accessibility of genomic health care services.
format	Article
id	doaj-art-c6b42ba0e7ef44c8b86e721b1d8f2066
institution	Kabale University
issn	2369-1999
language	English
publishDate	2025-06-01
publisher	JMIR Publications
record_format	Article
series	JMIR Cancer
spelling	doaj-art-c6b42ba0e7ef44c8b86e721b1d8f20662025-08-20T03:45:36ZengJMIR PublicationsJMIR Cancer2369-19992025-06-0111e65848e6584810.2196/65848Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering ProjectEmma Coenhttp://orcid.org/0009-0009-5724-1298Guilherme Del Fiolhttp://orcid.org/0000-0001-9954-6799Kimberly A Kaphingsthttp://orcid.org/0000-0003-2668-9080Emerson Borsatohttp://orcid.org/0000-0002-2786-6395Jackilen Shannonhttp://orcid.org/0000-0001-5377-9511Hadley Smithhttp://orcid.org/0000-0003-1247-6535Aaron Masinohttp://orcid.org/0000-0002-2684-0548Caitlin G Allenhttp://orcid.org/0000-0002-6288-3529 AbstractThe increasing demand for population-wide genomic screening and the limited availability of genetic counseling resources have created a pressing need for innovative service delivery models. Chatbots powered by large language models (LLMs) have shown potential in genomic services, particularly in pretest counseling, but their application in returning positive population-wide genomic screening results remains underexplored. Leveraging advanced LLMs like GPT-4 offers an opportunity to address this gap by delivering accurate, contextual, and user-centered communication to individuals receiving positive genetic test results. This project aimed to design, implement, and evaluate a chatbot integrated with GPT-4, tailored to support the return of positive genomic screening results in the context of South Carolina’s In Our DNA SC program. This initiative offers free genetic screening to 100,000 individuals, with over 33,000 results returned and numerous positive findings for conditions such as Lynch syndrome, hereditary breast and ovarian cancer syndrome, and familial hypercholesterolemia. A 3-step prompt engineering process using retrieval-augmented generation and few-shot techniques was used to create the chatbot. Training materials included patient frequently asked questions, genetic counseling scripts, and patient-derived queries. The chatbot underwent iterative refinement based on 13 training questions, while performance was evaluated through expert ratings on responses to 2 hypothetical patient scenarios. The 2 scenarios were intended to represent common but distinct patient profiles in terms of gender, race, ethnicity, age, and background knowledge. Domain experts rated the chatbot using a 5-point Likert scale across 8 predefined criteria: tone, clarity, program accuracy, domain accuracy, robustness, efficiency, boundaries, and usability. The chatbot achieved an average score of 3.86 (SD 0.89) across all evaluation metrics. The highest-rated criteria were tone (mean 4.25, SD 0.71) and usability (mean 4.25, SD 0.58), reflecting the chatbot’s ability to communicate effectively and provide a seamless user experience. Boundary management (mean 4.0, SD 0.76) and efficiency (mean 3.88, SD 1.08) also scored well, while clarity and robustness received ratings of 3.81 (SD 1.05) and 3.81 (SD 0.66), respectively. Domain accuracy was rated 3.63 (SD 0.96), indicating satisfactory performance in delivering genetic information, whereas program accuracy received the lowest score of 3.25 (SD 1.39), highlighting the need for improvements in delivering program-specific details. This project demonstrates the feasibility of using LLM-powered chatbots to support the return of positive genomic screening results. The chatbot effectively handled open-ended patient queries, maintained conversational boundaries, and delivered user-friendly responses. However, enhancements in program-specific accuracy are essential to maximize its utility. Future research will explore hybrid chatbot designs that combine the strengths of LLMs with rule-based components to improve scalability, accuracy, and accessibility in genomic service delivery. The findings underscore the potential of generative artificial intelligence tools to address resource limitations and improve the accessibility of genomic health care services.https://cancer.jmir.org/2025/1/e65848
spellingShingle	Emma Coen Guilherme Del Fiol Kimberly A Kaphingst Emerson Borsato Jackilen Shannon Hadley Smith Aaron Masino Caitlin G Allen Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project JMIR Cancer
title	Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_full	Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_fullStr	Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_full_unstemmed	Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_short	Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_sort	chatbot for the return of positive genetic screening results for hereditary cancer syndromes prompt engineering project
url	https://cancer.jmir.org/2025/1/e65848
work_keys_str_mv	AT emmacoen chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject AT guilhermedelfiol chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject AT kimberlyakaphingst chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject AT emersonborsato chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject AT jackilenshannon chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject AT hadleysmith chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject AT aaronmasino chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject AT caitlingallen chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject

Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project

Similar Items