Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project

AbstractThe increasing demand for population-wide genomic screening and the limited availability of genetic counseling resources have created a pressing need for innovative service delivery models. Chatbots powered by large language models (LLMs) have shown potential in genomic services,...

Full description

Saved in:
Bibliographic Details
Main Authors: Emma Coen, Guilherme Del Fiol, Kimberly A Kaphingst, Emerson Borsato, Jackilen Shannon, Hadley Smith, Aaron Masino, Caitlin G Allen
Format: Article
Language:English
Published: JMIR Publications 2025-06-01
Series:JMIR Cancer
Online Access:https://cancer.jmir.org/2025/1/e65848
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849334271685165056
author Emma Coen
Guilherme Del Fiol
Kimberly A Kaphingst
Emerson Borsato
Jackilen Shannon
Hadley Smith
Aaron Masino
Caitlin G Allen
author_facet Emma Coen
Guilherme Del Fiol
Kimberly A Kaphingst
Emerson Borsato
Jackilen Shannon
Hadley Smith
Aaron Masino
Caitlin G Allen
author_sort Emma Coen
collection DOAJ
description AbstractThe increasing demand for population-wide genomic screening and the limited availability of genetic counseling resources have created a pressing need for innovative service delivery models. Chatbots powered by large language models (LLMs) have shown potential in genomic services, particularly in pretest counseling, but their application in returning positive population-wide genomic screening results remains underexplored. Leveraging advanced LLMs like GPT-4 offers an opportunity to address this gap by delivering accurate, contextual, and user-centered communication to individuals receiving positive genetic test results. This project aimed to design, implement, and evaluate a chatbot integrated with GPT-4, tailored to support the return of positive genomic screening results in the context of South Carolina’s In Our DNA SC program. This initiative offers free genetic screening to 100,000 individuals, with over 33,000 results returned and numerous positive findings for conditions such as Lynch syndrome, hereditary breast and ovarian cancer syndrome, and familial hypercholesterolemia. A 3-step prompt engineering process using retrieval-augmented generation and few-shot techniques was used to create the chatbot. Training materials included patient frequently asked questions, genetic counseling scripts, and patient-derived queries. The chatbot underwent iterative refinement based on 13 training questions, while performance was evaluated through expert ratings on responses to 2 hypothetical patient scenarios. The 2 scenarios were intended to represent common but distinct patient profiles in terms of gender, race, ethnicity, age, and background knowledge. Domain experts rated the chatbot using a 5-point Likert scale across 8 predefined criteria: tone, clarity, program accuracy, domain accuracy, robustness, efficiency, boundaries, and usability. The chatbot achieved an average score of 3.86 (SD 0.89) across all evaluation metrics. The highest-rated criteria were tone (mean 4.25, SD 0.71) and usability (mean 4.25, SD 0.58), reflecting the chatbot’s ability to communicate effectively and provide a seamless user experience. Boundary management (mean 4.0, SD 0.76) and efficiency (mean 3.88, SD 1.08) also scored well, while clarity and robustness received ratings of 3.81 (SD 1.05) and 3.81 (SD 0.66), respectively. Domain accuracy was rated 3.63 (SD 0.96), indicating satisfactory performance in delivering genetic information, whereas program accuracy received the lowest score of 3.25 (SD 1.39), highlighting the need for improvements in delivering program-specific details. This project demonstrates the feasibility of using LLM-powered chatbots to support the return of positive genomic screening results. The chatbot effectively handled open-ended patient queries, maintained conversational boundaries, and delivered user-friendly responses. However, enhancements in program-specific accuracy are essential to maximize its utility. Future research will explore hybrid chatbot designs that combine the strengths of LLMs with rule-based components to improve scalability, accuracy, and accessibility in genomic service delivery. The findings underscore the potential of generative artificial intelligence tools to address resource limitations and improve the accessibility of genomic health care services.
format Article
id doaj-art-c6b42ba0e7ef44c8b86e721b1d8f2066
institution Kabale University
issn 2369-1999
language English
publishDate 2025-06-01
publisher JMIR Publications
record_format Article
series JMIR Cancer
spelling doaj-art-c6b42ba0e7ef44c8b86e721b1d8f20662025-08-20T03:45:36ZengJMIR PublicationsJMIR Cancer2369-19992025-06-0111e65848e6584810.2196/65848Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering ProjectEmma Coenhttp://orcid.org/0009-0009-5724-1298Guilherme Del Fiolhttp://orcid.org/0000-0001-9954-6799Kimberly A Kaphingsthttp://orcid.org/0000-0003-2668-9080Emerson Borsatohttp://orcid.org/0000-0002-2786-6395Jackilen Shannonhttp://orcid.org/0000-0001-5377-9511Hadley Smithhttp://orcid.org/0000-0003-1247-6535Aaron Masinohttp://orcid.org/0000-0002-2684-0548Caitlin G Allenhttp://orcid.org/0000-0002-6288-3529 AbstractThe increasing demand for population-wide genomic screening and the limited availability of genetic counseling resources have created a pressing need for innovative service delivery models. Chatbots powered by large language models (LLMs) have shown potential in genomic services, particularly in pretest counseling, but their application in returning positive population-wide genomic screening results remains underexplored. Leveraging advanced LLMs like GPT-4 offers an opportunity to address this gap by delivering accurate, contextual, and user-centered communication to individuals receiving positive genetic test results. This project aimed to design, implement, and evaluate a chatbot integrated with GPT-4, tailored to support the return of positive genomic screening results in the context of South Carolina’s In Our DNA SC program. This initiative offers free genetic screening to 100,000 individuals, with over 33,000 results returned and numerous positive findings for conditions such as Lynch syndrome, hereditary breast and ovarian cancer syndrome, and familial hypercholesterolemia. A 3-step prompt engineering process using retrieval-augmented generation and few-shot techniques was used to create the chatbot. Training materials included patient frequently asked questions, genetic counseling scripts, and patient-derived queries. The chatbot underwent iterative refinement based on 13 training questions, while performance was evaluated through expert ratings on responses to 2 hypothetical patient scenarios. The 2 scenarios were intended to represent common but distinct patient profiles in terms of gender, race, ethnicity, age, and background knowledge. Domain experts rated the chatbot using a 5-point Likert scale across 8 predefined criteria: tone, clarity, program accuracy, domain accuracy, robustness, efficiency, boundaries, and usability. The chatbot achieved an average score of 3.86 (SD 0.89) across all evaluation metrics. The highest-rated criteria were tone (mean 4.25, SD 0.71) and usability (mean 4.25, SD 0.58), reflecting the chatbot’s ability to communicate effectively and provide a seamless user experience. Boundary management (mean 4.0, SD 0.76) and efficiency (mean 3.88, SD 1.08) also scored well, while clarity and robustness received ratings of 3.81 (SD 1.05) and 3.81 (SD 0.66), respectively. Domain accuracy was rated 3.63 (SD 0.96), indicating satisfactory performance in delivering genetic information, whereas program accuracy received the lowest score of 3.25 (SD 1.39), highlighting the need for improvements in delivering program-specific details. This project demonstrates the feasibility of using LLM-powered chatbots to support the return of positive genomic screening results. The chatbot effectively handled open-ended patient queries, maintained conversational boundaries, and delivered user-friendly responses. However, enhancements in program-specific accuracy are essential to maximize its utility. Future research will explore hybrid chatbot designs that combine the strengths of LLMs with rule-based components to improve scalability, accuracy, and accessibility in genomic service delivery. The findings underscore the potential of generative artificial intelligence tools to address resource limitations and improve the accessibility of genomic health care services.https://cancer.jmir.org/2025/1/e65848
spellingShingle Emma Coen
Guilherme Del Fiol
Kimberly A Kaphingst
Emerson Borsato
Jackilen Shannon
Hadley Smith
Aaron Masino
Caitlin G Allen
Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
JMIR Cancer
title Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_full Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_fullStr Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_full_unstemmed Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_short Chatbot for the Return of Positive Genetic Screening Results for Hereditary Cancer Syndromes: Prompt Engineering Project
title_sort chatbot for the return of positive genetic screening results for hereditary cancer syndromes prompt engineering project
url https://cancer.jmir.org/2025/1/e65848
work_keys_str_mv AT emmacoen chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject
AT guilhermedelfiol chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject
AT kimberlyakaphingst chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject
AT emersonborsato chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject
AT jackilenshannon chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject
AT hadleysmith chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject
AT aaronmasino chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject
AT caitlingallen chatbotforthereturnofpositivegeneticscreeningresultsforhereditarycancersyndromespromptengineeringproject