An Empirical Evaluation of Large Language Models on Consumer Health Questions

An Empirical Evaluation of Large Language Models on Consumer Health Questions

<b>Background:</b> Large Language Models (LLMs) have demonstrated strong performances in clinical question-answering (QA) benchmarks, yet their effectiveness in addressing real-world consumer medical queries remains underexplored. This study evaluates the capabilities and limitations of...

Full description

Saved in:

Bibliographic Details
Main Authors:	Moaiz Abrar, Yusuf Sermet, Ibrahim Demir
Format:	Article
Language:	English
Published:	MDPI AG 2025-02-01
Series:	BioMedInformatics
Subjects:	medical question answering consumer medical question answering natural language processing artificial intelligence large language models
Online Access:	https://www.mdpi.com/2673-7426/5/1/12
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Cross-Encoder-Based Semantic Evaluation of Extractive and Generative Question Answering in Low-Resourced African Languages
by: Funebi Francis Ijebu, et al.
Published: (2025-03-01)

Evaluating large language models as graders of medical short answer questions: a comparative analysis with expert human graders
by: Olena Bolgova, et al.
Published: (2025-12-01)

Intelligent accounting question-answering robot based on a large language model and knowledge graph
by: Shi Shengyun, et al.
Published: (2025-04-01)

Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion
by: Junkai Zhang, et al.
Published: (2025-04-01)

Assessing the quality of automatic-generated short answers using GPT-4
by: Luiz Rodrigues, et al.
Published: (2024-12-01)

A question-answering framework for geospatial data retrieval enhanced by a knowledge graph and large language models
by: Hao Li, et al.
Published: (2025-08-01)

A clinician-based comparative study of large language models in answering medical questions: the case of asthma
by: Yong Yin, et al.
Published: (2025-04-01)

MOODLE IN LANGUAGE TEACHING AND TESTING. THE EMBEDDED ANSWERS QUESTION TYPE
by: Ioana-Claudia Horea
Published: (2025-03-01)

Medical Knowledge-Based Differential Image Visual Question Answering
by: Fangpeng Lu, et al.
Published: (2025-01-01)

Designing and Evaluating a Dual-Stream Transformer-Based Architecture for Visual Question Answering
by: Faheem Shehzad, et al.
Published: (2024-01-01)

Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation
by: Tomohisa Seki, et al.
Published: (2025-02-01)

A large language model for multimodal identification of crop diseases and pests
by: Yiqun Wang, et al.
Published: (2025-07-01)

Artificial intelligence assisted automated short answer question scoring tool shows high correlation with human examiner markings
by: H.M.T.W. Seneviratne, et al.
Published: (2025-08-01)

Elimination-based reasoning with LLM for multiple-choice educational question answering
by: Qianli Zhao, et al.
Published: (2025-08-01)

SHIFA: SBERT-Based Healthcare Information Focused Arabic Question Answering
by: Rahaf Alruwaithi, et al.
Published: (2025-01-01)

VQABG: Vietnamese question/answers benchmark generator for field-specific chatbot ground-truth dataset using EMINI (Exact Match wIth Numeric Information) indicator
by: Anh-Khoa NGO-HO, et al.
Published: (2024-10-01)

VQABG: Vietnamese question/answers benchmark generator for field-specific chatbot ground-truth dataset using EMINI (Exact Match wIth Numeric Information) indicator
by: Anh-Khoa NGO-HO, et al.
Published: (2024-10-01)

Enhancing Visual Question Answering for Multiple Choice Questions
by: Rashi Goel, et al.
Published: (2025-01-01)

A lightweight knowledge graph-driven question answering system for field-based mineral resource survey
by: Mingguo Wang, et al.
Published: (2025-09-01)

Enhancing the performance of neurosurgery medical question-answering systems using a multi-task knowledge graph-augmented answer generation model
by: Ting Pan, et al.
Published: (2025-05-01)

Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework
by: Yusong Ke, et al.
Published: (2025-05-01)

Semantic classification of Indonesian consumer health questions
by: Raniah Nur Hanami, et al.
Published: (2025-07-01)

DRKG: Faithful and Interpretable Multi-Hop Knowledge Graph Question Answering via LLM-Guided Reasoning Plans
by: Yan Chen, et al.
Published: (2025-06-01)

Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
by: Yuchen Duan, et al.
Published: (2025-01-01)

ZPVQA: Visual Question Answering of Images Based on Zero-Shot Prompt Learning
by: Naihao Hu, et al.
Published: (2025-01-01)

BVQA: Connecting Language and Vision Through Multimodal Attention for Open-Ended Question Answering
by: Md. Shalha Mucha Bhuyan, et al.
Published: (2025-01-01)

Generative Models for Multiple-Choice Question Answering in Portuguese: A Monolingual and Multilingual Experimental Study
by: Guilherme Dallmann Lima, et al.
Published: (2025-05-01)

Classifying the Clarity of Questions in CQA Networks: A Topic based Approach
by: Alireza Khabbazan, et al.
Published: (2023-03-01)

Towards a benchmark dataset for large language models in the context of process automation
by: Tejennour Tizaoui, et al.
Published: (2024-12-01)

Knowledge Graphs as a source of trust for LLM-powered enterprise question answering
by: Juan Sequeda, et al.
Published: (2025-05-01)

Adapting an English Corpus and a Question Answering System for Slovene
by: Uroš Šmajdek, et al.
Published: (2023-09-01)

Knowledge Graph as Pre-Training Corpus for Structural Reasoning via Multi-Hop Linearization
by: Wooyoung Kim, et al.
Published: (2025-01-01)

ChemQuery: A Natural Language Query‐Driven Service for Comprehensive Exploration of Chemistry Patent Literature
by: Shubham Gupta, et al.
Published: (2025-04-01)

Is Self-Mark dependable in Very Short Answer Question formats among pre-clinical medical students?
by: Sethapong Lertsakulbunlue, et al.
Published: (2025-04-01)

A Region-based Approach to the Automated Marking of Short Textual Answers
by: Raheel Siddiqi
Published: (2011-12-01)

Analyzing Diagnostic Reasoning of Vision–Language Models via Zero-Shot Chain-of-Thought Prompting in Medical Visual Question Answering
by: Fatema Tuj Johora Faria, et al.
Published: (2025-07-01)

An Image Grid Can Be Worth a Video: Zero-Shot Video Question Answering Using a VLM
by: Wonkyun Kim, et al.
Published: (2024-01-01)

Adaptive Conditional Reasoning for Remote Sensing Visual Question Answering
by: Yiqun Gao, et al.
Published: (2025-04-01)

ReceiptQA: A Question-Answering Dataset for Receipt Understanding
by: Mahmoud Abdalla, et al.
Published: (2025-05-01)

Enhancing pre-trained language model by answering natural questions for event extraction
by: Yuxin Zhang, et al.
Published: (2025-04-01)