Suggested Topics within your search.
Suggested Topics within your search.
-
101
An Empirical Evaluation of Large Language Models on Consumer Health Questions
Published 2025-02-01“…<b>Conclusions:</b> Current small or medium sized LLMs struggle to provide accurate answers to consumer health questions and must be significantly improved.…”
Get full text
Article -
102
Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation
Published 2025-02-01“…These findings suggest a need for improved control over image hallucination and indicate that performance evaluation using the percentage of correct answers to multiple-choice questions may not be sufficient for performance assessment in VQA tasks.…”
Get full text
Article -
103
-
104
The battle of question formats: a comparative study of retrieval practice using very short answer questions and multiple choice questions
Published 2024-12-01“…Results The VSAQs were answered incorrectly more frequently on the practice tests and final test. …”
Get full text
Article -
105
Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
Published 2025-01-01“…This approach could play a crucial role in modernizing TCM research and improving access to clinical insights. Future research may explore expanding the dataset and refining the query system for broader applications.…”
Get full text
Article -
106
Enhancing vaccine communication in social Q&A: identifying readily applicable factors for answer acceptance on medical sciences stack exchange
Published 2025-03-01“…This study investigates factors influencing the acceptance of answers to vaccine-related questions on social Q&A platforms, aiming to improve online vaccine communication. …”
Get full text
Article -
107
-
108
Analyzing Diagnostic Reasoning of Vision–Language Models via Zero-Shot Chain-of-Thought Prompting in Medical Visual Question Answering
Published 2025-07-01“…Medical Visual Question Answering (MedVQA) lies at the intersection of computer vision, natural language processing, and clinical decision-making, aiming to generate accurate responses from medical images paired with complex inquiries. …”
Get full text
Article -
109
From Questions to Answers: Teaching Evidence-Based Medicine Question Formulation and Literature Searching Skills to First-Year Medical Students
Published 2025-02-01“…After the workshop, students completed a posttest. Students showed improvement in differentiating background and foreground questions (p < .001), formulating answerable clinical questions (p < .001), and developing appropriate database searches (p < .001 and p = .002). …”
Get full text
Article -
110
The Use of the Cloze Test in Reading Comprehension Assessment in Brazil: Post-Pandemic Challenges
Published 2025-05-01“…The criteria for analyzing these answers are based on Taylor’s (1953) exact answers initial proposal (Brown 1980; 2013), added to other assessment instruments used in the Psychology field. …”
Get full text
Article -
111
Accuracy, appropriateness, and readability of ChatGPT-4 and ChatGPT-3.5 in answering pediatric emergency medicine post-discharge questions
Published 2025-04-01“…This study compared 2 versions of ChatGPT in answering post-discharge follow-up questions in the area of pediatric emergency medicine (PEM). …”
Get full text
Article -
112
How reliable are ChatGPT and Google’s answers to frequently asked questions about unicondylar knee arthroplasty from a scientific perspective?
Published 2025-06-01“…Results A total of 83.3% of ChatGPT’s responses were found to be consistent with academic sources, whereas this rate was 58.3% for Google. ChatGPT’s answers of 142.8 words, compared to Google’s 85.6-word average. …”
Get full text
Article -
113
The pearls and pitfalls of setting high-quality multiple choice questions for clinical medicine
Published 2023-05-01Get full text
Article -
114
Mother: a maternal online technology for health care dataset
Published 2025-04-01“…The answers to the questions were provided and validated by professional medical personnel. …”
Get full text
Article -
115
Arch-Eval benchmark for assessing chinese architectural domain knowledge in large language models
Published 2025-04-01“…The results reveal significant differences in the performance of these models in the domain of architectural knowledge question-answering. Our findings show that the average accuracy difference between Chain-of-Thought (COT) evaluation and Answer-Only (AO) evaluation is less than 3%, but the response time for COT is significantly longer, extending to 26 times that of AO (62.23 seconds per question vs. 2.38 seconds per question). …”
Get full text
Article -
116
Enhancing Chatbot Responses through Improved T5 Model Incorporating Aggregated Multi-Head Attention Mechanism and Bidirectional Long Short-Term Memory
Published 2025-07-01“…This research proposes an advanced transformer model, the Improved T5 (IT5), designed to address these issues. …”
Get full text
Article -
117
Enhancing responses from large language models with role-playing prompts: a comparative study on answering frequently asked questions about total knee arthroplasty
Published 2025-05-01“…This study aims to evaluate and compare the performance of these LLMs in answering frequently asked questions (FAQs) about Total Knee Arthroplasty (TKA), with a specific focus on the impact of role-playing prompts. …”
Get full text
Article -
118
Comparative performance of ChatGPT, Gemini, and final-year emergency medicine clerkship students in answering multiple-choice questions: implications for the use of AI in medical e...
Published 2025-08-01“…While these tools show promise for answering multiple-choice questions (MCQs), their efficacy in specialized domains, such as Emergency Medicine (EM) clerkship, remains underexplored. …”
Get full text
Article -
119
-
120