Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation

Large Language Models (LLM) are increasingly multimodal, and Zero-Shot Visual Question Answering (VQA) shows promise for image interpretation. If zero-shot VQA can be applied to a 12-lead electrocardiogram (ECG), a prevalent diagnostic tool in the medical field, the potential benefits to the field w...

Full description

Saved in:

Bibliographic Details
Main Authors:	Tomohisa Seki, Yoshimasa Kawazoe, Hiromasa Ito, Yu Akagi, Toru Takiguchi, Kazuhiko Ohe
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2025-02-01
Series:	Frontiers in Cardiovascular Medicine
Subjects:	large language model electrocardiography visual question answering hallucination zero-shot learning
Online Access:	https://www.frontiersin.org/articles/10.3389/fcvm.2025.1458289/full
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.frontiersin.org/articles/10.3389/fcvm.2025.1458289/full

Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation

Internet

Similar Items