Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation

Large Language Models (LLM) are increasingly multimodal, and Zero-Shot Visual Question Answering (VQA) shows promise for image interpretation. If zero-shot VQA can be applied to a 12-lead electrocardiogram (ECG), a prevalent diagnostic tool in the medical field, the potential benefits to the field w...

Full description

Saved in:
Bibliographic Details
Main Authors: Tomohisa Seki, Yoshimasa Kawazoe, Hiromasa Ito, Yu Akagi, Toru Takiguchi, Kazuhiko Ohe
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-02-01
Series:Frontiers in Cardiovascular Medicine
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fcvm.2025.1458289/full
Tags: Add Tag
No Tags, Be the first to tag this record!