Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation
Large Language Models (LLM) are increasingly multimodal, and Zero-Shot Visual Question Answering (VQA) shows promise for image interpretation. If zero-shot VQA can be applied to a 12-lead electrocardiogram (ECG), a prevalent diagnostic tool in the medical field, the potential benefits to the field w...
Saved in:
Main Authors: | Tomohisa Seki, Yoshimasa Kawazoe, Hiromasa Ito, Yu Akagi, Toru Takiguchi, Kazuhiko Ohe |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2025-02-01
|
Series: | Frontiers in Cardiovascular Medicine |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fcvm.2025.1458289/full |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Positional embeddings and zero-shot learning using BERT for molecular-property prediction
by: Medard Edmund Mswahili, et al.
Published: (2025-02-01) -
Comprehensive Study on Zero-Shot Text Classification Using Category Mapping
by: Kai Zhang, et al.
Published: (2025-01-01) -
Pocket Reference for ECGs Made Easy /
by: Aehlert, Barbara
Published: (2013) -
Enhancing zero-shot stance detection via multi-task fine-tuning with debate data and knowledge augmentation
by: Qinlong Fan, et al.
Published: (2025-01-01) -
One size fits all: Enhanced zero-shot text classification for patient listening on social media
by: Veton Matoshi, et al.
Published: (2025-02-01)