Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation
Large Language Models (LLM) are increasingly multimodal, and Zero-Shot Visual Question Answering (VQA) shows promise for image interpretation. If zero-shot VQA can be applied to a 12-lead electrocardiogram (ECG), a prevalent diagnostic tool in the medical field, the potential benefits to the field w...
Saved in:
Main Authors: | Tomohisa Seki, Yoshimasa Kawazoe, Hiromasa Ito, Yu Akagi, Toru Takiguchi, Kazuhiko Ohe |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2025-02-01
|
Series: | Frontiers in Cardiovascular Medicine |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fcvm.2025.1458289/full |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering
by: Zhongjian Hu, et al.
Published: (2024-09-01) -
Enhancing students’ participation through question and answer on SMAN 2 Sungai Kakap Kubu Raya
by: Clarry Sada, et al.
Published: (2024-02-01) -
QAR (QUESTION ANSWER RELATIONSHIP) AS AN ALTERNATIVE STRATEGY TO TEACH READING
by: Sa’dulloh Muzammil
Published: (2017-01-01) -
cLegal-QA: a Chinese legal question answering with natural language generation methods
by: Yizhen Wang, et al.
Published: (2024-12-01) -
Multimodal Zero-Shot Shelf Deformation Detection Based on MEMS Sensors and Images
by: Hong Yan, et al.
Published: (2025-01-01)