A vision attention driven Language framework for medical report generation

Abstract This study introduces the Medical Vision Attention Generation (MedVAG) model, a novel framework designed to facilitate the automated generation of medical reports. MedVAG integrates Vision Transformer (ViT)-based visual feature extraction and GPT-2 language modeling, enhanced by graph-based...

Full description

Saved in:
Bibliographic Details
Main Authors: Merve Varol Arısoy, Ayhan Arısoy, İlhan Uysal
Format: Article
Language:English
Published: Nature Portfolio 2025-03-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-95666-8
Tags: Add Tag
No Tags, Be the first to tag this record!