A vision attention driven Language framework for medical report generation

Abstract This study introduces the Medical Vision Attention Generation (MedVAG) model, a novel framework designed to facilitate the automated generation of medical reports. MedVAG integrates Vision Transformer (ViT)-based visual feature extraction and GPT-2 language modeling, enhanced by graph-based...

Full description

Saved in:

Bibliographic Details
Main Authors:	Merve Varol Arısoy, Ayhan Arısoy, İlhan Uysal
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-03-01
Series:	Scientific Reports
Subjects:	Medical report generation Retrieval augmentation Multi-view Graph-based feature fusion Vision transformer Memory-Guided attention Radiology reports
Online Access:	https://doi.org/10.1038/s41598-025-95666-8
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://doi.org/10.1038/s41598-025-95666-8

A vision attention driven Language framework for medical report generation

Internet

Similar Items