Thangka image captioning model with Salient Attention and Local Interaction Aggregator

Abstract Thangka image captioning aims to automatically generate accurate and complete sentences that describe the main content of Thangka images. However, existing methods fall short in capturing the features of the core deity regions and the surrounding background details of Thangka images, and th...

Full description

Saved in:
Bibliographic Details
Main Authors: Wenjin Hu, Fujun Zhang, Yinqiu Zhao
Format: Article
Language:English
Published: SpringerOpen 2024-11-01
Series:Heritage Science
Subjects:
Online Access:https://doi.org/10.1186/s40494-024-01518-5
Tags: Add Tag
No Tags, Be the first to tag this record!