Thangka image captioning model with Salient Attention and Local Interaction Aggregator

Abstract Thangka image captioning aims to automatically generate accurate and complete sentences that describe the main content of Thangka images. However, existing methods fall short in capturing the features of the core deity regions and the surrounding background details of Thangka images, and th...

Full description

Saved in:

Bibliographic Details
Main Authors:	Wenjin Hu, Fujun Zhang, Yinqiu Zhao
Format:	Article
Language:	English
Published:	SpringerOpen 2024-11-01
Series:	Heritage Science
Subjects:	Image captioning Thangka Dual-Branch Salient Attention Local Interaction Aggregator
Online Access:	https://doi.org/10.1186/s40494-024-01518-5
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://doi.org/10.1186/s40494-024-01518-5

Thangka image captioning model with Salient Attention and Local Interaction Aggregator

Internet

Similar Items