Thangka image captioning model with Salient Attention and Local Interaction Aggregator
Abstract Thangka image captioning aims to automatically generate accurate and complete sentences that describe the main content of Thangka images. However, existing methods fall short in capturing the features of the core deity regions and the surrounding background details of Thangka images, and th...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
SpringerOpen
2024-11-01
|
| Series: | Heritage Science |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s40494-024-01518-5 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|