Combining Region-Guided Attention and Attribute Prediction for Thangka Image Captioning Method
To enhance the understanding of the core regions in Thangka images and improve the richness of generated content during decoding, we propose a Thangka image captioning method based on Region-Guided Feature Enhancement and Attribute Prediction (RGFEAP). The image feature enhancement encoder, guided b...
Saved in:
| Main Authors: | Fujun Zhang, Wendong Kang, Wenjin Hu |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10833628/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Thangka image captioning model with Salient Attention and Local Interaction Aggregator
by: Wenjin Hu, et al.
Published: (2024-11-01) -
Enhanced Object Detection in Thangka Images Using Gabor, Wavelet, and Color Feature Fusion
by: Yukai Xian, et al.
Published: (2025-06-01) -
Tiny TR-CAP: A novel small-scale benchmark dataset for general-purpose image captioning tasks
by: Abbas Memiş, et al.
Published: (2025-04-01) -
A novel image captioning model with visual-semantic similarities and visual representations re-weighting
by: Alaa Thobhani, et al.
Published: (2024-09-01) -
Few shot object detection for headdresses and seats in Thangka Yidam based on ResNet and deformable convolution
by: Hu Wenjin, et al.
Published: (2022-12-01)