Combining Region-Guided Attention and Attribute Prediction for Thangka Image Captioning Method
To enhance the understanding of the core regions in Thangka images and improve the richness of generated content during decoding, we propose a Thangka image captioning method based on Region-Guided Feature Enhancement and Attribute Prediction (RGFEAP). The image feature enhancement encoder, guided b...
Saved in:
Main Authors: | Fujun Zhang, Wendong Kang, Wenjin Hu |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10833628/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
CLIP-Based Grid Features and Masking for Remote Sensing Image Captioning
by: Qiaoling Lin, et al.
Published: (2025-01-01) -
KE-RSIC: Remote Sensing Image Captioning Based on Knowledge Embedding
by: Kangda Cheng, et al.
Published: (2025-01-01) -
Detailed Image Captioning and Hashtag Generation
by: Nikshep Shetty, et al.
Published: (2024-11-01) -
MIRA-CAP: Memory-Integrated Retrieval-Augmented Captioning for State-of-the-Art Image and Video Captioning
by: Sabina Umirzakova, et al.
Published: (2024-12-01) -
Preliminary Study on Image Captioning for Construction Hazards
by: Wen-Ta Hsiao, et al.
Published: (2024-08-01)