Combining Region-Guided Attention and Attribute Prediction for Thangka Image Captioning Method

Combining Region-Guided Attention and Attribute Prediction for Thangka Image Captioning Method

To enhance the understanding of the core regions in Thangka images and improve the richness of generated content during decoding, we propose a Thangka image captioning method based on Region-Guided Feature Enhancement and Attribute Prediction (RGFEAP). The image feature enhancement encoder, guided b...

Full description

Saved in:

Bibliographic Details
Main Authors:	Fujun Zhang, Wendong Kang, Wenjin Hu
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Image captioning Thangka images region-guided attribute prediction
Online Access:	https://ieeexplore.ieee.org/document/10833628/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Thangka image captioning model with Salient Attention and Local Interaction Aggregator
by: Wenjin Hu, et al.
Published: (2024-11-01)

Enhanced Object Detection in Thangka Images Using Gabor, Wavelet, and Color Feature Fusion
by: Yukai Xian, et al.
Published: (2025-06-01)

Tiny TR-CAP: A novel small-scale benchmark dataset for general-purpose image captioning tasks
by: Abbas Memiş, et al.
Published: (2025-04-01)

A novel image captioning model with visual-semantic similarities and visual representations re-weighting
by: Alaa Thobhani, et al.
Published: (2024-09-01)

Few shot object detection for headdresses and seats in Thangka Yidam based on ResNet and deformable convolution
by: Hu Wenjin, et al.
Published: (2022-12-01)

Affective Image Captioning for Visual Artworks Using Emotion-Based Cross-Attention Mechanisms
by: Shintaro Ishikawa, et al.
Published: (2023-01-01)

Remote Sensing Image Change Captioning Using Multi-Attentive Network with Diffusion Model
by: Yue Yang, et al.
Published: (2024-11-01)

Semantic-Guided Selective Representation for Image Captioning
by: Yinan Li, et al.
Published: (2023-01-01)

Integrating visual memory for image captioning
by: Jiahui Wei, et al.
Published: (2025-05-01)

Dual-Stream Spatially Aware Transformer for Remote Sensing Image Captioning
by: Haifeng Sima, et al.
Published: (2025-01-01)

Enhanced group relation learning via aligned attention masking for fashion product captioning
by: Yuhao Tang, et al.
Published: (2025-08-01)

Image Captioning Based on Semantic Scenes
by: Fengzhi Zhao, et al.
Published: (2024-10-01)

Attribute-Based Learning for Remote Sensing Image Captioning in Unseen Scenes
by: Zhang Guo, et al.
Published: (2025-03-01)

Visual Content Captioning and Audio Conversion using CNN-RNN with Attention Model
by: Aldy Agil Hermanto, et al.
Published: (2025-06-01)

A Study on Generating Maritime Image Captions Based on Transformer Dual Information Flow
by: Zhenqiang Zhao, et al.
Published: (2025-06-01)

News Image Captioning via Separate Attention on Entity Categories
by: Sonali Ajankar, et al.
Published: (2025-01-01)

Contrastive learning based remote sensing text-to-image generation for few-shot remote sensing image captioning
by: Haonan Zhou, et al.
Published: (2025-08-01)

Improving Visual Question Answering by Image Captioning
by: Xiangjun Shao, et al.
Published: (2025-01-01)

Feature refinement and rethinking attention for remote sensing image captioning
by: Yunpeng Li, et al.
Published: (2025-03-01)

MFEAM: Multi-View Feature Enhanced Attention Model for Image Captioning
by: Yang Cui, et al.
Published: (2025-07-01)

Detailed Image Captioning and Hashtag Generation
by: Nikshep Shetty, et al.
Published: (2024-11-01)

Improved IEC performance via emotional stimuli-aware captioning
by: Zibo Zhou, et al.
Published: (2025-07-01)

Preliminary Study on Image Captioning for Construction Hazards
by: Wen-Ta Hsiao, et al.
Published: (2024-08-01)

Chinese Image Captioning Based on Deep Fusion Feature and Multi-Layer Feature Filtering Block
by: Xi Yang, et al.
Published: (2025-01-01)

Enhanced CLIP-GPT Framework for Cross-Lingual Remote Sensing Image Captioning
by: Rui Song, et al.
Published: (2025-01-01)

NuCap: A Numerically Aware Captioning Framework for Improved Numerical Reasoning
by: Yuna Jeong, et al.
Published: (2025-05-01)

A Patch-Level Region-Aware Module with a Multi-Label Framework for Remote Sensing Image Captioning
by: Yunpeng Li, et al.
Published: (2024-10-01)

Auto-Scenario Generator for Autonomous Vehicle Safety: Multi-Modal Attention-Based Image Captioning Model Using Digital Twin Data
by: Hojun Lee, et al.
Published: (2024-01-01)

Image captioning using bidirectional LSTM neural network
by: Farnaz Hoseini, et al.
Published: (2025-05-01)

The CLIP - GPT Image Captioning Model Integrated with Global Semantics
by: TAO Rui, et al.
Published: (2024-04-01)

3M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model
by: Chengxi Li, et al.
Published: (2021-04-01)

Integrating Abstract Meaning Representation to Enhance Transformer-Based Image Captioning
by: Nguyen Van Thinh, et al.
Published: (2025-01-01)

PBC-Transformer: Interpreting Poultry Behavior Classification Using Image Caption Generation Techniques
by: Jun Li, et al.
Published: (2025-05-01)

Offline visual aid system for the blind based on image captioning
by: Yue CHEN, et al.
Published: (2022-01-01)

Privacy-Preserving Image Captioning with Partial Encryption and Deep Learning
by: Antoinette Deborah Martin, et al.
Published: (2025-02-01)

Cap2Seg: leveraging caption generation for enhanced segmentation of COVID-19 medical images
by: Wanlong Zhao, et al.
Published: (2024-10-01)

Frequency–Spatial–Temporal Domain Fusion Network for Remote Sensing Image Change Captioning
by: Shiwei Zou, et al.
Published: (2025-04-01)

Training strategies for semi-supervised remote sensing image captioning
by: Qimin Cheng, et al.
Published: (2025-07-01)

Human Scene Understanding Mechanism-Based Image Captioning for Blind Assistance
by: Jong-Hoon Kim, et al.
Published: (2025-01-01)

Automated Medical Image Captioning Using the BLIP Model: Enhancing Diagnostic Support with AI-Driven Language Generation
by: Enas Abbas Abed, et al.
Published: (2025-06-01)