Feature refinement and rethinking attention for remote sensing image captioning
Abstract Effectively recognizing different regions of interest with attention mechanisms plays an important role in remote sensing image captioning task. However, these attention-driven models implicitly hypothesize that the focused region information is correct, which is too restrictive. Furthermor...
Saved in:
| Main Authors: | Yunpeng Li, Chengjin Tao, Meng Liu, Xiangrong Zhang, Guanchun Wang, Tianyang Zhang, Dong Zhao, Dabao Wang |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-03-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-93125-y |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
A Patch-Level Region-Aware Module with a Multi-Label Framework for Remote Sensing Image Captioning
by: Yunpeng Li, et al.
Published: (2024-10-01) -
Remote Sensing Image Change Captioning Using Multi-Attentive Network with Diffusion Model
by: Yue Yang, et al.
Published: (2024-11-01) -
Semantic–Spatial Feature Fusion With Dynamic Graph Refinement for Remote Sensing Image Captioning
by: Maofu Liu, et al.
Published: (2025-01-01) -
Dual-Stream Spatially Aware Transformer for Remote Sensing Image Captioning
by: Haifeng Sima, et al.
Published: (2025-01-01) -
Integrating visual memory for image captioning
by: Jiahui Wei, et al.
Published: (2025-05-01)