Dual-Stream Spatially Aware Transformer for Remote Sensing Image Captioning
Remote sensing image captioning (RSIC) aims to generate semantically rich and syntactically accurate descriptions for remote sensing images (RSIs). However, due to the complex spatial layouts, occlusions, and overlapping objects in such images, caption generation is often challenged by semantic ambi...
Saved in:
| Main Authors: | Haifeng Sima, Xiangtao Ding, JianLong Wang, Mingliang Xu |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11104798/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Contrastive learning based remote sensing text-to-image generation for few-shot remote sensing image captioning
by: Haonan Zhou, et al.
Published: (2025-08-01) -
Remote Sensing Image Change Captioning Using Multi-Attentive Network with Diffusion Model
by: Yue Yang, et al.
Published: (2024-11-01) -
Tiny TR-CAP: A novel small-scale benchmark dataset for general-purpose image captioning tasks
by: Abbas Memiş, et al.
Published: (2025-04-01) -
Enhanced CLIP-GPT Framework for Cross-Lingual Remote Sensing Image Captioning
by: Rui Song, et al.
Published: (2025-01-01) -
Frequency–Spatial–Temporal Domain Fusion Network for Remote Sensing Image Change Captioning
by: Shiwei Zou, et al.
Published: (2025-04-01)