Dual-Stream Spatially Aware Transformer for Remote Sensing Image Captioning

Remote sensing image captioning (RSIC) aims to generate semantically rich and syntactically accurate descriptions for remote sensing images (RSIs). However, due to the complex spatial layouts, occlusions, and overlapping objects in such images, caption generation is often challenged by semantic ambi...

Full description

Saved in:
Bibliographic Details
Main Authors: Haifeng Sima, Xiangtao Ding, JianLong Wang, Mingliang Xu
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11104798/
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items