CD4C: Change Detection for Remote Sensing Image Change Captioning

Remote sensing image change captioning is an important image interpretation technique that automatically generates captions describing the visual changes in multitemporal remote sensing images. However, the visual changes present in multitemporal images can be classified as foreground changes, which...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiliang Li, Bin Sun, Zhenhua Wu, Shutao Li, Hu Guo
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10938120/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850182987477417984
author Xiliang Li
Bin Sun
Zhenhua Wu
Shutao Li
Hu Guo
author_facet Xiliang Li
Bin Sun
Zhenhua Wu
Shutao Li
Hu Guo
author_sort Xiliang Li
collection DOAJ
description Remote sensing image change captioning is an important image interpretation technique that automatically generates captions describing the visual changes in multitemporal remote sensing images. However, the visual changes present in multitemporal images can be classified as foreground changes, which are captured in captions, and background changes, which interfere with traditional methods and complicate the effective capture of foreground changes. This ultimately limits the overall performance of the model. To address this issue, this study introduces change detection for remote sensing image change captioning (CD4C). Specifically, a change detection module generates binary masks that contain relevant visual change information from multitemporal images. Subsequently, based on whether changes are detected, samples are classified and processed through the C-Stream and N-Stream of the multitemporal difference feature fusion (MDF) module to extract visual change features. The C-Stream leverages the visual change information provided by the mask to enhance the ability of CD4C to capture foreground visual change features at both the image and feature levels. The N-Stream incorporates a pseudofeature generation module designed to mitigate the interference caused by poor change detection results. Finally, the caption generation module interprets the visual change features extracted by the MDF to produce accurate textual descriptions. Experiments on the LEVIR-CC and Dubai-CC datasets demonstrate that the proposed method outperforms other approaches.
format Article
id doaj-art-41e4a45618c84f6db98e92127a8a5f57
institution OA Journals
issn 1939-1404
2151-1535
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj-art-41e4a45618c84f6db98e92127a8a5f572025-08-20T02:17:28ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-01189181919410.1109/JSTARS.2025.355438510938120CD4C: Change Detection for Remote Sensing Image Change CaptioningXiliang Li0https://orcid.org/0000-0003-0201-3975Bin Sun1https://orcid.org/0000-0002-7029-8784Zhenhua Wu2https://orcid.org/0009-0001-9692-2418Shutao Li3https://orcid.org/0000-0002-0585-9848Hu Guo4https://orcid.org/0009-0006-1416-0998College of Electrical and Information Engineering, Hunan University, Changsha, ChinaCollege of Electrical and Information Engineering, Hunan University, Changsha, ChinaCollege of Electrical and Information Engineering, Hunan University, Changsha, ChinaCollege of Electrical and Information Engineering, Hunan University, Changsha, ChinaCollege of Computer Science and Electronic Engineering, Hunan University, Changsha, ChinaRemote sensing image change captioning is an important image interpretation technique that automatically generates captions describing the visual changes in multitemporal remote sensing images. However, the visual changes present in multitemporal images can be classified as foreground changes, which are captured in captions, and background changes, which interfere with traditional methods and complicate the effective capture of foreground changes. This ultimately limits the overall performance of the model. To address this issue, this study introduces change detection for remote sensing image change captioning (CD4C). Specifically, a change detection module generates binary masks that contain relevant visual change information from multitemporal images. Subsequently, based on whether changes are detected, samples are classified and processed through the C-Stream and N-Stream of the multitemporal difference feature fusion (MDF) module to extract visual change features. The C-Stream leverages the visual change information provided by the mask to enhance the ability of CD4C to capture foreground visual change features at both the image and feature levels. The N-Stream incorporates a pseudofeature generation module designed to mitigate the interference caused by poor change detection results. Finally, the caption generation module interprets the visual change features extracted by the MDF to produce accurate textual descriptions. Experiments on the LEVIR-CC and Dubai-CC datasets demonstrate that the proposed method outperforms other approaches.https://ieeexplore.ieee.org/document/10938120/Change captionchange detectionmultitemporal difference feature fusion (MDF)remote sensing
spellingShingle Xiliang Li
Bin Sun
Zhenhua Wu
Shutao Li
Hu Guo
CD4C: Change Detection for Remote Sensing Image Change Captioning
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Change caption
change detection
multitemporal difference feature fusion (MDF)
remote sensing
title CD4C: Change Detection for Remote Sensing Image Change Captioning
title_full CD4C: Change Detection for Remote Sensing Image Change Captioning
title_fullStr CD4C: Change Detection for Remote Sensing Image Change Captioning
title_full_unstemmed CD4C: Change Detection for Remote Sensing Image Change Captioning
title_short CD4C: Change Detection for Remote Sensing Image Change Captioning
title_sort cd4c change detection for remote sensing image change captioning
topic Change caption
change detection
multitemporal difference feature fusion (MDF)
remote sensing
url https://ieeexplore.ieee.org/document/10938120/
work_keys_str_mv AT xiliangli cd4cchangedetectionforremotesensingimagechangecaptioning
AT binsun cd4cchangedetectionforremotesensingimagechangecaptioning
AT zhenhuawu cd4cchangedetectionforremotesensingimagechangecaptioning
AT shutaoli cd4cchangedetectionforremotesensingimagechangecaptioning
AT huguo cd4cchangedetectionforremotesensingimagechangecaptioning