CD4C: Change Detection for Remote Sensing Image Change Captioning
Remote sensing image change captioning is an important image interpretation technique that automatically generates captions describing the visual changes in multitemporal remote sensing images. However, the visual changes present in multitemporal images can be classified as foreground changes, which...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10938120/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850182987477417984 |
|---|---|
| author | Xiliang Li Bin Sun Zhenhua Wu Shutao Li Hu Guo |
| author_facet | Xiliang Li Bin Sun Zhenhua Wu Shutao Li Hu Guo |
| author_sort | Xiliang Li |
| collection | DOAJ |
| description | Remote sensing image change captioning is an important image interpretation technique that automatically generates captions describing the visual changes in multitemporal remote sensing images. However, the visual changes present in multitemporal images can be classified as foreground changes, which are captured in captions, and background changes, which interfere with traditional methods and complicate the effective capture of foreground changes. This ultimately limits the overall performance of the model. To address this issue, this study introduces change detection for remote sensing image change captioning (CD4C). Specifically, a change detection module generates binary masks that contain relevant visual change information from multitemporal images. Subsequently, based on whether changes are detected, samples are classified and processed through the C-Stream and N-Stream of the multitemporal difference feature fusion (MDF) module to extract visual change features. The C-Stream leverages the visual change information provided by the mask to enhance the ability of CD4C to capture foreground visual change features at both the image and feature levels. The N-Stream incorporates a pseudofeature generation module designed to mitigate the interference caused by poor change detection results. Finally, the caption generation module interprets the visual change features extracted by the MDF to produce accurate textual descriptions. Experiments on the LEVIR-CC and Dubai-CC datasets demonstrate that the proposed method outperforms other approaches. |
| format | Article |
| id | doaj-art-41e4a45618c84f6db98e92127a8a5f57 |
| institution | OA Journals |
| issn | 1939-1404 2151-1535 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | IEEE |
| record_format | Article |
| series | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
| spelling | doaj-art-41e4a45618c84f6db98e92127a8a5f572025-08-20T02:17:28ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-01189181919410.1109/JSTARS.2025.355438510938120CD4C: Change Detection for Remote Sensing Image Change CaptioningXiliang Li0https://orcid.org/0000-0003-0201-3975Bin Sun1https://orcid.org/0000-0002-7029-8784Zhenhua Wu2https://orcid.org/0009-0001-9692-2418Shutao Li3https://orcid.org/0000-0002-0585-9848Hu Guo4https://orcid.org/0009-0006-1416-0998College of Electrical and Information Engineering, Hunan University, Changsha, ChinaCollege of Electrical and Information Engineering, Hunan University, Changsha, ChinaCollege of Electrical and Information Engineering, Hunan University, Changsha, ChinaCollege of Electrical and Information Engineering, Hunan University, Changsha, ChinaCollege of Computer Science and Electronic Engineering, Hunan University, Changsha, ChinaRemote sensing image change captioning is an important image interpretation technique that automatically generates captions describing the visual changes in multitemporal remote sensing images. However, the visual changes present in multitemporal images can be classified as foreground changes, which are captured in captions, and background changes, which interfere with traditional methods and complicate the effective capture of foreground changes. This ultimately limits the overall performance of the model. To address this issue, this study introduces change detection for remote sensing image change captioning (CD4C). Specifically, a change detection module generates binary masks that contain relevant visual change information from multitemporal images. Subsequently, based on whether changes are detected, samples are classified and processed through the C-Stream and N-Stream of the multitemporal difference feature fusion (MDF) module to extract visual change features. The C-Stream leverages the visual change information provided by the mask to enhance the ability of CD4C to capture foreground visual change features at both the image and feature levels. The N-Stream incorporates a pseudofeature generation module designed to mitigate the interference caused by poor change detection results. Finally, the caption generation module interprets the visual change features extracted by the MDF to produce accurate textual descriptions. Experiments on the LEVIR-CC and Dubai-CC datasets demonstrate that the proposed method outperforms other approaches.https://ieeexplore.ieee.org/document/10938120/Change captionchange detectionmultitemporal difference feature fusion (MDF)remote sensing |
| spellingShingle | Xiliang Li Bin Sun Zhenhua Wu Shutao Li Hu Guo CD4C: Change Detection for Remote Sensing Image Change Captioning IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Change caption change detection multitemporal difference feature fusion (MDF) remote sensing |
| title | CD4C: Change Detection for Remote Sensing Image Change Captioning |
| title_full | CD4C: Change Detection for Remote Sensing Image Change Captioning |
| title_fullStr | CD4C: Change Detection for Remote Sensing Image Change Captioning |
| title_full_unstemmed | CD4C: Change Detection for Remote Sensing Image Change Captioning |
| title_short | CD4C: Change Detection for Remote Sensing Image Change Captioning |
| title_sort | cd4c change detection for remote sensing image change captioning |
| topic | Change caption change detection multitemporal difference feature fusion (MDF) remote sensing |
| url | https://ieeexplore.ieee.org/document/10938120/ |
| work_keys_str_mv | AT xiliangli cd4cchangedetectionforremotesensingimagechangecaptioning AT binsun cd4cchangedetectionforremotesensingimagechangecaptioning AT zhenhuawu cd4cchangedetectionforremotesensingimagechangecaptioning AT shutaoli cd4cchangedetectionforremotesensingimagechangecaptioning AT huguo cd4cchangedetectionforremotesensingimagechangecaptioning |