SVDDD: SAR Vehicle Target Detection Dataset Augmentation Based on Diffusion Model

In the field of target detection using synthetic aperture radar (SAR) images, deep learning-based supervised learning methods have demonstrated outstanding performance. However, the effectiveness of deep learning methods is largely influenced by the quantity and diversity of samples in the dataset....

Full description

Saved in:
Bibliographic Details
Main Authors: Keao Wang, Zongxu Pan, Zixiao Wen
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/17/2/286
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832587559204552704
author Keao Wang
Zongxu Pan
Zixiao Wen
author_facet Keao Wang
Zongxu Pan
Zixiao Wen
author_sort Keao Wang
collection DOAJ
description In the field of target detection using synthetic aperture radar (SAR) images, deep learning-based supervised learning methods have demonstrated outstanding performance. However, the effectiveness of deep learning methods is largely influenced by the quantity and diversity of samples in the dataset. Unfortunately, due to various constraints, the availability of labeled image data for training SAR vehicle detection networks is quite limited. This scarcity of data has become one of the main obstacles hindering the further development of SAR vehicle detection. In response to this issue, this paper collects SAR images of the Ka, Ku, and X bands to construct a labeled dataset for training Stable Diffusion and then propose a framework for data augmentation for SAR vehicle detection based on the Diffusion model, which consists of a fine-tuned Stable Diffusion model, a ControlNet, and a series of methods for processing and filtering images based on image clarity, histogram, and an influence function to enhance the diversity of the original dataset, thereby improving the performance of deep learning detection models. In the experiment, the samples we generated and screened achieved an average improvement of 2.32%, with a maximum of 6.6% in <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>m</mi><mi>A</mi><msub><mi>P</mi><mn>75</mn></msub></mrow></semantics></math></inline-formula> on five different strong baseline detectors.
format Article
id doaj-art-381bb547d0c445c082734deb6fd52c78
institution Kabale University
issn 2072-4292
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj-art-381bb547d0c445c082734deb6fd52c782025-01-24T13:48:00ZengMDPI AGRemote Sensing2072-42922025-01-0117228610.3390/rs17020286SVDDD: SAR Vehicle Target Detection Dataset Augmentation Based on Diffusion ModelKeao Wang0Zongxu Pan1Zixiao Wen2Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaIn the field of target detection using synthetic aperture radar (SAR) images, deep learning-based supervised learning methods have demonstrated outstanding performance. However, the effectiveness of deep learning methods is largely influenced by the quantity and diversity of samples in the dataset. Unfortunately, due to various constraints, the availability of labeled image data for training SAR vehicle detection networks is quite limited. This scarcity of data has become one of the main obstacles hindering the further development of SAR vehicle detection. In response to this issue, this paper collects SAR images of the Ka, Ku, and X bands to construct a labeled dataset for training Stable Diffusion and then propose a framework for data augmentation for SAR vehicle detection based on the Diffusion model, which consists of a fine-tuned Stable Diffusion model, a ControlNet, and a series of methods for processing and filtering images based on image clarity, histogram, and an influence function to enhance the diversity of the original dataset, thereby improving the performance of deep learning detection models. In the experiment, the samples we generated and screened achieved an average improvement of 2.32%, with a maximum of 6.6% in <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>m</mi><mi>A</mi><msub><mi>P</mi><mn>75</mn></msub></mrow></semantics></math></inline-formula> on five different strong baseline detectors.https://www.mdpi.com/2072-4292/17/2/286SARvehicletarget detectiondata augmentationdiffusion modelControlNet
spellingShingle Keao Wang
Zongxu Pan
Zixiao Wen
SVDDD: SAR Vehicle Target Detection Dataset Augmentation Based on Diffusion Model
Remote Sensing
SAR
vehicle
target detection
data augmentation
diffusion model
ControlNet
title SVDDD: SAR Vehicle Target Detection Dataset Augmentation Based on Diffusion Model
title_full SVDDD: SAR Vehicle Target Detection Dataset Augmentation Based on Diffusion Model
title_fullStr SVDDD: SAR Vehicle Target Detection Dataset Augmentation Based on Diffusion Model
title_full_unstemmed SVDDD: SAR Vehicle Target Detection Dataset Augmentation Based on Diffusion Model
title_short SVDDD: SAR Vehicle Target Detection Dataset Augmentation Based on Diffusion Model
title_sort svddd sar vehicle target detection dataset augmentation based on diffusion model
topic SAR
vehicle
target detection
data augmentation
diffusion model
ControlNet
url https://www.mdpi.com/2072-4292/17/2/286
work_keys_str_mv AT keaowang svdddsarvehicletargetdetectiondatasetaugmentationbasedondiffusionmodel
AT zongxupan svdddsarvehicletargetdetectiondatasetaugmentationbasedondiffusionmodel
AT zixiaowen svdddsarvehicletargetdetectiondatasetaugmentationbasedondiffusionmodel