SSMM-DS: A semantic segmentation model for mangroves based on Deeplabv3+ with swin transformer

Mangrove wetlands play a crucial role in maintaining species diversity. However, they face threats from habitat degradation, deforestation, pollution, and climate change. Detecting changes in mangrove wetlands is essential for understanding their ecological implications, but it remains a challenging...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhenhua Wang, Jinlong Yang, Chuansheng Dong, Xi Zhang, Congqin Yi, Jiuhu Sun
Format: Article
Language:English
Published: AIMS Press 2024-10-01
Series:Electronic Research Archive
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/era.2024260
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590743633395712
author Zhenhua Wang
Jinlong Yang
Chuansheng Dong
Xi Zhang
Congqin Yi
Jiuhu Sun
author_facet Zhenhua Wang
Jinlong Yang
Chuansheng Dong
Xi Zhang
Congqin Yi
Jiuhu Sun
author_sort Zhenhua Wang
collection DOAJ
description Mangrove wetlands play a crucial role in maintaining species diversity. However, they face threats from habitat degradation, deforestation, pollution, and climate change. Detecting changes in mangrove wetlands is essential for understanding their ecological implications, but it remains a challenging task. In this study, we propose a semantic segmentation model for mangroves based on Deeplabv3+ with Swin Transformer, abbreviated as SSMM-DS. Using Deeplabv3+ as the basic framework, we first constructed a data concatenation module to improve the contrast between mangroves and other vegetation or water. We then employed Swin Transformer as the backbone network, enhancing the capability of global information learning and detail feature extraction. Finally, we optimized the loss function by combining cross-entropy loss and dice loss, addressing the issue of sampling imbalance caused by the small areas of mangroves. Using GF-1 and GF-6 images, taking mean precision (mPrecision), mean intersection over union (mIoU), floating-point operations (FLOPs), and the number of parameters (Params) as evaluation metrics, we evaluate SSMM-DS against state-of-the-art models, including FCN, PSPNet, OCRNet, uPerNet, and SegFormer. The results demonstrate SSMM-DS's superiority in terms of mIoU, mPrecision, and parameter efficiency. SSMM-DS achieves a higher mIoU (95.11%) and mPrecision (97.79%) while using fewer parameters (17.48M) compared to others. Although its FLOPs are slightly higher than SegFormer's (15.11G vs. 9.9G), SSMM-DS offers a balance between performance and efficiency. Experimental results highlight SSMM-DS's effectiveness in extracting mangrove features, making it a valuable tool for monitoring and managing these critical ecosystems.
format Article
id doaj-art-56c0bd0978ac4ffe9526ac51a737a6f3
institution Kabale University
issn 2688-1594
language English
publishDate 2024-10-01
publisher AIMS Press
record_format Article
series Electronic Research Archive
spelling doaj-art-56c0bd0978ac4ffe9526ac51a737a6f32025-01-23T07:52:52ZengAIMS PressElectronic Research Archive2688-15942024-10-0132105615563210.3934/era.2024260SSMM-DS: A semantic segmentation model for mangroves based on Deeplabv3+ with swin transformerZhenhua Wang0Jinlong Yang1Chuansheng Dong2Xi Zhang3Congqin Yi4Jiuhu Sun5College of Information Technology, Shanghai Ocean University, Shanghai 201306, ChinaCollege of Information Technology, Shanghai Ocean University, Shanghai 201306, ChinaShandong Land Surveying and Mapping Institute, Jinan 250000, ChinaShandong provincial institute of land space data and remote sensing technology, Jinan 250000, ChinaCollege of Information Technology, Shanghai Ocean University, Shanghai 201306, ChinaShandong Land Surveying and Mapping Institute, Jinan 250000, ChinaMangrove wetlands play a crucial role in maintaining species diversity. However, they face threats from habitat degradation, deforestation, pollution, and climate change. Detecting changes in mangrove wetlands is essential for understanding their ecological implications, but it remains a challenging task. In this study, we propose a semantic segmentation model for mangroves based on Deeplabv3+ with Swin Transformer, abbreviated as SSMM-DS. Using Deeplabv3+ as the basic framework, we first constructed a data concatenation module to improve the contrast between mangroves and other vegetation or water. We then employed Swin Transformer as the backbone network, enhancing the capability of global information learning and detail feature extraction. Finally, we optimized the loss function by combining cross-entropy loss and dice loss, addressing the issue of sampling imbalance caused by the small areas of mangroves. Using GF-1 and GF-6 images, taking mean precision (mPrecision), mean intersection over union (mIoU), floating-point operations (FLOPs), and the number of parameters (Params) as evaluation metrics, we evaluate SSMM-DS against state-of-the-art models, including FCN, PSPNet, OCRNet, uPerNet, and SegFormer. The results demonstrate SSMM-DS's superiority in terms of mIoU, mPrecision, and parameter efficiency. SSMM-DS achieves a higher mIoU (95.11%) and mPrecision (97.79%) while using fewer parameters (17.48M) compared to others. Although its FLOPs are slightly higher than SegFormer's (15.11G vs. 9.9G), SSMM-DS offers a balance between performance and efficiency. Experimental results highlight SSMM-DS's effectiveness in extracting mangrove features, making it a valuable tool for monitoring and managing these critical ecosystems.https://www.aimspress.com/article/doi/10.3934/era.2024260remote sensing imagesemantic segmentationmangroveswin transformer
spellingShingle Zhenhua Wang
Jinlong Yang
Chuansheng Dong
Xi Zhang
Congqin Yi
Jiuhu Sun
SSMM-DS: A semantic segmentation model for mangroves based on Deeplabv3+ with swin transformer
Electronic Research Archive
remote sensing image
semantic segmentation
mangrove
swin transformer
title SSMM-DS: A semantic segmentation model for mangroves based on Deeplabv3+ with swin transformer
title_full SSMM-DS: A semantic segmentation model for mangroves based on Deeplabv3+ with swin transformer
title_fullStr SSMM-DS: A semantic segmentation model for mangroves based on Deeplabv3+ with swin transformer
title_full_unstemmed SSMM-DS: A semantic segmentation model for mangroves based on Deeplabv3+ with swin transformer
title_short SSMM-DS: A semantic segmentation model for mangroves based on Deeplabv3+ with swin transformer
title_sort ssmm ds a semantic segmentation model for mangroves based on deeplabv3 with swin transformer
topic remote sensing image
semantic segmentation
mangrove
swin transformer
url https://www.aimspress.com/article/doi/10.3934/era.2024260
work_keys_str_mv AT zhenhuawang ssmmdsasemanticsegmentationmodelformangrovesbasedondeeplabv3withswintransformer
AT jinlongyang ssmmdsasemanticsegmentationmodelformangrovesbasedondeeplabv3withswintransformer
AT chuanshengdong ssmmdsasemanticsegmentationmodelformangrovesbasedondeeplabv3withswintransformer
AT xizhang ssmmdsasemanticsegmentationmodelformangrovesbasedondeeplabv3withswintransformer
AT congqinyi ssmmdsasemanticsegmentationmodelformangrovesbasedondeeplabv3withswintransformer
AT jiuhusun ssmmdsasemanticsegmentationmodelformangrovesbasedondeeplabv3withswintransformer