ES-UNet: efficient 3D medical image segmentation with enhanced skip connections in 3D UNet

Abstract Background Deep learning has significantly advanced medical image analysis, particularly in semantic segmentation, which is essential for clinical decisions. However, existing 3D segmentation models, like the traditional 3D UNet, face challenges in balancing computational efficiency and acc...

Full description

Saved in:
Bibliographic Details
Main Authors: Minyoung Park, Seungtaek Oh, Junyoung Park, Taikyeong Jeong, Sungwook Yu
Format: Article
Language:English
Published: BMC 2025-08-01
Series:BMC Medical Imaging
Subjects:
Online Access:https://doi.org/10.1186/s12880-025-01857-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849332066675589120
author Minyoung Park
Seungtaek Oh
Junyoung Park
Taikyeong Jeong
Sungwook Yu
author_facet Minyoung Park
Seungtaek Oh
Junyoung Park
Taikyeong Jeong
Sungwook Yu
author_sort Minyoung Park
collection DOAJ
description Abstract Background Deep learning has significantly advanced medical image analysis, particularly in semantic segmentation, which is essential for clinical decisions. However, existing 3D segmentation models, like the traditional 3D UNet, face challenges in balancing computational efficiency and accuracy when processing volumetric medical data. This study aims to develop an improved architecture for 3D medical image segmentation with enhanced learning strategies to improve accuracy and address challenges related to limited training data. Methods We propose ES-UNet, a 3D segmentation architecture that achieves superior segmentation performance while offering competitive efficiency across multiple computational metrics, including memory usage, inference time, and parameter count. The model builds upon the full-scale skip connection design of UNet3+ by integrating channel attention modules into each encoder-to-decoder path and incorporating full-scale deep supervision to enhance multi-resolution feature learning. We further introduce Region Specific Scaling (RSS), a data augmentation method that adaptively applies geometric transformations to annotated regions, and a Dynamically Weighted Dice (DWD) loss to improve the balance between precision and recall. The model was evaluated on the MICCAI HECKTOR dataset, and additional validation was conducted on selected tasks from the Medical Segmentation Decathlon (MSD). Results On the HECKTOR dataset, ES-UNet achieved a Dice Similarity Coefficient (DSC) of 76.87%, outperforming baseline models including 3D UNet, 3D UNet 3+, nnUNet, and Swin UNETR. Ablation studies showed that RSS and DWD contributed up to 1.22% and 1.06% improvement in DSC, respectively. A sensitivity analysis demonstrated that the chosen scaling range in RSS offered a favorable trade-off between deformation and anatomical plausibility. Cross-dataset evaluation on MSD Heart and Spleen tasks also indicated strong generalization. Computational analysis revealed that ES-UNet achieves superior segmentation performance with moderate computational demands. Specifically, the enhanced skip connection design with lightweight channel attention modules integrated throughout the network architecture enables this favorable balance between high segmentation accuracy and computational efficiency. Conclusion ES-UNet integrates architectural and algorithmic improvements to achieve robust 3D medical image segmentation. While the framework incorporates established components, its core contributions lie in the optimized skip connection strategy and supporting techniques like RSS and DWD. Future work will explore adaptive scaling strategies and broader validation across diverse imaging modalities.
format Article
id doaj-art-c6f71aea8b0742758b76cedae95dffde
institution Kabale University
issn 1471-2342
language English
publishDate 2025-08-01
publisher BMC
record_format Article
series BMC Medical Imaging
spelling doaj-art-c6f71aea8b0742758b76cedae95dffde2025-08-20T03:46:20ZengBMCBMC Medical Imaging1471-23422025-08-0125112310.1186/s12880-025-01857-0ES-UNet: efficient 3D medical image segmentation with enhanced skip connections in 3D UNetMinyoung Park0Seungtaek Oh1Junyoung Park2Taikyeong Jeong3Sungwook Yu4School of Electrical and Electronics Engineering, Chung-Ang UniversitySchool of Electrical and Electronics Engineering, Chung-Ang UniversitySchool of Electrical and Electronics Engineering, Chung-Ang UniversitySchool of Artificial Intelligence Convergence, Hallym UniversitySchool of Electrical and Electronics Engineering, Chung-Ang UniversityAbstract Background Deep learning has significantly advanced medical image analysis, particularly in semantic segmentation, which is essential for clinical decisions. However, existing 3D segmentation models, like the traditional 3D UNet, face challenges in balancing computational efficiency and accuracy when processing volumetric medical data. This study aims to develop an improved architecture for 3D medical image segmentation with enhanced learning strategies to improve accuracy and address challenges related to limited training data. Methods We propose ES-UNet, a 3D segmentation architecture that achieves superior segmentation performance while offering competitive efficiency across multiple computational metrics, including memory usage, inference time, and parameter count. The model builds upon the full-scale skip connection design of UNet3+ by integrating channel attention modules into each encoder-to-decoder path and incorporating full-scale deep supervision to enhance multi-resolution feature learning. We further introduce Region Specific Scaling (RSS), a data augmentation method that adaptively applies geometric transformations to annotated regions, and a Dynamically Weighted Dice (DWD) loss to improve the balance between precision and recall. The model was evaluated on the MICCAI HECKTOR dataset, and additional validation was conducted on selected tasks from the Medical Segmentation Decathlon (MSD). Results On the HECKTOR dataset, ES-UNet achieved a Dice Similarity Coefficient (DSC) of 76.87%, outperforming baseline models including 3D UNet, 3D UNet 3+, nnUNet, and Swin UNETR. Ablation studies showed that RSS and DWD contributed up to 1.22% and 1.06% improvement in DSC, respectively. A sensitivity analysis demonstrated that the chosen scaling range in RSS offered a favorable trade-off between deformation and anatomical plausibility. Cross-dataset evaluation on MSD Heart and Spleen tasks also indicated strong generalization. Computational analysis revealed that ES-UNet achieves superior segmentation performance with moderate computational demands. Specifically, the enhanced skip connection design with lightweight channel attention modules integrated throughout the network architecture enables this favorable balance between high segmentation accuracy and computational efficiency. Conclusion ES-UNet integrates architectural and algorithmic improvements to achieve robust 3D medical image segmentation. While the framework incorporates established components, its core contributions lie in the optimized skip connection strategy and supporting techniques like RSS and DWD. Future work will explore adaptive scaling strategies and broader validation across diverse imaging modalities.https://doi.org/10.1186/s12880-025-01857-03D medical image segmentationTumor segmentationData augmentation in medical imagingUNet
spellingShingle Minyoung Park
Seungtaek Oh
Junyoung Park
Taikyeong Jeong
Sungwook Yu
ES-UNet: efficient 3D medical image segmentation with enhanced skip connections in 3D UNet
BMC Medical Imaging
3D medical image segmentation
Tumor segmentation
Data augmentation in medical imaging
UNet
title ES-UNet: efficient 3D medical image segmentation with enhanced skip connections in 3D UNet
title_full ES-UNet: efficient 3D medical image segmentation with enhanced skip connections in 3D UNet
title_fullStr ES-UNet: efficient 3D medical image segmentation with enhanced skip connections in 3D UNet
title_full_unstemmed ES-UNet: efficient 3D medical image segmentation with enhanced skip connections in 3D UNet
title_short ES-UNet: efficient 3D medical image segmentation with enhanced skip connections in 3D UNet
title_sort es unet efficient 3d medical image segmentation with enhanced skip connections in 3d unet
topic 3D medical image segmentation
Tumor segmentation
Data augmentation in medical imaging
UNet
url https://doi.org/10.1186/s12880-025-01857-0
work_keys_str_mv AT minyoungpark esunetefficient3dmedicalimagesegmentationwithenhancedskipconnectionsin3dunet
AT seungtaekoh esunetefficient3dmedicalimagesegmentationwithenhancedskipconnectionsin3dunet
AT junyoungpark esunetefficient3dmedicalimagesegmentationwithenhancedskipconnectionsin3dunet
AT taikyeongjeong esunetefficient3dmedicalimagesegmentationwithenhancedskipconnectionsin3dunet
AT sungwookyu esunetefficient3dmedicalimagesegmentationwithenhancedskipconnectionsin3dunet